九州大学-研究者情報 [若宮幸平 (助教) 芸術工学研究院音響設計部門]

若宮幸平（わかみやこうへい）

データ更新日：2024.04.25

助教　／　芸術工学研究院音響設計部門情報音響システム学

学会発表等

1.	加藤日花里,李庸學,鏑木時彦,若宮幸平, 高速度ディジタル撮像を用いたボーカルフライ声区における声帯振動の分析, 日本音響学会 2023年秋季研究発表会, 2023.09, HSDIを用いてボーカルフライ発声時の声帯振動を分析し、発声メカニズムの解明を試みた。低音域ボーカルフライでは長い閉鎖期とパルス状の開放期からなる声帯振動が観察され、振動部位は被験者によって異なった。高音域ボーカルフライでは非周期的な振動が観察された。いずれの場合もOQは地声声区より小さかったが、呼気発声時よりも吸気発声時に大きくなった。.
2.	橋本将史, 上田和夫, 竹市博臣, 若宮幸平, ジェラードB. レメイン, 断続伸長モザイク音声：時間分解精度と断続が了解度におよぼす効果の分離, 日本音響学会聴覚研究会, 2022.12, モザイク音声 (階段状の時間スペクトル包絡で雑音駆動した音声) を断続および伸長した音声の了解度について調べた。これまで, 区間長 80 ms の断続モザイク音声の了解度は 0%近くになるとされていた。しかし, このような区間長と了解度の関係がモザイク音声の時間分解精度を保った場合でも成り立つかどうかは不明であった。ここでは元のモザイク区間長を 20 ms で固定し, 断続の効果を独立させた。周波数帯域数は 20 に固定した。断続モザイク音声の了解度は断続区間長 160 ms までは緩やかに低下した。一方で, モザイク区間を伸長し空白を縮小すると, 最大で25%の了解度向上が見られた。本研究により, モザイク音声刺激における断続と伸長の効果がより明瞭になった。.
3.	加藤日花里, 李庸學, 若宮幸平, 鏑木時彦, 高速度ディジタル撮像に基づいたホイッスル声区の声帯振動パターンの分類, 日本音響学会 2022年秋季研究発表会, 2022.09, 歌唱声区の中で最も基本周波数の高いホイッスル声区における声帯振動メカニズムを明らかにするため、本研究では、10000 fps を超えるフレームレートの高速度ディジタル撮像（HSDI）によって、4 名の歌手の喉頭を観察し、声門面積波形とフォノバイブログラム(PVG)からスペクトルと声門開放率を算出して分析を行った。観測された声帯振動パターンは声門の閉鎖状態から(a）完全閉鎖、(b)不完全閉鎖、(c)部分閉鎖の３パターンに分類することができた。PVG は声帯長方向の声帯振動情報を表すことができ、声帯振動様態の分析に有効であった。先行研究ではそれぞれのパターンは単独で確認されていたが、本研究では、3 つの振動パターンは同時に存在することが明らかになり、ホイッスル声区は単なる超高音域の裏声ではなく、声帯音源の生成に多様性を持つ声区と考えられる。.
4.	日髙駿介，李庸學，中西萌，若宮幸平，中川尚志，鏑木時彦, 少量ラベル付きデータを用いた病的音声の声質自動評価, 日本音響学会 2022年秋季研究発表会, 2022.09, 音声医学の臨床で行われる病的音声の声質の聴覚印象評価(GRBAS尺度)には, 評者間・評者内変動が原因で再現性に欠ける問題がある. 機械学習に基づく声質評価は再現性の問題を解決するが, ラベル付きデータを大量に集めることは困難である. 本研究では, ラベル付きデータが少ない状況下での高精度なGRBAS自動判定の再現を試みた. 8名の専門家により付与されたGRBAS評価が付された300の病的音声サンプルからなるデータセットを構築し, ネットワークの性能評価を行ったところ, 最良の推定条件では, 自動推定の評価は, GRBAS全項目で専門家の評者管信頼性に匹敵し, 項目G, B, A, Sで専門家の評者内信頼性に匹敵する結果が得られた..
5.	Kazuo UEDA, Hiroshige TAKEICH, Kohei WAKAMIYA, Gerard B. REMIJN, Auditory grouping by stretching: Regaining intelligibility of interrupted mosaic speech stimuli, 日本音響学会 2022年秋季研究発表会, 2022.09, The intelligibility of interrupted speech stimuli has been known to be almost perfect when segment duration is shorter than 80ms, which means that the interrupted segments are perceptually organized into a coherent stream under this condition. However, why listeners can successfully group the interrupted segments into a coherent stream has been largely unknown. Here we show that the intelligibility for masaic speech, in which original speech was segmented in frequency and time, and noise-vocoded with the average power in each unit, was largely reduced by periodical interruption. The interruption was devastating for mosaic speech, very likely because the deprivation of priodicity and temporal fine structure with mosaicking prevented successful auditory grouping for the interrupted segments. At the same time, the intelligibility could be recovered by promoting auditory grouping of the interrupted segments with stretching the segments up to 40 ms and reducing the gaps, provided that the number of frequency bands was enough (≧ 4) and the original segment duration was equal to or less than 40 ms. These results sugget that a grouping cue may play an important role in the perception of normal speech under adverse conditions..
6.	Kazuo Ueda, Hiroshige Takeichi, Kohei Wakamiya, AUDITORY GROUPING FACILITATES UNDERSTANDING INTERRUPTED MOSAIC SPEECH STIMULI, 38th Annual Meeting of the International Society for Psychophysics, 2022.08, The intelligibility of interrupted speech stimuli has been known to be almost perfect when segment duration is shorter than 80 ms, which means that the interrupted segments are perceptually organized into a coherent stream under this condition. However, why listeners can successfully group the interrupted segments into a coherent stream has been largely unknown. Here we show that the intelligibility for mosaic speech, in which original speech was segmented in frequency and time, and noise-vocoded with the average power in each unit, was largely reduced by periodical interruption. At the same time, the intelligibility could be recovered by promoting auditory grouping of the interrupted segments with stretching the segments up to 40 ms and reducing the gaps, provided that the number of frequency bands was enough (≧ 4) and the original segment duration was equal to or less than 40 ms. The interruption was devastating for mosaic speech stimuli, very likely because a poor grouping cue, which resulted from the deprivation of periodicity and temporal fine structure with mosaicking, prevented successful auditory grouping for the interrupted segments. These results suggest that a grouping cue should play an important role in the perception of normal speech under adverse conditions..
7.	Shunsuke HIDAKA, Kohei WAKAMIYA, Tokihiko KABURAGI, AN INVESTIGATION OF THE EFFECTIVENESS OF PHASE FOR AUDIO CLASSIFICATION, 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2022), 2022.05, While log-amplitude mel-spectrogram has widely been used as the feature representation for processing speech based on deep learning, the effectiveness of another aspect of speech spectrum, i.e., phase information, was shown recently for tasks such as speech enhancement and source separation. In this study, we extensively investigated the effectiveness of including phase information of signals for eight audio classification tasks. We constructed a learnable front-end that can compute the phase and its derivatives based on a time-frequency representation with mel-like frequency axis. As a result, experimental results showed significant performance improvement for musical pitch detection, musical instrument detection, language identification, speaker identification, and birdsong detection. On the other hand, overfitting to the recording condition was observed for some tasks when the instantaneous frequency was used. The results implied that the relationship between the phase values of adjacent elements is more important than the phase itself in audio classification..
8.	日髙駿介, 若宮幸平, 鏑木時彦, 音分類課題において有効な位相情報の表現に関する検討, 日本音響学会 2021年秋季研究発表会, 2021.09, 音声強調などでは入力特徴量の位相情報が有用であることが確かめられている. 本研究では, 対数振幅メルスペクトログラムに類する時間周波数表現への位相情報の付与が性能向上に寄与するかどうかを調査した. 具体的には, 位相に関する情報として位相, 群遅延, 瞬時周波数を比較した. 調査のために, 音声や環境音など複数の分類課題を用いて, 性能を評価した. 音声の分類課題では位相情報, 特に群遅延が性能向上に寄与する傾向が見られた. その一方で, 環境音分類や鳥の存在判定では, 一部例外を除き, 位相情報の付与は性能低下に繋がった..
9.	＃中西萌, 日髙駿介, 李庸學, 若宮幸平, 中川尚志, 鏑木時彦, 聴覚心理的評価の再現性について, 日本音響学会 2021年秋季研究発表会, 2021.09, 声質に異常がある音声を嗄声と呼び，耳鼻咽喉科では GRBAS 尺度をはじめとする聴覚心理的評価法を用いて嗄声の程度を評価・記録する. これらの評価法は, 主観評価であるために再現性の問題がある. 評者の評定の信頼性の評価については, 再現性の評価に用いる統計量によって結果に違いが出てくる. 本研究では，単一の評者が複数回評価を行った時の一致の度合いを示す評者内信頼性を対象とし, 信頼性の評価については, 従来法よりも頑強とされている Gwet の AC2を用いて分析を行った. その結果, 評者の経験年数と職業，患者の性別による嗄声評価の再現性への影響を調べた結果，評者の経験年数による影響が最も大きかった．経験年数が長いほど再現性が高い傾向があり，短い評者は再現性の高さの個人差が大きかった．.
10.	Shunsuke Hidaka1, Yogaku Lee, Kohei Wakamiya, Takashi Nakagawa, Tokihiko Kaburagi, Automatic Estimation of Pathological Voice Quality based on Recurrent Neural Network using Amplitude and Phase Spectrogram, Interspeech 2020, 2020.10, Perceptual evaluation of voice quality is widely used in laryngological practice, but it lacks reproducibility caused by inter- and intra-rater variability. This problem can be solved by automatic estimation of voice quality using machine learning. In the previous studies, conventional acoustic features, such as jitter, have often been employed as inputs. However, many of them are vulnerable to severe hoarseness because they assume a quasiperiodicity of voice. This paper investigated non-parametric features derived from amplitude and phase spectrograms. We applied the instantaneous phase correction proposed by Yatabe et al. (2018) to extract features that could be interpreted as indicators of non-sinusoidality. Specifically, we compared log amplitude, temporal phase variation, temporal complex value variation, and mel-scale versions of them. A deep neural network with a bidirectional GRU was constructed for each item of GRBAS Scale, a hoarseness evaluation method. The dataset was composed of 2545 samples of sustained vowel /a/ with the GRBAS scores labeled by an otolaryngologist. The results showed that the Hz-mel conversion improved the performance in almost all the case. The best scores were obtained when using temporal phase variation along the mel scale for Grade, Rough, Breathy, and Strained, and when using log mel amplitude for Asthenic..
11.	日髙駿介, 李庸學, 若宮幸平, 中川尚志, 鏑木時彦 , 振幅および位相スペクトルを考慮した病的音声の声質評価, 日本音響学会2020年秋季研究発表会, 2020.09, スペクトル時系列を入力とし，GRBAS尺度の各項目について，その評点を分類問題として出力する DNN を構築した。実験により，振幅情報だけでなく位相情報も有効な入力特徴量になりうることを示した..
12.	Shunsuke Hidaka, Yogaku Lee, Kohei Wakamiya, Takashi Nakagawa, Tokihiko Kaburagi, Automatic Evaluation of Voice Severity using Deep Neural Network, The Voice Foundation's VIRTUAL VOICE SYMPOSIUM Care of the Professional Voice, 2020.05, Introduction: Perceptual evaluation of voice quality (e.g., the GRBAS scale or CAPE-V) is used widely in laryngological practice. However, this method suffers from the lack of reproducibility caused by inter- and intra-rater variability. To date, it has been a topic of discussion among clinicians how to improve the reliability of judgement. Objective: The purpose of this study was to solve the inevitable problem of perceptual evaluation by building an automatic evaluation system. Understandably, automatic evaluation is surely reproducible (i.e., reliable). Moreover, the system was required to output meaningful judgements (i.e., to be valid). Methods: We constructed a deep neural network (DNN) that estimated all the scores of the GRBAS scale. DNN was composed of Bidirectional GRUs and fully connected layers. As the acoustic feature, we compared spectrogram and mel-spectrogram of speech samples obtained using sustained vowel /a/. The dataset for supervised learning was composed of 3118 samples. All true labels were given by an otolaryngologist. Results: The performance of the system was measured in terms of accuracy and statistical agreement index Cohen’s linearly weighted Kappa. Five-fold cross validation showed the accuracy of 60% on average. The Kappa scores of GBAS were “moderate” and that of R was “fair.” For all the GRBAS, the performance was higher when using mel-spectrogram. Conclusions: Our study showed the feasibility of automatic evaluation. In order to indicate how valid the system performance is, future studies could investigate inter- and intra-rater variability for our dataset..
13.	日髙駿介, 李庸學, 若宮幸平, 中川尚志, 鏑木時彦, Deep Neural Networkを用いた病的音声の声質評価, 日本音響学会 2019年秋季研究発表会, 2019.09, 病的音声の声質の聴覚心理的評価法としてGRBAS尺度が国内外で広く使用されている. しかしながら, GRBAS尺度は, 評者間変動や評者内変動が原因となり, 再現性に欠けるという問題点がある. 本研究では, GRBAS尺度の評点をDeep Neural Network(DNN)を用いて出力することにより, この問題の解決を図る. 今回使用したDNNは, GRBAS各項目について, Bidirectional GRUと全結合層を縦続接続する形で構成した. 入力特徴量としては, 持続母音/a/の起声から終声までの対数振幅スペクトログラムおよび位相変化スペクトログラムを比較した, その結果, 項目GRBSについて，位相変化スペクトログラムが対数振幅スペクトログラムに対して優位性を示す実験結果が得られた. .
14.	河原一彦，高田正幸，尾本章，鏑木時彦，鮫島俊哉，山内勝也，若宮幸平 , 九州大学芸術工学部の施設公開事業における音響関連展示 -2019年の事例報告- , 日本音響学会 2019年秋季研究発表会, 2019.09, 九州大学芸術工学部は, 5月ごろに施設公開事業を行っており, 音響関連の展示を行っている. 本稿では, 2019年5月25日開催の施設公開事業「デザインのフシギ体験」から, 「音響樽で聞いてみよう」, 「楽器音の合成を体感」, 「音声の仕組みを探る」, 「ダミーヘッドとお話ししよう」, 「無響室へようこそ」の5つの音響関連の企画について概要を報告する..
15.	若宮幸平, 田口史朗, 渡辺莉子, 桂田浩一, 牧野武彦, 鏑木時彦, 大規模日本語調音・音声パラレルデータの収集, 音声研究会, 2019.06, 調音データを用いた自然な音声の合成やサイレントスピーチインタフェース構築等のための基礎的データとして, 日本語調音・音声パラレルデータの収集を行っているので報告する. 現在収集中のデータは, 3次元磁気センサシステム(3D-EMA)を用いて測定した調音運動データ, 並びに, それと同期した音声信号データ, EGG信号データ, 音素セグメントデータである. 発話者は1名, 発話内容はATR503文と新たに構築した1,298文の日本語音素バランス文セットの計1,811文, 発話時間は67分, モーラ数は29,611となっている..
16.	日高駿介, 李庸學, 若宮幸平, 中川尚志, 鏑木時彦, Deep Neural Networkを用いた嗄声度の自動推定, 日本音響学会聴覚研究会, 2018.12, 声質の聴覚心理評価は音声障害の臨床において広く用いられているが, 評者間変動や評者内変動が原因となり, 再現性に欠ける問題がある. この問題点は, 声質評価の自動推定により解決可能であると考えれれる. 本論では, 嗄声の評価基準として国内外で使用されているGRBAS尺度の全項目の評点を出力するDeep Neural Network(DNN)を構築した. DNNはBidirectional GRUと全結合層から構成される. 特徴量として持続母音/a:/の起声から終声までのスペクトログラムおよびメルスペクトログラムを比較した. 1名の耳鼻咽喉科医により, 全ての正解ラベルを付与したデータセットを作成した. 3118サンプルを含むデータセットに対して5分割交差検証を行った結果, GRBASの正解率は平均で60%となった..
17.	Kohei WAKAMIYA, Hidetsugu UCHIDA, Tokihiko Kaburagi, ALIGNMENT OF THE TRANSMITTER COILS IN THE THREE-DIMENSIONAL ELECTROMAGNETIC ARTICULOGRAPHY HAVING EIGHT TRANSMISSION CHANNELS , Youngnam-Kyushu Joint Conference on Acoustics 2017, 2017.02, The relationship between position-estimation performance and transmitter-coil alignment in three-dimensional electromagnetic articulography (3D-EMA) having eight transmission channels was investigated. 3D-EMA is a measurement system used to observe articulatory movements and consists of transmitter coils as magnetic ﬁeld generators and receiver coils as position markers. In this system, the state (position and orientation) of each receiver coil is estimated by minimizing the signal diﬀerence between the measured and predicted receiver signals. The magnetic ﬁeld distribution determined by the transmitter-coil alignment has a strong inﬂuence on the position-estimation performance. In a previous study, we proposed a method for transmitter-coil alignment using a criterion expressing the minimum signal diﬀerence between two points in the measurement region. We have here increased the number of transmission channels from six to eight and investigated the position-estimation performance for various combinations of transmitter-coil position using computer simulations. As a result, we found that two combinations of transmitter-coil positions produced good results for the position estimation. One combination allocated coils at the vertices of a rectangular parallelepiped; in another the transmitter coils were allocated at the vertices of a cube. For both, the spacing of the coils was reduced from the ordinary size. .
18.	若宮幸平, 内田秀継, 鏑木時彦, 8個の送信チャネルを持つ磁気センサシステムの検討, 日本音響学会 2016年秋季研究発表会, 2016.09, 調音観測用3次元磁気センサシステムの位置推定精度の改善のため, 送信チャネル数を6個から8個に増やした場合の送信チャネルの設置位置について, 位置推定シミュレーションにより検討を行った. その結果, サイズを縮小して送信コイルと測定領域を近接させる設定とすることで, 高精度の位置推定ができる可能性が示された..
19.	若宮幸平, 内田秀継, 鏑木時彦, 最適配置された送信コイルをもつ3 次元磁気センサシステムの構築, 日本音響学会 2016年春季研究発表会, 2016.03, 非決定性の問題を低減するように最適配置された送信コイルを持つ3次元磁気センサシステムを構築したので, 構築した3次元磁気センサシステムの構成, 調音データの観測法について述べた. 本システムを用いた観測実験の結果, 成人男性話者による連続母音/aiueo/の調音運動の観測結果は各母音の調音的特徴をよく反映したものであった. .
20.	Kohei WAKAMIYA, Hidetsugu UCHIDA, Tokihiko Kaburagi, The Effect of Additional Transmission Channels in Three-dimensional Electromagnetic Articurography, Kyushu-Youngnam Joint Conference on Acoustics 2015, 2015.01, The relations between the performance of the position estimation and the number of transmission channels in the three-dimensional electromagnatic articulography(3D-EMA), a measurement system used to observe articulatory movements, were investigated. The receiver coils of the 3D-EMA are used as position markers and are placed in alternating magnetic field produced by multiple transmitter coils. The state(position and orientation) of each receiver coil is estimated by minimizing the signal error between the measured and predicted reciever signals using the magnetic field model. In our previous study, we proposed an alignment method of transmitter coils using the criterion that the minimum value of the difference between predicted receiver signals for any two states in the measurement region. This method was developed to resolve the problem that the existence of the specific zone in the measurement region that the position estimation error was noticeable increased, irrespective of small signal error. In this study, we added additional transmitter coils to the 3D-EMA system, optimized the alignment of transmitter, and investigated the estimation accuracy by computer simulation. As a result, we know that if the number of transmission channels is increased, the estimation accuracy is improved, and the maximum estimation error might become less than 1mm when the number is 8..
21.	Hidetsugu UCHIDA, Kohei WAKAMIYA, Tokihiko Kaburagi, A Study on the Improvement of Measurement Accuracy of the Three-Dimensional Electromagnatic Articulography, Interspeech 2014, 2014.09, The alignment of the transmitter coils for the three-dimensional electromagnentic articulography(3D-EMA), an instrument used to measure articulatory movements, was studied. The receiver coils of the 3D-EMA are used as position markers and are placed in an alternating magnetic field produced by multiple transmitter coils. The estimation of state(the position and orientation) of each receiver coil is based on the minimization of signal error between the measured and predicted receiver signals using a model of the magnetic field. Previous studies report a noticeable increase in the position estimation error at a specific portion of the measurement region irrespective of small signal error values. The existence of non-uniqueness in the position estimation problem is hypothesized to be the cause of this problem. To resolve the problem, we optimized the alignment of tha transmitter coils by maximizing the difference between the receiver signals at any two states in the mesurement region and evaluated the alignment using a computer simulation and an experiment. As a result, a measurement accuracy of approximatley 0.4mm was obtained..
22.	若宮幸平, 内田秀継, 鏑木時彦, 3次元磁気センサシステムにおける送信チャネル数に関する検討, 日本音響学会 2014年秋季研究発表会, 2014.09, 3次元磁気センサシステム(3D-EMA)において, 複数の受信コイル状態(位置と傾き)の受信信号予測値がほぼ同一になることに起因する大きな位置推定誤差を解消するため, 領域内の任意の2点間の受信信号の差の最小値を評価基準として導入し, それを最大にする送信コイル配置が提案されているが, 受信コイルの向きがEMAの筐体に対して傾いており, 実用上の問題があった. 本研究では, 受信コイルの基準方向を筐体の軸に平行な方向に変更し, 上記評価基準により, コイルの取り付け角度を最適化した. また, 送信チャネルを増やした場合についても同様の最適化を行い, 得られた送信コイル配置に対して, 位置推定シミュレーション実験を行った. その結果, 送信チャネル数を増やすに従い, 位置推定精度は向上し, 送信チャネル数を8とした場合, 最大誤差が1mm以下となる可能性があることがわかった..
23.	Hidetsugu Uchida, Kohei Wakamiya, Tokihiko Kaburagi, A study on the improvement of measurement accuracy of the three-dimensional electromagnetic articulography, 15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014, 2014.01, The alignment of the transmitter coils for the three-dimensional electromagnetic articulography (3D-EMA), an instrument used to measure articulatory movements, was studied. The receiver coils of the 3D-EMA are used as position markers and are placed in an alternating magnetic field produced by multiple transmitter coils. The estimation of state (the position and orientation) of each receiver coil is based on the minimization of signal error between the measured and predicted receiver signals using a model of the magnetic field. Previous studies report a noticeable increase in the position estimation error at a specific portion of the measurement region irrespective of small signal error values. The existence of non-uniqueness in the position estimation problem is hypothesized to be the cause of this problem. To resolve the problem, we optimized the alignment of the transmitter coils by maximizing the difference between the receiver signals at any two states in the measurement region and evaluated the alignment using a computer simulation and an experiment. As a result, a measurement accuracy of approximately 0.4 mm was obtained..
24.	内田秀継, 若宮幸平, 鏑木時彦, 三次元磁気センサシステムにおける送信コイルの配置の最適化についての検討, 電子情報通信学会音声研究会, 2013.11, 三次元磁気センサシステムでは, 受信コイルから得られる受信信号が送信コイルとの位置関係により定まることを利用し位置推定を行うが, 受信コイルの位置が異なるにもかかわらず受信信号がほぼ等しくなる場合, 受信コイルの位置を受信信号から一意に決めることが出来ないという問題が発生する. 本研究では, 送信コイルの配置を最適化することでこの非決定性の問題を解決することを試みた. 測定領域内に受信コイルの位置と傾きに関する多数の標本点を設け, その任意の2点間の受信信号差が最大になるように送信コイル配置を最適化し, 位置推定シミュレーションを行ったところ, 非決定性の問題の緩和が確認された. さらに, その配置をシステムに実装し, 位置推定実験により推定制度を検証した結果, 平均位置誤差0.4mmの精度を得た..
25.	内田秀継, 若宮幸平, 鏑木時彦, 3次元磁気センサシステムにおける送信コイル配置の検討と精度評価, 音響学会秋季研究発表会, 2013.09, 3次元磁気センサシステムにおいて, 位置推定における非決定性の問題があることが明らかとなり, その対応策として, 信号分離度という指標を導入することで, 最適な送信コイルについて検討し, その位置推定精度を実測で検証した..
26.	内田秀継, 若宮幸平, 鏑木時彦, 3次元磁気センサシステムにおける送信コイル配置の検討, 音響学会秋季研究発表会, 2012.09, 3次元磁気センサシステムにおいて, 位置推定における非決定性の問題があることが明らかとなり, その対応策として, 信号分離度という指標を導入することで, 最適な送信コイルについて検討した..
27.	河原一彦, 若宮幸平, 佐賀県立宇宙科学館「平成15年春の企画展音と響のテクノロジー」の概要と展示計画支援, 音響学会秋季研究発表会, 2011.09, 2003年(平成15年)2月22日から4月20日に, 佐賀県立宇宙科学館で開催された「平成15年春の企画展音と響のテクノロジー」の概要と, 著者らが支援した展示計画について報告する..
28.	若宮幸平, 高田正幸, 河辺哲次, 鮫島俊哉, 中島祥好, 上田和夫, 鏑木時彦, [チュートリアル招待講演]高校生のための音響サイエンスキャンプ～先進的科学技術体験合宿プログラムによる音響入門講座～, 電子通信情報学会音声研究会, 2010.06.
29.	福冨隆朗, 仲田昌史, 鏑木時彦, 若宮幸平, 粒子法による声帯の連続体モデル, 電子通信情報学会音声研究会, 2007.10.
30.	福冨隆朗, 仲田昌史, 鏑木時彦, 若宮幸平, 粒子法を用いた生体連続体モデルの構築, 日本音響学会秋季研究発表会, 2007.09.
31.	鏑木時彦, 若宮幸平, 持田岳美, 3次元磁気センサシステムの精度評価, 日本音響学会秋季研究発表会, 2005.09.
32.	鏑木時彦, 若宮幸平, 持田岳美, 3次元磁気センサシステムの評価, 電子通信情報学会音声研究会, 2005.08.
33.	Kohei WAKAMIYA, Takuya TSUJI and Tokihiko KABURAGI, Estimation of the vocal tract spectrum from articulatory movements using phoneme-dependent neural networks, International conference on Spoken Langage Processing, 2004.10.
34.	若宮幸平, 辻拓哉, 鏑木時彦, 音素別ニューラルネットワークを用いた調音-音響マッピング ---パラメータ学習法の検討---, 電子情報通信学会音声研究会, 2004.10.
35.	金智之, 若宮幸平, 鏑木時彦, 調音運動モデルを用いた声道スペクトルの生成, 電子情報通信学会音声研究会, 2004.10.
36.	Kohei Wakamiya, Tokihiko Kaburagi, Takuya Tsuji, Jiji Kim, Estimation of the vocal tract spectrum from articulatory movements using phoneme-dependent neural networks, 8th International Conference on Spoken Language Processing, ICSLP 2004, 2004.01, This paper presents an estimation method of the vocal tract spectrum from articulatory movements. The method is based on the interpolation of spectra obtained by phoneme-dependent neural networks. Given the phonemic context and articulation timing corresponding to each phoneme, the proposed method first transforms articulator positions to phoneme-dependent spectra. Then the vocal tract spectrum is estimated by the interpolation of transformed spectra. This interpolation is based on the distance among the input articulator position and that of the preceding and succeeding phonemes. Also, a training procedure of the neural networks is presented while taking the spectral interpolation into account. Articulatory and acoustic data pairs collected by a simultaneous recording of articulator positions and speech were used as the training and test data. Finally, we showed an estimation result using the proposed method..
37.	Kohei WAKAMIYA, Tokihiko KABURAGI and Masaaki HONDA, An investigation of the measurement accuracy of the three-dimensional electromagnetic articulography, The 6th Internatinal Seminar on Speech Production, 2003.12.
38.	若宮幸平, 鏑木時彦, 誉田雅彰, 3次元磁気位置計測における解の収束性, 日本音響学会2003年秋季研究発表会, 2003.09.
39.	辻拓也, 金智之, 若宮幸平, 鏑木時彦, 音素別ニューラルネットワークを用いた調音-音響マッピング, 電子情報通信学会音声研究会, 2003.09.
40.	Tokihiko KABURAGI, Kohei WAKAMIYA and Masaaki HONDA, Three-dimensional electromagnetic articulograph based on a nonparametric representation of the magnetic field, International Conference on Spoken Language Processing, 2002.09.
41.	若宮幸平, 鏑木時彦, 澤田弘太郎, 誉田雅彰, 澤田雅司, 3次元磁気センサシステムの位置推定誤差要因, 日本音響学会2002年秋季研究発表会, 2002.09.
42.	若宮幸平, 鏑木時彦, 澤田弘太郎, 誉田雅彰, 澤田雅司, 3次元磁気センサシステムにおける推定誤差要因に関する検討, 電子情報通信学会音声研究会, 2002.08.
43.	若宮幸平, 鏑木時彦, 澤田弘太郎, 誉田雅彰, 3次元磁気センサシステムの位置推定精度の検討, 日本音響学会2002年春季研究発表会, 2002.03.
44.	鏑木時彦, 若宮幸平, 澤田弘太郎, 誉田雅彰, 磁界のスプライン表現を用いた3次元磁気センサシステムの検討, 電子情報通信学会音声研究会, 2002.03.
45.	Tokihiko Kaburagi, Kohei Wakamiya, Masaaki Honda, Three-dimensional electromagnetic articulograph based ona nonparametric representation of the magnetic field, 7th International Conference on Spoken Language Processing, ICSLP 2002, 2002.01, A measurement method of the three-dimensional electromagnetic articulograph system is presented to investigate the dynamic behavior of articulatory organs which can include lateral or rotational movements. To accurately represent the spatial pattern of the magnetic field, we use a multivariate B-spline function, which smoothly interpolates a given set of calibration data samples. The strength of the received signal is predicted based on the spline field function while considering the tilting effect of the receiver coil relative to the direction of the magnetic field. The position and orientation of the receiver coil are then estimated using an iterative procedure so that the difference between the measured and predicted signal strengths is minimized. Preliminary experiments showed that the mean estimation error of the receiver position is about 0.5 mm when the axis of the receiver coil is parallel with one of the axes of the coordinate system..
46.	若宮幸平, 鈴木俊行, 実測孤立再生波形を用いた高密度再生波形の推定限界, 電子情報通信学会磁気記録研究会, 2000.12.
47.	若宮幸平, 竹永和功, 鈴木俊行, 孤立再生波形の重畳に関する一検討, 平成12年度電気関係学会九州支部連合大会, 2000.09.
48.	若宮幸平, 鈴木俊行, 誤差制限学習法を用いたニューラルネットワーク等化の一検討, 平成11年度電気関係学会九州支部連合大会, 1999.10.
49.	田中輝光, 若宮幸平, 鈴木俊行, 薄膜ディスク記録特性に及ぼすヘッドエッジ磁界の影響, 第23回日本応用磁気学会, 1999.10.
50.	Terumitsu TANAKA, Kohei WAKAMIYA, Toshiyuki SUZUKI, Read/Write Track Fringe Width in Thin Film Disks, IEICE Magnetic Recording Conference, 1999.02.
51.	若宮幸平, 姜小明, 鈴木俊行, 仮想計測器による軟磁性用M-Hループトレーサの性能改善, 平成10年度電気関係学会九州支部連合大会, 1998.10.
52.	田中輝光, 若宮幸平, 鈴木俊行, 薄膜ディスクにおける記録・再生にじみの発生機構, 平成10年度電気関係学会九州支部連合大会, 1998.10.
53.	田中輝光, 若宮幸平, 鈴木俊行, 薄膜ディスクにおける記録・再生にじみに関する検討, 第22買い日本応用磁気学会学術講演会, 1998.09.

九大関連コンテンツ

pure2017年10月2日から、「九州大学研究者情報」を補完するデータベースとして、Elsevier社の「Pure」による研究業績の公開を開始しました。

QIR　九州大学学術情報リポジトリシステム情報科学研究院

留学生センター

ストーリー・マンガの作品を利用した、異文化間理解教育の教材とプロ ...


	※ 研究者情報全体を検索する場合は右端の「すべて」を選択して下さい。