Faculty Profiles - UEDA KAZUO

Information

写真a

UEDA KAZUO

Organization

Faculty of Design Department of Acoustic Design Associate Professor
Research and Development Center for Five-Sense Devices （Concurrent）
Research and Development Center for Five-Sense Devices （Concurrent）
Graduate School of Design Department of Design（Concurrent）

Profile

The current research themes are the following: (1) perception of degraded speech, including checkerboard speech, interrupted speech, locally time-reversed speech, noise-vocoded speech, and mosaic speech, (2) Irrelevant sound effect on short-term memory, (3) multivariate analyses of speech, (4) multivariate analyses of choral music, etc. Some are collaborative works with Technische Universitaet Darmstadt, Germany, and the University of British Columbia, Canada. He was a member of the Perceptual Psychology Unit of the governmental "Center of Excellence" (COE) program entitled "Design of Artificial Environments Based on Human Sensibility," Kyushu University, since 2003. He joined "The Center for Applied Perceptual Research" in 2010. The Center developed into "Research Center for Applied Perceptual Science" in 2013. He supervises some undergraduate and graduate students. He teaches Psychology of Hearing, Auditory Perception and Cognition, Perceptual Psychology, Science of Auditory and Visual Perception, etc. At Kyoto Prefectural University, he taught Psychology, Perceptual Psychology, Experimental Design, etc. He has experienced management in publishing and research meetings at the Acoustical Society of Japan and the Japanese Society for Music Perception and Cognition. He experienced one of the topic editors in Frontiers in Psychology from 2019 to 2020. From 2018 to 2019, he served as one of the Vice Presidents of the International Society for Psychophysics. From 2019 to 2022, he was the President of the International Society for Psychophysics. He has served as an Associate Editor for Auditory Perception & Cognition since 2021.

Homepage

External link

Research Areas

Humanities & Social Sciences / Experimental psychology

Degree

Ph.D.

Research History

昭和62.4.1〜平成2.3.31 株式会社 ATR 視聴覚機構研究所聴覚研究室　研修研究員平成2.4.1〜平成4.3.31 株式会社 ATR 視聴覚機構研究所聴覚研究室　奨励研究員

昭和62.4.1〜平成2.3.31 株式会社 ATR 視聴覚機構研究所聴覚研究室　研修研究員平成2.4.1〜平成4.3.31 株式会社 ATR 視聴覚機構研究所聴覚研究室　奨励研究員
平成4.4.1〜平成5.3.31 ボルドー第 2 大学音響心理学研究所　客員研究員平成5.4.1〜平成12.2.15 京都府立大学文学部講師平成9.4.1〜平成12.2.15 京都府立大学福祉社会学部講師を兼務

Education

Kyoto University 文学研究科博士後期課程

1987.4 - 1990.3

　 More details

Country：Japan
Kyoto University 文学研究科修士課程（心理学専攻）

1985.4 - 1987.3

　 More details

Country：Japan
Kyoto University Faculty of Letters 哲学科

1980.4 - 1984.3

　 More details

Country：Japan

Notes：心理学専攻

Research Interests・Research Keywords

Research theme：感覚・知覚

Keyword：感覚・知覚

Research period： 2024
Research theme：実験系心理学

Keyword：実験系心理学

Research period： 2024
Research theme： Perceptual restoration of interrupted speech

Keyword： perceptual restoration, interrupted speech

Research period： 2015.9
Research theme： Multilingual comparison about perception of locally time-reversed speech

Keyword： intelligibility, time reversal, time window

Research period： 2012.12 - 2017.5
Research theme： Irrelevant sound effects

Keyword： irrelevant sound effects, serial recall, speech intelligibility

Research period： 2009.10
Research theme： Perception of degraded speech

Keyword： checkerboard speech, interrupted speech, locally time-reversed speech, noise-vocoded speech, mosaic speech

Research period： 2007.4
Research theme： Factor analyses of critical-band-filtered speech and perception of noise-vocoded speech

Keyword： Power fluctuation, frequency bands, noise-vocoded speech

Research period： 2007.4
Research theme：雑音下の音声知覚と学習効果

Keyword：聴覚心理学，耐雑音性，SN 比，同定正答率，第二言語学習，非母語話者

Research period： 1999.4
Research theme： Short-term memory of speech and non-speech

Keyword： auditory short-term memory

Research period： 1992.4

Awards

JPA Distinguished Poster Presentation Award: International Division

2024.12 The Japanese Psychological Association Intelligibility of interrupted and checkerboard speech with two talkers

Kazuo Ueda, Jun Hasegawa, Hiroshige Takeichi, Gerard B. Remijn, Emi Hasuo

　More details

Award type：Award from Japanese society, conference, symposium, etc. Country：Japan

Winner of JPA Distinguished Poster Presentation Award -International Division
Presentation Number: 1A-078-PI
Title: Intelligibility of interrupted and checkerboard speech with two talkers Presenter: Kazuo Ueda

Selection Process
There were 34 studies presented in English. Based on the voting by the review committee, the study that received the highest number of votes (9) was selected for the "JPA Most Distinguished Poster Presentation Award – International Division." The study that received 7 votes was selected for the "JPA Distinguished Poster Presentation Award – International Division.
Twenty-five year awards

2013.6 the Acoustical Society of America Twenty-five year awards, the Acoustical Society of America, 5 June 2013.
粟屋　潔　学術奨励賞

1988.3 日本音響学会上田和夫，(1987.03). ”音色の表現語に階層構造は存在するか,” 昭和 62 年度春季研究発表会での講演発表.

Papers

Filling the blanks of checkerboard speech with noise: Evidence for phonemic restoration and masking Reviewed International journal

Munechika, K., Ueda, K., Takeichi, H., Hasuo, E., and Remijn, G. B.

The Journal of the Acoustical Society of America (in press) 2025.8

　More details

Authorship：Corresponding author Language：English Publishing type：Research paper (scientific journal)
Rivalry between pitch and timbre in auditory stream segregation Reviewed International journal

Jhang, G-Y, Ueda, K., Takeichi, H., Remijn, G., and Hasuo, E.

PLOS ONE 20 ( 6 ) e0323964 - e0323964 2025.6

　More details

Authorship：Corresponding author Language：English Publishing type：Research paper (scientific journal)

DOI： 10.1371/journal.pone.0323964
A speech compression method without utilizing signal prediction Reviewed International journal

Matsuo, I., Ueda, K., and Nakajima, Y.

i-Perception 16 ( 3 ) 1 - 5 2025.5 （ ISSN:2041-6695 ）

　More details

Language：English Publishing type：Research paper (scientific journal) Publisher：Sage

DOI： 10.1177/20416695251340236

Web of Science

researchmap

Repository Public URL： https://hdl.handle.net/2324/7361945
Band tones: Auditory stream segregation with alternating frequency bands Reviewed International journal

Jhang, G-Y, Ueda, K., Takeichi, H., Remijn, G. B., Hasuo, E.

Acoustics Australia 2025.3 （ ISSN:0814-6039 eISSN:1839-2571 ）

　More details

Language：English Publishing type：Research paper (scientific journal) Publisher：Springer Nature

An alternating tone sequence may be perceptually integrated into one stream or segregated into two streams based on pitch and timbre differences between the tones (sequential stream segregation). However, the effect of the spectral dispersion of harmonic complex tones on sequential stream segregation has been largely unexplored. We introduced band tones that were harmonic complex tones divided into several frequency bands, in which frequency components in every other frequency band were removed. Here, we show that segregation was reported more often with fewer frequency bands and larger separation in fundamental frequency. Listeners generally responded to 2–8-band stimuli as segregated most of the time. However, the percentages of segregation responses for 16-band stimuli were generally dominated by fundamental frequency separations and whether the movements of fundamental frequencies and band-like spectral patterns were congruent or incongruent. The results suggest that the auditory system cannot organize rapidly alternating frequency component blocks spanning a wide frequency range into one stream.

DOI： 10.1007/s40857-025-00348-0

DOI： 10.1007/s40857-025-00348-0

Web of Science

Scopus

researchmap

Repository Public URL： https://hdl.handle.net/2324/7361944
Interrupted mosaic speech revisited: Gain and loss in intelligibility by stretching Reviewed International journal

Kazuo Ueda, #Masashi Hashimoto, @Hiroshige Takeichi, and Kohei Wakamiya

The Journal of the Acoustical Society of America 155 ( 3 ) 1767 - 1779 2024.3 （ ISSN:0001-4966 eISSN:1520-8524 ）

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal) Publisher：Journal of the Acoustical Society of America

Our previous investigation on the effect of stretching spectrotemporally degraded and temporally interrupted speech stimuli showed remarkable intelligibility gains [Ueda, Takeichi, and Wakamiya (2022). J. Acoust. Soc. Am. 152(2), 970–980]. In this previous study, however, gap durations and temporal resolution were confounded. In the current investigation, we therefore observed the intelligibility of so-called mosaic speech while dissociating the effects of interruption and temporal resolution. The intelligibility of mosaic speech (20 frequency bands and 20 ms segment duration) declined from 95% to 78% and 33% by interrupting it with 20 and 80 ms gaps. Intelligibility improved, however, to 92% and 54% (14% and 21% gains for 20 and 80 ms gaps, respectively) by stretching mosaic segments to fill silent gaps (n = 21). By contrast, the intelligibility was impoverished to a minimum of 9% (7% loss) when stretching stimuli interrupted with 160 ms gaps. Explanations based on auditory grouping, modulation unmasking, or phonemic restoration may account for the intelligibility improvement by stretching, but not for the loss. The probability summation model accounted for “U”-shaped intelligibility curves and the gain and loss of intelligibility, suggesting that perceptual unit length and speech rate may affect the intelligibility of spectrotemporally degraded speech stimuli.

DOI： 10.1121/10.0025132

Web of Science

Scopus

PubMed

CiNii Research

researchmap

Other Link： https://doi.org/10.1121/10.0025132
Checkerboard and interrupted speech: Intelligibility contrasts related to factor-analysis-based frequency bands Reviewed International journal

Kazuo Ueda, #Linh Le Dieu Doan, and Hiroshige Takeichi

The Journal of the Acoustical Society of America 154 ( 4 ) 2010 - 2020 2023.10 （ ISSN:0001-4966 eISSN:1520-8524 ）

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal) Publisher：Acoustical Society of America (ASA)

It has been shown that the intelligibility of checkerboard speech stimuli, in which speech signals were periodically interrupted in time and frequency, drastically varied according to the combination of the number of frequency bands (2–20) and segment duration (20–320 ms). However, the effects of the number of frequency bands between 4 and 20 and the frequency division parameters on intelligibility have been largely unknown. Here, we show that speech intelligibility was lowest in four-band checkerboard speech stimuli, except for the 320-ms segment duration. Then, temporally interrupted speech stimuli and eight-band checkerboard speech stimuli came in this order (N = 19 and 20). At the same time, U-shaped intelligibility curves were observed for four-band and possibly eight-band checkerboard speech stimuli. Furthermore, different parameters of frequency division resulted in small but significant intelligibility differences at the 160- and 320-ms segment duration in four-band checkerboard speech stimuli. These results suggest that factor-analysis-based four frequency bands, representing groups of critical bands correlating with each other in speech power fluctuations, work as speech cue channels essential for speech perception. Moreover, a probability summation model for perceptual units, consisting of a sub-unit process and a supra-unit process that receives outputs of the speech cue channels, may account for the U-shaped intelligibility curves.

DOI： 10.1121/10.0021165

Web of Science

Scopus

PubMed

CiNii Research

researchmap

Other Link： https://doi.org/10.1121/10.0021165
Auditory grouping is necessary to understand interrupted mosaic speech stimuli Reviewed International journal

Ueda, K; Takeichi, H; Wakamiya, K

The Journal of the Acoustical Society of America 152 ( 2 ) 970 - 980 2022.8 （ ISSN:0001-4966 eISSN:1520-8524 ）

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal) Publisher：Acoustical Society of America ({ASA})

The intelligibility of interrupted speech stimuli has been known to be almost perfect when segment duration is shorter than 80 ms, which means that the interrupted segments are perceptually organized into a coherent stream under this condition. However, why listeners can successfully group the interrupted segments into a coherent stream has been largely unknown. Here, we show that the intelligibility for mosaic speech in which original speech was segmented in frequency and time and noise-vocoded with the average power in each unit was largely reduced by periodical interruption. At the same time, the intelligibility could be recovered by promoting auditory grouping of the interrupted segments by stretching the segments up to 40 ms and reducing the gaps, provided that the number of frequency bands was enough ([Formula: see text]) and the original segment duration was equal to or less than 40 ms. The interruption was devastating for mosaic speech stimuli, very likely because the deprivation of periodicity and temporal fine structure with mosaicking prevented successful auditory grouping for the interrupted segments.

DOI： 10.1121/10.0013425

Web of Science

Scopus

PubMed

researchmap

Other Link： https://doi.org/10.1121/10.0013425
The common limitations in auditory temporal processing for Mandarin Chinese and Japanese Reviewed International journal

Eguchi, H; Ueda, K; Remijn, GB; Nakajima, Y; Takeichi, H

Scientific Reports 12 ( 1 ) 3002 - 3002 2022.2 （ ISSN:2045-2322 eISSN:2045-2322 ）

　More details

Authorship：Corresponding author Language：English Publishing type：Research paper (scientific journal) Publisher：SpringerNature

The present investigation focused on how temporal degradation affected intelligibility in two types of languages, i.e., a tonal language (Mandarin Chinese) and a non-tonal language (Japanese). The temporal resolution of common daily-life sentences spoken by native speakers was systematically degraded with mosaicking (mosaicising), in which the power of original speech in each of regularly spaced time-frequency unit was averaged and temporal fine structure was removed. The results showed very similar patterns of variations in intelligibility for these two languages over a wide range of temporal resolution, implying that temporal degradation crucially affected speech cues other than tonal cues in degraded speech without temporal fine structure. Specifically, the intelligibility of both languages maintained a ceiling up to about the 40-ms segment duration, then the performance gradually declined with increasing segment duration, and reached a floor at about the 150-ms segment duration or longer. The same limitations for the ceiling performance up to 40 ms appeared for the other method of degradation, i.e., local time-reversal, implying that a common temporal processing mechanism was related to the limitations. The general tendency fitted to a dual time-window model of speech processing, in which a short (~ 20–30 ms) and a long (~ 200 ms) time-window run in parallel.

File： Eguchi_SciRep_2022.pdf

DOI： 10.1038/s41598-022-06925-x

Web of Science

Scopus

PubMed

researchmap

Other Link： https://www.nature.com/articles/s41598-022-06925-x

Repository Public URL： https://hdl.handle.net/2324/7161745
Checkerboard speech vs interrupted speech: Effects of spectrotemporal segmentation on intelligibility Reviewed International journal

Ueda, K., #Kawakami, R., and @Takeichi, H.

JASA Express Letters 1 ( 7 ) 1 - 7 2021.7

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

The intelligibility of interrupted speech (interrupted over time) and checkerboard speech (interrupted over time-by-frequency), both of which retained a half of the original speech, was examined. The intelligibility of interrupted speech stimuli decreased as segment duration increased. 20-band checkerboard speech stimuli brought nearly 100% intelligibility irrespective of segment duration, whereas, with 2 and 4 frequency bands, a trough of 35%-40% appeared at the 160-ms segment duration. Mosaic speech stimuli (power was averaged over a time-frequency unit) yielded generally poor intelligibility (<= 10%). The results revealed the limitations of underlying auditory organization for speech cues scattered in a time-frequency domain.

DOI： 10.1121/10.0005600

Other Link： https://asa.scitation.org/doi/10.1121/10.0005600

Repository Public URL： http://hdl.handle.net/2324/4485661
Intelligibility of chimeric locally time-reversed speech: Relative contribution of four frequency bands Reviewed International journal

Kazuo Ueda and @Ikuo Matsuo

JASA Express Letters 1 ( 6 ) 1 - 6 2021.6

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

Intelligibility of 4-band speech stimuli was investigated (n = 18), such that only one of the frequency bands was preserved, whereas other bands were locally time-reversed (segment duration: 75-300 ms), or vice versa. Intelligibility was best retained (82% at 75 ms) when the second lowest band (540-1700 Hz) was preserved. When the same band was degraded, the largest drop (10% at 300 ms) occurred. The lowest and second highest bands contributed similarly less strongly to intelligibility. The highest frequency band contributed least. A close connection between the second lowest frequency band and sonority was suggested

DOI： 10.1121/10.0005439

Other Link： https://asa.scitation.org/doi/10.1121/10.0005439

Repository Public URL： http://hdl.handle.net/2324/4485662
Phonemic restoration of interrupted locally time-reversed speech: Effects of segment duration and noise levels Reviewed International coauthorship International journal

Kazuo UEDA and @Valter CIOCCA

Attention, Perception, & Psychophysics 83 ( 5 ) 1928 - 1934 2021.6

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

Intelligibility of temporally degraded speech was investigated with locally time-reversed speech (LTR) and its interrupted version (ILTR). Control stimuli comprising interrupted speech (I) were also included. Speech stimuli consisted of 200 Japanese meaningful sentences. In interrupted stimuli, speech segments were alternated with either silent gaps or pink noise bursts. The noise bursts had a level of -10, 0 or +10 dB relative to the speech level. Segment duration varied from 20 to 160 ms for ILTR sentences, but was fixed at 160 ms for I sentences. At segment durations between 40 and 80 ms, severe reductions in intelligibility were observed for ILTR sentences, compared with LTR sentences. A substantial improvement in intelligibility (30-33%) was observed when 40-ms silent gaps in ILTR were replaced with 0- and +10-dB noise. Noise with a level of -10 dB had no effect on the intelligibility. These findings show that the combined effects of interruptions and temporal reversal of speech segments on intelligibility are greater than the sum of each individual effect. The results also support the idea that illusory continuity induced by high-level noise bursts improves the intelligibility of ILTR and I sentences

DOI： 10.3758/s13414-021-02292-3

Other Link： http://link.springer.com/article/10.3758/s13414-021-02292-3

Repository Public URL： http://hdl.handle.net/2324/4377850
Intelligibility of chimeric locally time-reversed speech Reviewed International journal

@Ikuo Matsuo, Kazuo Ueda, and Yoshitaka Nakajima

The Journal of the Acoustical Society of America Express Letters 147 ( 6 ) EL523 - EL528 2020.6

　More details

Language：English Publishing type：Research paper (scientific journal)

The intelligibility of chimeric locally time-reversed speech was investigated. Both (1) the boundary frequency between the temporally degraded band and the non-degraded band and (2) the segment duration were varied. Japanese mora accuracy decreased if the width of the degraded band or the segment duration increased. Nevertheless, the chimeric stimuli were more intelligible than the locally time-reversed controls. The results imply that the auditory system can use both temporally degraded speech information and undamaged speech information over different frequency regions in the processing of the speech signal, if the amplitude envelope in the frequency range of 840–1600 Hz was preserved.
(C) 2020 Acoustical Society of America

DOI： 10.1121/10.0001414

Other Link： https://doi.org/10.1121/10.0001414

Repository Public URL： http://hdl.handle.net/2324/4066581
Irrelevant speech effects with locally time-reversed speech: Native vs non-native language Reviewed International coauthorship International journal

Kazuo Ueda, Yoshitaka Nakajima, @Florian Kattner, and @Wolfgang Ellermeier

The Journal of the Acoustical Society of America 145 ( 6 ) 3686 - 3694 2019.6

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

Irrelevant speech is known to interfere with short-term memory of visually presented items. Here, this irrelevant speech effect was studied with a factorial combination of 3 variables: the participants' native language, the language the irrelevant speech was derived from, and the playback direction of the irrelevant speech. We used locally time-reversed speech as well to disentangle the contributions of local and global integrity. German and Japanese speech was presented to German (n = 79) and Japanese (n = 81) participants while they were performing a serial-recall task. In both groups, any kind of irrelevant speech impaired recall accuracy as compared to a pink-noise control condition. When the participants' native language was presented, normal speech and locally time-reversed speech with short segment duration, preserving intelligibility, was the most disruptive. Locally time-reversed speech with longer segment durations and normal or locally time-reversed speech played entirely backward, both lacking intelligibility, was less disruptive. When unfamiliar, incomprehensible signal was presented as irrelevant speech, no significant difference was found between locally time-reversed speech and its globally inverted version, suggesting that the effect of global inversion depends on the familiarity of the language.

DOI： 10.1121/1.5112774

Other Link： https://doi.org/10.1121/1.5112774

Repository Public URL： http://hdl.handle.net/2324/2320609
Frequency specificity of amplitude envelope patterns in noise-vocoded speech Reviewed International journal

Kazuo Ueda, #Tomoya Araki, Yoshitaka Nakajima

Hearing Research 367 169 - 181 2018.8

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

We examined the frequency specificity of amplitude envelope patterns in 4 frequency bands, which universally appeared through factor analyses applied to power fluctuations of critical-band filtered speech sounds in 8 different languages/dialects [Ueda and Nakajima (2017). Sci. Rep., 7 (42468)]. A series of 3 perceptual experiments with noise-vocoded speech of Japanese sentences was conducted. Nearly perfect (92–94%) mora recognition was achieved, without any extensive training, in a control condition in which 4-band noise-vocoded speech was employed (Experiments 1–3). Blending amplitude envelope patterns of the frequency bands, which resulted in reducing the number of amplitude envelope patterns while keeping the average spectral levels unchanged, revealed a clear deteriorating effect on intelligibility (Experiment 1). Exchanging amplitude envelope patterns brought generally detrimental effects on intelligibility, especially when involving the 2 lowest bands (≲1850 Hz; Experiment 2). Exchanging spectral levels averaged in time had a small but significant deteriorating effect on intelligibility in a few conditions (Experiment 3). Frequency specificity in low-frequency-band envelope patterns thus turned out to be conspicuous in speech perception.

DOI： 10.1016/j.heares.2018.06.005

Other Link： https://doi.org/10.1016/j.heares.2018.06.005
Temporal Resolution Needed for Auditory Communication: Measurement with Mosaic Speech Reviewed International journal

Yoshitaka Nakajima, #Mizuki Matsuda, Kazuo Ueda, and Gerard B. Remijn

Frontiers in Human Neuroscience 12 ( 149 ) 2018.4

　More details

Language：English Publishing type：Research paper (scientific journal)

Temporal resolution needed for Japanese speech communication was measured. A new experimental paradigm that can reflect the spectro-temporal resolution necessary for healthy listeners to perceive speech is introduced. As a first step, we report listeners' intelligibility scores of Japanese speech with a systematically degraded temporal resolution, so-called "mosaic speech": speech mosaicized in the coordinates of time and frequency. The results of two experiments show that mosaic speech cut into short static segments was almost perfectly intelligible with a temporal resolution of 40 ms or finer. Intelligibility dropped for a temporal resolution of 80 ms, but was still around 50%-correct level. The data are in line with previous results showing that speech signals separated into short temporal segments of <100 ms can be remarkably robust in terms of linguistic-content perception against drastic manipulations in each segment, such as partial signal omission or temporal reversal. The human perceptual system thus can extract meaning from unexpectedly rough temporal information in speech. The process resembles that of the visual system stringing together static movie frames of ~40 ms into vivid motion.

DOI： 10.3389/fnhum.2018.00149

Other Link： https://www.frontiersin.org/article/10.3389/fnhum.2018.00149
Intelligibility of locally time-reversed speech: A multilingual comparison Reviewed International coauthorship International journal

Kazuo UEDA, Yoshitaka NAKAJIMA, @Wolfgang ELLERMEIER, @Florian KATTNER

Scientific Reports 7 2017.5

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

A set of experiments was performed to make a cross-language comparison of intelligibility of locally time-reversed speech, employing a total of 117 native listeners of English, German, Japanese, and Mandarin Chinese. The experiments enabled to examine whether the languages of three types of timing---stress-, syllable-, and mora-timed languages---exhibit different trends in intelligibility, depending on the duration of the segments that were temporally reversed. The results showed a strikingly similar trend across languages, especially when the time axis of segment duration was normalised with respect to the deviation of a talker's speech rate from the average in each language.
This similarity is somewhat surprising given the systematic differences in vocalic proportions characterising the languages studied which had been shown in previous research and were largely replicated with the present speech material. These findings suggest that a universal temporal window shorter than 20--40~ms plays a crucial role in perceiving locally time-reversed speech by working as a buffer in which temporal reorganisation can take place with regard to lexical and semantic processing.

DOI： 10.1038/s41598-017-01831-z

Other Link： https://www.nature.com/articles/s41598-017-01831-z
English phonology and an acoustic language universal Reviewed International journal

Yoshitaka NAKAJIMA, Kazuo UEDA, #Shota FUJIMARU, #Hirotoshi MOTOMURA, #Yuki OHSAKA

Scientific Reports 7 ( 46049 ) 1 - 6 2017.4

　More details

Language：English Publishing type：Research paper (scientific journal)

Acoustic analyses of eight different languages/dialects had revealed a language universal: Three spectral factors consistently appeared in analyses of power fluctuations of spoken sentences divided by critical-band filters into narrow frequency bands. Examining linguistic implications of these factors seems important to understand how speech sounds carry linguistic information. Here we show the three general categories of the English phonemes, i.e., vowels, sonorant consonants, and obstruents, to be discriminable in the Cartesian space constructed by these factors: A factor related to frequency components above 3,300 Hz was associated only with obstruents (e.g., /k/ or /z/), and another factor related to frequency components around 1,100 Hz only with vowels (e.g., /a/ or /i/) and sonorant consonants (e.g., /w/, /r/, or /m/). The latter factor highly correlated with the hypothetical concept of sonority or aperture in phonology. These factors turned out to connect the linguistic and acoustic aspects of speech sounds systematically.

DOI： 10.1038/srep46049

Other Link： http://www.nature.com/articles/srep46049
An acoustic key to eight languages/dialects: Factor analyses of critical-band-filtered speech Reviewed International journal

Kazuo UEDA, Yoshitaka NAKAJIMA

Scientific Reports 7 ( 42468 ) 1 - 4 2017.2

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

The peripheral auditory system functions like a frequency analyser, often modelled as a bank of non-overlapping band-pass filters called critical bands; 20 bands are necessary for simulating frequency resolution of the ear within an ordinary frequency range of speech (up to 7,000 Hz). A far smaller number of filters seemed sufficient, however, to re-synthesise intelligible speech sentences with power fluctuations of the speech signals passing through them; nevertheless, the number and frequency ranges of the frequency bands for efficient speech communication are yet unknown. We derived four common frequency bands---covering approximately 50--540, 540--1,700, 1,700--3,300, and above 3,300 Hz---from factor analyses of spectral fluctuations in eight different spoken languages/dialects. The analyses robustly led to three factors common to all languages investigated---the low & mid-high factor related to the two separate frequency ranges of 50--540 and 1,700--3,300 Hz, the mid-low factor the range of 540--1,700 Hz, and the high factor the range above 3,300 Hz---in these different languages/dialects, suggesting a language universal.

DOI： 10.1038/srep42468

Other Link： http://www.nature.com/articles/srep42468
Three Factors Are Critical in Order to Synthesize Intelligible Noise-Vocoded Japanese Speech Reviewed International journal

#Takuya KISHIDA, Yoshitaka NAKAJIMA, Kazuo UEDA, Gerard Remijn

Front. Psychol., 26 April 2016 7 ( 517 ) 1 - 9 2016.4

　More details

Language：English Publishing type：Research paper (scientific journal)

DOI： 10.3389/fpsyg.2016.00517

Other Link： https://doi.org/10.3389/fpsyg.2016.00517
Memory disruption by irrelevant noise-vocoded speech: Effects of native language and the number of frequency bands Reviewed International coauthorship International journal

@Wolfgang Ellermeier, @Florian Kattner, Kazuo UEDA, #Kana Doumoto, Yoshitaka NAKAJIMA

the Journal of the Acoustical Society of America 138 ( 3 ) 1561 - 1569 2015.9

　More details

Language：English Publishing type：Research paper (scientific journal)

To investigate the mechanisms by which unattended speech impairs short-term memory performance, speech samples were systematically degraded by means of a noise vocoder. For experiment 1, recordings of German and Japanese sentences were passed through a filter bank dividing the spectrum between 50 and 7000 Hz into 20 critical-band channels or combinations of those, yielding 20, 4, 2, or just 1 channel(s) of noise-vocoded speech. Listening tests conducted with native speakers of both languages showed a monotonic decrease in speech intelligibility as the number of frequency channels was reduced. For experiment 2, 40 native German and 40 native Japanese participants were exposed to speech processed in the same manner while trying to memorize visually presented sequences of digits in the correct order. Half of each sample received the German, the other half received the Japanese speech samples. The results show large irrelevant-speech effects increasing in magnitude with the number of frequency channels. The effects are slightly larger when subjects are exposed to their own native language. The results are neither predicted very well by the speech transmission index, nor by psychoacoustical fluctuation strength, most likely, since both metrics fail to disentangle amplitude and frequency modulations in the signals.
(C) 2015 Acoustical Society of America.

DOI： 10.1121/1.4928954

Other Link： http://dx.doi.org/10.1121/1.4928954
Auditory Grammar Invited Reviewed International journal

Yoshitaka NAKAJIMA, @Takayuki SASAKI, Kazuo UEDA, Gerard B. REMIJN

Acoustics Australia 42 ( 2 ) 97 - 101 2014.8

　More details

Language：English Publishing type：Research paper (scientific journal)
The occurrence of the filled duration illusion: A comparison of the method of adjustment with the method of magnitude estimation Reviewed International coauthorship International journal

#Emi HASUO, Yoshitaka NAKAJIMA, Erika TOMIMATSU, @Simon GRONDIN, Kazuo UEDA

Acta Psychologica 147 111 - 121 2014.2

　More details

Language：English Publishing type：Research paper (scientific journal)

A time interval between the onset and the offset of a continuous sound (filled interval) is often perceived to be longer than a time interval between two successive brief sounds (empty interval) of the same physical duration. The present study examined whether and how this phenomenon, sometimes called the filled duration illusion (FDI), occurs for short time intervals (40–520 ms). The investigation was conducted with the method of adjustment (Experiment 1) and the method of magnitude estimation (Experiment 2). When the method of adjustment was used, the FDI did not appear for the majority of the participants, but it appeared clearly for some participants. In the latter case, the amount of the FDI increased as the interval duration lengthened. The FDI was more likely to occur with magnitude estimation than with the method of adjustment. The participants who showed clear FDI with one method did not necessarily show such clear FDI with the other method.
Acoustic analyses of speech sounds and rhythms in Japanese- and English-learning infants Reviewed International coauthorship International journal

#Yuko Yamashita, Yoshitaka Nakajima, Kazuo Ueda, @Yohko Shimada, @David Hirsh, Takeharu Seno and @Benjamin Alexander Smith

Frontiers in Language Sciences 4 ( 57 ) 2013.2

　More details

Language：English Publishing type：Research paper (scientific journal)

The purpose of this study was to explore developmental changes, in terms of spectral fluctuations and temporal periodicity with Japanese- and English-learning infants. Three age groups (15, 20, and 24 months) were selected, because infants diversify phonetic inventories with age. Natural speech of the infants was recorded. We utilized a critical-band-filter bank, which simulated the frequency resolution in adults’ auditory periphery. First, the correlations between the power fluctuations of the critical-band outputs represented by factor analysis were observed in order to see how the critical bands should be connected to each other, if a listener is to differentiate sounds in infants’ speech. In the following analysis, we analyzed the temporal fluctuations of factor scores by calculating autocorrelations. The present analysis identified three factors as had been observed in adult speech at 24 months of age in both linguistic environments. These three factors were shifted to a higher frequency range corresponding to the smaller vocal tract size of the infants. The results suggest that the vocal tract structures of the infants had developed to become adult-like configuration by 24 months of age in both language environments. The amount of utterances with periodic nature of shorter time increased with age in both environments. This trend was clearer in the Japanese environment. - See more at: http://www.frontiersin.org/language_sciences/10.3389/fpsyg.2013.00057/abstract#sthash.R2weBtfH.dpuf

DOI： 10.3389/fpsyg.2013.00057
Time-stretching: Illusory lengthening of filled auditory durations Reviewed International coauthorship International journal

@Takayuki Sasaki, Yoshitaka Nakajima, @Gert ten Hoopen, @Edwin van Buuringen, @Bob Massier, #Taku Kojo, #Tsuyoshi Kuroda, and Kazuo Ueda

Attention, Perception, & Psychophysics 72 2010.7

　More details

Language：English Publishing type：Research paper (scientific journal)
Identification of English /r/ and /l/ in noise: the effects of baseline performance Reviewed International journal

Kazuo Ueda, @Reiko Akahane-Yamada, @Ryo Komaki, and @Takahiro Adachi

Acoustical Science and Technology 2007.7

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
Short-term auditory memory interference: the Deutsch demonstration revisited Reviewed International journal

Ueda, K.

Acoustical Science and Technology 2004.11

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
Speech versus nonspeech in pitch memory Reviewed International coauthorship International journal

@Semal, C., @Demany, L., Ueda, K., and @Halle, P.

Journal of the Acoustical Society of America 100 ( 2 ) 1132 - 1140 1996.8

　More details

Language：English Publishing type：Research paper (scientific journal)
Sharpness and amplitude envelopes of broadband noise Reviewed International journal

Ueda, K., and Akagi, M.

Journal of the Acoustical Society of America 1990.2

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
Perceptual components of pitch: Spatial representation using a multidimensional scaling technique Reviewed International journal

Ueda, K., and Ohgushi, K.

Journal of the Acoustical Society of America 1987.10

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
応用知覚科学研究センター（ReCAPS）創立10 周年 : 当センターの目標と成果

Remijn GerardＢ, UEDA Kazuo, Hasuo Emi

芸術工学研究 39 25 - 29 2024.3 （ ISSN:13490915 ）

　More details

Language：Japanese Publisher：Faculty of Design, Kyushu University

応用知覚科学研究センター(ReCAPS)は，2013年に設立され，今年10周年を迎えた。本稿では，当センターの目標を述べ，これまでの活動の概要をまとめることによって，当センターが知覚に関連するさまざまな研究分野をつなぐ，開かれた交流の場として機能してきたことを示す。

DOI： 10.15017/7170836

CiNii Research
The 10th Anniversary of the Research Center for Applied Perceptual Science: ReCAPS’ Aims and Achievements

REMIJN Gerard B., UEDA Kazuo, Hasuo Emi

芸術工学研究 39 31 - 35 2024.3 （ ISSN:13490915 ）

　More details

Language：English Publisher：Faculty of Design, Kyushu University

Founded in 2013, the Research Center for Applied Perceptual Science (ReCAPS) of the Faculty of Design, Kyushu University is celebrating its 10-year anniversary. The purpose of this article is to describe the aims of Re- CAPS and to give an overview of its activities and its functioning as an open platform for scientific exchange in the field of perception.

DOI： 10.15017/7170837

CiNii Research
Auditory Ensemble Perception (Summary Statistics) for Music Scale Tones by Listeners with and without Absolute Pitch Reviewed International journal

Gerard B. Remijn, #Masaki Teramachi, and Kazuo Ueda

Auditory Perception & Cognition 7 ( 2 ) 163 - 178 2024.1 （ ISSN:25742442 eISSN:25742450 ）

　More details

Language：English Publishing type：Research paper (scientific journal) Publisher：T & F

DOI： 10.1080/25742442.2024.2310460

CiNii Research

researchmap

Other Link： https://doi.org/10.1080/25742442.2024.2310460
Erratum: Intelligibility of chimeric locally time-reversed speech: Relative contribution of four frequency bands [ JASA Express Lett. 1(6), 065201 (2021)] Reviewed International journal

Kazuo Ueda and @Ikuo Matsuo

JASA Express Letters 1 ( 9 ) 095201 - 095201 2021.9

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

DOI： 10.1121/10.0006007

Other Link： https://asa.scitation.org/doi/10.1121/10.0005439

Repository Public URL： http://hdl.handle.net/2324/4495874
Erratum: Checkerboard speech vs interrupted speech: Effects of spectrotemporal segmentation on intelligibility [JASA Express Lett. 1(7), 075204 (2001)] Reviewed International journal

Ueda, K., #Kawakami, R., and @Takeichi, H.

JASA Express Letters 1 ( 8 ) 1 - 1 2021.8

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)

DOI： 10.1121/10.0005990

Other Link： https://asa.scitation.org/doi/10.1121/10.0005990

Repository Public URL： http://hdl.handle.net/2324/4495873
Intelligibility of English Mosaic Speech: Comparison between Native and Non-Native Speakers of English Reviewed International journal

#Santi, Yoshitaka Nakajima, Kazuo Ueda, and Gerard B. Remijn

Applied Sciences 10 ( 6920 ) 1 - 13 2020.10

　More details

Language：English Publishing type：Research paper (scientific journal)

DOI： doi:10.3390/app10196920
Comparison of Multivariate Analysis Methods as Applied to English Speech Reviewed International journal

#Yixin Zhang, Yoshitaka Nakajima, Kazuo Ueda,@Takuya Kishida, and Gerard B. Remijn

Applied Sciences 10 ( 7076 ) 1 - 10 2020.10

　More details

Language：English Publishing type：Research paper (scientific journal)

DOI： 10.3390/app10207076

Other Link： https://doi.org/10.3390/app10207076
Perceived Congruency in Audiovisual Stimuli Consisting of Gabor Patches and AM- and FM-tones Invited Reviewed International journal

#Natalia, Postnova, Yoshitaka Nakajima, Kazuo Ueda, Gerard B. Remijn

Multisensory Research 34 ( 5 ) 455 - 475 2020.10

　More details

Language：English Publishing type：Research paper (scientific journal)

DOI： 10.1163/22134808-bja10041
Does filled duration illusion occur for very short time intervals? Reviewed International journal

#Emi Hasuo, Yoshitaka Nakajima, and Kazuo Ueda

Acoustical Science and Technology 32 ( 2 ) 2011.3

　More details

Language：English Publishing type：Research paper (scientific journal)
Intelligibility of English phonemes in noise for native and non-native listeners Reviewed International journal

@Takahiro Adachi, @Reiko Akahane-Yamada, and Kazuo Ueda

Acoustical Science and Technology 2006.9

　More details

Language：English Publishing type：Research paper (scientific journal)
An artificial environment is often a noisy environment: Auditory scene analysis and speech perception in noise Reviewed International journal

Kazuo Ueda, Yoshitaka Nakajima, and @Reiko Akahane-Yamada

Journal of Physiological Anthropology and Applied Human Science 24 ( 1 ) 129 - 133 2005.2

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
Perceptual organization of onsets and offsets of sounds Reviewed International journal

Nakajima, Y., @Sasaki, T., Remijn, G. B., and Ueda, K.

Journal of Physiological Anthropology and Applied Human Science 23 ( 6 ) 345 - 349 2004.12

　More details

Language：English Publishing type：Research paper (scientific journal)
Identification of English /r/ and /l/ in white noise by native and non-native listeners Reviewed International journal

Ueda, K., @Akahane-Yamada, R., and @Komaki, R.

Acoustical Science and Technology 23 ( 6 ) 336 - 338 2002.11

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
The effect of sound pressure level difference on filled duration extension Reviewed International journal

Ueda, K., and Ohtsuki, M.

Journal of the Acoustical Society of Japan (E) 1996.5

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
Frequency response of headphones measured in free field and diffuse field by loudness comparison Reviewed International journal

Ueda, K., and Hirahara, T.

Journal of the Acoustical Society of Japan (E) 1991.5

　More details

Authorship：Lead author,　Corresponding author Language：English Publishing type：Research paper (scientific journal)
音色の表現語に階層構造は存在するか Reviewed

上田和夫

日本音響学会誌 1988.2

　More details

Authorship：Lead author,　Corresponding author Language：Japanese Publishing type：Research paper (scientific journal)

Should we assume a hierarchical structure for adjectives describing timbre?
多次元尺度法による音の高さの二面性の空間的表現 Reviewed

上田和夫, 大串健吾

日本音響学会誌 1984.12

　More details

Authorship：Lead author,　Corresponding author Language：Japanese Publishing type：Research paper (scientific journal)

Spatial representations of two components of pitch using multidimensional scaling technique

Patent	Number of applications: 6	Number of registrations: 5
Utility model	Number of applications: 0	Number of registrations: 0
Design	Number of applications: 0	Number of registrations: 0
Trademark	Number of applications: 0	Number of registrations: 0

Information

Information

Research Areas

Research Areas

Degree

Degree

Research History

Research History

Education

Education

Research Interests・Research Keywords

Research Interests・Research Keywords

Awards

Awards

Papers

Papers

Books

Books

Presentations

Presentations

MISC

MISC

Industrial property rights

Industrial property rights

Professional Memberships

Professional Memberships

Committee Memberships

Committee Memberships

Academic Activities

Academic Activities

Research Projects

Research Projects

Educational Activities

Educational Activities

Class subject

Class subject

FD Participation

FD Participation

Visiting, concurrent, or part-time lecturers at other universities, institutions, etc.

Visiting, concurrent, or part-time lecturers at other universities, institutions, etc.

Participation in international educational events, etc.

Participation in international educational events, etc.

Teaching Student Awards

Teaching Student Awards

Other educational activity and Special note

Other educational activity and Special note

Outline of Social Contribution and International Cooperation activities

Outline of Social Contribution and International Cooperation activities

Social Activities

Social Activities

Activities contributing to policy formation, academic promotion, etc.

Activities contributing to policy formation, academic promotion, etc.

Acceptance of Foreign Researchers, etc.

Acceptance of Foreign Researchers, etc.

Travel Abroad

Travel Abroad