B

Beatrice T. Oshika

Mitre (United States)

Publishes on Natural Language Processing Techniques, Speech Recognition and Synthesis, Speech and dialogue systems. 26 papers and 567 citations.

26Publications
567Total Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

The OGI multi-language telephone speech corpus
Cited by 253

The OGI Multi-language Telephone Speech Corpus is designed to support research on automatic language identification and multi-language speech recognition. The corpus consists of up to nine separate responses from each caller, ranging from single words to short topic-specific descriptions to 60 seconds of unconstrained spontaneous speech. The utterances were spoken over commercial telephone lines by speakers of English, Farsi (Persian), French, German, Japanese, Korean, Mandarin Chinese, Spanish, Tamil, and Vietnamese. We have completed the initial phase of our data acquisition effort: the recording and initial verification of utterances produced by 100 different speakers in each of the 10 languages. We describe the recording protocol, data collection procedure, ongoing corpus development, preliminary results of the statistical evaluation of the 10 languages, and plans to provide orthographic transcriptions of the speech. INTRODUCTION Research in multi-language recognition systems wou...

The role of phonological rules in speech understanding research
Beatrice T. Oshika, Victor W. Zue, Raymond Weeks et al.|IEEE Transactions on Acoustics Speech and Signal Processing|1975
Cited by 98

This paper presents some phonological rules which describe systematic pronunciation variation occurring in natural continuous speech. It is argued that a speech understanding system must account for such variation by incorporating phonological rules, either implicitly or explicitly, into the system. Spectrographic evidence for the phonological phenomena described by the rules is included.

Long-term feature averaging for speaker recognition
J. Markel, Beatrice T. Oshika, Alfred Gray|IEEE Transactions on Acoustics Speech and Signal Processing|1977
Cited by 83

The potential benefits of long-term parameter averaging for speaker recognition were investigated. Parameters studied were pitch, gain, and reflection coefficients. Parameter variability was computed over various averaging lengths from one frame averaging (in effect, no averaging) to 1000 frame averaging (about 70 s of speech). It was demonstrated that the between-to-within speaker variance ratio, measured over several speakers, was significantly increased by performing long-term averaging of the parameter sets. The reflection coefficient averages for k <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</inf> and k <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">6</inf> , respectively, were shown to produce the highest variance ratios.

Labeler agreement in phonetic labeling of continuous speech
Cited by 15

This paper analyzes inter-labeler agreement of label choice and boundary placement for human phonetic transcriptions of continuous telephone speech in different languages. In experiment one, English, German, Mandarin and Spanish are labeled by fluent speakers of the languages. In experiment two, German and Hindi are labeled by linguists who do not speak the languages. Experiment two uses a somewhat finer phonetic transcription set than experiment one. We compare the transcriptions of the utterances in terms of the minimum number of substitutions, insertions and deletions needed to map one transcription to the other. Native speakers agree on the average 67.52% of the time at the finest level of labeling, including diacritics. Non-native linguists agree 34.41% of the time. The implications of the results are discussed for evaluation of phonetic recognition algorithms. 1. INTRODUCTION Phonetically transcribed continuous speech databases are important for understanding the phonological st...