24.1.4 Signal Processing, Speech Papers

Chapter Contents (Back)
These are mostly included since they are in the full ToC for journals that are taken completely.

Hanson, A.R., Riseman, E.M., Fisher, E.,
Context in word recognition,
PR(8), No. 1, January 1976, pp. 35-45.
WWW Version. 0309
BibRef

de Mori, R., Laface, P., Makhonine, V.A., Mezzalama, M.,
A syntactic procedure for the recognition of glottal pulses in continuous speech,
PR(9), No. 4, 1977, pp. 181-189.
WWW Version. 0309
BibRef

Maroy, J.P., Berthod, M.,
Natural language understanding by a robot: A pattern recognition problem,
PR(10), No. 2, 1978, pp. 63-71.
WWW Version. 0309
BibRef

Pal, S.K., Datta, A.K., Majumder, D.D.[D. Dutta],
A self-supervised vowel recognition system,
PR(12), No. 1, 1980, pp. 27-34.
WWW Version. 0309
BibRef

Pathak, A.[Amita], Pal, S.K.[Sankar K.],
On the convergence of 'A self-supervised vowel recognition system',
PR(20), No. 2, 1987, pp. 237-244.
WWW Version. 0309
BibRef

de Mori, R.[Renato], Giordano, G.[Giovanna],
Algorithms for syllabic hypothesization in continuous speech,
PR(14), No. 1-6, 1981, pp. 245-260.
WWW Version. 0309
BibRef

Howard, Jr., J.H.[James H.],
Feature selection in human auditory perception,
PR(15), No. 5, 1982, pp. 397-403.
WWW Version. 0309
BibRef

Thomason, M.G., Granum, E., Blake, R.E.,
Experiments in dynamic programming inference of Markov networks with strings representing speech data,
PR(19), No. 5, 1986, pp. 343-352.
WWW Version. 0309
BibRef

Tanaka, E.[Eiichi], Toyama, T.[Takanori], Kawai, S.[Sachiko],
High speed error correction of phoneme sequences,
PR(19), No. 5, 1986, pp. 407-412.
WWW Version. 0309
BibRef

Lee, L.S., Tseng, C.Y., Chen, K.J., Huang, J., Hwang, C.H., Ting, P.Y., Lin, L.J., Chen, C.C.,
A Mandarin dictation machine based upon a hierarchical recognition approach and Chinese natural language analysis,
PAMI(12), No. 7, July 1990, pp. 695-704.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401
BibRef

Kenny, P., Lennig, M., Mermelstein, P.,
Speaker adaptation in a large-vocabulary Gaussian HMM recognizer,
PAMI(12), No. 9, September 1990, pp. 917-920.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401
BibRef

Casacuberta, F.,
Some relations among stochastic finite state networks used in automatic speech recognition,
PAMI(12), No. 7, July 1990, pp. 691-695.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401
BibRef

Yannakoudakis, E.J., Tsomokos, I., Hutton, P.J.,
n-Grams and their implication to natural language understanding,
PR(23), No. 5, 1990, pp. 509-528.
WWW Version. 0401
BibRef

Hochberg, J., Mniszewski, S.M., Calleja, T., Papcun, G.J.,
A default hierarchy for pronouncing English,
PAMI(13), No. 9, September 1991, pp. 957-964.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401
BibRef

Carlson, B.A., Clements, M.A.,
A computationally compact divergence measure for speech processing,
PAMI(13), No. 12, December 1991, pp. 1255-1260.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401
BibRef

Ney, H.[Hermann],
A comparative study of two search strategies for connected word recognition: dynamic programming and heuristic search,
PAMI(14), No. 5, May 1992, pp. 586-595.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401
BibRef

Ney, H.[Hermann],
Stochastic Modelling: From Pattern Classification to Speech Recognition and Translation,
ICPR00(Vol III: 21-28).
IEEE DOI Link
HTML Version. 0009
BibRef

Wu, J.X.[Jian-Xiong], Chan, C.[Chorkin],
Isolated word recognition by neural network models with cross-correlation coefficients for speech dynamics,
PAMI(15), No. 11, November 1993, pp. 1174-1185.
IEEE Abstract. IEEE Top Reference.
WWW Version. 0401
BibRef

Liu, L.C.[Lih-Cherng], Chiou, D.[Denis], Wang, H.C.[Hsiao-Chuan],
A speech recognition method based on feature distributions,
PR(24), No. 8, 1991, pp. 717-722.
WWW Version. 0401
BibRef

Pinkowski, B.[Ben],
Multiscale fourier descriptors for classifying semivowels in spectrograms,
PR(26), No. 10, October 1993, pp. 1593-1602.
WWW Version. 0401
BibRef

Pinkowski, B.,
Principal Component Analysis of Speech Spectrogram Images,
PR(30), No. 5, May 1997, pp. 777-787.
WWW Version. 9705
BibRef

Chen, W.Y.[Wen-Yuan], Liao, Y.F.[Yuan-Fu], Chen, S.H.[Sin-Horng],
Speech recognition with hierarchical recurrent neural networks,
PR(28), No. 6, June 1995, pp. 795-805.
WWW Version. 0401
BibRef

Huo, Q.A.[Qi-Ang], Chan, C.[Chorkin],
Contextual vector quantization for speech recognition with discrete hidden Markov model,
PR(28), No. 4, April 1995, pp. 513-517.
WWW Version. 0401
BibRef

Pham, T.D.[Tuan D.], Wagner, M.[Michael],
A geostatistical model for linear prediction analysis of speech,
PR(31), No. 12, December 1998, pp. 1981-1991.
WWW Version. 0401
BibRef

Lee, T.[Tan], Ching, P.C., Chan, L.W.[Lai-Wan],
Isolated word recognition using modular recurrent neural networks,
PR(31), No. 6, June 1998, pp. 751-760.
WWW Version. 0401
BibRef

Tacer, B.[Berkant], Loughlin, P.J.[Patrick J.],
Non-stationary signal classification using the joint moments of time-frequency distributions,
PR(31), No. 11, November 1998, pp. 1635-1641.
WWW Version. 0401
BibRef

Han, J.[Jiqing], Gao, W.[Wen],
Robust telephone speech recognition based on channel compensation,
PR(32), No. 6, June 1999, pp. 1061-1067.
WWW Version. 0401
BibRef

Lewis, M.A.[Michael A.], Ramachandran, R.P.[Ravi P.],
Cochannel speaker count labelling based on the use of cepstral and pitch prediction derived features,
PR(34), No. 2, February 2001, pp. 499-507.
WWW Version. 0011
BibRef

Kant, S.[Shri], Verma, N.[Neelam],
An Effective Source Recognition Algorithm: Extraction of Significant Binary Words,
PRL(21), No. 11, October 2000, pp. 981-988. 0010
BibRef

Kwong, S., He, Q.H., Man, K.F., Tang, K.S.,
A maximum model distance approach for HMM-based speech recognition,
PR(31), No. 3, March 1998, pp. 219-229.
WWW Version. 0401
BibRef

He, Q.H., Kwong, S., Man, K.F., Tang, K.S.,
An improved maximum model distance approach for HMM-based speech recognition systems,
PR(33), No. 10, October 2000, pp. 1749-1758.
WWW Version. 0006
BibRef

Li, M., McAllister, H.G., Black, N.D., de Perez, T.A.,
Wavelet-based nonlinear AGC method for hearing aid loudness compensation,
VISP(147), No. 6, December 2000, pp. 502-507. 0101
BibRef

Gray, P., Hollier, M.P., Massara, R.E.,
Non-intrusive speech-quality assessment using vocal-tract models,
VISP(147), No. 6, December 2000, pp. 493-501. 0101
BibRef

Wu, C.H., Chen, Y.J., Yan, G.L.,
Integration of phonetic and prosodic information for robust utterance verification,
VISP(147), No. 1, February 2000, pp. 55. 0005
BibRef

Kim, W.[Wooil], Kang, S.[Sunmee], Ko, H.S.[Han-Seok],
Spectral subtraction based on phonetic dependency and masking effects,
VISP(147), No. 5, October 2000, pp. 423-427. 0101
BibRef

Hussain, A., Campbell, D.R.,
Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise,
VISP(148), No. 2, April 2001, pp. 127-132. 0106
BibRef

Bohez, E.L.J.[Erik L.J.], Senevirathne, T.R.,
Speech recognition using fractals,
PR(34), No. 11, November 2001, pp. 2227-2243.
WWW Version. 0108
BibRef

Sarkar, S., Poor, H.V.,
Multirate signal processing on finite fields,
VISP(148), No. 4, August 2001, pp. 254-262. 0201
BibRef

Chen, S.H., Wang, J.F.,
Application of wavelet transforms for C/V segmentation on Mandarin speech signals,
VISP(148), No. 2, April 2001, pp. 133-139. 0106
BibRef

Mouria-Beji, F.[Fériel],
A hierarchical Bayesian model for continuous speech recognition,
PRL(23), No. 7, May 2002, pp. 773-781.
HTML Version. 0203
BibRef

Chen, F.K., Yang, J.F., Yan, Y.L.,
Candidate scheme for fast ACELP search,
VISP(149), No. 1, February 2002, pp. 10-16.
IEEE Top Reference. 0205
Algebraic code excited linear prediction. Speech coding. BibRef

Mumolo, E.[Enzo],
Spectral domain texture analysis for speech enhancement,
PR(35), No. 10, October 2002, pp. 2181-2191.
WWW Version. 0206
BibRef

Liu, J.W.[Jing-Wei], Cheng, Q.S.[Qian-Sheng], Zheng, Z.G.[Zhong-Guo], Qian, M.[Minping],
A DTW-based probability model for speaker feature analysis and data mining,
PRL(23), No. 11, September 2002, pp. 1271-1276.
HTML Version. 0206
BibRef

Ding, Z.O., McLoughlin, I.V., Tan, E.C.,
Extension of proposal of standards for intelligibility tests of Chinese speech: CDRT-tone,
VISP(150), No. 1, February 2003, pp. 1-5.
IEEE Top Reference. 0304
BibRef

Huang, C.S.[Chao-Shih], Wang, H.C.[Hsiao-Chuan],
Bandwidth-adjusted LPC analysis for robust speech recognition,
PRL(24), No. 9-10, June 2003, pp. 1583-1587.
WWW Version. 0304
BibRef

Juang, Y.T.[Yau-Tarng], Huang, K.C.[Kuo-Chang], Ding, I.J.[Ing-Jr],
Speaker adaptation based on MAP estimation using fuzzy controller,
PRL(24), No. 15, November 2003, pp. 2807-2813.
WWW Version. 0308
BibRef

Ding, I.J.[Ing-Jr],
Incremental MLLR speaker adaptation by fuzzy logic control,
PR(40), No. 11, November 2007, pp. 3110-3119.
WWW Version. 0707
Speech recognition; Speaker adaptation; Hidden Markov model; Maximum likelihood linear regression; T-S fuzzy logic controller BibRef

Li, T.F.[Tze Fen],
Speech Recognition of Mandarin Monosyllables,
PR(36), No. 11, November 2003, pp. 2713-2721.
WWW Version. 0309
BibRef

Farooq, O., Datta, S.,
Wavelet based robust sub-band features for phoneme recognition,
VISP(151), No. 3, June 2004, pp. 187-193.
IEEE Abstract. IEEE Top Reference. 0409
BibRef

de Lamare, R.C., Alcaim, A.,
Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec,
VISP(152), No. 1, February 2005, pp. 74-86.
IEEE Abstract. IEEE Top Reference. 0501
BibRef

Ricotti, L.P.,
Multitapering and a wavelet variant of MFCC in speech recognition,
VISP(152), No. 1, February 2005, pp. 29-35.
IEEE Abstract. IEEE Top Reference. 0501
BibRef

Chen, K.[Ke],
On the use of different speech representations for speaker modeling,
SMC-C(35), No. 3, August 2005, pp. 301-314.
IEEE DOI Link 0508
BibRef

Vera-Candeas, P., Ruiz-Reyes, N., Rosa-Zurera, M., Lopez-Ferreras, F., Curpian-Alonso, J.,
New matching pursuit based sinusoidal modelling method for audio coding,
VISP(151), No. 1, February 2004, pp. 21-28.
IEEE Abstract. IEEE Top Reference. 0403
BibRef

Vera-Candeas, P.[Pedro], Ruiz-Reyes, N.[Nicolás], Rosa-Zurera, M.[Manuel], Cuevas-Martinez, J.C.[Juan C.], López-Ferreras, F.[Francisco],
Adaptive Signal Models for Wide-Band Speech and Audio Compression,
IbPRIA05(II:571).
Springer DOI Link 0509
BibRef

Zhong, W., Li, S., Tai, H.M.,
Signal subspace approach for narrowband noise reduction in speech,
VISP(152), No. 6, December 2005, pp. 800-805.
WWW Version. 0512
BibRef

Chen, B.[Berlin],
Exploring the use of latent topical information for statistical Chinese spoken document retrieval,
PRL(27), No. 1, 1 January 2006, pp. 9-18.
WWW Version. 0512
BibRef

Chen, B.[Berlin], Chen, Y.T.[Yi-Ting],
Extractive spoken document summarization for information retrieval,
PRL(29), No. 4, 1 March 2008, pp. 426-437.
WWW Version. 0711
Extractive summarization; Information retrieval; Topical mixture model; Spoken documents; Speech recognition BibRef

Wan, C.[Chunru], Liu, M.C.[Ming-Chun],
Content-based audio retrieval with relevance feedback,
PRL(27), No. 2, 15 January 2006, pp. 85-92.
WWW Version. 0512
BibRef

Li, C., Li, S., Zhang, D., Chen, G.,
Cryptanalysis of a data securityp protection scheme for VoIP,
VISP(153), No. 1, February 2006, pp. 1-10.
WWW Version. 0602
BibRef

Radhakrishnan, R.[Regunathan], Divakaran, A.[Ajay], Xiong, Z.Y.[Zi-You], Otsuka, I.[Isao],
A Content-Adaptive Analysis and Representation Framework for Audio Event Discovery from 'Unscripted' Multimedia,
JASP(2006), 2006, pp. 1-24.
WWW Version. 0603
BibRef

Chu, W.T.[Wei-Ta], Cheng, W.H.[Wen-Huang], Wu, J.L.[Ja-Ling],
Semantic Context Detection Using Audio Event Fusion,
JASP(2006), 2006, pp. 1-12.
WWW Version. 0603
BibRef

Sandler, M., Black, D.,
Scalable audio coding for compression and loss resilient streaming,
VISP(153), No. 3, June 2006, pp. 331-339.
WWW Version. 0608
BibRef

Chang, J.H.[Joon-Hyuk], Gazor, S.[Saeed], Kim, N.S.[Nam Soo], Mitra, S.K.[Sanjit K.],
Multiple statistical models for soft decision in noisy speech enhancement,
PR(40), No. 3, March 2007, pp. 1123-1134.
WWW Version. 0611
Speech enhancement; DCT; Multiple statistical model; Gaussian; Laplacian; Gamma; GOF; PSFM; SAP; PESQ BibRef

Liu, J.W.[Jing-Wei], Wang, Z.Y.[Zuo-Ying], Xiao, X.[Xi],
A hybrid SVM/DDBHMM decision fusion modeling for robust continuous digital speech recognition,
PRL(28), No. 8, 1 June 2007, pp. 912-920.
WWW Version. 0704
Speech recognition; Gaussian mixture model; Duration distribution based hidden Markov model (DDBHMM); Support vector machine BibRef

Guido, R.C.[Rodrigo Capobianco], Pereira, J.C.[Jose Carlos], Slaets, J.F.W.[Jan Frans Willem],
Introduction to the Special Issue: Advances on pattern recognition for speech and audio processing,
PRL(28), No. 11, 1 August 2007, pp. 1283-1284.
WWW Version. 0706
BibRef

Leavitt, N.,
Two technologies vie for recognition in speech market,
Computer(36), No. 6, June 2003, pp. 13-16.
IEEE DOI Link 0306
BibRef

Paulson, L.D.,
Speech Recognition Moves from Software to Hardware,
Computer(39), No. 11, November 2006, pp. 15-18.
IEEE DOI Link 0611
BibRef

Stavrakoudis, D.G., Theocharis, J.B.,
Pipelined Recurrent Fuzzy Neural Networks for Nonlinear Adaptive Speech Prediction,
SMC-B(37), No. 5, October 2007, pp. 1305-1320.
IEEE DOI Link 0711
BibRef

Frankel, J.[Joe], King, S.[Simon],
Factoring Gaussian precision matrices for linear dynamic models,
PRL(28), No. 16, December 2007, pp. 2264-2272.
WWW Version. 0711
Linear dynamic model; Error distribution; Precision matrix Speech. BibRef

Chouireb, F.[Fatima], Guerti, M.[Mhania],
Towards a high quality Arabic speech synthesis system based on neural networks and residual excited vocal tract model,
SIViP(2), No. 1, January 2008, pp. 73-87.
Springer DOI Link 0712
BibRef

Araujo, L.[Lourdes], Serrano, J.I.[J. Ignacio],
Highly accurate error-driven method for noun phrase detection,
PRL(29), No. 4, 1 March 2008, pp. 547-557.
WWW Version. 0711
Noun phrase detection; Evolutionary programming; Grammar induction; Information retrieval BibRef

Zhang, Y.X.[Yong-Xin], Scordilis, M.S.[Michael S.],
Effective online unsupervised adaptation of Gaussian mixture models and its application to speech classification,
PRL(29), No. 6, 15 April 2008, pp. 735-744.
WWW Version. 0803
Gaussian mixture model; Speech classification; Online adaptation; Unsupervised adaptation BibRef

Baluja, S.[Shumeet], Covell, M.[Michele],
Waveprint: Efficient wavelet-based audio fingerprinting,
PR(41), No. 11, November 2008, pp. 3467-3480.
WWW Version. 0808
Audio retrieval; Applications; Image/video retrieval; Pattern analysis BibRef

O'Shaughnessy, D.[Douglas],
Invited paper: Automatic speech recognition: History, methods and challenges,
PR(41), No. 10, October 2008, pp. 2965-2979.
WWW Version. 0808
Automatic speech recognition; Hidden Markov models; Adaptation; Compensation; Pattern recognition; Spectral representation BibRef

Zeng, J.[Jia], Xie, L.[Lei], Liu, Z.Q.A.[Zhi-Qi-Ang],
Type-2 fuzzy Gaussian mixture models,
PR(41), No. 12, December 2008, pp. 3636-3643.
WWW Version. 0810
BibRef
Earlier: A1, A3, Only:
Type-2 fuzzy hidden markov models to phoneme recognition,
ICPR04(I: 192-195).
IEEE DOI Link 0409
Type-2 fuzzy sets; Gaussian mixture models; Hidden Markov models BibRef

Chen, B.[Berlin], Liu, S.H.[Shih-Hung], Chu, F.H.[Fang-Hui],
Training data selection for improving discriminative training of acoustic models,
PRL(30), No. 13, 1 October 2009, pp. 1228-1235,.
Elsevier DOI Link
WWW Version. 0909
Continuous speech recognition; Discriminative training; Acoustic models; Data selection; Phone accuracy; Entropy BibRef


Qasemi Zadeh, B.[Behrang], Shen, J.[Jiali], O'Neill, I.[Ian], Miller, P.[Paul], Hanna, P.[Philip], Stewart, D.[Darryl], Wang, H.B.[Hong-Bin],
A Speech Based Approach to Surveillance Video Retrieval,
AVSBS09(336-339).
IEEE DOI Link 0909
BibRef

Kuhnapfel, T.[Thorsten], Tan, T.[Tele], Venkatesh, S.[Svertha], Igel, B.[Burkhard],
Distributed Audio Network for Speech Enhancement in Challenging Noise Backgrounds,
AVSBS09(308-313).
IEEE DOI Link 0909
BibRef

Cristani, M., Pesarin, A., Drioli, C., Tavano, A., Perina, A., Murino, V.,
Auditory dialog analysis and understanding by generative modelling of interactional dynamics,
CVPR4HB09(103-109).
IEEE DOI Link 0906
BibRef

Gosztolya, G.[Gábor], Bánhalmi, A.[András], Tóth, L.[László],
Using One-Class Classification Techniques in the Anti-phoneme Problem,
IbPRIA09(433-440).
Springer DOI Link 0906
BibRef

Chen, J.B.[Jin-Biao], Zhang, S.[Shiqing],
Manifold learning-based phoneme recognition,
IASP09(308-312).
IEEE DOI Link 0904
BibRef

Mahdhaoui, A.[Ammar], Chetouani, M.[Mohamed], Zong, C.[Cong],
Motherese detection based on segmental and supra-segmental features,
ICPR08(1-4).
IEEE DOI Link 0812
parent-infant interactions. BibRef

Zeng, Z.[Zhi], Li, X.[Xin], Ma, X.H.[Xiao-Hong], Ji, Q.A.[Qi-Ang],
Adaptive context recognition based on audio signal,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Luo, L.[Li], Lu, P.F.[Peng-Fei], Wang, Z.F.[Zeng-Fu],
A real-time accompaniment system based on sung voice recognition,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Pesarin, A., Cristani, M., Murino, V., Drioli, C., Perina, A., Tavano, A.,
A statistical signature for automatic dialogue classification,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Choi, H.[Heeyoul], Gutierrez-Osuna, R.[Ricardo], Choi, S.J.[Seung-Jin], Choe, Y.[Yoonsuck],
Kernel oriented discriminant analysis for speaker-independent phoneme spaces,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Terry, L.[Louis], Katsaggelos, A.K.[Aggelos K.],
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Kuhnapfel, T.[Thorsten], Tan, T.[Tele], Venkatesh, S.[Svetha], Nordholm, S.E.[Sven Erik], Igel, B.[Burkhard],
Adaptive speech enhancement with varying noise backgrounds,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Krajewski, J.[Jarek], Batliner, A.[Anton], Wieland, R.[Rainer],
Multiple classifier applied on predicting microsleep from speech,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Banerjee, P.[Pratyush], Garg, G.[Gaurav], Mitra, P.[Pabitra], Basu, A.[Anupam],
Application of triphone clustering in acoustic modeling for continuous speech recognition in Bengali,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Li, X.K.[Xiao-Kun], Deng, Y.[Yunbin],
Combining speech energy and edge information for fast and efficient voice activity detection in noisy environments,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Ser, W.[Wee], Cen, L.[Ling], Yu, Z.L.[Zhu Liang],
A Hybrid PNN-GMM classification scheme for speech emotion recognition,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Dehzangi, O.[Omid], Ma, B.[Bin], Chng, E.S.[Eng Siong], Li, H.Z.[Hai-Zhou],
Fuzzy rule selection using Iterative Rule Learning for speech data classification,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Bouzid, A.[Aďcha], Ellouze, N.[Noureddine],
Voicing Detection in Noisy Speech Signal,
ICISP08(544-551).
Springer DOI Link 0807
BibRef

Kukharchik, P., Kheidorov, I., Bovbel, E., Ladeev, D.,
Speech Signal Processing Based on Wavelets and SVM for Vocal Tract Pathology Detection,
ICISP08(192-199).
Springer DOI Link 0807
BibRef

Türkmen, H.I.[H. Irem], Karsligil, M.E.[M. Elif],
Reconstruction of Dysphonic Speech by MELP,
CIARP08(767-774).
Springer DOI Link 0809
BibRef

Hain, T.[Thomas], Burget, L.[Lukas], Dines, J.[John], Garau, G.[Giulia], Karafiat, M.[Martin], van Leeuwen, D.[David], Lincoln, M.[Mike], Wan, V.[Vincent],
The 2007 AMI(DA) System for Meeting Transcription,
MTPH07(xx-yy).
Springer DOI Link 0705
BibRef

Lamel, L., Bilinski, E., Gauvain, J.L., Adda, G., Barras, C., Zhu, X.,
The LIMSI RT07 Lecture Transcription System,
MTPH07(xx-yy).
Springer DOI Link 0705
BibRef

Fiscus, J.G.[Jonathan G.], Ajot, J.[Jerome], Garofolo, J.S.[John S.],
The Rich Transcription 2007 Meeting Recognition Evaluation,
MTPH07(xx-yy).
Springer DOI Link 0705
BibRef

Stolcke, A.[Andreas], Anguera, X.[Xavier], Boakye, K.[Kofi], Çetin, Ö.[Özgür], Janin, A.[Adam], Magimai-Doss, M.[Mathew], Wooters, C.[Chuck], Zheng, J.[Jing],
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System,
MTPH07(xx-yy).
Springer DOI Link 0705
BibRef

Huang, J.[Jing], Marcheret, E.[Etienne], Visweswariah, K.[Karthik], Libal, V.[Vit], Potamianos, G.[Gerasimos],
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings,
MTPH07(xx-yy).
Springer DOI Link 0705
BibRef

Wölfel, M.[Matthias], Stüker, S.[Sebastian], Kraft, F.[Florian],
The ISL RT-07 Speech-to-Text System,
MTPH07(xx-yy).
Springer DOI Link 0705
BibRef

Schuller, B.[Björn], Wöllmer, M.[Martin], Moosmayr, T.[Tobias], Ruske, G.[Günther], Rigoll, G.[Gerhard],
Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition,
DAGM08(xx-yy).
Springer DOI Link 0806
BibRef

Patil, H.A.[Hemant A.], Basu, T.K.,
Cepstral Domain Teager Energy for Identifying Perceptually Similar Languages,
PReMI07(455-462).
Springer DOI Link 0712
BibRef

Manwani, N.[Naresh], Mitra, S.K.[Suman K.], Joshi, M.V.,
Spoken Language Identification for Indian Languages Using Split and Merge EM Algorithm,
PReMI07(463-468).
Springer DOI Link 0712
BibRef

Rao, K.S.[K. Sreenivasa], Laskar, R.H., Koolagudi, S.G.[Shashidhar G.],
Voice Transformation by Mapping the Features at Syllable Level,
PReMI07(479-486).
Springer DOI Link 0712
BibRef

Nagesha, Kumar, G.H.[G. Hemantha],
Signal Resampling Technique Combining Level Crossing and Auditory Features,
PReMI07(447-454).
Springer DOI Link 0712
BibRef

Hurtado, L.F.[Lluís F.], Griol, D.[David], Sanchis, E.[Emilio], Segarra, E.[Encarna],
A Statistical User Simulation Technique for the Improvement of a Spoken Dialog System,
CIARP07(743-752).
Springer DOI Link 0711
BibRef

Oropeza Rodríguez, J.L.[José Luis], Suárez Guerra, S.[Sergio], Sánchez Fernández, L.P.[Luis Pastor],
Using Adaptive Filter to Increase Automatic Speech Recognition Rate in a Digit Corpus,
CIARP07(78-87).
Springer DOI Link 0711
BibRef

Várallyay, G.[György],
SSM: A Novel Method to Recognize the Fundamental Frequency in Voice Signals,
CIARP07(88-95).
Springer DOI Link 0711
BibRef

Ohara, M.[Masatoshi], Utsumi, A.[Akira], Yamazoe, H.[Hirotake], Abe, S.[Shinji], Katayama, N.[Noriaki],
Attention Monitoring for Music Contents Based on Analysis of Signal-Behavior Structures,
ACCV07(I: 292-302).
Springer DOI Link 0711
BibRef

Simőes, C.[Carla], Teixeira, C.[Carlos], Dias, M.[Miguel], Braga, D.[Daniela], Calado, A.[António],
European Portuguese Accent in Acoustic Models for Non-native English Speakers,
CIARP07(734-742).
Springer DOI Link 0711
BibRef

Smeaton, A.F.[Alan F.], McHugh, M.[Mike],
Towards event detection in an audio-based sensor network,
VSSN05(87-94).
WWW Version. 0511
BibRef

Esposito, A.[Anna], Stejskal, V.[Vojtech], Smékal, Z.[Zdenek], Bourbakis, N.[Nikolaos],
The Significance of Empty Speech Pauses: Cognitive and Algorithmic Issues,
BVAI07(542-554).
Springer DOI Link 0710
BibRef

Hernández, I.[Igmar], García, P.[Paola], Nolazco, J.[Juan], Buera, L.[Luis], Lleida, E.[Eduardo],
Robust Automatic Speech Recognition Using PD-MEEMLIN,
IbPRIA07(II: 1-8).
Springer DOI Link 0706
BibRef

Chung, Y.J.[Yong-Joo], Bae, K.S.[Keun-Sung],
Data-Driven Jacobian Adaptation in a Multi-model Structure for Noisy Speech Recognition,
IbPRIA07(II: 452-459).
Springer DOI Link 0706
BibRef

Expósito, J.E.M.[J. E. Muńoz], Reyes, N.R.[N. Ruiz], Galán, S.G.[S. Garcia], Candeas, P.V.[P. Vera],
Speech/Music Classification Based on Distributed Evolutionary Fuzzy Logic for Intelligent Audio Coding,
IbPRIA07(II: 556-563).
Springer DOI Link 0706
BibRef

Ferrer, C.A.[Carlos A.], González, E.[Eduardo], Hernández-Díaz, M.E.[María E.],
Evaluation of Time and Frequency Domain-Based Methods for the Estimation of Harmonics-to-Noise-Ratios in Voice Signals,
CIARP06(406-415).
Springer DOI Link 0611
BibRef

Cano, S.[Sergio], Suaste, I.[Israel], Escobedo, D.[Daniel], Reyes-García, C.A.[Carlos A.], Ekkel, T.[Taco],
A Combined Classifier of Cry Units with New Acoustic Attributes,
CIARP06(416-425).
Springer DOI Link 0611
BibRef

Huerta-Hernández, L.D.[Luis D.], Reyes-García, C.A.[Carlos A.],
On the Processing of Fuzzy Patterns for Text Independent Phonetic Speech Segmentation,
CIARP06(437-445).
Springer DOI Link 0611
BibRef

Alghassi, H., Tafazoli, S., Lawrence, P.,
The Audio Surveillance Eye,
AVSBS06(106-106).
IEEE DOI Link 0611
BibRef

Yuan, L.[Lichi], Chen, Z.G.[Zhi-Gang],
A Novel Statistical Model for Speech Recognition and POS Tagging,
AVSBS06(61-61).
IEEE DOI Link 0611
BibRef

Yin, B.[Bo], Ambikairajah, E.[Eliathamby], Chen, F.[Fang],
Combining Cepstral and Prosodic Features in Language Identification,
ICPR06(IV: 254-257).
WWW Version. 0609
BibRef

Leila, Chollet, G.[Gerard],
Efficient Gaussian Mixture for Speech Recognition,
ICPR06(IV: 294-297).
WWW Version. 0609
BibRef

Vinciarelli, A.[Alessandro],
Sociometry Based Multiparty Audio Recordings Summarization,
ICPR06(II: 1154-1157).
WWW Version. 0609
BibRef

Wang, J.C.[Jia-Ching], Wang, J.F.[Jhing-Fa], Lin, C.B.[Cai-Bei], Jian, K.T.[Kun-Ting], Kuok, W.H.[Wai-He],
Content-Based Audio Classification Using Support Vector Machines and Independent Component Analysis,
ICPR06(IV: 157-160).
WWW Version. 0609
BibRef

Huang, R.Q.[Rong-Qing], Ma, C.X.[Chang-Xue],
Toward A Speaker-Independent Real-Time Affect Detection System,
ICPR06(I: 1204-1207).
WWW Version. 0609
BibRef

Wang, L.[Liang], Ambikairajah, E.[Eliathamby], Choi, E.H.C.[Eric H.C.],
Multi-lingual Phoneme Recognition and Language Identification Using Phonotactic Information,
ICPR06(IV: 245-248).
WWW Version. 0609
BibRef

Kruger, S.E.[Sven E.], Schaffoner, M.[Martin], Katz, M.[Marcel], Andelic, E.[Edin], Wendemuth, A.[Andreas],
Mixture of Support Vector Machines for HMM based Speech Recognition,
ICPR06(IV: 326-329).
WWW Version. 0609
BibRef

Zhang, S.L.[Shi-Lei], Zhang, S.W.[Shu-Wu], Xu, B.[Bo],
A Two-level Method for Unsupervised Speaker-based Audio Segmentation,
ICPR06(IV: 298-301).
WWW Version. 0609
BibRef

Pao, T.L.[Tsang-Long], Chen, Y.T.[Yu-Te], Yeh, J.H.[Jun-Heng], Li, P.J.[Pei-Jia],
Mandarin Emotional Speech Recognition Based on SVM and NN,
ICPR06(I: 1096-1100).
WWW Version. 0609
BibRef

Andelic, E.[Edin], Schaffoner, M.[Martin], Katz, M.[Marcel], Kruger, S.E.[Sven E.],
A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants as Acoustic Models,
ICPR06(II: 1158-1161).
WWW Version. 0609
BibRef

You, M.[Mingyu], Chen, C.[Chun], Bu, J.J.[Jia-Jun], Liu, J.[Jia], Tao, J.H.[Jian-Hua],
Emotional Speech Analysis on Nonlinear Manifold,
ICPR06(III: 91-94).
WWW Version. 0609
BibRef

Halavati, R.[Ramin], Shouraki, S.B.[Saeed Bagheri], Tajik, H.[Hossein], Cholakian, A.[Arpineh], Razaghpour, M.[Mina],
A Novel Approach to Very Fast and Noise Robust, Isolated Word Speech Recognition,
ICPR06(III: 190-193).
WWW Version. 0609
BibRef

Lin, H.[Hui], Ou, Z.J.[Zhi-Jian],
Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks,
ICPR06(IV: 258-261).
WWW Version. 0609
BibRef

Li, W.H.[Wei-Hong], Liu, M.[Ming], Zhu, Z.G.[Zhi-Gang], Huang, T.S.[Thomas S.],
LDV Remote Voice Acquisition and Enhancement,
ICPR06(IV: 262-265).
WWW Version. 0609
BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Noth, E.[Elmar], Nkenke, E.[Emeka], Haderlein, T.[Tino], Rosanowski, F.[Frank], Schuster, M.[Maria],
Intelligibility of Children with Cleft Lip and Palate: Evaluation by Speech Recognition Techniques,
ICPR06(IV: 274-277).
WWW Version. 0609
BibRef

Zioko, B.[Bartosz], Manandhar, S.[Suresh], Wilson, R.C.[Richard C.],
Phoneme segmentation of speech,
ICPR06(IV: 282-285).
WWW Version. 0609
BibRef

Choi, E.H.C.[Eric H. C.],
A Noise Robust Front-end for Speech Recognition Using Hough Transform and Cumulative Distribution Mapping,
ICPR06(IV: 286-289).
WWW Version. 0609
BibRef

Liu, M.[Ming], Huang, T.S.[Thomas S.],
A Bayesian Predictive Method for Automatic Speech Segmentation,
ICPR06(IV: 290-293).
WWW Version. 0609
BibRef

Xue, W.[Wei], Du, S.[Sidan], Fang, C.Z.[Cheng-Zhi], Ye, Y.[Yingxian],
Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm,
CVHCI06(78-88).
Springer DOI Link 0605
BibRef

Haas, J.[Jürgen], Gallwitz, F.[Florian], Horndasch, A.[Axel], Huber, R.[Richard], Warnke, V.[Volker],
Telephone-Based Speech Dialog Systems,
DAGM05(125).
Springer DOI Link 0509
BibRef

Maier, A.[Andreas], Hacker, C.[Christian], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Robust Parallel Speech Recognition in Multiple Energy Bands,
DAGM05(133).
Springer DOI Link 0509
BibRef

Hacker, C.[Christian], Cincarek, T.[Tobias], Gruhn, R.[Rainer], Steidl, S.[Stefan], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Pronunciation Feature Extraction,
DAGM05(141).
Springer DOI Link 0509
BibRef

Ivanecky, J.[Jozef], Fischer, J.[Julia], Mast, M.[Marion], Kunzmann, S.[Siegfried], Ross, T.[Thomas], Fischer, V.[Volker],
Multi-lingual and Multi-modal Speech Processing and Applications,
DAGM05(149).
Springer DOI Link 0509
BibRef

Dai, H.S.[Hai-Sheng], Zhu, X.Y.[Xiao-Yan], Luo, Y.P.[Yu-Pin], Yang, S.[Shiyuan],
An Utterance Verification Algorithm in Keyword Spotting System,
IbPRIA05(II:555).
Springer DOI Link 0509
BibRef

Rodríguez, L.J.[Luis Javier], Torres, M.I.[M. Inés],
A Clustering Algorithm for the Fast Match of Acoustic Conditions in Continuous Speech Recognition,
IbPRIA05(II:562).
Springer DOI Link 0509
BibRef

García-Perera, L.P.[L. Paola], Nolazco-Flores, J.A.[Juan A.], Mex-Perera, C.[Carlos],
Cryptographic-Speech-Key Generation Architecture Improvements,
IbPRIA05(II:579).
Springer DOI Link 0509
BibRef

Sánchez, J.A.[Joan Andreu], Benedí, J.M.[José Miguel], Linares, D.[Diego],
Performance of a SCFG-Based Language Model with Training Data Sets of Increasing Size,
IbPRIA05(II:586).
Springer DOI Link 0509
BibRef

Nolazco-Flores, J.A.[Juan A.], Salgado-Garza, L.R.[Luis R.], Peńa-Díaz, M.[Marco],
Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages,
IbPRIA05(II:595).
Springer DOI Link 0509
BibRef

Ortiz, D.[Daniel], Varea, I.G.[Ismael García], Casacuberta, F.[Francisco],
A General Framework to Deal with the Scaling Problem in Phrase-Based Statistical Machine Translation,
IbPRIA07(II: 314-322).
Springer DOI Link 0706
BibRef

Tomás, J.[Jesús], Lloret, J.[Jaime], Casacuberta, F.[Francisco],
Phrase-Based Statistical Machine Translation Using Approximate Matching,
IbPRIA07(I: 475-482).
Springer DOI Link 0706
BibRef
Earlier:
Phrase-Based Alignment Models for Statistical Machine Translation,
IbPRIA05(II:605).
Springer DOI Link 0509
BibRef

García-Varea, I.[Ismael], Ortiz, D.[Daniel], Nevado, F.[Francisco], Gómez, P.A.[Pedro A.], Casacuberta, F.[Francisco],
Automatic Segmentation of Bilingual Corpora: A Comparison of Different Techniques,
IbPRIA05(II:614).
Springer DOI Link 0509
BibRef

Andrés, J.[Jesús], Navarro, J.R.[José R.], Juan, A.[Alfons], Casacuberta, F.[Francisco],
Word Translation Disambiguation Using Multinomial Classifiers,
IbPRIA05(II:622).
Springer DOI Link 0509
BibRef

Civera, J.[Jorge], Cubel, E.[Elsa], Juan, A.[Alfons], Vidal, E.[Enrique],
Different Approaches to Bilingual Text Classification Based on Grammatical Inference Techniques,
IbPRIA05(II:630).
Springer DOI Link 0509
BibRef

Ribadas, F.J.[Francisco Jose], Vilares, M.[Manuel], Vilares, J.[Jesus],
Semantic Similarity Between Sentences Through Approximate Tree Matching,
IbPRIA05(II:638).
Springer DOI Link 0509
BibRef

Welk, M.[Martin], Bergmeister, A.[Achim], Weickert, J.[Joachim],
Denoising of Audio Data by Nonlinear Diffusion,
ScaleSpace05(598-609).
WWW Version. 0505
BibRef

Chen, K.[Ke],
Speaker Modeling with Various Speech Representations,
ICBA04(592-599).
WWW Version. 0505
BibRef

Sit, C.H.[Chin-Hung], Mak, M.W.[Man-Wai], Kung, S.Y.[Sun-Yuan],
Maximum Likelihood and Maximum a Posteriori Adaptation for Distributed Speaker Recognition Systems,
ICBA04(640-647).
WWW Version. 0505
BibRef

Gutkin, A., King, S.,
Structural representation of speech for phonetic classification,
ICPR04(III: 438-441).
IEEE DOI Link 0409
BibRef

Cristani, M., Bicego, M., Murino, V.,
On-line adaptive background modelling for audio surveillance,
ICPR04(II: 399-402).
IEEE DOI Link 0409
BibRef

Demirekler, M., Karahan, F., Ciloglu, T.,
Fusing length and voicing information, and HMM decision using a Bayesian causal tree against insufficient training data,
ICPR00(Vol III: 102-105).
IEEE DOI Link 0403
BibRef

Kashino, K., Kurozumi, T., Murase, H.,
Feature fluctuation absorption for a quick audio retrieval from long recordings,
ICPR00(Vol III: 98-101).
IEEE DOI Link 0403
BibRef

Garcia-Varea, I., Sanchis, A., Casacuberta, F.,
A new approach to speech-input statistical translation,
ICPR00(Vol III: 90-93).
IEEE DOI Link 0403
BibRef

Gravier, G., Sigelle, M., Chollet, G.,
A Markov random field model for automatic speech recognition,
ICPR00(Vol III: 254-257).
IEEE DOI Link 0403
BibRef

Ruiz, N., Rosa, M., Lopez, F., Martinez, D., Mata, R.,
New algorithm for searching minimum bit rate wavelet representations with application to multiresolution-based perceptual audio coding,
ICPR00(Vol III: 286-289).
IEEE DOI Link 0403
BibRef

Steidl, S.[Stefan], Stemmer, G.[Georg], Hacker, C.[Christian], Nöth, E.[Elmar], Niemann, H.[Heinrich],
Improving Children's Speech Recognition by HMM Interpolation with an Adults' Speech Recognizer,
DAGM03(600-607).
HTML Version. 0310
BibRef

Stephenson, T.A., Magimai-Doss, M., Bourlard, H.,
Mixed bayesian networks with auxiliary variables for automatic speech recognition,
ICPR02(IV: 293-296).
IEEE DOI Link 0211
BibRef

Bourlard, H.,
Some recent advances in speech recognition with potential applications in other statistical pattern recognition areas,
ICPR02(III: 727-727).
IEEE DOI Link 0211
BibRef

Tanaka, K., Kojima, H., Fujimura, N., Itoh, Y.,
Constructing speech processing systems on universal phonetic codes accompanied with reference acoustic models,
ICPR02(III: 728-731).
IEEE DOI Link 0211
BibRef

Katz, M., Meier, H.G., Dolfing, H., Klakow, D.,
Robustness of linear discriminant analysis in automatic speech recognition,
ICPR02(III: 371-374).
IEEE DOI Link 0211
BibRef

Lefevre, S., Maillard, B., Vincent, N.,
A two level classifier process for audio segmentation,
ICPR02(III: 891-894).
IEEE DOI Link 0211
BibRef

de Stefano, C., Della Cioppa, A., Marcelli, A.,
An investigation on MPEG audio segmentation by evolutionary algorithms,
ICDAR01(952-956).
IEEE DOI Link 0109
BibRef

Nouza, J.,
Feature selection methods for hidden Markov model-based speech recognition,
ICPR96(II: 186-190).
IEEE DOI Link 0509
BibRef

Vande Wouwer, G., Scheunders, P., van Dyck, D.,
Wavelet-FILVQ classifier for speech analysis,
ICPR96(IV: 214-218).
IEEE DOI Link 0509
BibRef

Uma, S., Sridhar, V., Krishna, G.,
Time-normalization techniques for speaker-independent isolated word recognition,
ICPR92(III:537-540).
IEEE DOI Link 9208
BibRef

Rieck, S., Schukat-Talamazzini, E.G., Niemann, H.,
Speaker adaptation using semi-continuous hidden Markov models,
ICPR92(III:541-544).
IEEE DOI Link 9208
BibRef

He, H.Y.[Hai-Yan], Wen, C.Y.[Cheng-Yi],
ART2-based multiple MLPs neural network for speaker-independent recognition of isolated words,
ICPR92(II:590-593).
IEEE DOI Link 9208
BibRef

Edmonds, E.A., Pan, L.Y., O'Brien, S.M.,
Automatic feature extraction from spectrograms for acoustic-phonetic analysis,
ICPR92(II:701-704).
IEEE DOI Link 9208
BibRef

Ishikawa, Y., Nakajima, K.,
A real time connected word recognition system,
ICPR90(II: 215-217).
IEEE DOI Link 9008
BibRef

Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speaker Verification, Speaker Identification .


Last update:Nov 16, 2009 at 19:35:14