Hanson, A.R.,
Riseman, E.M.,
Fisher, E.,
Context in word recognition,
PR(8), No. 1, January 1976, pp. 35-45.
WWW Version.
0309
BibRef
de Mori, R.,
Laface, P.,
Makhonine, V.A.,
Mezzalama, M.,
A syntactic procedure for the recognition of glottal pulses in
continuous speech,
PR(9), No. 4, 1977, pp. 181-189.
WWW Version.
0309
BibRef
Maroy, J.P.,
Berthod, M.,
Natural language understanding by a robot:
A pattern recognition problem,
PR(10), No. 2, 1978, pp. 63-71.
WWW Version.
0309
BibRef
Pal, S.K.,
Datta, A.K.,
Majumder, D.D.[D. Dutta],
A self-supervised vowel recognition system,
PR(12), No. 1, 1980, pp. 27-34.
WWW Version.
0309
BibRef
Pathak, A.[Amita],
Pal, S.K.[Sankar K.],
On the convergence of 'A self-supervised vowel recognition system',
PR(20), No. 2, 1987, pp. 237-244.
WWW Version.
0309
BibRef
de Mori, R.[Renato],
Giordano, G.[Giovanna],
Algorithms for syllabic hypothesization in continuous speech,
PR(14), No. 1-6, 1981, pp. 245-260.
WWW Version.
0309
BibRef
Howard, Jr., J.H.[James H.],
Feature selection in human auditory perception,
PR(15), No. 5, 1982, pp. 397-403.
WWW Version.
0309
BibRef
Thomason, M.G.,
Granum, E.,
Blake, R.E.,
Experiments in dynamic programming inference of Markov networks with
strings representing speech data,
PR(19), No. 5, 1986, pp. 343-352.
WWW Version.
0309
BibRef
Tanaka, E.[Eiichi],
Toyama, T.[Takanori],
Kawai, S.[Sachiko],
High speed error correction of phoneme sequences,
PR(19), No. 5, 1986, pp. 407-412.
WWW Version.
0309
BibRef
Lee, L.S.,
Tseng, C.Y.,
Chen, K.J.,
Huang, J.,
Hwang, C.H.,
Ting, P.Y.,
Lin, L.J.,
Chen, C.C.,
A Mandarin dictation machine based upon a hierarchical recognition
approach and Chinese natural language analysis,
PAMI(12), No. 7, July 1990, pp. 695-704.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0401
BibRef
Kenny, P.,
Lennig, M.,
Mermelstein, P.,
Speaker adaptation in a large-vocabulary Gaussian HMM recognizer,
PAMI(12), No. 9, September 1990, pp. 917-920.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0401
BibRef
Casacuberta, F.,
Some relations among stochastic finite state networks used in automatic
speech recognition,
PAMI(12), No. 7, July 1990, pp. 691-695.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0401
BibRef
Yannakoudakis, E.J.,
Tsomokos, I.,
Hutton, P.J.,
n-Grams and their implication to natural language understanding,
PR(23), No. 5, 1990, pp. 509-528.
WWW Version.
0401
BibRef
Hochberg, J.,
Mniszewski, S.M.,
Calleja, T.,
Papcun, G.J.,
A default hierarchy for pronouncing English,
PAMI(13), No. 9, September 1991, pp. 957-964.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0401
BibRef
Carlson, B.A.,
Clements, M.A.,
A computationally compact divergence measure for speech processing,
PAMI(13), No. 12, December 1991, pp. 1255-1260.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0401
BibRef
Ney, H.[Hermann],
A comparative study of two search strategies for connected word
recognition: dynamic programming and heuristic search,
PAMI(14), No. 5, May 1992, pp. 586-595.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0401
BibRef
Ney, H.[Hermann],
Stochastic Modelling: From Pattern Classification to Speech
Recognition and Translation,
ICPR00(Vol III: 21-28).
IEEE DOI Link
HTML Version.
0009
BibRef
Wu, J.X.[Jian-Xiong],
Chan, C.[Chorkin],
Isolated word recognition by neural network models with
cross-correlation coefficients for speech dynamics,
PAMI(15), No. 11, November 1993, pp. 1174-1185.
IEEE Abstract. IEEE Top Reference.
WWW Version.
0401
BibRef
Liu, L.C.[Lih-Cherng],
Chiou, D.[Denis],
Wang, H.C.[Hsiao-Chuan],
A speech recognition method based on feature distributions,
PR(24), No. 8, 1991, pp. 717-722.
WWW Version.
0401
BibRef
Pinkowski, B.[Ben],
Multiscale fourier descriptors for classifying semivowels in
spectrograms,
PR(26), No. 10, October 1993, pp. 1593-1602.
WWW Version.
0401
BibRef
Pinkowski, B.,
Principal Component Analysis of Speech Spectrogram Images,
PR(30), No. 5, May 1997, pp. 777-787.
WWW Version.
9705
BibRef
Chen, W.Y.[Wen-Yuan],
Liao, Y.F.[Yuan-Fu],
Chen, S.H.[Sin-Horng],
Speech recognition with hierarchical recurrent neural networks,
PR(28), No. 6, June 1995, pp. 795-805.
WWW Version.
0401
BibRef
Huo, Q.A.[Qi-Ang],
Chan, C.[Chorkin],
Contextual vector quantization for speech recognition with discrete
hidden Markov model,
PR(28), No. 4, April 1995, pp. 513-517.
WWW Version.
0401
BibRef
Pham, T.D.[Tuan D.],
Wagner, M.[Michael],
A geostatistical model for linear prediction analysis of speech,
PR(31), No. 12, December 1998, pp. 1981-1991.
WWW Version.
0401
BibRef
Lee, T.[Tan],
Ching, P.C.,
Chan, L.W.[Lai-Wan],
Isolated word recognition using modular recurrent neural networks,
PR(31), No. 6, June 1998, pp. 751-760.
WWW Version.
0401
BibRef
Tacer, B.[Berkant],
Loughlin, P.J.[Patrick J.],
Non-stationary signal classification using the joint moments of
time-frequency distributions,
PR(31), No. 11, November 1998, pp. 1635-1641.
WWW Version.
0401
BibRef
Han, J.[Jiqing],
Gao, W.[Wen],
Robust telephone speech recognition based on channel compensation,
PR(32), No. 6, June 1999, pp. 1061-1067.
WWW Version.
0401
BibRef
Lewis, M.A.[Michael A.],
Ramachandran, R.P.[Ravi P.],
Cochannel speaker count labelling based on the use of cepstral and
pitch prediction derived features,
PR(34), No. 2, February 2001, pp. 499-507.
WWW Version.
0011
BibRef
Kant, S.[Shri],
Verma, N.[Neelam],
An Effective Source Recognition Algorithm:
Extraction of Significant Binary Words,
PRL(21), No. 11, October 2000, pp. 981-988.
0010
BibRef
Kwong, S.,
He, Q.H.,
Man, K.F.,
Tang, K.S.,
A maximum model distance approach for HMM-based speech recognition,
PR(31), No. 3, March 1998, pp. 219-229.
WWW Version.
0401
BibRef
He, Q.H.,
Kwong, S.,
Man, K.F.,
Tang, K.S.,
An improved maximum model distance approach for HMM-based speech
recognition systems,
PR(33), No. 10, October 2000, pp. 1749-1758.
WWW Version.
0006
BibRef
Li, M.,
McAllister, H.G.,
Black, N.D.,
de Perez, T.A.,
Wavelet-based nonlinear AGC method for hearing aid loudness
compensation,
VISP(147), No. 6, December 2000, pp. 502-507.
0101
BibRef
Gray, P.,
Hollier, M.P.,
Massara, R.E.,
Non-intrusive speech-quality assessment using vocal-tract models,
VISP(147), No. 6, December 2000, pp. 493-501.
0101
BibRef
Wu, C.H.,
Chen, Y.J.,
Yan, G.L.,
Integration of phonetic and prosodic information for robust utterance
verification,
VISP(147), No. 1, February 2000, pp. 55.
0005
BibRef
Kim, W.[Wooil],
Kang, S.[Sunmee],
Ko, H.S.[Han-Seok],
Spectral subtraction based on phonetic dependency and masking effects,
VISP(147), No. 5, October 2000, pp. 423-427.
0101
BibRef
Hussain, A.,
Campbell, D.R.,
Intelligibility improvements using binaural diverse sub-band processing
applied to speech corrupted with automobile noise,
VISP(148), No. 2, April 2001, pp. 127-132.
0106
BibRef
Bohez, E.L.J.[Erik L.J.],
Senevirathne, T.R.,
Speech recognition using fractals,
PR(34), No. 11, November 2001, pp. 2227-2243.
WWW Version.
0108
BibRef
Sarkar, S.,
Poor, H.V.,
Multirate signal processing on finite fields,
VISP(148), No. 4, August 2001, pp. 254-262.
0201
BibRef
Chen, S.H.,
Wang, J.F.,
Application of wavelet transforms for C/V segmentation on Mandarin
speech signals,
VISP(148), No. 2, April 2001, pp. 133-139.
0106
BibRef
Mouria-Beji, F.[Fériel],
A hierarchical Bayesian model for continuous speech recognition,
PRL(23), No. 7, May 2002, pp. 773-781.
HTML Version.
0203
BibRef
Chen, F.K.,
Yang, J.F.,
Yan, Y.L.,
Candidate scheme for fast ACELP search,
VISP(149), No. 1, February 2002, pp. 10-16.
IEEE Top Reference.
0205
Algebraic code excited linear prediction. Speech coding.
BibRef
Mumolo, E.[Enzo],
Spectral domain texture analysis for speech enhancement,
PR(35), No. 10, October 2002, pp. 2181-2191.
WWW Version.
0206
BibRef
Liu, J.W.[Jing-Wei],
Cheng, Q.S.[Qian-Sheng],
Zheng, Z.G.[Zhong-Guo],
Qian, M.[Minping],
A DTW-based probability model for speaker feature analysis and data
mining,
PRL(23), No. 11, September 2002, pp. 1271-1276.
HTML Version.
0206
BibRef
Ding, Z.O.,
McLoughlin, I.V.,
Tan, E.C.,
Extension of proposal of standards for intelligibility tests of Chinese
speech: CDRT-tone,
VISP(150), No. 1, February 2003, pp. 1-5.
IEEE Top Reference.
0304
BibRef
Huang, C.S.[Chao-Shih],
Wang, H.C.[Hsiao-Chuan],
Bandwidth-adjusted LPC analysis for robust speech recognition,
PRL(24), No. 9-10, June 2003, pp. 1583-1587.
WWW Version.
0304
BibRef
Juang, Y.T.[Yau-Tarng],
Huang, K.C.[Kuo-Chang],
Ding, I.J.[Ing-Jr],
Speaker adaptation based on MAP estimation using fuzzy controller,
PRL(24), No. 15, November 2003, pp. 2807-2813.
WWW Version.
0308
BibRef
Ding, I.J.[Ing-Jr],
Incremental MLLR speaker adaptation by fuzzy logic control,
PR(40), No. 11, November 2007, pp. 3110-3119.
WWW Version.
0707
Speech recognition; Speaker adaptation; Hidden Markov model;
Maximum likelihood linear regression; T-S fuzzy logic controller
BibRef
Li, T.F.[Tze Fen],
Speech Recognition of Mandarin Monosyllables,
PR(36), No. 11, November 2003, pp. 2713-2721.
WWW Version.
0309
BibRef
Farooq, O.,
Datta, S.,
Wavelet based robust sub-band features for phoneme recognition,
VISP(151), No. 3, June 2004, pp. 187-193.
IEEE Abstract. IEEE Top Reference.
0409
BibRef
de Lamare, R.C.,
Alcaim, A.,
Strategies to improve the performance of very low bit rate speech
coders and application to a variable rate 1.2 kb/s codec,
VISP(152), No. 1, February 2005, pp. 74-86.
IEEE Abstract. IEEE Top Reference.
0501
BibRef
Ricotti, L.P.,
Multitapering and a wavelet variant of MFCC in speech recognition,
VISP(152), No. 1, February 2005, pp. 29-35.
IEEE Abstract. IEEE Top Reference.
0501
BibRef
Chen, K.[Ke],
On the use of different speech representations for speaker modeling,
SMC-C(35), No. 3, August 2005, pp. 301-314.
IEEE DOI Link
0508
BibRef
Vera-Candeas, P.,
Ruiz-Reyes, N.,
Rosa-Zurera, M.,
Lopez-Ferreras, F.,
Curpian-Alonso, J.,
New matching pursuit based sinusoidal modelling method for audio coding,
VISP(151), No. 1, February 2004, pp. 21-28.
IEEE Abstract. IEEE Top Reference.
0403
BibRef
Vera-Candeas, P.[Pedro],
Ruiz-Reyes, N.[Nicolás],
Rosa-Zurera, M.[Manuel],
Cuevas-Martinez, J.C.[Juan C.],
López-Ferreras, F.[Francisco],
Adaptive Signal Models for Wide-Band Speech and Audio Compression,
IbPRIA05(II:571).
Springer DOI Link
0509
BibRef
Zhong, W.,
Li, S.,
Tai, H.M.,
Signal subspace approach for narrowband noise reduction in speech,
VISP(152), No. 6, December 2005, pp. 800-805.
WWW Version.
0512
BibRef
Chen, B.[Berlin],
Exploring the use of latent topical information for statistical Chinese
spoken document retrieval,
PRL(27), No. 1, 1 January 2006, pp. 9-18.
WWW Version.
0512
BibRef
Chen, B.[Berlin],
Chen, Y.T.[Yi-Ting],
Extractive spoken document summarization for information retrieval,
PRL(29), No. 4, 1 March 2008, pp. 426-437.
WWW Version.
0711
Extractive summarization; Information retrieval; Topical mixture model;
Spoken documents; Speech recognition
BibRef
Wan, C.[Chunru],
Liu, M.C.[Ming-Chun],
Content-based audio retrieval with relevance feedback,
PRL(27), No. 2, 15 January 2006, pp. 85-92.
WWW Version.
0512
BibRef
Li, C.,
Li, S.,
Zhang, D.,
Chen, G.,
Cryptanalysis of a data securityp protection scheme for VoIP,
VISP(153), No. 1, February 2006, pp. 1-10.
WWW Version.
0602
BibRef
Radhakrishnan, R.[Regunathan],
Divakaran, A.[Ajay],
Xiong, Z.Y.[Zi-You],
Otsuka, I.[Isao],
A Content-Adaptive Analysis and Representation Framework for Audio
Event Discovery from 'Unscripted' Multimedia,
JASP(2006), 2006, pp. 1-24.
WWW Version.
0603
BibRef
Chu, W.T.[Wei-Ta],
Cheng, W.H.[Wen-Huang],
Wu, J.L.[Ja-Ling],
Semantic Context Detection Using Audio Event Fusion,
JASP(2006), 2006, pp. 1-12.
WWW Version.
0603
BibRef
Sandler, M.,
Black, D.,
Scalable audio coding for compression and loss resilient streaming,
VISP(153), No. 3, June 2006, pp. 331-339.
WWW Version.
0608
BibRef
Chang, J.H.[Joon-Hyuk],
Gazor, S.[Saeed],
Kim, N.S.[Nam Soo],
Mitra, S.K.[Sanjit K.],
Multiple statistical models for soft decision in noisy speech
enhancement,
PR(40), No. 3, March 2007, pp. 1123-1134.
WWW Version.
0611
Speech enhancement; DCT; Multiple statistical model; Gaussian;
Laplacian; Gamma; GOF; PSFM; SAP; PESQ
BibRef
Liu, J.W.[Jing-Wei],
Wang, Z.Y.[Zuo-Ying],
Xiao, X.[Xi],
A hybrid SVM/DDBHMM decision fusion modeling for robust continuous
digital speech recognition,
PRL(28), No. 8, 1 June 2007, pp. 912-920.
WWW Version.
0704
Speech recognition; Gaussian mixture model; Duration distribution based
hidden Markov model (DDBHMM); Support vector machine
BibRef
Guido, R.C.[Rodrigo Capobianco],
Pereira, J.C.[Jose Carlos],
Slaets, J.F.W.[Jan Frans Willem],
Introduction to the Special Issue:
Advances on pattern recognition for speech and audio processing,
PRL(28), No. 11, 1 August 2007, pp. 1283-1284.
WWW Version.
0706
BibRef
Leavitt, N.,
Two technologies vie for recognition in speech market,
Computer(36), No. 6, June 2003, pp. 13-16.
IEEE DOI Link
0306
BibRef
Paulson, L.D.,
Speech Recognition Moves from Software to Hardware,
Computer(39), No. 11, November 2006, pp. 15-18.
IEEE DOI Link
0611
BibRef
Stavrakoudis, D.G.,
Theocharis, J.B.,
Pipelined Recurrent Fuzzy Neural Networks for Nonlinear Adaptive Speech
Prediction,
SMC-B(37), No. 5, October 2007, pp. 1305-1320.
IEEE DOI Link
0711
BibRef
Frankel, J.[Joe],
King, S.[Simon],
Factoring Gaussian precision matrices for linear dynamic models,
PRL(28), No. 16, December 2007, pp. 2264-2272.
WWW Version.
0711
Linear dynamic model; Error distribution; Precision matrix
Speech.
BibRef
Chouireb, F.[Fatima],
Guerti, M.[Mhania],
Towards a high quality Arabic speech synthesis system based on neural
networks and residual excited vocal tract model,
SIViP(2), No. 1, January 2008, pp. 73-87.
Springer DOI Link
0712
BibRef
Araujo, L.[Lourdes],
Serrano, J.I.[J. Ignacio],
Highly accurate error-driven method for noun phrase detection,
PRL(29), No. 4, 1 March 2008, pp. 547-557.
WWW Version.
0711
Noun phrase detection; Evolutionary programming;
Grammar induction; Information retrieval
BibRef
Zhang, Y.X.[Yong-Xin],
Scordilis, M.S.[Michael S.],
Effective online unsupervised adaptation of Gaussian mixture models and
its application to speech classification,
PRL(29), No. 6, 15 April 2008, pp. 735-744.
WWW Version.
0803
Gaussian mixture model; Speech classification; Online adaptation;
Unsupervised adaptation
BibRef
Baluja, S.[Shumeet],
Covell, M.[Michele],
Waveprint: Efficient wavelet-based audio fingerprinting,
PR(41), No. 11, November 2008, pp. 3467-3480.
WWW Version.
0808
Audio retrieval; Applications; Image/video retrieval; Pattern analysis
BibRef
O'Shaughnessy, D.[Douglas],
Invited paper: Automatic speech recognition: History, methods and
challenges,
PR(41), No. 10, October 2008, pp. 2965-2979.
WWW Version.
0808
Automatic speech recognition; Hidden Markov models; Adaptation;
Compensation; Pattern recognition; Spectral representation
BibRef
Zeng, J.[Jia],
Xie, L.[Lei],
Liu, Z.Q.A.[Zhi-Qi-Ang],
Type-2 fuzzy Gaussian mixture models,
PR(41), No. 12, December 2008, pp. 3636-3643.
WWW Version.
0810
BibRef
Earlier: A1, A3, Only:
Type-2 fuzzy hidden markov models to phoneme recognition,
ICPR04(I: 192-195).
IEEE DOI Link
0409
Type-2 fuzzy sets; Gaussian mixture models; Hidden Markov models
BibRef
Chen, B.[Berlin],
Liu, S.H.[Shih-Hung],
Chu, F.H.[Fang-Hui],
Training data selection for improving discriminative training of
acoustic models,
PRL(30), No. 13, 1 October 2009, pp. 1228-1235,.
Elsevier DOI Link
WWW Version.
0909
Continuous speech recognition; Discriminative training; Acoustic
models; Data selection; Phone accuracy; Entropy
BibRef
Kuhnapfel, T.[Thorsten],
Tan, T.[Tele],
Venkatesh, S.[Svertha],
Igel, B.[Burkhard],
Distributed Audio Network for Speech Enhancement in Challenging Noise
Backgrounds,
AVSBS09(308-313).
IEEE DOI Link
0909
BibRef
Cristani, M.,
Pesarin, A.,
Drioli, C.,
Tavano, A.,
Perina, A.,
Murino, V.,
Auditory dialog analysis and understanding by generative modelling of
interactional dynamics,
CVPR4HB09(103-109).
IEEE DOI Link
0906
BibRef
Gosztolya, G.[Gábor],
Bánhalmi, A.[András],
Tóth, L.[László],
Using One-Class Classification Techniques in the Anti-phoneme Problem,
IbPRIA09(433-440).
Springer DOI Link
0906
BibRef
Chen, J.B.[Jin-Biao],
Zhang, S.[Shiqing],
Manifold learning-based phoneme recognition,
IASP09(308-312).
IEEE DOI Link
0904
BibRef
Mahdhaoui, A.[Ammar],
Chetouani, M.[Mohamed],
Zong, C.[Cong],
Motherese detection based on segmental and supra-segmental features,
ICPR08(1-4).
IEEE DOI Link
0812
parent-infant interactions.
BibRef
Zeng, Z.[Zhi],
Li, X.[Xin],
Ma, X.H.[Xiao-Hong],
Ji, Q.A.[Qi-Ang],
Adaptive context recognition based on audio signal,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Luo, L.[Li],
Lu, P.F.[Peng-Fei],
Wang, Z.F.[Zeng-Fu],
A real-time accompaniment system based on sung voice recognition,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Pesarin, A.,
Cristani, M.,
Murino, V.,
Drioli, C.,
Perina, A.,
Tavano, A.,
A statistical signature for automatic dialogue classification,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Choi, H.[Heeyoul],
Gutierrez-Osuna, R.[Ricardo],
Choi, S.J.[Seung-Jin],
Choe, Y.[Yoonsuck],
Kernel oriented discriminant analysis for speaker-independent phoneme
spaces,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Terry, L.[Louis],
Katsaggelos, A.K.[Aggelos K.],
A phone-viseme dynamic Bayesian network for audio-visual automatic
speech recognition,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Kuhnapfel, T.[Thorsten],
Tan, T.[Tele],
Venkatesh, S.[Svetha],
Nordholm, S.E.[Sven Erik],
Igel, B.[Burkhard],
Adaptive speech enhancement with varying noise backgrounds,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Krajewski, J.[Jarek],
Batliner, A.[Anton],
Wieland, R.[Rainer],
Multiple classifier applied on predicting microsleep from speech,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Banerjee, P.[Pratyush],
Garg, G.[Gaurav],
Mitra, P.[Pabitra],
Basu, A.[Anupam],
Application of triphone clustering in acoustic modeling for continuous
speech recognition in Bengali,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Li, X.K.[Xiao-Kun],
Deng, Y.[Yunbin],
Combining speech energy and edge information for fast and efficient
voice activity detection in noisy environments,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Ser, W.[Wee],
Cen, L.[Ling],
Yu, Z.L.[Zhu Liang],
A Hybrid PNN-GMM classification scheme for speech emotion recognition,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Dehzangi, O.[Omid],
Ma, B.[Bin],
Chng, E.S.[Eng Siong],
Li, H.Z.[Hai-Zhou],
Fuzzy rule selection using Iterative Rule Learning for speech data
classification,
ICPR08(1-4).
IEEE DOI Link
0812
BibRef
Bouzid, A.[Aďcha],
Ellouze, N.[Noureddine],
Voicing Detection in Noisy Speech Signal,
ICISP08(544-551).
Springer DOI Link
0807
BibRef
Kukharchik, P.,
Kheidorov, I.,
Bovbel, E.,
Ladeev, D.,
Speech Signal Processing Based on Wavelets and SVM for Vocal Tract
Pathology Detection,
ICISP08(192-199).
Springer DOI Link
0807
BibRef
Türkmen, H.I.[H. Irem],
Karsligil, M.E.[M. Elif],
Reconstruction of Dysphonic Speech by MELP,
CIARP08(767-774).
Springer DOI Link
0809
BibRef
Hain, T.[Thomas],
Burget, L.[Lukas],
Dines, J.[John],
Garau, G.[Giulia],
Karafiat, M.[Martin],
van Leeuwen, D.[David],
Lincoln, M.[Mike],
Wan, V.[Vincent],
The 2007 AMI(DA) System for Meeting Transcription,
MTPH07(xx-yy).
Springer DOI Link
0705
BibRef
Lamel, L.,
Bilinski, E.,
Gauvain, J.L.,
Adda, G.,
Barras, C.,
Zhu, X.,
The LIMSI RT07 Lecture Transcription System,
MTPH07(xx-yy).
Springer DOI Link
0705
BibRef
Fiscus, J.G.[Jonathan G.],
Ajot, J.[Jerome],
Garofolo, J.S.[John S.],
The Rich Transcription 2007 Meeting Recognition Evaluation,
MTPH07(xx-yy).
Springer DOI Link
0705
BibRef
Stolcke, A.[Andreas],
Anguera, X.[Xavier],
Boakye, K.[Kofi],
Çetin, Ö.[Özgür],
Janin, A.[Adam],
Magimai-Doss, M.[Mathew],
Wooters, C.[Chuck],
Zheng, J.[Jing],
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System,
MTPH07(xx-yy).
Springer DOI Link
0705
BibRef
Huang, J.[Jing],
Marcheret, E.[Etienne],
Visweswariah, K.[Karthik],
Libal, V.[Vit],
Potamianos, G.[Gerasimos],
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture
Meetings,
MTPH07(xx-yy).
Springer DOI Link
0705
BibRef
Wölfel, M.[Matthias],
Stüker, S.[Sebastian],
Kraft, F.[Florian],
The ISL RT-07 Speech-to-Text System,
MTPH07(xx-yy).
Springer DOI Link
0705
BibRef
Schuller, B.[Björn],
Wöllmer, M.[Martin],
Moosmayr, T.[Tobias],
Ruske, G.[Günther],
Rigoll, G.[Gerhard],
Switching Linear Dynamic Models for Noise Robust In-Car Speech
Recognition,
DAGM08(xx-yy).
Springer DOI Link
0806
BibRef
Patil, H.A.[Hemant A.],
Basu, T.K.,
Cepstral Domain Teager Energy for Identifying Perceptually Similar
Languages,
PReMI07(455-462).
Springer DOI Link
0712
BibRef
Manwani, N.[Naresh],
Mitra, S.K.[Suman K.],
Joshi, M.V.,
Spoken Language Identification for Indian Languages Using Split and
Merge EM Algorithm,
PReMI07(463-468).
Springer DOI Link
0712
BibRef
Rao, K.S.[K. Sreenivasa],
Laskar, R.H.,
Koolagudi, S.G.[Shashidhar G.],
Voice Transformation by Mapping the Features at Syllable Level,
PReMI07(479-486).
Springer DOI Link
0712
BibRef
Nagesha,
Kumar, G.H.[G. Hemantha],
Signal Resampling Technique Combining Level Crossing and Auditory
Features,
PReMI07(447-454).
Springer DOI Link
0712
BibRef
Hurtado, L.F.[Lluís F.],
Griol, D.[David],
Sanchis, E.[Emilio],
Segarra, E.[Encarna],
A Statistical User Simulation Technique for the Improvement of a Spoken
Dialog System,
CIARP07(743-752).
Springer DOI Link
0711
BibRef
Oropeza Rodríguez, J.L.[José Luis],
Suárez Guerra, S.[Sergio],
Sánchez Fernández, L.P.[Luis Pastor],
Using Adaptive Filter to Increase Automatic Speech Recognition Rate in
a Digit Corpus,
CIARP07(78-87).
Springer DOI Link
0711
BibRef
Várallyay, G.[György],
SSM: A Novel Method to Recognize the Fundamental Frequency in Voice
Signals,
CIARP07(88-95).
Springer DOI Link
0711
BibRef
Ohara, M.[Masatoshi],
Utsumi, A.[Akira],
Yamazoe, H.[Hirotake],
Abe, S.[Shinji],
Katayama, N.[Noriaki],
Attention Monitoring for Music Contents Based on Analysis of
Signal-Behavior Structures,
ACCV07(I: 292-302).
Springer DOI Link
0711
BibRef
Simőes, C.[Carla],
Teixeira, C.[Carlos],
Dias, M.[Miguel],
Braga, D.[Daniela],
Calado, A.[António],
European Portuguese Accent in Acoustic Models for Non-native English
Speakers,
CIARP07(734-742).
Springer DOI Link
0711
BibRef
Smeaton, A.F.[Alan F.],
McHugh, M.[Mike],
Towards event detection in an audio-based sensor network,
VSSN05(87-94).
WWW Version.
0511
BibRef
Esposito, A.[Anna],
Stejskal, V.[Vojtech],
Smékal, Z.[Zdenek],
Bourbakis, N.[Nikolaos],
The Significance of Empty Speech Pauses:
Cognitive and Algorithmic Issues,
BVAI07(542-554).
Springer DOI Link
0710
BibRef
Hernández, I.[Igmar],
García, P.[Paola],
Nolazco, J.[Juan],
Buera, L.[Luis],
Lleida, E.[Eduardo],
Robust Automatic Speech Recognition Using PD-MEEMLIN,
IbPRIA07(II: 1-8).
Springer DOI Link
0706
BibRef
Chung, Y.J.[Yong-Joo],
Bae, K.S.[Keun-Sung],
Data-Driven Jacobian Adaptation in a Multi-model Structure for Noisy
Speech Recognition,
IbPRIA07(II: 452-459).
Springer DOI Link
0706
BibRef
Expósito, J.E.M.[J. E. Muńoz],
Reyes, N.R.[N. Ruiz],
Galán, S.G.[S. Garcia],
Candeas, P.V.[P. Vera],
Speech/Music Classification Based on Distributed Evolutionary Fuzzy
Logic for Intelligent Audio Coding,
IbPRIA07(II: 556-563).
Springer DOI Link
0706
BibRef
Ferrer, C.A.[Carlos A.],
González, E.[Eduardo],
Hernández-Díaz, M.E.[María E.],
Evaluation of Time and Frequency Domain-Based Methods for the
Estimation of Harmonics-to-Noise-Ratios in Voice Signals,
CIARP06(406-415).
Springer DOI Link
0611
BibRef
Cano, S.[Sergio],
Suaste, I.[Israel],
Escobedo, D.[Daniel],
Reyes-García, C.A.[Carlos A.],
Ekkel, T.[Taco],
A Combined Classifier of Cry Units with New Acoustic Attributes,
CIARP06(416-425).
Springer DOI Link
0611
BibRef
Huerta-Hernández, L.D.[Luis D.],
Reyes-García, C.A.[Carlos A.],
On the Processing of Fuzzy Patterns for Text Independent Phonetic
Speech Segmentation,
CIARP06(437-445).
Springer DOI Link
0611
BibRef
Alghassi, H.,
Tafazoli, S.,
Lawrence, P.,
The Audio Surveillance Eye,
AVSBS06(106-106).
IEEE DOI Link
0611
BibRef
Yuan, L.[Lichi],
Chen, Z.G.[Zhi-Gang],
A Novel Statistical Model for Speech Recognition and POS Tagging,
AVSBS06(61-61).
IEEE DOI Link
0611
BibRef
Yin, B.[Bo],
Ambikairajah, E.[Eliathamby],
Chen, F.[Fang],
Combining Cepstral and Prosodic Features in Language Identification,
ICPR06(IV: 254-257).
WWW Version.
0609
BibRef
Leila,
Chollet, G.[Gerard],
Efficient Gaussian Mixture for Speech Recognition,
ICPR06(IV: 294-297).
WWW Version.
0609
BibRef
Vinciarelli, A.[Alessandro],
Sociometry Based Multiparty Audio Recordings Summarization,
ICPR06(II: 1154-1157).
WWW Version.
0609
BibRef
Wang, J.C.[Jia-Ching],
Wang, J.F.[Jhing-Fa],
Lin, C.B.[Cai-Bei],
Jian, K.T.[Kun-Ting],
Kuok, W.H.[Wai-He],
Content-Based Audio Classification Using Support Vector Machines and
Independent Component Analysis,
ICPR06(IV: 157-160).
WWW Version.
0609
BibRef
Huang, R.Q.[Rong-Qing],
Ma, C.X.[Chang-Xue],
Toward A Speaker-Independent Real-Time Affect Detection System,
ICPR06(I: 1204-1207).
WWW Version.
0609
BibRef
Wang, L.[Liang],
Ambikairajah, E.[Eliathamby],
Choi, E.H.C.[Eric H.C.],
Multi-lingual Phoneme Recognition and Language Identification Using
Phonotactic Information,
ICPR06(IV: 245-248).
WWW Version.
0609
BibRef
Kruger, S.E.[Sven E.],
Schaffoner, M.[Martin],
Katz, M.[Marcel],
Andelic, E.[Edin],
Wendemuth, A.[Andreas],
Mixture of Support Vector Machines for HMM based Speech Recognition,
ICPR06(IV: 326-329).
WWW Version.
0609
BibRef
Zhang, S.L.[Shi-Lei],
Zhang, S.W.[Shu-Wu],
Xu, B.[Bo],
A Two-level Method for Unsupervised Speaker-based Audio Segmentation,
ICPR06(IV: 298-301).
WWW Version.
0609
BibRef
Pao, T.L.[Tsang-Long],
Chen, Y.T.[Yu-Te],
Yeh, J.H.[Jun-Heng],
Li, P.J.[Pei-Jia],
Mandarin Emotional Speech Recognition Based on SVM and NN,
ICPR06(I: 1096-1100).
WWW Version.
0609
BibRef
Andelic, E.[Edin],
Schaffoner, M.[Martin],
Katz, M.[Marcel],
Kruger, S.E.[Sven E.],
A Hybrid HMM-Based Speech Recognizer Using Kernel-Based Discriminants
as Acoustic Models,
ICPR06(II: 1158-1161).
WWW Version.
0609
BibRef
You, M.[Mingyu],
Chen, C.[Chun],
Bu, J.J.[Jia-Jun],
Liu, J.[Jia],
Tao, J.H.[Jian-Hua],
Emotional Speech Analysis on Nonlinear Manifold,
ICPR06(III: 91-94).
WWW Version.
0609
BibRef
Halavati, R.[Ramin],
Shouraki, S.B.[Saeed Bagheri],
Tajik, H.[Hossein],
Cholakian, A.[Arpineh],
Razaghpour, M.[Mina],
A Novel Approach to Very Fast and Noise Robust, Isolated Word Speech
Recognition,
ICPR06(III: 190-193).
WWW Version.
0609
BibRef
Lin, H.[Hui],
Ou, Z.J.[Zhi-Jian],
Switching Auxiliary Chains for Speech Recognition based on Dynamic
Bayesian Networks,
ICPR06(IV: 258-261).
WWW Version.
0609
BibRef
Li, W.H.[Wei-Hong],
Liu, M.[Ming],
Zhu, Z.G.[Zhi-Gang],
Huang, T.S.[Thomas S.],
LDV Remote Voice Acquisition and Enhancement,
ICPR06(IV: 262-265).
WWW Version.
0609
BibRef
Maier, A.[Andreas],
Hacker, C.[Christian],
Noth, E.[Elmar],
Nkenke, E.[Emeka],
Haderlein, T.[Tino],
Rosanowski, F.[Frank],
Schuster, M.[Maria],
Intelligibility of Children with Cleft Lip and Palate:
Evaluation by Speech Recognition Techniques,
ICPR06(IV: 274-277).
WWW Version.
0609
BibRef
Zioko, B.[Bartosz],
Manandhar, S.[Suresh],
Wilson, R.C.[Richard C.],
Phoneme segmentation of speech,
ICPR06(IV: 282-285).
WWW Version.
0609
BibRef
Choi, E.H.C.[Eric H. C.],
A Noise Robust Front-end for Speech Recognition Using Hough Transform
and Cumulative Distribution Mapping,
ICPR06(IV: 286-289).
WWW Version.
0609
BibRef
Liu, M.[Ming],
Huang, T.S.[Thomas S.],
A Bayesian Predictive Method for Automatic Speech Segmentation,
ICPR06(IV: 290-293).
WWW Version.
0609
BibRef
Xue, W.[Wei],
Du, S.[Sidan],
Fang, C.Z.[Cheng-Zhi],
Ye, Y.[Yingxian],
Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum
and Support Vector Machines and Audio Mixing Algorithm,
CVHCI06(78-88).
Springer DOI Link
0605
BibRef
Haas, J.[Jürgen],
Gallwitz, F.[Florian],
Horndasch, A.[Axel],
Huber, R.[Richard],
Warnke, V.[Volker],
Telephone-Based Speech Dialog Systems,
DAGM05(125).
Springer DOI Link
0509
BibRef
Maier, A.[Andreas],
Hacker, C.[Christian],
Steidl, S.[Stefan],
Nöth, E.[Elmar],
Niemann, H.[Heinrich],
Robust Parallel Speech Recognition in Multiple Energy Bands,
DAGM05(133).
Springer DOI Link
0509
BibRef
Hacker, C.[Christian],
Cincarek, T.[Tobias],
Gruhn, R.[Rainer],
Steidl, S.[Stefan],
Nöth, E.[Elmar],
Niemann, H.[Heinrich],
Pronunciation Feature Extraction,
DAGM05(141).
Springer DOI Link
0509
BibRef
Ivanecky, J.[Jozef],
Fischer, J.[Julia],
Mast, M.[Marion],
Kunzmann, S.[Siegfried],
Ross, T.[Thomas],
Fischer, V.[Volker],
Multi-lingual and Multi-modal Speech Processing and Applications,
DAGM05(149).
Springer DOI Link
0509
BibRef
Dai, H.S.[Hai-Sheng],
Zhu, X.Y.[Xiao-Yan],
Luo, Y.P.[Yu-Pin],
Yang, S.[Shiyuan],
An Utterance Verification Algorithm in Keyword Spotting System,
IbPRIA05(II:555).
Springer DOI Link
0509
BibRef
Rodríguez, L.J.[Luis Javier],
Torres, M.I.[M. Inés],
A Clustering Algorithm for the Fast Match of Acoustic Conditions in
Continuous Speech Recognition,
IbPRIA05(II:562).
Springer DOI Link
0509
BibRef
García-Perera, L.P.[L. Paola],
Nolazco-Flores, J.A.[Juan A.],
Mex-Perera, C.[Carlos],
Cryptographic-Speech-Key Generation Architecture Improvements,
IbPRIA05(II:579).
Springer DOI Link
0509
BibRef
Sánchez, J.A.[Joan Andreu],
Benedí, J.M.[José Miguel],
Linares, D.[Diego],
Performance of a SCFG-Based Language Model with Training Data Sets of
Increasing Size,
IbPRIA05(II:586).
Springer DOI Link
0509
BibRef
Nolazco-Flores, J.A.[Juan A.],
Salgado-Garza, L.R.[Luis R.],
Peńa-Díaz, M.[Marco],
Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl
Languages,
IbPRIA05(II:595).
Springer DOI Link
0509
BibRef
Ortiz, D.[Daniel],
Varea, I.G.[Ismael García],
Casacuberta, F.[Francisco],
A General Framework to Deal with the Scaling Problem in Phrase-Based
Statistical Machine Translation,
IbPRIA07(II: 314-322).
Springer DOI Link
0706
BibRef
Tomás, J.[Jesús],
Lloret, J.[Jaime],
Casacuberta, F.[Francisco],
Phrase-Based Statistical Machine Translation Using Approximate Matching,
IbPRIA07(I: 475-482).
Springer DOI Link
0706
BibRef
Earlier:
Phrase-Based Alignment Models for Statistical Machine Translation,
IbPRIA05(II:605).
Springer DOI Link
0509
BibRef
García-Varea, I.[Ismael],
Ortiz, D.[Daniel],
Nevado, F.[Francisco],
Gómez, P.A.[Pedro A.],
Casacuberta, F.[Francisco],
Automatic Segmentation of Bilingual Corpora: A Comparison of Different
Techniques,
IbPRIA05(II:614).
Springer DOI Link
0509
BibRef
Andrés, J.[Jesús],
Navarro, J.R.[José R.],
Juan, A.[Alfons],
Casacuberta, F.[Francisco],
Word Translation Disambiguation Using Multinomial Classifiers,
IbPRIA05(II:622).
Springer DOI Link
0509
BibRef
Civera, J.[Jorge],
Cubel, E.[Elsa],
Juan, A.[Alfons],
Vidal, E.[Enrique],
Different Approaches to Bilingual Text Classification Based on
Grammatical Inference Techniques,
IbPRIA05(II:630).
Springer DOI Link
0509
BibRef
Ribadas, F.J.[Francisco Jose],
Vilares, M.[Manuel],
Vilares, J.[Jesus],
Semantic Similarity Between Sentences Through Approximate Tree Matching,
IbPRIA05(II:638).
Springer DOI Link
0509
BibRef
Welk, M.[Martin],
Bergmeister, A.[Achim],
Weickert, J.[Joachim],
Denoising of Audio Data by Nonlinear Diffusion,
ScaleSpace05(598-609).
WWW Version.
0505
BibRef
Chen, K.[Ke],
Speaker Modeling with Various Speech Representations,
ICBA04(592-599).
WWW Version.
0505
BibRef
Sit, C.H.[Chin-Hung],
Mak, M.W.[Man-Wai],
Kung, S.Y.[Sun-Yuan],
Maximum Likelihood and Maximum a Posteriori Adaptation for Distributed
Speaker Recognition Systems,
ICBA04(640-647).
WWW Version.
0505
BibRef
Gutkin, A.,
King, S.,
Structural representation of speech for phonetic classification,
ICPR04(III: 438-441).
IEEE DOI Link
0409
BibRef
Cristani, M.,
Bicego, M.,
Murino, V.,
On-line adaptive background modelling for audio surveillance,
ICPR04(II: 399-402).
IEEE DOI Link
0409
BibRef
Demirekler, M.,
Karahan, F.,
Ciloglu, T.,
Fusing length and voicing information, and HMM decision using a
Bayesian causal tree against insufficient training data,
ICPR00(Vol III: 102-105).
IEEE DOI Link
0403
BibRef
Kashino, K.,
Kurozumi, T.,
Murase, H.,
Feature fluctuation absorption for a quick audio retrieval from long
recordings,
ICPR00(Vol III: 98-101).
IEEE DOI Link
0403
BibRef
Garcia-Varea, I.,
Sanchis, A.,
Casacuberta, F.,
A new approach to speech-input statistical translation,
ICPR00(Vol III: 90-93).
IEEE DOI Link
0403
BibRef
Gravier, G.,
Sigelle, M.,
Chollet, G.,
A Markov random field model for automatic speech recognition,
ICPR00(Vol III: 254-257).
IEEE DOI Link
0403
BibRef
Ruiz, N.,
Rosa, M.,
Lopez, F.,
Martinez, D.,
Mata, R.,
New algorithm for searching minimum bit rate wavelet representations
with application to multiresolution-based perceptual audio coding,
ICPR00(Vol III: 286-289).
IEEE DOI Link
0403
BibRef
Steidl, S.[Stefan],
Stemmer, G.[Georg],
Hacker, C.[Christian],
Nöth, E.[Elmar],
Niemann, H.[Heinrich],
Improving Children's Speech Recognition by HMM Interpolation with an
Adults' Speech Recognizer,
DAGM03(600-607).
HTML Version.
0310
BibRef
Stephenson, T.A.,
Magimai-Doss, M.,
Bourlard, H.,
Mixed bayesian networks with auxiliary variables for automatic speech
recognition,
ICPR02(IV: 293-296).
IEEE DOI Link
0211
BibRef
Bourlard, H.,
Some recent advances in speech recognition with potential applications
in other statistical pattern recognition areas,
ICPR02(III: 727-727).
IEEE DOI Link
0211
BibRef
Tanaka, K.,
Kojima, H.,
Fujimura, N.,
Itoh, Y.,
Constructing speech processing systems on universal phonetic codes
accompanied with reference acoustic models,
ICPR02(III: 728-731).
IEEE DOI Link
0211
BibRef
Katz, M.,
Meier, H.G.,
Dolfing, H.,
Klakow, D.,
Robustness of linear discriminant analysis in automatic speech
recognition,
ICPR02(III: 371-374).
IEEE DOI Link
0211
BibRef
Lefevre, S.,
Maillard, B.,
Vincent, N.,
A two level classifier process for audio segmentation,
ICPR02(III: 891-894).
IEEE DOI Link
0211
BibRef
de Stefano, C.,
Della Cioppa, A.,
Marcelli, A.,
An investigation on MPEG audio segmentation by evolutionary algorithms,
ICDAR01(952-956).
IEEE DOI Link
0109
BibRef
Nouza, J.,
Feature selection methods for hidden Markov model-based speech
recognition,
ICPR96(II: 186-190).
IEEE DOI Link
0509
BibRef
Vande Wouwer, G.,
Scheunders, P.,
van Dyck, D.,
Wavelet-FILVQ classifier for speech analysis,
ICPR96(IV: 214-218).
IEEE DOI Link
0509
BibRef
Uma, S.,
Sridhar, V.,
Krishna, G.,
Time-normalization techniques for speaker-independent isolated word
recognition,
ICPR92(III:537-540).
IEEE DOI Link
9208
BibRef
Rieck, S.,
Schukat-Talamazzini, E.G.,
Niemann, H.,
Speaker adaptation using semi-continuous hidden Markov models,
ICPR92(III:541-544).
IEEE DOI Link
9208
BibRef
He, H.Y.[Hai-Yan],
Wen, C.Y.[Cheng-Yi],
ART2-based multiple MLPs neural network for speaker-independent
recognition of isolated words,
ICPR92(II:590-593).
IEEE DOI Link
9208
BibRef
Edmonds, E.A.,
Pan, L.Y.,
O'Brien, S.M.,
Automatic feature extraction from spectrograms for acoustic-phonetic
analysis,
ICPR92(II:701-704).
IEEE DOI Link
9208
BibRef
Ishikawa, Y.,
Nakajima, K.,
A real time connected word recognition system,
ICPR90(II: 215-217).
IEEE DOI Link
9008
BibRef
Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speaker Verification, Speaker Identification .