24.1.10.1 Speech Analysis, other than Recognition

Chapter Contents (Back)
Speech. Not so much what is said, but other analysis

Howard, Jr., J.H.[James H.],
Feature selection in human auditory perception,
PR(15), No. 5, 1982, pp. 397-403.
WWW Version. 0309
BibRef

Thomason, M.G., Granum, E., Blake, R.E.,
Experiments in dynamic programming inference of Markov networks with strings representing speech data,
PR(19), No. 5, 1986, pp. 343-352.
WWW Version. 0309
BibRef

Hochberg, J., Mniszewski, S.M., Calleja, T., Papcun, G.J.,
A default hierarchy for pronouncing English,
PAMI(13), No. 9, September 1991, pp. 957-964.
IEEE DOI Link 0401
BibRef

Carlson, B.A., Clements, M.A.,
A computationally compact divergence measure for speech processing,
PAMI(13), No. 12, December 1991, pp. 1255-1260.
IEEE DOI Link 0401
BibRef

Tacer, B.[Berkant], Loughlin, P.J.[Patrick J.],
Non-stationary signal classification using the joint moments of time-frequency distributions,
PR(31), No. 11, November 1998, pp. 1635-1641.
WWW Version. 0401
BibRef

Li, M., McAllister, H.G., Black, N.D., de Perez, T.A.,
Wavelet-based nonlinear AGC method for hearing aid loudness compensation,
VISP(147), No. 6, December 2000, pp. 502-507. 0101
BibRef

Gray, P., Hollier, M.P., Massara, R.E.,
Non-intrusive speech-quality assessment using vocal-tract models,
VISP(147), No. 6, December 2000, pp. 493-501. 0101
BibRef

Sarkar, S., Poor, H.V.,
Multirate signal processing on finite fields,
VISP(148), No. 4, August 2001, pp. 254-262. 0201
BibRef

Mumolo, E.[Enzo],
Spectral domain texture analysis for speech enhancement,
PR(35), No. 10, October 2002, pp. 2181-2191.
WWW Version. 0206
BibRef

de Lamare, R.C., Alcaim, A.,
Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec,
VISP(152), No. 1, February 2005, pp. 74-86.
IEEE Abstract. 0501
BibRef

Vera-Candeas, P., Ruiz-Reyes, N., Rosa-Zurera, M., Lopez-Ferreras, F., Curpian-Alonso, J.,
New matching pursuit based sinusoidal modelling method for audio coding,
VISP(151), No. 1, February 2004, pp. 21-28.
IEEE Abstract. 0403
BibRef

Vera-Candeas, P.[Pedro], Ruiz-Reyes, N.[Nicolás], Rosa-Zurera, M.[Manuel], Cuevas-Martinez, J.C.[Juan C.], López-Ferreras, F.[Francisco],
Adaptive Signal Models for Wide-Band Speech and Audio Compression,
IbPRIA05(II:571).
Springer DOI Link 0509
BibRef

Li, C., Li, S., Zhang, D., Chen, G.,
Cryptanalysis of a data securityp protection scheme for VoIP,
VISP(153), No. 1, February 2006, pp. 1-10.
WWW Version. 0602
BibRef

Sandler, M., Black, D.,
Scalable audio coding for compression and loss resilient streaming,
VISP(153), No. 3, June 2006, pp. 331-339.
WWW Version. 0608
BibRef

Chang, J.H.[Joon-Hyuk], Gazor, S.[Saeed], Kim, N.S.[Nam Soo], Mitra, S.K.[Sanjit K.],
Multiple statistical models for soft decision in noisy speech enhancement,
PR(40), No. 3, March 2007, pp. 1123-1134.
WWW Version. 0611
Speech enhancement; DCT; Multiple statistical model; Gaussian; Laplacian; Gamma; GOF; PSFM; SAP; PESQ BibRef

Frankel, J.[Joe], King, S.[Simon],
Factoring Gaussian precision matrices for linear dynamic models,
PRL(28), No. 16, December 2007, pp. 2264-2272.
WWW Version. 0711
Linear dynamic model; Error distribution; Precision matrix Speech. BibRef

Mahdi, A.E.[Abdulhussain E.], Picovici, D.[Dorel],
New single-ended objective measure for non-intrusive speech quality evaluation,
SIViP(4), No. 1, March 2010, pp. xx-yy.
Springer DOI Link 1003
BibRef

Shafiee, S.[Soheil], Almasganj, F.[Farshad], Vazirnezhad, B.[Bahram], Jafari, A.[Ayyoob],
A two-stage speech activity detection system considering fractal aspects of prosody,
PRL(31), No. 9, 1 July 2010, pp. 936-948.
Elsevier DOI Link
WWW Version. 1004
Speech activity detection; Prosody; Fractal dimension BibRef

Dennis, J., Tran, H.D., Li, H.,
Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions,
SPLetters(18), No. 2, February 2011, pp. 130-133.
IEEE DOI Link 1101
BibRef

Liang, Y.[Yuan], Liu, X.L.[Xiang-Long], Lou, Y.H.[Yi-Hua], Shan, B.[Baosong],
An improved noise-robust voice activity detector based on hidden semi-Markov models,
PRL(32), No. 7, 1 May 2011, pp. 1044-1053.
Elsevier DOI Link
WWW Version. 1101
Voice activity detection; State duration; Observation distribution; Hidden semi-Markov model; Likelihood ratio test; Forward variable BibRef

Liu, X.L.[Xiang-Long], Liang, Y.[Yuan], Lou, Y.H.[Yi-Hua], Li, H.[He], Shan, B.[Baosong],
Noise-Robust Voice Activity Detector Based on Hidden Semi-Markov Models,
ICPR10(81-84).
IEEE DOI Link 1008
BibRef

Mohanty, M.N.[Mihir Narayan], Jena, B.[Bhagyalaxmi],
Analysis of stressed human speech,
IJCVR(2), No. 2, 2011, pp. 180-187.
WWW Version. 1109
BibRef

Lopez-Moreno, I., Ramos, D., Gonzalez-Dominguez, J., Gonzalez-Rodriguez, J.,
Von Mises-Fisher Models in the Total Variability Subspace for Language Recognition,
SPLetters(18), No. 12, December 2011, pp. 705-708.
IEEE DOI Link 1112
BibRef

Jelassi, S.[Sofiene], Rubino, G.[Gerardo],
A study of artificial speech quality assessors of VoIP calls subject to limited bursty packet losses,
JIVP(2011), No. 1 2011, pp. xx-yy.
WWW Version. 1203
BibRef

Ben Aicha, A.[Anis], Ben Jebara, S.[Sofia],
Reduction of musical residual noise using perceptual tools with classic speech denoising techniques,
SIViP(6), No. 1, March 2012, pp. 85-97.
WWW Version. 1203
BibRef

Pulakka, H., Laaksonen, L., Myllyla, V., Yrttiaho, Y., Alku, P.,
Conversational Evaluation of Speech Bandwidth Extension Using a Mobile Handset,
SPLetters(19), No. 4, April 2012, pp. 203-206.
IEEE DOI Link 1203
BibRef


Krum, D.M.[David M.], Suma, E.A.[Evan A.], Bolas, M.[Mark],
Spatial misregistration of virtual human audio: Implications of the precedence effect,
3DUI12(147-148).
IEEE DOI Link 1204
BibRef

Yang, Y.J.[Ying-Jie], Zhang, H.H.[Huan-Huan], Guo, X.[Xiue],
A pitch tracking method mixing ACF and AMDF algorithms based on correlations,
IASP11(553-556).
IEEE DOI Link 1112
autocorrelation functions; average magnitude difference functions. Speech BibRef

Guo, S.[Shuni], Gao, L.[Lu], Yu, H.[Hongzhi],
Research on Lhasa Tibetan prosodic model of journalese based on respiratory signal,
IASP11(26-30).
IEEE DOI Link 1112
BibRef

Resmi, K., Kumar, S.[Satish], Sardana, H.K., Chhabra, R.[Radhika],
Graphical Speech Training system for hearing impaired,
ICIIP11(1-6).
IEEE DOI Link 1112
BibRef

Gao, L.[Lu], Yu, H.Z.[Hong-Zhi], Zhang, J.H.[Jins-Huang], Fang, H.P.[Hua-Ping],
Research on HMM_based speech synthesis for Lhasa dialect,
IASP11(429-433).
IEEE DOI Link 1112
BibRef

Gómez, J.A.[Jon Ander], Calvo, M.[Marcos],
Improvements on Automatic Speech Segmentation at the Phonetic Level,
CIARP11(557-564).
Springer DOI Link 1111
BibRef

Deng, S.[Shiwen], Han, J.[Jiqing],
Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test,
ICPR10(89-92).
IEEE DOI Link 1008
BibRef

Le, P.N.[Phu Ngoc], Epps, J.[Julien], Choi, E.H.C.[Eric H.C.], Ambikairajah, E.[Eliathamby],
A Study of Voice Source and Vocal Tract Filter Based Features in Cognitive Load Classification,
ICPR10(4516-4519).
IEEE DOI Link 1008
BibRef

Stark, M.[Michael], Wohlmayr, M.[Michael], Pernkopf, F.[Franz],
Single Channel Speech Separation Using Source-Filter Representation,
ICPR10(826-829).
IEEE DOI Link 1008
BibRef

Stadelmann, T.[Thilo], Wang, Y.H.[Ying-Hui], Smith, M.[Matthew], Ewerth, R.[Ralph], Freisleben, B.[Bernd],
Rethinking Algorithm Design and Development in Speech Processing,
ICPR10(4476-4479).
IEEE DOI Link 1008
BibRef

Gonzalez-Caravaca, G.[Guillermo], Toledano, D.T.[Doroteo Torre], Puertas, M.[Maria],
Phone-Conditioned Suboptimal Wiener Filtering,
ICPR10(4480-4483).
IEEE DOI Link 1008
BibRef

Sepehr, H.[Hamid], Nooralahiyan, A.Y.[Amir Y.], Brennan, P.V.[Paul V.],
Improving Performance of a Noise Reduction Algorithm by Switching the Analysis Filter Bank,
ICISP10(262-271).
Springer DOI Link 1006
for speech BibRef

Kos, M., Grasic, M., Vlaj, D., Kacic, Z.,
On-Line Speech/Music Segmentation for Broadcast News Domain,
WSSIP09(1-4).
IEEE DOI Link 0906
BibRef

Grasic, M., Kos, M., Vlaj, D., Kacic, Z.,
The Influence of Speech/Non-Speech Segmentation on On-Line and Off-Line Speaker Segmentation Accuracy,
WSSIP09(1-4).
IEEE DOI Link 0906
BibRef

Zuta, V.[Vivien],
Voice Pleasantness of Female Voices and the Assessment of Physical Characteristics,
COST08(116-125).
Springer DOI Link 0810
BibRef

Tucková, J.[Jana], Holub, J.[Jan], Dubeda, T.[Tomáš],
Technical and Phonetic Aspects of Speech Quality Assessment: The Case of Prosody Synthesis,
COST08(126-132).
Springer DOI Link 0810
BibRef

Pignotti, A.[Alessio], Marcozzi, D.[Daniele], Cifani, S.[Simone], Squartini, S.[Stefano], Piazza, F.[Francesco],
A Blind Source Separation Based Approach for Speech Enhancement in Noisy and Reverberant Environment,
COST08(356-367).
Springer DOI Link 0810
BibRef

Stadelmann, T., Heinzl, S., Unterberger, M., Freisleben, B.,
WebVoice: A Toolkit for Perceptual Insights into Speech Processing,
CISP09(1-5).
IEEE DOI Link 0910
BibRef

Tang, Y.B.[Yi-Bin], Huang, R.[Rong], Wu, Z.Y.[Zhen-Yang],
A 2.4kbps Multiband Characteristic Waveform Interpolation Speech Coding Algorithm,
CISP09(1-4).
IEEE DOI Link 0910
BibRef

Zou, X.[Xia], Zhang, X.W.[Xiong-Wei],
A 450bps Speech Coding Algorithm Based on Multi-Mode Matrix Quantization,
CISP09(1-3).
IEEE DOI Link 0910
BibRef

Kuhnapfel, T.[Thorsten], Tan, T.[Tele], Venkatesh, S.[Svertha], Igel, B.[Burkhard],
Distributed Audio Network for Speech Enhancement in Challenging Noise Backgrounds,
AVSBS09(308-313).
IEEE DOI Link 0909
BibRef

Kuhnapfel, T.[Thorsten], Tan, T.[Tele], Venkatesh, S.[Svetha], Nordholm, S.E.[Sven Erik], Igel, B.[Burkhard],
Adaptive speech enhancement with varying noise backgrounds,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Li, X.K.[Xiao-Kun], Deng, Y.[Yunbin],
Combining speech energy and edge information for fast and efficient voice activity detection in noisy environments,
ICPR08(1-4).
IEEE DOI Link 0812
BibRef

Kukharchik, P., Kheidorov, I., Bovbel, E., Ladeev, D.,
Speech Signal Processing Based on Wavelets and SVM for Vocal Tract Pathology Detection,
ICISP08(192-199).
Springer DOI Link 0807
BibRef

Nagesha, Kumar, G.H.[G. Hemantha],
Signal Resampling Technique Combining Level Crossing and Auditory Features,
PReMI07(447-454).
Springer DOI Link 0712
BibRef

Ferrer, C.A.[Carlos A.], González, E.[Eduardo], Hernández-Díaz, M.E.[María E.],
Evaluation of Time and Frequency Domain-Based Methods for the Estimation of Harmonics-to-Noise-Ratios in Voice Signals,
CIARP06(406-415).
Springer DOI Link 0611
BibRef

Li, W.H.[Wei-Hong], Liu, M.[Ming], Zhu, Z.G.[Zhi-Gang], Huang, T.S.[Thomas S.],
LDV Remote Voice Acquisition and Enhancement,
ICPR06(IV: 262-265).
IEEE DOI Link 0609
BibRef

Xue, W.[Wei], Du, S.[Sidan], Fang, C.Z.[Cheng-Zhi], Ye, Y.[Yingxian],
Voice Activity Detection Using Wavelet-Based Multiresolution Spectrum and Support Vector Machines and Audio Mixing Algorithm,
CVHCI06(78-88).
Springer DOI Link 0605
BibRef

García-Perera, L.P.[L. Paola], Nolazco-Flores, J.A.[Juan A.], Mex-Perera, C.[Carlos],
Cryptographic-Speech-Key Generation Architecture Improvements,
IbPRIA05(II:579).
Springer DOI Link 0509
BibRef

Welk, M.[Martin], Bergmeister, A.[Achim], Weickert, J.[Joachim],
Denoising of Audio Data by Nonlinear Diffusion,
ScaleSpace05(598-609).
WWW Version. 0505
BibRef

Cristani, M., Bicego, M., Murino, V.,
On-line adaptive background modelling for audio surveillance,
ICPR04(II: 399-402).
IEEE DOI Link 0409
BibRef

Chapter on New Unsorted Entries, and Other Miscellaneous Papers continues in
Speaker Verification, Speaker Identification .


Last update:May 16, 2012 at 20:31:07