6 Zakończenie - Index of /rozprawy2/10009

W niniejszej rozprawie zaprezentowano wyniki badań mających na celu zdefiniowanie nowych sposobów segmentacji i parametryzacji sygnału mowy polskiej. Prowadzone prace miały charakter zarówno teoretyczny (np. kryteria ewaluacji, funkcje kosztu, rozwaŜania na temat WPCT i optymalizacji bazy) jak i praktyczny (implementacje algorytmów, ewaluacja skuteczności na podstawie bazy mowy Corpora).

Wyniki otrzymane za pomocą opracowanych algorytmów, przedstawione w rozdziałach 4 oraz 5 dowodzą słuszności następujących tez rozprawy:

1. Transformacja falkowa jest odpowiednim narzędziem do analizy sygnałów mowy.

2. Zastosowanie transformacji falkowej umoŜliwia racjonalną, nierównomierną segmentację sygnału mowy polskiej.

3. Transformacja falkowa umoŜliwia efektywną ekstrakcję parametrów sygnału w systemach rozpoznawania mowy polskiej.

Wszystkie cele pracy wyszczególnione w rozdziale 1.1, zostały zrealizowane. W tym celu konieczne było opracowanie nowatorskich metod i algorytmów.

Wkładem autora są:

1. Dwie nowe metody falkowej, nierównomiernej segmentacji sygnału mowy bez znajomości transkrypcji. Na szczególną uwagę zasługuje algorytm wyznaczania mapy istotności pasm falkowych i generowania dyskretnej funkcji zdarzeń, wykorzystanej do segmentacji sygnału (Rozdz. 4.4).

2. Nowe sposoby skutecznej parametryzacji falkowej dla systemów rozpoznawania mowy (Rozdz. 5.3 i 5.4).

3. Systematyzacja i uporządkowanie kryteriów oceny segmentacji i parametryzacji sygnału mowy (Rozdz. 4.3 i 5.1).

4. Przedstawienie sposobu dokładnej aproksymacji skal psychoakustycznych przez paczkową transformację falkową, na przykładzie skali melowej (Rozdz. 5.4.1).

5. Definicja optymalnej bazy dekompozycji zbioru zróŜnicowanych sygnałów i algorytm Mean Best Basis - uogólnienie algorytmu BB, do wyznaczania tej bazy (Rozdz. 5.4.2).

6. Zastosowanie WPCT - paczkowej transformacji falkowo-kosinusowej do wyznaczenia nowych schematów dekompozycji i parametryzacji sygnału mowy metodą MBB (Rozdz. 5.4.2).

7. Nowa funkcja kosztu – wskaźnik koncentracji, nadająca się do zastosowania w algorytmach BB oraz MBB, i zapewniająca skuteczne generowanie schematów dekompozycji w oparciu o transformację WPCT (Rozdz. 5.4.2).

Zakończenie

Oprócz wyszczególnionych idei i rozwiązań, wynikiem prac są takŜe działające implementacje wszystkich omawianych algorytmów. Powstałe oprogramowanie moŜe słuŜyć do dalszych badań nad problematyką falkowego przetwarzania mowy oraz być istotnym fragmentem prototypu automatycznego systemu rozpoznawania mowy.

Przedstawione rezultaty były publikowane i prezentowane na konferencjach międzynarodowych.

7 Bibliografia

[1] A. Alani, M. Deriche, A Novel Approach to Speech Segmentation Using The Wavelet Transform,

Proceedings of The Fifth International Symposium on Signal Processing and

its Applications ISSPA‘99, Brisbane 1999

[2] G. Almpanidis, C. Kotropoulos, Automatic Phonemic Segmentation Using The Bayesian Information Criterion With Generalised Gamma Priors, Proceedings of EUSIPCO 2007

[3] D. L. Baddeley, R.A. Owens, A New Metric for grey Scale Image Comparison, International

Journal of computer Vision, Vol. 24, 1995

[4] M. Bahoura, J. Rouat, Wavelet Speech Enhancement Based on the Teager Energy Operator,

IEEE Signal Processing Letters, Vol. 8, no. 1, IEEE Signal Processing Society 2001

[5] Cz. Basztura, J. Jurkiewicz, E. Tyburcy, Fonetyczna funkcja mowy (FFM) jako metoda segmentacji ciągłego sygnału mowy, Archiwum Akustyki, T.14, Z.2, 1979

[6] Cz. Basztura, S. Brachmański, T. Sawczyn, Automatyczna segmentacja sygnału mowy w modelu komputerowego rozpoznawania głosów niezaleŜnie od tekstu, Prace XXXVI Otwartego

Seminarium z Akustyki, BiałowieŜa 1988

[7] C. Becchetti, L. P. Ricotti, Speech recognition – Theory and C++ implementation, Wiley & Sons 2004

[8] J. Benesty, M. M. Sondhi, Y. Huang eds., Springer Handbook of Speech Processing, Springer 2008

[9] B. Bojar, Elementy językoznawstwa dla informatyków, PAN ODiIN, Warszawa 1974

[10] H. Bourlard, N. Morgan, Continuous Speech Recognition, IEEE Signal Processing Magazine, Vol. 12, No. 3, 1995

[11] F. Brugnara, D. Falavigna, M. Omologo, Automatic Segmentation And Labeling of Speech Based on Hidden Markov Models, Speech Communication, Vol.12 No. 4, Elsevier 1993

[12] J. C. Burges, i in., A Tutorial on Support Vector Machines for Pattern Recognition, Jornal of Data

Mining and Knowledge Discovery, Vol. 2, Springer 1998

[13] P. Cardinal, G. Boulianne, M. Comeau, Segmentation of recordings based on partial transcriptions,

Proceedings of Interspeech, 2005

[14] B. Carnero, A. Drygajlo, Perceptual Speech Coding and Enhancement Using Frame-Synchronized Fast Wavelet Packet Transform Algorithms, IEEE Transactions on Signal Processing, Vol. 47, No. 6, 1999

[15] M. Cettolo, M. Vescovi, Efficient audio segmentation algorithms based on the BIC, Proceedings

of ICASSP 2003

[16] S.-H. Chen, J.-F. Wang, Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator, Journal of VLSI Signal Processing, Vol. 36, Kluwer Academic Pub. 2004

[17] H.-W. Chen, T. Olson, New Aggressive Way to Search for The Best Base in Wavelet Packets,

IEE Proceedings of Vision and Image Signal Process, Vol. 152, No. 6, 2005

[18] S.-S. Cheng, H.-M. Wang, A sequential metric-based audio segmentation method via the Bayesian information criterion, Proceedings of 8^th European Conference on Speech Communication and Technology – EUROSPEECH 2003, Geneva 2003

[19] S.-S. Cheng, H.-M. Wang, METRIC-SEQDAC: A Hybrid Approach for Audio Segmentation,

Proceedings of ICSLP 2004

[20] Z. Chengyi, Y. Yonghong, Fusion based speech segmentation in DARPA SPINE2 task,

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing -

ICASSP’04, Vol. 1, 2004

[21] S. Cherniz, M. Torres, H. L. Rufiner, A. Esposito, Multiresolution Analysis applied to Text-Independent Phone Segmentation, Journal of Physics: Conference Series, Vol. 90, IOP Publishing 2007

[22] Ch. K. Chui, An Introduction to Wavelets, Academic Press 1992

[23] L. Cohen, S. Umesh, D. Nelson, Frequency Warping and the Mel Scale, IEEE Signal Processing

Letters, Vol. 9, No. 3, 2002

[24] Couvreur, L. Couvreur, Wavelet-Based Method for Nonparametric Estimation of HMM’s,

Bibliografia

[25] R. M. Dansereau, W. Kinsner, V. Cevher, Wavelet Packet Best Basis Search Using Generalized Renyi Entropy, Proceedings of the 2002 IEEE Canadian Conference on Electrical & Computer

Engineering

[26] S. Datta, O. Farooq, Mel Filter-Like Admissible Wavelet Packet Structure for Speech Recognition,

IEEE Signal Processing Letters, Vol. 8, No. 7, 2001

[27] S. Datta, O. Farooq, Wavelet Based Robust Sub-band Features For Phoneme Recognition,

IEE Proceedings: Vision, Image and Signal Processing, Vol. 151(3), 2004

[28] S. Datta, O. Farooq, Mel-Scaled Wavelet Filter Based Features For Noisy Unvoiced Phoneme Recognition, Proceedings of ICSLP 2002

[29] S. Datta, O. Farooq, A Novel Wavelet based Pre-processing for Robust Features in ASR,

Int. Symposium on Communication Systems, Networks and Digital Signal Processing,

Staffordshire University, 2002

[30] S. Datta, C. J. Long, Wavelet Based Feature Extraction for Phoneme Recognition, Proceedings

of ICSLP 1996

[31] S. Datta, O. Farooq, Phoneme Recognition Using Wavelet Based Features, An International

Journal on Information Sciences, Vol. 150, Elsevier 2003

[32] S. Datta, M. Al-Zabibi, Spectral Variation Function and Its Application to Speech Segmentation,

Acoustic Letters, Vol. 14, 1990

[33] I. Daubechies, Ten Lectures on Wavelets, SIAM 1992

[34] S. B. Davis, P. Mermelstein, Comparison Of Parametric Representations For Monosyllabic Word Recognition In Continuously Spoken Sentences, IEEE Transactions on Acoustics, Speech and

Signal Processing, Vol. 28, 1980

[35] K. Demuynck, T. Laureys, A Comparison of Different Approaches to Automatic Speech Segmentation, Proceedings of the 5^th International Conference on Text, Speech and Dialogue, Lecture Notes In Computer Science, Vol. 2448, 2002

[36] L. Deng, J. Wu, J. Droppo, A. Acero, Analysis and comparison of two speech feature extraction/compensation algorithms. IEEE Signal Processing Letters, Vol. 12(6), 2005

[37] H. Drucker, C. J. C. Burges, L. Kaufman, A. Smola, V. Vapnik, Support Vector Regression Machines, Advances in Neural Information Processing Systems 9, NIPS 1996, MIT Press 1997 [38] Drygajlo, New Fast Wavelet Packet Transform Algorithms for Frame Synchronized Speech

Processing, Proceedings of Fourth IEEE International Conference on Spoken Language

ICSLP 1996

[39] Dubois, H. Prade, Fuzzy Sets and Systems, Academic Press 1998

[40] L. Dukiewicz, R. Piela, Wyrazistość i rozróŜnialność głosek w języku polskim w zaleŜności od górnej granicy częstotliwości, Przegląd Telekomunikacyjny, Vol. 7, 1962

[41] L. Dukiewicz, Gramatyka Współczesnego Języka Polskiego – Fonetyka, Instytut Języka Polskiego PAN, Kraków 1995

[42] E. Ercelebi, Second Generation Wavelet Transform-Based Pitch Period Estimation And Voiced/Unvoiced Decision For Speech Signals, Applied Acoustics, Vol. 64, Elsevier 2003 [43] G. Evangelista, S. Cavaliere, Discrete Frequency Warped Wavelets: Theory and Applications,

IEEE Transactions On Signal Processing, Vol. 46, No. 4, 1998

[44] X. Fang, Automatic Phoneme Segmentation of Continuous Speech Signals, IEEE Transactions

on Signal Processing, 1994

[45] J. Gałka, M. Kepiński, M. Ziółko, Speech Signals in Wavelet-Fourier Domain, Proceedings of The

Fiftieth Open Seminar on Acoustics - Speech Analysis, Synthesis And Recognition In Technology, Linguistics And Medicine, Archives of Acoustics, Vol. 28, No. 3

[46] J. Gałka, Distance Measures for Wavelet Representation of Speech Segments, Proceedings

of XII National Conference Application of Mathematics in Biology and Medicine - KKZMBM,

Koninki 2006

[47] J. Gałka, M. Ziółko, Wavelets in Speech Segmentation, Proceedings of The 14^th IEEE Mediterranean Electrotechnical Conference MELECON 2008

[48] J. Gałka, M. Dyrek, B. Ziółko, Wavelet Segmentation of Speech, Proceedings of The 8^th WSEAS International Conference On Multimedia Systems And Signal Processing, International Journal Of Circuits, Systems And Signal Processing, NAUN 2008

[49] J. Gałka, B. Ziółko, Study of Performance Evaluation Methods for Non-Uniform Speech Segmentation, Proceedings of The 8^th WSEAS International Conference On Multimedia Systems And Signal Processing, International Journal Of Circuits, Systems And Signal Processing,

[50] J. Gałka, M. Kępiński, Wavelet-Fourier Spectrum Parameterisation for Speech Signal Recognition, Proceedings of X National Conference Application of Mathematics in Biology and

Medicine, Święty KrzyŜ 2004

[51] J. Gałka, M. Kępiński, WFT context-sensitive speech signal representation, Advances in Soft

Computing: Proceedings of the International Intelligent Information Systems, Intelligent Information Processing and Web Mining Conference, Springer 2006

[52] J. Gałka, M. Kępiński, M. Ziółko, Wavelet-Fourier Analysis of Speech Signal, Proceedings of the

Workshop on Multimedia Communications and Services, Kielce 2003

[53] A. Gallardo-Antolín, J. Macías-Guarasa, J. Ferreiros, R. Córdoba, J. M. Montero-Martínez, R. San-Segundo, J. M. Pardo, A Comparison of Several Approaches to the Feature Extractor Design for ASR Tasks in Telephone Environment, Proceedings of the 15th International

Conference of Phonetic Sciences, Barcelona 2003

[54] T. Ganchev, M. Siafarikas, N. Fakotakis, Speaker Verification Based on Wavelet Packets, Lecture

Notes in Computer Science - Text, Speech and Dialogue, Springer 2004

[55] J. S. Garofolo ed., DARPA TIMIT – Acoustic-Phonetic Speech Corpus, NIST 1993

[56] R. Gemello, F. Mana, P. Pegoraro, R. De Mori, Robust Multiple Resolution Analysis for Automatic Speech Recognition, Computer Speech & Language, Vol. 20, No. 1, Elsevier 2006 [57] Y. Ghanbari, M. R. Karami-Mollaei, A New Approach For Speech Enhancement Based

on The Adaptive Thresholding Of The Wavelet Packets, Speech Communication, Vol. 48, Elsevier 2006

[58] B. Gold, N. Morgan, Speech and audio signal processing, Wiley & Sons 2000

[59] J. A. Gómez, M. J. Castro, Automatic Segmentation of Speech at the Phonetic Level, Lecture

Notes In Computer Science, Vol. 2396, Springer 2002

[60] J. N. Gowdy, Z. Tufekci, Mel-Scaled Discrete Wavelet Coefficients for Speech Recognition,

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing - ICASSP ‘00, Vol. 3, Istanbul 2000

[61] H. Gray Jr., J. D. Markel, Distance Measures for Speech Processing, IEEE Trans. Acoustics,

Speech, Signal Processing, Vol. 24 (5), 1976

[62] R. M. Gray, A. Buzo, A. H. Gray Jr., Y. Matsuyama, Distortion Measures for Speech Processing,

IEEE Trans. Acoustics, Speech, Signal Processing, Vol. 28 (4), 1980

[63] D.B. Grayden, M.S. Scordilis, Phonemic Segmentation of Fluent Speech, Proceedings

of ICASSP’91

[64] S. Grocholewski, First Database for Spoken Polish, Proceedings of International Conference

on Language Resources and Evaluation, Grenada 1998

[65] S. Grocholewski, Hidden Markov Models for Polish, Proceedings of Prosody 2000, UAM Poznań 2001

[66] S. Grocholewski, The Use of HMMs for Modeling Polish Triphones, Speech and Language

Technology, Vol. 5, 2001

[67] S. Grocholewski, Acoustic Modeling for Polish, Proceedings of International Workshop Speech

and Computer - SPECOM’2000, Petersburg 2000

[68] S. Grocholewski, E. Łukasik, Wavelet Transform in Speech Processing, Proceedings of Summer

School on Wavelets, Zakopane 1996

[69] S. Grocholewski, Baza Nagrań Sygnałów Mowy Corpora, UAM Poznań 1997

[70] S. Grocholewski, E. Łukasik, Comparison of Some Time-Frequency Analysis Methods for Classification of Plosives, Signal Processing IX, Theories and Applications, Typorama Publications, Greece 1998

[71] S. Grocholewski, Design of Polish Diphones Corpus, Proceedings of 4-th Int. Workshop

on Systems, Signals and Image Processing, Poznań 1997

[72] S. Grocholewski, CORPORA - Speech Database for Polish Diphones, Proceedings

of EUROSPEECH’97,Rodos 1997

[73] S. Grocholewski, M Szymański, Semi Automatic Segmentation of Speech: Manual Segmentation Strategy. Problem Space Analysis, Advances in Soft Computing, Computer Recognition Systems, Springer 2005

[74] S. Grocholewski, M. Szymański, Strategies of The Selected Manual Annotations in Semi-Automatic Speech Signal Segmentation, Speech and Language Technology, Vol.8, 2005

[75] S. Grocholewski, M. Szymański, Dynamic Programming Method for Fine-tuning the Boundary Points in Automatic Segmentation of Speech, Archives of Acoustics, No.1, Vol. 32, 2007

[76] S. Grocholewski, Podstawy systemu rozpoznawania mowy dla języka polskiego, Multimedialne

Bibliografia

[77] S. Grocholewski, M. Szymański, Post-processing of Automatic Segmentation of Speech Using Dynamic Programming, LNAI, Vol. 4188, Springer 2006

[78] S. Grocholewski, M. Szymański, Transcription-based Automatic Segmentation of Speech,

Archives of Control Sciences, Vol. 15, No. 3, 2005

[79] S. Grocholewski, G. Demenko, A. Wagner, M. Szymański, Prosody Annotation for Corpus Based Speech Synthesis, Proceedings of Eleventh International Conference on Speech Science

and Technology, Auckland 2006

[80] S. Grocholewski, An Analysis of Variability of Polish Vowels in the Cepstral Domain,

Proceedings of Signal Processing, 2001

[81] M. Gupta, A. Gilbert, Robust Speech Recognition Using Wavelet Coefficient Features,

Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding –

ASRU 2001

[82] J. K. Hammond, K. Shin, Fundamentals of Signal processing for sound and vibration engineers, Wiley & Sons 2008

[83] J. H. L. Hansen, B. W. Zhou, Unsupervised Audio Stream Segmentation and Clustering via the Bayesian Information Criterion. Proceeding of ICSLP 2000

[84] E. Hansler, G. Schmidt, Acoustic echo and noise control – A practical Approach, Wiley & Sons 2004

[85] J. P. Haton, J. C. Junqua, Robustness in automatic speech recognition, Kulwer Academic Publishers, Norwell MA 1996

[86] Hermansky, G. Adami, Segmentation Of Speech For Speaker And Language Recognition,

Proceedings of 8^th European Conference on Speech Communication and Technology -

EUROSPEECH, Geneva 2003

[87] H. Hermansky, N. Morgan, RASTA processing of speech, IEEE Transactions on Speech

and Audio Processing, Vol. 2(4), 1994

[88] P. Horak, Automatic Speech Segmentation Based On DTW With The Application Of The Czech TTS System, Improvements in Speech Synthesis, Wiley & Sons 2001

[89] C.-T. Hsieh, M.-C. Su, E. LAI, C.-H. Hsu, A Segmentation Method for Continuous Speech Utilizing Hybrid Neuro-Fuzzy Network, Journal Of Information Science And Engineering, Vol. 15, 1999

[90] W.-W. Hung, H.-C. Wang, On The Use of Weighted Filter Bank Analysis for the Derivation of Robust MFCCs, IEEE Signal Processing Letters, Vol. 8, No. 3, 2001

[91] L. Janer, J. Marti, C. Nadeu, E. Lleida-Solano, Wavelet Transforms for Non-Uniform Speech Recognition Systems, Proceedings of ICSLP 1996

[92] W. Jassem, Podstawy fonetyki akustycznej, PWN, Warszawa, 1973

[93] B.-H. Juang, C.-H. Lee, K. Wang, Selective Feature Extraction via Signal Decomposition,

IEEE Signal Processing Letters, Vol. 4, No. 3, 2002

[94] M. Kępiński, M. Ziółko, Speech Signal Segmentation, Proceedings of the VIII National

Conference Application of Mathematics in Biology and Medicine, Łajs 2002

[95] Kim, D. H. Youn, Ch. Lee, Evaluation of Wavelet Filters for Speech Recognition, Proceedings

of the IEEE International Conference on Systems, Man, and Cybernetics, 2000

[96] Kirchhoff, T. Schultz, Multilingual speech processing, Elsevier / Academic Press 2006

[97] C-H. Lee, K. K. Paliwal, F. K. Soong, Automatic speech and speaker recognition, Kluwer Academic Publishers, Norwell 1996

[98] K-F. Lee, A. Waibel, Readings in speech recognition, Morgan Kaufmann Pub. 1990 [99] P. C. Loizou, Speech Enhancement – Theory and practice, CRC Press 2007

[100] E. Łukasik, Classification Of Voiceless Plosives Using Wavelet Packet Based Approaches,

Proceedings of EUSIPCO 2000

[101] E. Łukasik, Wavelet Packets Based Features Selection for Voiceless Plosives Classification,

Proceedings of ICASSP 2000

[102] S.-Y. Lung, Wavelet Feature Selection Based Neural Networks With Application to The Text Independent Speaker Identification, Pattern Recognition, Vol. 39, Elsevier 2006

[103] S. G. Mallat, Wavelet Tour of Signal Processing, San Diego 1995

[104] S. G. Mallat, Z. Zhang, Matching Pursuits With Time-Frequency Dictionaries, IEEE Transactions

On Signal Processing, Vol. 41, No. 12, 1993

[105] J. P. Marques de Sa, Pattern recognition, Springer 2001

[106] R. Martinez, W. L. Martinez, Exploratory Data Analysis with Matlab, Chapman & Hall / CRC Press 2005

[107] P. M. McCourt, S. V. Vaseghi, B. Doherty, Multiresolution Sub-Band Features And Models For Hmm-based Phonetic Modelling, Computer Speech and Language, Vol. 14, No. 3, 2000

[108] J. M. McQueen, Segmentation of Continuous Speech Using Phonotactics, Journal Of Memory And Language, Vol. 39, 1998

[109] P. Mermelstein, Automatic Segmentation of Speech Into Syllabic’ Units, The Journal of the

Acoustical Society of America, Vol. 58 No. 4, 1975

[110] H. Misra, S. Ikbal, H. Bourlard, and H. Hermansky, Spectral Entropy Based Feature For Robust ASR, Proceedings of ICASSP 2004

[111] N. Morgan, S. Renals, H. Bourlard, M. Cohen, H. Franco, Connectionist Probability Estimators in HMM Speech Recognition, IEEE Transactions Speech and Audio Processing, Vol. 2(1), 1994 [112] G. Noel, B. J. van Wyk, Wavelet Packet Tree Selection for Vibration Data, Proceedings of IEEE

AFRICON 2004

[113] D. Ostaszewska, J. Tambor, Fonetyka i fonologia współczesnego języka polskiego. PWN Warszawa 2000

[114] S. S. Park, N. S. Kim, Automatic Speech Segmentation Based on Boundary-Type Candidate Selection, IEEE Signal Processing Letters, Vol. 13, No. 10, 2006

[115] Pulse code modulation (PCM) of voice frequencies, ITU-T Recommendation G.711, http://www.itu.int/ITU-T/

[116] M. Peinado, J. C. Segura, Speech recognition over digital channels, Wiley & Sons 2006

[117] P. Petropulu, C. Wendt, Pitch Determination and Speech Segmentation Using The Discrete Wavelet Transform, Proceedings of IEEE International Symposium on Circuits and Systems -

Connecting the World, Vol. 2, 1996

[118] I. Pinter, Perceptual Wavelet-Representation of Speech Signals and Its Application to Speech Enhancement, Computer Speech and Language, Vol. 10, Academic Pub. 1996

[119] V. K. Prasad, T. Nagarajan, H. A. Murthy, Automatic Segmentation of Continuous Speech Using Minimum Phase Group Delay Functions, Speech Communication, Vol. 42, Elsevier 2004

[120] T. F. Quatieri, Discrete-Time speech signal processing – Principles and Practice, Prentice Hall 2002

[121] R. Rabiner, R. W. Schafer, Introduction to digital speech processing, NOW Pub., Hanover MA, 2007

[122] L. Rabiner, B-I. Juang, Fundamentals of Speech recognition, Prentice Hall 1993

[123] M. Rajman ed., Speech and language engineering, EPFL Press / CRC Press, Boca Raton FL 2008 [124] D. Rao, K. Kreutz-Delgado, An Affine Scaling Methodology for Best Basis Selection,

IEEE Transactions On Signal Processing, Vol. 47, No. 1, 1999

[125] R. Reddy, Segmentation of Speech Sounds, The Journal of the Acoustical Society of America, Vol. 40, No. 2, 1966

[126] R. Reyesa, M. R. Zurerab, F. L. Ferrerasb, P. J. Amoresb, Adaptive Wavelet-Packet Analysis for Audio Coding Purposes, Signal Processing, Vol. 83, Elsevier 2003

[127] J. van Rijsbergen, Information Retrieval, Butterworths 1979

[128] B. Rocławski, Zarys fonologii, fonetyki, fonotaktyki i fonostatystyki współczesnego języka

polskiego, PWN, Gdańsk 1976

[129] SAMPA - A computer readable phonetic alphabet, http://www.phon.ucl.ac.uk/home/sampa/home.htm

[130] R. Sarikaya, J. H. L. Hansen, High Resolution Speech Feature Parameterization for Monophone – Based Stressed Speech recognition, IEEE Signal Processing Letters, Vol. 7, No. 7, 2000

[131] R. Sarikaya, J. N. Gowdy, Subband Based Classification of Speech Under Stress, Proceedings

of IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle 1998

[132] I. Sawicka, Gramatyka Współczesnego Języka Polskiego – Fonologia, Instytut Języka Polskiego PAN, Kraków 1995

[133] A. Sethy, S. Narayanan, Refined Speech Segmentation for Concatenative Speech Synthesis,

Proceedings of International Conference on Spoken Language Processing ICSLP 2002

[134] E. Shriberg, A. Stolcke, D. Hakkani-Tur, G. Tur, Prosody-Based Automatic Segmentation of Speech into Sentences and Topics, Speech Communication, Vol. 32 (1-2), Elsevier 2000 [135] S. Srinivasan, D. Wang, A Schema-Based Model for Phonemic Restoration, Speech

Communication, Vol. 45, Elsevier 2005

[136] J. 0. Stromberg, A modified Franklin system and higher order spline system on R" as unconditional bases for Hardy spaces, Proceedings of Conference in Honor of Antoni Zygmund, Vol. 2, Wadsworth, New York 1981

[137] A. Subramanya, J. Bilmes, C. P. Chen, Focused Word Segmentation for ASR, Proceedings

of Interspeech 2005

Bibliografia

[139] R. Tadeusiewicz, A. Izworski, Application of Computational Intelligence Methods in Processing And Recognition of The Pathological Speech Signals, Proceedings of The International

Conference on Computational Intelligence, Robotics and Autonomous Systems CIRAS 2001

[140] R. Tadeusiewicz, A. Izworski, Application of Neural Networks in The Diagnosis of Pathological Speech, Perspectives in Neural Computing: Artificial Neural Networks in Biomedicine, Springer 2000

[141] R. Tadeusiewicz, A. Izworski, W. Wszołek, T. Wszołek, Methods of Deformed Speech Analysis,

Models And Analysis of Vocal Emissions for Biomedical Applications, Firenze 1999

[142] R. Tadeusiewicz, G. Demenko, Technologie komputerowego przetwarzania mowy i ich moŜliwe zastosowania w pracy policji, Policja w Polsce: stan obecny i perspektywy, Wydawnictwo Naukowe Instytutu Nauk Politycznych i Dziennikarstwa Uniwersytetu im. Adama Mickiewicza, 2007

[143] R. Tadeusiewicz, Rozpoznawanie mowy przy pomocy sieci neuronowych, Sztuczna Inteligencja

i Cybernetyka Rozwoju, Siedlce 1993

[144] R. Tadeusiewicz, M. Flasiński, Rozpoznawanie obrazów, Biblioteka Główna AGH, Kraków, 2000. http://winntbg.bg.agh.edu.pl/skrypty/0005/

[145] R. Tadeusiewicz, A. Izworski, Metody komputerowej ekstrakcji parametrów dystynktywnych z ciągłego sygnału mowy polskiej, Archiwum Akustyki, Vol. 18, No. 3, 1983

[146] R. Tadeusiewicz, Sygnał mowy, Wydawnictwa Komunikacji i Łączności, Warszawa, 1988. [147] B. T. Tan, R. Lang, H. Schroder, A. Spray, Ph. Dermody, Applying Wavelet Analysis to Speech

Segmentation and Classification, Wavelet Applications – Proceedings of SPIE 1994

[148] B. T. Tan, M. Fu, A. Spray, Ph. Dermody, The Use of Wavelet Transforms in Phoneme Recognition, Proceedings of ICSLP 1996

[149] C. Taswell, Speech Compression With Cosine and Wavelet Packet Near-Best Bases, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP, 1996 [150] C. Taswell, Satisficing Search Algorithms for Selecting Near-Best Bases in Adaptive

Tree-Structured Wavelet Transforms, IEEE Transactions on Signal Processing, Vol. 44, No. 10, 1996

[151] Z. Tufekci, J. N. Gowdy, S. Gurbuz 1, E. Patterson, Applied Mel-Frequency Discrete Wavelet Coefficients and Parallel Model Compensation for Noise-Robust Speech Recognition, Speech

Communication, Vol. 48, Elsevier 2006

[152] D. Tufts, D. Mayhew, G. Dostie, A. Smith, Some Results In Automatic Speech Segmentation Using Wide-Band Filtering And Substraction, The Journal of the Acoustical Society of America, 08-1965

[153] J. W. Tukey, Nonlinear (Nonsuperposable) Methods for Smoothing Data, Proceedings

of EASCON’74, 1974

[154] J. W. Tukey, B. P. Bogert, M. J. R. Healy, The Quefrency Analysis Of Time Series For Echoes: Cepstrum, Pseudo-Autocovariance, Cross-Cepstrum, And Saphecracking, Proceedings of the Symposium on Time Series Analysis, 1963

[155] S. V. Vaseghi, Multimedia signal processing, Wiley & Sons 2007

[156] M. Vetterli, K. Ramchandran, Best Wavelet Packet Bases Using Rate-Distortion Criteria,

Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS 1992

[157] M. Vetterli, C. Herley , J. Kovacevic, K. Ramchandran, Tilings of the Time-Frequency Plane: Construction of Arbitrary Orthogonal Bases and Fast Tiling Algorithms, IEEE Transactions on Signal Processing, Vol. 41, No. 12, 1993

[158] M. Vetterli, K. Ramchandran, C. Herley, Wavelets, Subband Coding, and Best Bases, Proceedings

of the IEEE, Vol. 84, No. 4, 1996

[159] R. Villing, J. Timoney, T. Ward, J. Costello, Automatic Blind Syllable Segmentation for Continuous Speech, Proceedings of ISSC 2004

[160] R. Villing, T. Ward, J. Timoney, Performance Limits for Envelope Based Automatic Syllable

W dokumencie Index of /rozprawy2/10009 (Stron 125-133)