Adriana STAN, PhD

Technical University of Cluj-Napoca, Romania

Speech Processing Group

Research interests: text-to-speech synthesis, acoustic modeling, speech and text alignment, machine learning algorithms and evolution programming in speech applications, multimedia databases.

Projects

Tools and Corpora

2017
Stefan-Adrian Toma, Adriana Stan, Mihai-Lica Pura, Traian Barsan, "MaRePhoR - An Open Access Machine-Readable Phonetic Dictionary for Romanian", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
Adriana Stan, Florina Dinescu, Cristina Tiple, Serban Meza, Bogdan Orza, Magdalena Chirila, Mircea Giurgiu, "The SWARA Speech Corpus: A Large Parallel Romanian Read Speech Dataset", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
2016
Adriana Stan, Cassia Valentini-Botinhao, Bogdan Orza, Mircea Giurgiu, "Blind Speech Segmentation using Spectrogram-image Based Features and Mel Cepstral Coefficients", In Proc. IEEE Workshop on Spoken Language Technology, San Diego, USA, 2016. [bib] [pdf]
Alexandru Moldovan, Adriana Stan, Mircea Giurgiu, "Improving Sentence-level Alignment of Speech with Imperfect Transcripts using Utterance Concatenation and VAD", In Proc. of IEEE ICCP, Cluj-Napoca, Romania, 2016. [bib] [pdf]
Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell, Oliver Watts, Rob Clark, Simon King, "ALISA: An automatic lightly supervised speech segmentation and alignment tool", In Computer Speech and Language, vol. 35, pp. 116-133, 2016. [bib] [pdf] [doi]
2015
Adriana Stan, Cassia Valentini-Botinhao, Mircea Giurgiu, Simon King, "Phonetic Segmentation of Speech using STEP and t-SNE", In Proc. of the 8th International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucuresti, Romania, 2015. [bib] [pdf]
2014
Jószef Domokos, Adriana Stan, Mircea Giurgiu, "An Approach to Lexical Stress Detection from Transcribed Continuous Speech Using Acoustic Features", In Proc. 22nd Telecommunications Forum, Belgrade, Serbia, 2014. [bib] [pdf]
Tiberiu Boros, Adriana Stan, Oliver Watts, Stefan Daniel Dumitrescu, "RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus", In Proc. The 9th edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014. [bib] [pdf]
O. Watts, S. Gangireddy, J. Yamagishi, S. King, S. Renals, A. Stan, M. Giurgiu, "Neural Net Word Representations for Phrase-Break Prediction Without a Part of Speech Tagger", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, pp. 2599-2603, 2014. [bib] [pdf]
2013
Y. Mamiya, A. Stan, J. Yamagishi, P. Bell, O. Watts, R.A.J. Clark, S. King, "Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
O. Watts, A. Stan, Y. Mamiya, A. Suni, M. Burgos, J.M. Montero, "The Simple4All entry to the Blizzard Challenge 2013", In Proc. Blizzard Challenge, Barcelona, Spain, 2013. [bib] [pdf]
O. Watts, A. Stan, R. Clark, Y. Mamiya, M. Giurgiu, J. Yamagishi, S. King, "Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from ‘found’ data: evaluation and analysis", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
Adriana Stan, Peter Bell, Junichi Yamagishi, Simon King, "Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data", In Proc. Interspeech, Lyon, France, pp. 1525-1529, 2013. [bib] [pdf]
A. Stan, O. Watts, Y. Mamiya, M. Giurgiu, R. A. J. Clark, J. Yamagishi, S. King, "TUNDRA: A Multilingual Corpus of Found Data for TTS Research Created with Light Supervision", In Proc. Interspeech, Lyon, France, pp. 2331-2335, 2013. [bib] [pdf]
Ioana Muresan, Adriana Stan, Mircea Giurgiu, Rodica Potolea, "Evaluation of Sentiment Polarity Prediction using a Dimensional and a Categorical Approach", In Proc. SPED, Cluj-Napoca, Romania, 2013. [bib] [pdf]
Yoshitaka Mamiya, Junichi Yamagishi, Oliver Watts, Robert A.J. Clark, Simon King, Adriana Stan, "Lightly Supervised GMM VAD to use Audiobook for Speech Synthesiser", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada, pp. 7987-7991, 2013. [bib] [pdf]
2012
Adriana Stan, Peter Bell, Simon King, "A Grapheme-based Method for Automatic Alignment of Speech and Text Data", In Proc. IEEE Workshop on Spoken Language Technology, Miami, Florida, USA, pp. 286-290, 2012. [bib] [pdf]
2011
Adriana Stan, "Romanian HMM-based Text-to-Speech Synthesis with Interactive Intonation Optimisation", PhD thesis, Technical University of Cluj-Napoca, 2011. [bib] [pdf]
Adriana Stan, Junichi Yamagishi, Simon King, Matthew Aylett, "The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate", In Speech Communication, vol. 53, no. 3, pp. 442-450, 2011. [bib] [pdf] [doi]
Adriana Stan, Mircea Giurgiu, "A Superpositional Model Applied to F0 Parametrisation using DCT for Text-to-Speech Synthesis", In Proceedings of the 6th Conference on Speech Technology and Human-Computer Dialogue, Brasov, Romania, 2011. [bib]
Adriana Stan, Florin-Claudiu Pop, Marcel Cremene, Mircea Giurgiu, Denis Pallez, "Interactive Intonation Optimisation Using CMA-ES and DCT Parametrisation of the F0 Contour for Speech Synthesis", In Proceedings of the 5th Workshop on Nature Inspired Cooperative Strategies for Optimisation, Springer, vol. 387, pp. 57-71, 2011. [bib] [pdf] [doi]
2010
Adriana Stan, Mircea Giurgiu, "Romanian language statistics and resources for text-to-speech systems", In Proceedings of the 9th Edition of the International Symposium on Electronics and Telecommunications, Timisoara, Romania, pp. 381-384, 2010. [bib]
2009
Adriana Stan, "Linear Interpolation of Spectrotemporal Excitation Pattern Representations for Automatic Speech Recognition in the Presence of Noise", In Proceedings of the 5th Conference on Speech Technology and Human-Computer Dialogue, Constanta, Romania, 2009. [bib]
2007
Adriana Stan, "A Study on the Performances of CELP Speech Coding at Low Bit Rates", In Novice Insights, 2007. [bib]

Office

26-28 George Baritiu,
Room S2.3,
400027 Cluj-Napoca,
Romania,
Phone: +40-264-202452

Department

26-28 George Baritiu,
Room 364,
400027, Cluj-Napoca,
Romania,
Phone: +40-264-401226
Fax: +40-264-597083

Adriana (dot) STAN (at) com.utcluj.ro

©Copyright 2015 | Adriana STAN