Adriana STAN, PhD


Associate Professor
Technical University of Cluj-Napoca, Romania

Research interests: text-to-speech synthesis, acoustic modeling, speech and text alignment, machine learning algorithms and evolution programming in speech applications, multimedia databases.

Projects

Tools and Corpora

Publications

> Dan Oneață, Beáta Lőrincz, Adriana Stan, Horia Cucu, "FlexLip: A Controllable Text-to-Lip System", In Special Issue Future Speech Interfaces with Sensors and Machine Intelligence, Sensors, MDPI, vol. 22, 2022. [bib] [pdf] [doi]
> Stefan Daniel Dumitrescu, Petru Rebeja, Beáta Lőrincz, Mihaela Gaman, Andrei Avram, Mihai Ilie, Andrei Pruteanu, Adriana Stan, Lorena Rosia, Cristina Iacobescu, Luciana Morogan, George Dima, Gabriel Marchidan, Traian Rebedea, Mădălina Chitez, Dani Yogatama, Sebastian Ruder, Radu Tudor Ionescu, Răzvan Pașcanu, Viorica Pătrăucean, "Liro: Benchmark and leaderboard for Romanian language tasks", In Proceedings of NeurIPS, 2021. [bib] [pdf]
> Dan Oneață, Alexandru Caranica, Adriana Stan, Horia Cucu, "An Evaluation of Word-level Confidence Estimation for end-to-end Automatic Speech Recognition", In Proceedings of the 8th IEEE Spoken Language Technology Workshop (SLT 2021), Shenzhen, China, 2021. [bib] [pdf]
> Adriana Stan, Beáta Lőrincz, "Generating the Voice of the Interactive Virtual Assistant", Chapter in Virtual Assistants, IntechOpen, 2021. [bib] [pdf] [doi]
> Beáta Lőrincz, Adriana Stan, Mircea Giurgiu, "An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis", In Proceedings of 25th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, 2021. [bib] [pdf]
> Beáta Lőrincz, Adriana Stan, Mircea Giurgiu, "Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis", In Proceedings of EUSIPCO, 2021. [bib] [pdf]
> Dan Oneață, Adriana Stan, Horia Cucu, "Speaker disentanglement in video-to-speech conversion", In Proceedings of EUSIPCO, 2021. [bib] [pdf]
> Adriana Stan, Beáta Lőrincz, Maria Nuțu, Mircea Giurgiu, "The MARA corpus: Expressivity in end-to-end TTS systems using synthesised speech data", In Proceedings of SpeD, 2021. [bib] [pdf]
> Adriana Stan, Mircea Giurgiu, "Prelucrarea semnalului vocal folosind Python", UTPress, Cluj-Napoca, Romania, 2021. [bib] [pdf]
> Georgiana Săracu, Adriana Stan, "An analysis of the data efficiency in Tacotron2 speech synthesis system", In Proceedings of SPED, 2021. [bib] [pdf]
> Kristen Scott, Simone Ashby, Adriana Stan, "Designing a Synthesized Content Feed System for Community Radio", In Proc. of NordICHI, Talinn, Estonia, 2020. [bib] [pdf]
> Adriana Stan, "RECOApy: Data recording, pre-processing and phonetic transcription for end-to-end speech-based applications", In Proceedings of Interspeech, Shanghai, China, 2020. [bib] [pdf]
> Beáta Lőrincz, Maria Nuțu, Adriana Stan, Mircea Giurgiu, "An Evaluation of Postfiltering for Deep Learning-based Speech Synthesis with Limited Data", In Proc. of 2020 IEEE 10th International Conference on Intelligent Systems, 2020. [bib] [pdf]
> Adriana Stan, "Input Encoding for Sequence-to-Sequence Learning of Romanian Grapheme-to-Phoneme Conversion", In Proceedings of the 10th IEEE International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Timisoara, Romania, 2019. [bib] [pdf]
> Beáta Lőrincz, Maria Nuțu, Adriana Stan, "Romanian Part of Speech Tagging using LSTM Networks", In Proceedings of the IEEE 15th International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, 2019. [bib] [pdf]
> Maria Nuțu, Beáta Lőrincz, Adriana Stan, "Deep Learning for Automatic Diacritics Restoration in Romanian", In Proceedings of the IEEE 15th International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, 2019. [bib] [pdf]
> David A. Braude, Matthew P. Aylett, Caoimhin Laoide-Kemp, Simone Ashby, Kristen M. Scott, Brian O Raghallaigh, Anna Braudo, Alex Brouwer, Adriana Stan, "All Together Now: The Living Audio Dataset", In Proceedings of Interspeech, Graz, Austria, 2019. [bib] [pdf]
> Adriana Stan, Mircea Giurgiu, "A Comparison Between Traditional Machine Learning Approaches And Deep Neural Networks For Text Processing In Romanian", In Proceedings of the 13th International Conference on Linguistic Resources and Tools for Processing Romanian Language (ConsILR), Jassy, Romania, 2018. [bib] [pdf]
> Stefan-Adrian Toma, Adriana Stan, Mihai-Lica Pura, Traian Barsan, "MaRePhoR - An Open Access Machine-Readable Phonetic Dictionary for Romanian", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
> Adriana Stan, Florina Dinescu, Cristina Tiple, Serban Meza, Bogdan Orza, Magdalena Chirila, Mircea Giurgiu, "The SWARA Speech Corpus: A Large Parallel Romanian Read Speech Dataset", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
> Adriana Stan, Cassia Valentini-Botinhao, Bogdan Orza, Mircea Giurgiu, "Blind Speech Segmentation using Spectrogram-image Based Features and Mel Cepstral Coefficients", In Proc. IEEE Workshop on Spoken Language Technology, San Diego, USA, 2016. [bib] [pdf]
> Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell, Oliver Watts, Rob Clark, Simon King, "ALISA: An automatic lightly supervised speech segmentation and alignment tool", In Computer Speech and Language, vol. 35, pp. 116-133, 2016. [bib] [pdf] [doi]
> Alexandru Moldovan, Adriana Stan, Mircea Giurgiu, "Improving Sentence-level Alignment of Speech with Imperfect Transcripts using Utterance Concatenation and VAD", In Proc. of IEEE ICCP, Cluj-Napoca, Romania, 2016. [bib] [pdf]
> Adriana Stan, Cassia Valentini-Botinhao, Mircea Giurgiu, Simon King, "Phonetic Segmentation of Speech using STEP and t-SNE", In Proc. of the 8th International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucuresti, Romania, 2015. [bib] [pdf]
> Jószef Domokos, Adriana Stan, Mircea Giurgiu, "An Approach to Lexical Stress Detection from Transcribed Continuous Speech Using Acoustic Features", In Proc. 22nd Telecommunications Forum, Belgrade, Serbia, 2014. [bib] [pdf]
> Tiberiu Boros, Adriana Stan, Oliver Watts, Stefan Daniel Dumitrescu, "RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus", In Proc. The 9th edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014. [bib] [pdf]
> O. Watts, S. Gangireddy, J. Yamagishi, S. King, S. Renals, A. Stan, M. Giurgiu, "Neural Net Word Representations for Phrase-Break Prediction Without a Part of Speech Tagger", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, pp. 2599-2603, 2014. [bib] [pdf]
> A. Stan, O. Watts, Y. Mamiya, M. Giurgiu, R. A. J. Clark, J. Yamagishi, S. King, "TUNDRA: A Multilingual Corpus of Found Data for TTS Research Created with Light Supervision", In Proc. Interspeech, Lyon, France, pp. 2331-2335, 2013. [bib] [pdf]
> Y. Mamiya, A. Stan, J. Yamagishi, P. Bell, O. Watts, R.A.J. Clark, S. King, "Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
> O. Watts, A. Stan, R. Clark, Y. Mamiya, M. Giurgiu, J. Yamagishi, S. King, "Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from ‘found’ data: evaluation and analysis", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
> O. Watts, A. Stan, Y. Mamiya, A. Suni, M. Burgos, J.M. Montero, "The Simple4All entry to the Blizzard Challenge 2013", In Proc. Blizzard Challenge, Barcelona, Spain, 2013. [bib] [pdf]
> Ioana Muresan, Adriana Stan, Mircea Giurgiu, Rodica Potolea, "Evaluation of Sentiment Polarity Prediction using a Dimensional and a Categorical Approach", In Proc. SPED, Cluj-Napoca, Romania, 2013. [bib] [pdf]
> Yoshitaka Mamiya, Junichi Yamagishi, Oliver Watts, Robert A.J. Clark, Simon King, Adriana Stan, "Lightly Supervised GMM VAD to use Audiobook for Speech Synthesiser", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada, pp. 7987-7991, 2013. [bib] [pdf]
> Adriana Stan, Peter Bell, Junichi Yamagishi, Simon King, "Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data", In Proc. Interspeech, Lyon, France, pp. 1525-1529, 2013. [bib] [pdf]
> Adriana Stan, Peter Bell, Simon King, "A Grapheme-based Method for Automatic Alignment of Speech and Text Data", In Proc. IEEE Workshop on Spoken Language Technology, Miami, Florida, USA, pp. 286-290, 2012. [bib] [pdf]
> Adriana Stan, Florin-Claudiu Pop, Marcel Cremene, Mircea Giurgiu, Denis Pallez, "Interactive Intonation Optimisation Using CMA-ES and DCT Parametrisation of the F0 Contour for Speech Synthesis", In Proceedings of the 5th Workshop on Nature Inspired Cooperative Strategies for Optimisation, Springer, vol. 387, pp. 57-71, 2011. [bib] [pdf] [doi]
> Adriana Stan, Junichi Yamagishi, Simon King, Matthew Aylett, "The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate", In Speech Communication, vol. 53, no. 3, pp. 442-450, 2011. [bib] [pdf] [doi]
> Adriana Stan, "Romanian HMM-based Text-to-Speech Synthesis with Interactive Intonation Optimisation", PhD thesis, Technical University of Cluj-Napoca, 2011. [bib] [pdf]
> Adriana Stan, Mircea Giurgiu, "A Superpositional Model Applied to F0 Parametrisation using DCT for Text-to-Speech Synthesis", In Proceedings of the 6th Conference on Speech Technology and Human-Computer Dialogue, Brasov, Romania, 2011. [bib]
> Adriana Stan, Mircea Giurgiu, "Romanian language statistics and resources for text-to-speech systems", In Proceedings of the 9th Edition of the International Symposium on Electronics and Telecommunications, Timisoara, Romania, pp. 381-384, 2010. [bib]
> Adriana Stan, "Linear Interpolation of Spectrotemporal Excitation Pattern Representations for Automatic Speech Recognition in the Presence of Noise", In Proceedings of the 5th Conference on Speech Technology and Human-Computer Dialogue, Constanta, Romania, 2009. [bib]
> Adriana Stan, "A Study on the Performances of CELP Speech Coding at Low Bit Rates", In Novice Insights, 2007. [bib]

Department

26-28 George Baritiu,
Room 364,
400027, Cluj-Napoca,
Romania,
Phone: +40-264-401226
Fax: +40-264-597083

Adriana (dot) STAN (at) com.utcluj.ro

©Copyright 2022 | Adriana STAN