Adriana STAN, PhD

ORCID logo


Associate Professor
Technical University of Cluj-Napoca, Romania

Research interests: text-to-speech synthesis, acoustic modeling, speech and text alignment, machine learning algorithms and evolution programming in speech applications, multimedia databases.


Projects


Cadrul strategic național în domeniul inteligenței artificiale (2021-2023)

https://strategie-ia.utcluj.ro

SINTERO: Tehnologii de realizare a interfețelor om-mașină pentru sinteza text-vorbire cu expresivitate (2018-2021)

https://speech.utcluj.ro/sintero/

SWARA: Mobile System for Rehabilitative Vocal Assistance of Surgical Aphonia (2015-2017)

https://speech.utcluj.ro/swara/

SIMPLE4ALL: Speech Synthesis that Improves through Adaptive Learning (2011-2014)

https://simple4all.org/


Tools and Corpora


RoLEX: An extended Romanian Lexical Dataset

https://github.com/adrianastan/rolex

RECOApy - prompted speech recording app

https://github.com/adrianastan/recoapy

ALISA: An automatic lightly supervised speech segmentation and alignment tool

https://simple4all.org/product/alisa/

The Romanian Speech Synthesis CORPUS

https://romaniantts.com/rssdb/

The Tundra CORPUS

http://tundra.simple4all.org

The MaRePhor Lexicon

http://speech.utcluj.ro/marephor/

Cartea Sonoră - ”Mara” Corpus

http://speech.utcluj.ro/corpora/mara.html

Romanian TTS System

http://romaniantts.com


Publications


> Adrian Bogdan Stânea, Vlad Strilețchi, Cosmin Strilețchi, Adriana Stan, "An analysis of large speech models-based representations for speech emotion recognition", In 2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 100-104, 2023. [bib] [pdf] [doi]
> Adriana STAN, Johannah O'Mahony, "An analysis on the effects of speaker embedding choice in non auto-regressive TTS", In 12th ISCA Speech Synthesis Workshop (SSW), 2023. [bib] [pdf]
> Samuel Rutunda, Kleber Kabanda, Adriana Stan, "Kinyarwanda TTS: Using a multi-speaker dataset to build a Kinyarwanda TTS model", In 4th Workshop on African Natural Language Processing, ICLR, 2023. [bib] [pdf]
> Dan Oneață, Beáta Lőrincz, Adriana Stan, Horia Cucu, "FlexLip: A Controllable Text-to-Lip System", In Special Issue Future Speech Interfaces with Sensors and Machine Intelligence, Sensors, MDPI, vol. 22, 2022. [bib] [pdf] [doi]
> Beáta Lőrincz, Elena Irimia, Adriana Stan, Verginica Barbu Mititelu, "RoLEX: The development of an extended Romanian lexical dataset and its evaluation at predicting concurrent lexical information", In Natural Language Engineering, Cambridge University Press, pp. 1–26, 2022. [bib] [pdf] [doi]
> Adriana Stan, "Introducere în Python folosind Google Colab", UTPress, Cluj-Napoca, Romania, 2022. [bib] [pdf]
> Adriana Stan, "Residual Information in Deep Speaker Embedding Architectures", In Mathematics, vol. 10, no. 21, 2022. [bib] [pdf] [doi]
> Adriana Stan, "The ZevoMOS entry to VoiceMOS Challenge 2022", In Proc. Interspeech 2022, pp. 4516-4520, 2022. [bib] [pdf] [doi]
> Beáta Lőrincz, Adriana Stan, Mircea Giurgiu, "Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis", In Proceedings of EUSIPCO, 2021. [bib] [pdf]
> Beáta Lőrincz, Adriana Stan, Mircea Giurgiu, "An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis", In Proceedings of 25th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, 2021. [bib] [pdf]
> Stefan Daniel Dumitrescu, Petru Rebeja, Beáta Lőrincz, Mihaela Gaman, Andrei Avram, Mihai Ilie, Andrei Pruteanu, Adriana Stan, Lorena Rosia, Cristina Iacobescu, Luciana Morogan, George Dima, Gabriel Marchidan, Traian Rebedea, Mădălina Chitez, Dani Yogatama, Sebastian Ruder, Radu Tudor Ionescu, Răzvan Pașcanu, Viorica Pătrăucean, "Liro: Benchmark and leaderboard for Romanian language tasks", In Proceedings of NeurIPS, 2021. [bib] [pdf]
> Adriana Stan, Mircea Giurgiu, "Prelucrarea semnalului vocal folosind Python", UTPress, Cluj-Napoca, Romania, 2021. [bib] [pdf]
> Adriana Stan, Beáta Lőrincz, Maria Nuțu, Mircea Giurgiu, "The MARA corpus: Expressivity in end-to-end TTS systems using synthesised speech data", In Proceedings of SpeD, 2021. [bib] [pdf]
> Dan Oneață, Adriana Stan, Horia Cucu, "Speaker disentanglement in video-to-speech conversion", In Proceedings of EUSIPCO, 2021. [bib] [pdf]
> Georgiana Săracu, Adriana Stan, "An analysis of the data efficiency in Tacotron2 speech synthesis system", In Proceedings of SPED, 2021. [bib] [pdf]
> Adriana Stan, Beáta Lőrincz, "Generating the Voice of the Interactive Virtual Assistant", Chapter in Virtual Assistants, IntechOpen, 2021. [bib] [pdf] [doi]
> Dan Oneață, Alexandru Caranica, Adriana Stan, Horia Cucu, "An Evaluation of Word-level Confidence Estimation for end-to-end Automatic Speech Recognition", In Proceedings of the 8th IEEE Spoken Language Technology Workshop (SLT 2021), Shenzhen, China, 2021. [bib] [pdf]
> Kristen Scott, Simone Ashby, Adriana Stan, "Designing a Synthesized Content Feed System for Community Radio", In Proc. of NordICHI, Talinn, Estonia, 2020. [bib] [pdf]
> Adriana Stan, "RECOApy: Data recording, pre-processing and phonetic transcription for end-to-end speech-based applications", In Proceedings of Interspeech, Shanghai, China, 2020. [bib] [pdf]
> Beáta Lőrincz, Maria Nuțu, Adriana Stan, Mircea Giurgiu, "An Evaluation of Postfiltering for Deep Learning-based Speech Synthesis with Limited Data", In Proc. of 2020 IEEE 10th International Conference on Intelligent Systems, 2020. [bib] [pdf]
> Adriana Stan, "Input Encoding for Sequence-to-Sequence Learning of Romanian Grapheme-to-Phoneme Conversion", In Proceedings of the 10th IEEE International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Timisoara, Romania, 2019. [bib] [pdf]
> Beáta Lőrincz, Maria Nuțu, Adriana Stan, "Romanian Part of Speech Tagging using LSTM Networks", In Proceedings of the IEEE 15th International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, 2019. [bib] [pdf]
> Maria Nuțu, Beáta Lőrincz, Adriana Stan, "Deep Learning for Automatic Diacritics Restoration in Romanian", In Proceedings of the IEEE 15th International Conference on Intelligent Computer Communication and Processing, Cluj-Napoca, Romania, 2019. [bib] [pdf]
> David A. Braude, Matthew P. Aylett, Caoimhin Laoide-Kemp, Simone Ashby, Kristen M. Scott, Brian O Raghallaigh, Anna Braudo, Alex Brouwer, Adriana Stan, "All Together Now: The Living Audio Dataset", In Proceedings of Interspeech, Graz, Austria, 2019. [bib] [pdf]
> Adriana Stan, Mircea Giurgiu, "A Comparison Between Traditional Machine Learning Approaches And Deep Neural Networks For Text Processing In Romanian", In Proceedings of the 13th International Conference on Linguistic Resources and Tools for Processing Romanian Language (ConsILR), Jassy, Romania, 2018. [bib] [pdf]
> Adriana Stan, Florina Dinescu, Cristina Tiple, Serban Meza, Bogdan Orza, Magdalena Chirila, Mircea Giurgiu, "The SWARA Speech Corpus: A Large Parallel Romanian Read Speech Dataset", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
> Stefan-Adrian Toma, Adriana Stan, Mihai-Lica Pura, Traian Barsan, "MaRePhoR - An Open Access Machine-Readable Phonetic Dictionary for Romanian", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
> Alexandru Moldovan, Adriana Stan, Mircea Giurgiu, "Improving Sentence-level Alignment of Speech with Imperfect Transcripts using Utterance Concatenation and VAD", In Proc. of IEEE ICCP, Cluj-Napoca, Romania, 2016. [bib] [pdf]
> Adriana Stan, Cassia Valentini-Botinhao, Bogdan Orza, Mircea Giurgiu, "Blind Speech Segmentation using Spectrogram-image Based Features and Mel Cepstral Coefficients", In Proc. IEEE Workshop on Spoken Language Technology, San Diego, USA, 2016. [bib] [pdf]
> Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell, Oliver Watts, Rob Clark, Simon King, "ALISA: An automatic lightly supervised speech segmentation and alignment tool", In Computer Speech and Language, vol. 35, pp. 116-133, 2016. [bib] [pdf] [doi]
> Adriana Stan, Cassia Valentini-Botinhao, Mircea Giurgiu, Simon King, "Phonetic Segmentation of Speech using STEP and t-SNE", In Proc. of the 8th International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucuresti, Romania, 2015. [bib] [pdf]
> Jószef Domokos, Adriana Stan, Mircea Giurgiu, "An Approach to Lexical Stress Detection from Transcribed Continuous Speech Using Acoustic Features", In Proc. 22nd Telecommunications Forum, Belgrade, Serbia, 2014. [bib] [pdf]
> Tiberiu Boros, Adriana Stan, Oliver Watts, Stefan Daniel Dumitrescu, "RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus", In Proc. The 9th edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014. [bib] [pdf]
> O. Watts, S. Gangireddy, J. Yamagishi, S. King, S. Renals, A. Stan, M. Giurgiu, "Neural Net Word Representations for Phrase-Break Prediction Without a Part of Speech Tagger", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, pp. 2599-2603, 2014. [bib] [pdf]
> O. Watts, A. Stan, Y. Mamiya, A. Suni, M. Burgos, J.M. Montero, "The Simple4All entry to the Blizzard Challenge 2013", In Proc. Blizzard Challenge, Barcelona, Spain, 2013. [bib] [pdf]
> Y. Mamiya, A. Stan, J. Yamagishi, P. Bell, O. Watts, R.A.J. Clark, S. King, "Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
> A. Stan, O. Watts, Y. Mamiya, M. Giurgiu, R. A. J. Clark, J. Yamagishi, S. King, "TUNDRA: A Multilingual Corpus of Found Data for TTS Research Created with Light Supervision", In Proc. Interspeech, Lyon, France, pp. 2331-2335, 2013. [bib] [pdf]
> O. Watts, A. Stan, R. Clark, Y. Mamiya, M. Giurgiu, J. Yamagishi, S. King, "Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from ‘found’ data: evaluation and analysis", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
> Adriana Stan, Peter Bell, Junichi Yamagishi, Simon King, "Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data", In Proc. Interspeech, Lyon, France, pp. 1525-1529, 2013. [bib] [pdf]
> Ioana Muresan, Adriana Stan, Mircea Giurgiu, Rodica Potolea, "Evaluation of Sentiment Polarity Prediction using a Dimensional and a Categorical Approach", In Proc. SPED, Cluj-Napoca, Romania, 2013. [bib] [pdf]
> Yoshitaka Mamiya, Junichi Yamagishi, Oliver Watts, Robert A.J. Clark, Simon King, Adriana Stan, "Lightly Supervised GMM VAD to use Audiobook for Speech Synthesiser", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada, pp. 7987-7991, 2013. [bib] [pdf]
> Adriana Stan, Peter Bell, Simon King, "A Grapheme-based Method for Automatic Alignment of Speech and Text Data", In Proc. IEEE Workshop on Spoken Language Technology, Miami, Florida, USA, pp. 286-290, 2012. [bib] [pdf]
> Adriana Stan, "Romanian HMM-based Text-to-Speech Synthesis with Interactive Intonation Optimisation", PhD thesis, Technical University of Cluj-Napoca, 2011. [bib] [pdf]
> Adriana Stan, Mircea Giurgiu, "A Superpositional Model Applied to F0 Parametrisation using DCT for Text-to-Speech Synthesis", In Proceedings of the 6th Conference on Speech Technology and Human-Computer Dialogue, Brasov, Romania, 2011. [bib]
> Adriana Stan, Florin-Claudiu Pop, Marcel Cremene, Mircea Giurgiu, Denis Pallez, "Interactive Intonation Optimisation Using CMA-ES and DCT Parametrisation of the F0 Contour for Speech Synthesis", In Proceedings of the 5th Workshop on Nature Inspired Cooperative Strategies for Optimisation, Springer, vol. 387, pp. 57-71, 2011. [bib] [pdf] [doi]
> Adriana Stan, Junichi Yamagishi, Simon King, Matthew Aylett, "The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate", In Speech Communication, vol. 53, no. 3, pp. 442-450, 2011. [bib] [pdf] [doi]
> Adriana Stan, Mircea Giurgiu, "Romanian language statistics and resources for text-to-speech systems", In Proceedings of the 9th Edition of the International Symposium on Electronics and Telecommunications, Timisoara, Romania, pp. 381-384, 2010. [bib]
> Adriana Stan, "Linear Interpolation of Spectrotemporal Excitation Pattern Representations for Automatic Speech Recognition in the Presence of Noise", In Proceedings of the 5th Conference on Speech Technology and Human-Computer Dialogue, Constanta, Romania, 2009. [bib]
> Adriana Stan, "A Study on the Performances of CELP Speech Coding at Low Bit Rates", In Novice Insights, 2007. [bib]

Contact

Communications Department

26-28 George Baritiu,
Room 364,
400027, Cluj-Napoca,
Romania,
Phone: +40-264-401226
Fax: +40-264-597083

Adriana (dot) STAN (at) com.utcluj.ro

©Copyright 2023 | Adriana STAN