Adriana Stan -- Official Webpage

Projects

EU Horizon "AI4Trust" (no. 101070190) (2023-2026)

https://ai4trust.eu

Cadrul strategic național în domeniul inteligenței artificiale (2021-2023)

https://strategie-ia.utcluj.ro

SINTERO: Tehnologii de realizare a interfețelor om-mașină pentru sinteza text-vorbire cu expresivitate (2018-2021)

https://speech.utcluj.ro/sintero/

SWARA: Mobile System for Rehabilitative Vocal Assistance of Surgical Aphonia (2015-2017)

https://speech.utcluj.ro/swara/

SIMPLE4ALL: Speech Synthesis that Improves through Adaptive Learning (2011-2014)

https://simple4all.org/

Tools and Corpora

RoLEX: An extended Romanian Lexical Dataset

https://github.com/adrianastan/rolex

RECOApy - prompted speech recording app

https://github.com/adrianastan/recoapy

ALISA: An automatic lightly supervised speech segmentation and alignment tool

https://simple4all.org/product/alisa/

The SWARA CORPUS

http://speech.utcluj.ro/swarasc/

The Romanian Speech Synthesis CORPUS

https://romaniantts.com/rssdb/

The Tundra CORPUS

https://zenodo.org/records/12543428

The MaRePhor Lexicon

http://speech.utcluj.ro/marephor/

Cartea Sonoră - ”Mara” Corpus

http://speech.utcluj.ro/corpora/mara.html

Romanian TTS System

http://romaniantts.com

Publications

	> Adriana Stan, David Combei, Dan Oneata, Nicolas Muller, Horia Cucu, "TADA: Training-free Attribution and Out-of-Domain Detection of Audio Deepfakes", In Proceedings of Interspeech, 2025. [bib] [pdf]
	> Nicolas Müller, Piotr Kawa, Wei-Herng Choong, Adriana Stan, Aditya Tirumala Bukkapatnam, Karla Pizzi, Alexander Wagner, Philip Sperl, "Replay Attacks Against Audio Deepfake Detection", In Proceedings of Interspeech, 2025. [bib] [pdf]
	> David Combei, Adriana Stan, Dan Oneata, Nicolas Muller, Horia Cucu, "Unmasking real-world audio deepfakes: A data-centric approach", In Proceedings of Interspeech, 2025. [bib] [pdf]
	> Teodora Răgman, Adriana Stan, "Efficient Training Strategies for Natural Sounding Speech Synthesis and Speaker Adaptation Based on Fastpitch", In 2024 IEEE 20th International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 1-6, 2024. [bib] [pdf] [doi]
	> David Combei, Adriana Stan, Dan Oneata, Horia Cucu, "WavLM model ensemble for audio deepfake detection", In The Automatic Speaker Verification Spoofing Countermeasures Workshop (ASVspoof 2024), pp. 170-175, 2024. [bib] [pdf] [doi]
	> Octavian Pascu, Adriana Stan, Dan Oneata, Elisabeta Oneata, Horia Cucu, "Towards generalisable and calibrated audio deepfake detection with self-supervised representations", In Proceedings of Interspeech, 2024. [bib] [pdf] [doi]
	> Vlad Striletchi, Cosmin Striletchi, Adriana Stan, "TBDM-Net: Bidirectional Dense Networks with Gender Information for Speech Emotion Recognition", In Proceedings of 2024 IEEE International Workshop on Machine Learning for Signal Processing, London, UK, 2024. [bib] [pdf] [doi]
	> Adrian Bogdan Stânea, Vlad Strilețchi, Cosmin Strilețchi, Adriana Stan, "An analysis of large speech models-based representations for speech emotion recognition", In 2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 100-104, 2023. [bib] [pdf] [doi]
	> Adriana Stan, Johannah O'Mahony, "An analysis on the effects of speaker embedding choice in non auto-regressive TTS", In 12th ISCA Speech Synthesis Workshop (SSW2023), pp. 134-138, 2023. [bib] [pdf] [doi]
	> Samuel Rutunda, Kleber Kabanda, Adriana Stan, "Kinyarwanda TTS: Using a multi-speaker dataset to build a Kinyarwanda TTS model", In 4th Workshop on African Natural Language Processing, ICLR, 2023. [bib] [pdf]
	> Beáta Lőrincz, Elena Irimia, Adriana Stan, Verginica Barbu Mititelu, "RoLEX: The development of an extended Romanian lexical dataset and its evaluation at predicting concurrent lexical information", In Natural Language Engineering, Cambridge University Press, pp. 1–26, 2022. [bib] [pdf] [doi]
	> Adriana Stan, "Residual Information in Deep Speaker Embedding Architectures", In Mathematics, vol. 10, no. 21, 2022. [bib] [pdf] [doi]
	> Adriana Stan, "Introducere în Python folosind Google Colab", UTPress, Cluj-Napoca, Romania, 2022. [bib] [pdf]
	> Adriana Stan, "The ZevoMOS entry to VoiceMOS Challenge 2022", In Proc. Interspeech 2022, pp. 4516-4520, 2022. [bib] [pdf] [doi]
	> Dan Oneață, Beáta Lőrincz, Adriana Stan, Horia Cucu, "FlexLip: A Controllable Text-to-Lip System", In Special Issue Future Speech Interfaces with Sensors and Machine Intelligence, Sensors, MDPI, vol. 22, 2022. [bib] [pdf] [doi]
	> Stefan Daniel Dumitrescu, Petru Rebeja, Beáta Lőrincz, Mihaela Gaman, Andrei Avram, Mihai Ilie, Andrei Pruteanu, Adriana Stan, Lorena Rosia, Cristina Iacobescu, Luciana Morogan, George Dima, Gabriel Marchidan, Traian Rebedea, Mădălina Chitez, Dani Yogatama, Sebastian Ruder, Radu Tudor Ionescu, Răzvan Pașcanu, Viorica Pătrăucean, "Liro: Benchmark and leaderboard for Romanian language tasks", In Proceedings of NeurIPS, 2021. [bib] [pdf]
	> Beáta Lőrincz, Adriana Stan, Mircea Giurgiu, "An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis", In Proceedings of 25th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, 2021. [bib] [pdf]
	> Adriana Stan, Mircea Giurgiu, "Prelucrarea semnalului vocal folosind Python", UTPress, Cluj-Napoca, Romania, 2021. [bib] [pdf]
	> Adriana Stan, Beáta Lőrincz, Maria Nuțu, Mircea Giurgiu, "The MARA corpus: Expressivity in end-to-end TTS systems using synthesised speech data", In Proceedings of SpeD, 2021. [bib] [pdf]
	> Dan Oneață, Adriana Stan, Horia Cucu, "Speaker disentanglement in video-to-speech conversion", In 29th European Signal Processing Conference (EUSIPCO), 2021. [bib] [pdf]
	> Beáta Lőrincz, Adriana Stan, Mircea Giurgiu, "Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis", In 29th European Signal Processing Conference (EUSIPCO), pp. 26-30, 2021. [bib] [pdf] [doi]
	> Georgiana Săracu, Adriana Stan, "An analysis of the data efficiency in Tacotron2 speech synthesis system", In 2021 International Conference on Speech Technology and Human-Computer Dialogue (SpeD), pp. 172-176, 2021. [bib] [pdf] [doi]
	> Adriana Stan, Beáta Lőrincz, "Generating the Voice of the Interactive Virtual Assistant", Chapter in Virtual Assistants, IntechOpen, 2021. [bib] [pdf] [doi]
	> Dan Oneață, Alexandru Caranica, Adriana Stan, Horia Cucu, "An Evaluation of Word-level Confidence Estimation for end-to-end Automatic Speech Recognition", In Proceedings of the 8th IEEE Spoken Language Technology Workshop (SLT 2021), Shenzhen, China, 2021. [bib] [pdf]
	> Kristen Scott, Simone Ashby, Adriana Stan, "Designing a Synthesized Content Feed System for Community Radio", In Proc. of NordICHI, Talinn, Estonia, 2020. [bib] [pdf]
	> Adriana Stan, "RECOApy: Data recording, pre-processing and phonetic transcription for end-to-end speech-based applications", In Proceedings of Interspeech, Shanghai, China, 2020. [bib] [pdf]
	> Beáta Lőrincz, Maria Nuțu, Adriana Stan, Mircea Giurgiu, "An Evaluation of Postfiltering for Deep Learning-based Speech Synthesis with Limited Data", In Proc. of 2020 IEEE 10th International Conference on Intelligent Systems, 2020. [bib] [pdf]
	> Adriana Stan, "Input Encoding for Sequence-to-Sequence Learning of Romanian Grapheme-to-Phoneme Conversion", In Proceedings of the 10th IEEE International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Timisoara, Romania, 2019. [bib] [pdf]
	> Maria Nuţu, Beáta Lőrincz, Adriana Stan, "Deep Learning for Automatic Diacritics Restoration in Romanian", In IEEE 15th International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 235-240, 2019. [bib] [pdf] [doi]
	> Beáta Lőrincz, Maria Nuţu, Adriana Stan, "Romanian Part of Speech Tagging using LSTM Networks", In IEEE 15th International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 223-228, 2019. [bib] [pdf] [doi]
	> David A. Braude, Matthew P. Aylett, Caoimhin Laoide-Kemp, Simone Ashby, Kristen M. Scott, Brian O Raghallaigh, Anna Braudo, Alex Brouwer, Adriana Stan, "All Together Now: The Living Audio Dataset", In Proceedings of Interspeech, Graz, Austria, 2019. [bib] [pdf]
	> Adriana Stan, Mircea Giurgiu, "A Comparison Between Traditional Machine Learning Approaches And Deep Neural Networks For Text Processing In Romanian", In Proceedings of the 13th International Conference on Linguistic Resources and Tools for Processing Romanian Language (ConsILR), Jassy, Romania, 2018. [bib] [pdf]
	> Adriana Stan, Florina Dinescu, Cristina Tiple, Serban Meza, Bogdan Orza, Magdalena Chirila, Mircea Giurgiu, "The SWARA Speech Corpus: A Large Parallel Romanian Read Speech Dataset", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
	> Stefan-Adrian Toma, Adriana Stan, Mihai-Lica Pura, Traian Barsan, "MaRePhoR - An Open Access Machine-Readable Phonetic Dictionary for Romanian", In Proceedings of the 9th Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucharest, Romania, 2017. [bib] [pdf]
	> Alexandru Moldovan, Adriana Stan, Mircea Giurgiu, "Improving Sentence-level Alignment of Speech with Imperfect Transcripts using Utterance Concatenation and VAD", In Proc. of IEEE ICCP, Cluj-Napoca, Romania, 2016. [bib] [pdf]
	> Adriana Stan, Cassia Valentini-Botinhao, Bogdan Orza, Mircea Giurgiu, "Blind Speech Segmentation using Spectrogram-image Based Features and Mel Cepstral Coefficients", In Proc. IEEE Workshop on Spoken Language Technology, San Diego, USA, 2016. [bib] [pdf]
	> Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell, Oliver Watts, Rob Clark, Simon King, "ALISA: An automatic lightly supervised speech segmentation and alignment tool", In Computer Speech and Language, vol. 35, pp. 116-133, 2016. [bib] [pdf] [doi]
	> Adriana Stan, Cassia Valentini-Botinhao, Mircea Giurgiu, Simon King, "Phonetic Segmentation of Speech using STEP and t-SNE", In Proc. of the 8th International Conference on Speech Technology and Human-Computer Dialogue (SpeD), Bucuresti, Romania, 2015. [bib] [pdf]
	> Jószef Domokos, Adriana Stan, Mircea Giurgiu, "An Approach to Lexical Stress Detection from Transcribed Continuous Speech Using Acoustic Features", In Proc. 22nd Telecommunications Forum, Belgrade, Serbia, 2014. [bib] [pdf]
	> Tiberiu Boros, Adriana Stan, Oliver Watts, Stefan Daniel Dumitrescu, "RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus", In Proc. The 9th edition of the Language Resources and Evaluation Conference, Reykjavik, Iceland, 2014. [bib] [pdf]
	> O. Watts, S. Gangireddy, J. Yamagishi, S. King, S. Renals, A. Stan, M. Giurgiu, "Neural Net Word Representations for Phrase-Break Prediction Without a Part of Speech Tagger", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, pp. 2599-2603, 2014. [bib] [pdf]
	> A. Stan, O. Watts, Y. Mamiya, M. Giurgiu, R. A. J. Clark, J. Yamagishi, S. King, "TUNDRA: A Multilingual Corpus of Found Data for TTS Research Created with Light Supervision", In Proc. Interspeech, Lyon, France, pp. 2331-2335, 2013. [bib] [pdf]
	> Y. Mamiya, A. Stan, J. Yamagishi, P. Bell, O. Watts, R.A.J. Clark, S. King, "Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
	> O. Watts, A. Stan, R. Clark, Y. Mamiya, M. Giurgiu, J. Yamagishi, S. King, "Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from ‘found’ data: evaluation and analysis", In Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013. [bib] [pdf]
	> Yoshitaka Mamiya, Junichi Yamagishi, Oliver Watts, Robert A.J. Clark, Simon King, Adriana Stan, "Lightly Supervised GMM VAD to use Audiobook for Speech Synthesiser", In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada, pp. 7987-7991, 2013. [bib] [pdf]
	> O. Watts, A. Stan, Y. Mamiya, A. Suni, M. Burgos, J.M. Montero, "The Simple4All entry to the Blizzard Challenge 2013", In Proc. Blizzard Challenge, Barcelona, Spain, 2013. [bib] [pdf]
	> Ioana Muresan, Adriana Stan, Mircea Giurgiu, Rodica Potolea, "Evaluation of Sentiment Polarity Prediction using a Dimensional and a Categorical Approach", In Proc. SPED, Cluj-Napoca, Romania, 2013. [bib] [pdf]
	> Adriana Stan, Peter Bell, Junichi Yamagishi, Simon King, "Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data", In Proc. Interspeech, Lyon, France, pp. 1525-1529, 2013. [bib] [pdf]
	> Adriana Stan, Peter Bell, Simon King, "A Grapheme-based Method for Automatic Alignment of Speech and Text Data", In Proc. IEEE Workshop on Spoken Language Technology, Miami, Florida, USA, pp. 286-290, 2012. [bib] [pdf]
	> Adriana Stan, "Romanian HMM-based Text-to-Speech Synthesis with Interactive Intonation Optimisation", PhD thesis, Technical University of Cluj-Napoca, 2011. [bib] [pdf]
	> Adriana Stan, Mircea Giurgiu, "A Superpositional Model Applied to F0 Parametrisation using DCT for Text-to-Speech Synthesis", In Proceedings of the 6th Conference on Speech Technology and Human-Computer Dialogue, Brasov, Romania, 2011. [bib]
	> Adriana Stan, Florin-Claudiu Pop, Marcel Cremene, Mircea Giurgiu, Denis Pallez, "Interactive Intonation Optimisation Using CMA-ES and DCT Parametrisation of the F0 Contour for Speech Synthesis", In Proceedings of the 5th Workshop on Nature Inspired Cooperative Strategies for Optimisation, Springer, vol. 387, pp. 57-71, 2011. [bib] [pdf] [doi]
	> Adriana Stan, Junichi Yamagishi, Simon King, Matthew Aylett, "The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate", In Speech Communication, vol. 53, no. 3, pp. 442-450, 2011. [bib] [pdf] [doi]
	> Adriana Stan, Mircea Giurgiu, "Romanian language statistics and resources for text-to-speech systems", In Proceedings of the 9th Edition of the International Symposium on Electronics and Telecommunications, Timisoara, Romania, pp. 381-384, 2010. [bib]
	> Adriana Stan, "Linear Interpolation of Spectrotemporal Excitation Pattern Representations for Automatic Speech Recognition in the Presence of Noise", In Proceedings of the 5th Conference on Speech Technology and Human-Computer Dialogue, Constanta, Romania, 2009. [bib]
	> Adriana Stan, "A Study on the Performances of CELP Speech Coding at Low Bit Rates", In Novice Insights, 2007. [bib]

Contact

Communications Department

26-28 George Baritiu,
Room 364,
400027, Cluj-Napoca,
Romania,
Phone: +40-264-401226
Fax: +40-264-597083

Projects

Tools and Corpora

Publications

Contact

Communications Department

Adriana (dot) STAN (at) com.utcluj.ro