Our Publications
Investigating Disentanglement in a Phoneme-level Speech Codec for Prosody Modeling
Sotirios Karapiperis, Nikolaos Ellinas, Alexandra Vioni, Junkwang Oh, Gunu Jho, Inchul Hwang and Spyros Raptis
SLT 2024
Link (Samples available)
Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification
Michail Mitsios, Georgios Vamvoukakis, Georgia Maniati, Nikolaos Ellinas, Georgios Dimitriou, Konstantinos Markopoulos, Panos Kakoulidis, Alexandra Vioni, Myrsini Christidou, Junkwang Oh, Gunu Jho, Inchul Hwang, Georgios Vardaxoglou, Aimilios Chalamandaris, Pirros Tsiakoulis and Spyros Raptis
NAACL 2024
Link
Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations
Panos Kakoulidis,
Nikolaos Ellinas,
Georgios Vamvoukakis,
Myrsini Christidou,
Alexandra Vioni,
Georgia Maniati,
Junkwang Oh,
Gunu Jho,
Inchul Hwang,
Pirros Tsiakoulis and Aimilios Chalamandaris
ΙΕΕΕ ICASSP SASB 2024
Link (Samples available)
Generating Multilingual Gender-Ambiguous Text-to-Speech Voices
Konstantinos Markopoulos,
Georgia Maniati,
Georgios Vamvoukakis,
Nikolaos Ellinas,
Georgios Vardaxoglou,
Panos Kakoulidis,
Junkwang Oh,
Gunu Jho,
Inchul Hwang,
Aimilios Chalamandaris,
Pirros Tsiakoulis and Spyros Raptis
Interspeech 2023
Link (Samples available)
Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features
Alexandra Vioni,
Georgia Maniati,
Nikolaos Ellinas,
June Sig Sung,
Inchul Hwang,
Aimilios Chalamandaris and
Pirros Tsiakoulis
ICASSP 2023
Link
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis
Georgia Maniati,
Alexandra Vioni,
Nikolaos Ellinas,
Karolos Nikitaras,
Konstantinos Klapsas,
June Sig Sung,
Gunu Jho,
Aimilios Chalamandaris and
Pirros Tsiakoulis
Interspeech 2022
Link (Samples available)
Karaoker: Alignment-free singing voice synthesis with speech training data
Panos Kakoulidis,
Nikolaos Ellinas,
Georgios Vamvoukakis,
Konstantinos Markopoulos,
June Sig Sung,
Gunu Jho,
Pirros Tsiakoulis and
Aimilios Chalamandaris
Interspeech 2022
Link (Samples available)
Fine-grained Noise Control for Multispeaker Speech Synthesis
Karolos Nikitaras,
Georgios Vamvoukakis,
Nikolaos Ellinas,
Konstantinos Klapsas,
Konstantinos Markopoulos,
Spyros Raptis,
June Sig Sung,
Gunu Jho,
Aimilios Chalamandaris and Pirros Tsiakoulis
Interspeech 2022
Link (Samples available)
Self supervised learning for robust voice cloning
Konstantinos Klapsas,
Nikolaos Ellinas,
Karolos Nikitaras,
Georgios Vamvoukakis,
Panos Kakoulidis,
Konstantinos Markopoulos,
Spyros Raptis,
June Sig Sung,
Gunu Jho,
Aimilios Chalamandaris and
Pirros Tsiakoulis
Interspeech 2022
Link (Samples available)
Controllable speech synthesis by learning discrete phoneme-level prosodic representations
Nikolaos Ellinas,
Myrsini Christidou,
Alexandra Vioni,
June Sig Sung,
Aimilios Chalamandaris,
Pirros Tsiakoulis and Paris Mastorocostas
Speech Communication
Link (Samples available)
Word-level Style Control for Expressive, Non-attentive Speech Synthesis
Konstantinos Klapsas,
Nikolaos Ellinas,
June Sig Sung,
Hyoungmin Park and Spyros Raptis
SPECOM 2021
Link (Samples available)
Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Myrsini Christidou*Equal Contribution,
Alexandra Vioni*Equal Contribution,
Nikolaos Ellinas,
Georgios Vamvoukakis,
Konstantinos Markopoulos,
Panos Kakoulidis,
June Sig Sung,
Hyoungmin Park,
Aimilios Chalamandaris and Pirros Tsiakoulis
SPECOM 2021
Link (Samples available)
Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Konstantinos Markopoulos,
Nikolaos Ellinas,
Alexandra Vioni,
Myrsini Christidou,
Panos Kakoulidis,
Georgios Vamvoukakis,
Georgia Maniati,
June Sig Sung,
Hyoungmin Park,
Pirros Tsiakoulis and Aimilios Chalamandaris
SSW 2021
Link (Samples available)
Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Georgia Maniati*Equal Contribution,
Nikolaos Ellinas*Equal Contribution,
Konstantinos Markopoulos,
Georgios Vamvoukakis,
June Sig Sung,
Hyoungmin Park,
Aimilios Chalamandaris and Pirros Tsiakoulis
Interspeech 2021
Link (Samples available)
Prosodic Clustering for Phoneme-level Prosody Control in End-to-end Speech Synthesis
Alexandra Vioni*Equal Contribution,
Myrsini Christidou*Equal Contribution,
Nikolaos Ellinas,
Georgios Vamvoukakis,
Panos Kakoulidis,
Taehoon Kim,
June Sig Sung,
Hyoungmin Park,
Aimilios Chalamandaris and Pirros Tsiakoulis
ICASSP 2021
Link (Samples available)
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Nikolaos Ellinas,
Georgios Vamvoukakis,
Konstantinos Markopoulos,
Aimilios Chalamandaris,
Georgia Maniati,
Panos Kakoulidis,
Spyros Raptis,
June Sig Sung,
Hyoungmin Park and Pirros Tsiakoulis
Interspeech 2020
Link (Samples available)