Publications | Thomas Rolland

LREC 2026

FalAR: A Large-scale Speaker-Annotated European Portuguese Speech Corpus of Parliamentary Sessions Authors

Release a large-scale European Portuguese speech dataset derived from parliamentary session recordings.

PDF

Preprint 2026

Exploring the Potential and Limitations of Model Merging for Multi-Domain Adaptation in ASR

Studies model merging for multi-domain ASR and proposes a more stable merging strategy for European Portuguese speech systems.

PDF

Preprint 2025

Group-Aware Partial Model Merging for Children’s Automatic Speech Recognition

Introduces a parameter-efficient approach that combines clustering, partial fine-tuning, and model merging for children’s ASR.

PDF

ASRU 2025

CAMOES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese

Create a new benchmark for European Portuguese ASR.

PDF

Interspeech 2025

Children's Voice Privacy: First Steps And Emerging Challenges

First paper on children's voice privacy, exploring technical challenges.

PDF

Interspeech 2025

Exploring Shared-Weight Mechanisms in Transformer and Conformer Architectures for Automatic Speech Recognition

Explores different configurations of weight-sharing strategies in transformer and conformer architectures for ASR.

PDF

Interspeech 2025

Acoustic and Linguistic Biomarkers for Cognitive Impairment Detection from Speech

Explores combinations of acoustic and linguistic biomarkers for cognitive impairment detection.

PDF

Preprint 2025

Tackling Cognitive Impairment Detection from Speech: A Submission to the PROCESS Challenge

Using knowledge-based versus pre-trained embeddings for pathology detection.

PDF

Iberspeech 2024

Towards Improved Automatic Speech Recognition for Children

Thesis summary on improving ASR for children.

PDF

ICASSP 2024

Improved Children's Automatic Speech Recognition Combining Adapters and Synthetic Data Augmentation

Combines adapters and synthetic data augmentation to improve children's ASR performance.

Link

Interspeech 2024

Introduction to Partial Fine-Tuning: A Comprehensive Evaluation of End-to-End Children's Automatic Speech Recognition Adaptation

Proposes and evaluates a new selective fine-tuning approach for children's ASR.

PDF

LREC 2022

Multilingual Transfer Learning for Children Automatic Speech Recognition

Multi-task training for multilingual low-resource children's ASR.

PDF

Interspeech 2021

Transfer Learning-Based Cough Representations for Automatic Detection of COVID-19

Using pre-trained embeddings for COVID-19 detection.

PDF

Interspeech 2020

The INESC-ID Multi-Modal System for the ADReSS 2020 Challenge

Using speech and text pre-trained embeddings for pathological speech detection.

PDF

Preprint 2020

Pathological Speech Detection Using X-Vector Embeddings

Using x-vector embeddings for pathological speech detection across different diseases.

PDF

SPARS 2019

Label-Consistent Sparse Auto-Encoders

Explores auto-encoders with label consistency.

PDF

Research outputs across speech, adaptation, and evaluation.

Publication List

FalAR: A Large-scale Speaker-Annotated European Portuguese Speech Corpus of Parliamentary Sessions Authors

Exploring the Potential and Limitations of Model Merging for Multi-Domain Adaptation in ASR

Group-Aware Partial Model Merging for Children’s Automatic Speech Recognition

CAMOES: A Comprehensive Automatic Speech Recognition Benchmark for European Portuguese

Children's Voice Privacy: First Steps And Emerging Challenges

Exploring Shared-Weight Mechanisms in Transformer and Conformer Architectures for Automatic Speech Recognition

Acoustic and Linguistic Biomarkers for Cognitive Impairment Detection from Speech

Tackling Cognitive Impairment Detection from Speech: A Submission to the PROCESS Challenge

Towards Improved Automatic Speech Recognition for Children

Improved Children's Automatic Speech Recognition Combining Adapters and Synthetic Data Augmentation

Introduction to Partial Fine-Tuning: A Comprehensive Evaluation of End-to-End Children's Automatic Speech Recognition Adaptation

Multilingual Transfer Learning for Children Automatic Speech Recognition

Transfer Learning-Based Cough Representations for Automatic Detection of COVID-19

The INESC-ID Multi-Modal System for the ADReSS 2020 Challenge

Pathological Speech Detection Using X-Vector Embeddings

Label-Consistent Sparse Auto-Encoders