Dr. Thomas Rolland, AI Research scientist.

My research focuses on advancing Automatic Speech Recognition (ASR) through the design of parameter-efficient architectures, large-scale synthetic data augmentation, and post-training strategies aimed at improving robustness, adaptability, and fairness across diverse languages, speakers, and domains.

Portrait of Thomas Rolland

About

Who I am

I am a Postdoctoral Researcher at INESC‑ID (Lisbon) working on parameter‑efficient Transformers and speech technologies. I focus on ASR/TTS, synthetic data augmentation, and post‑training strategies for robustness and fairness.

  • 🗣️Children & low‑resource ASR with scalable TTS generation + filtering.
  • 🧩Parameter‑efficient learning: adapters, LoRA, shared-weights.
  • 📈Evaluation, reproducibility and privacy.

Publications

Selected work

Peer-reviewed and preprint work across ASR, PEFT, and evaluation. See the dedicated page for the full list.

Research Interests

Topics I'm interested in

Speech processing

Automatic Speech Recognition

Children and Low-ressource ASR. Building large-scale TTS and filtering pipelines to expand training corpora and improve robustness.

ASRSynthetic data generation & filtering pipeline
NLP & Speech Processing

Multimodal & Responsible AI

Integrating acoustic and linguistic features for pathology detection and ensuring that speech technologies are interpretable, inclusive, and privacy-preserving.

MultimodalPrivacy
NLP & Speech processing

Post-training strategies

Exploring transfer learning, fine-tuning, and domain adaptation methods to enhance the adaptability and robustness of ASR and large language models (LLMs).

PEFTRLHF

Projects

Past and present work

Speech processing

European Portuguese ASR

Development of state-of-the-art ASR systems for European Portuguese, as well as CAMÕES a comprehensive benchmark for ASR evaluation.

ASR
Speech Processing

Children's speech data in the wild

Collecting children's speech from the wild. This project aims to create a diverse dataset that reflects real-world scenarios for testing purposes.

Data collection
NLP & Speech processing

Looped-Transformer

Exploring Exploring Transformer variant that uses loop mechanism for parameter-efficiency.

PretrainingParameter-Efficiency

Blog

Explainers & deep dives

Contact

Let’s collaborate