Hello, I'm

Dr. Satwinder Singh

Specializing in inclusive AI, speech recognition for atypical speech, and accessible language technologies. Passionate about bridging the gap between advanced machine learning and real-world clinical applications.

Get To Know More

About Me

Experience

7+ years
Research & Teaching

Education

PhD Computer Science
Massey University, NZ

I work on inclusive AI for speech and language technologies, with a focus on automatic speech recognition for atypical speech, low-resource settings, and accessibility-driven systems. I also supervise student projects and collaborate with clinicians and community partners to ensure research impact.

Explore My

Experience & Skills

Technical Skills

Python PyTorch TensorFlow C/C++ Linux/Bash SQL Git Docker Web Dev

Research Expertise

Speech Recognition (ASR) Deep Learning NLP AI Accessibility Low-resource Languages Atypical Speech Human-Computer Interaction

Relevant Experience

Postdoctoral Research Fellow

Dept. of Electrical, Computer and Software Engineering, University of Auckland | Auckland, NZ

Jan 2024 - Jan 2026

Working on Automatic Speech Recognition for Dysarthric Speech.
Supervising undergraduate and postgraduate research.

Research Supervision Deep Learning Machine Learning AI

Postdoctoral Research Fellow

School of Mathematical and Computational Sciences, Massey University | Auckland, NZ

Jul 2023 - Dec 2023

Worked on Natural language processing for Q&A in indigenous/vernacular languages.
Project funded by MBIE Catalyst: Strategic – New Zealand-Singapore Data Science Research Programme.

View Project

Research Teaching Deep Learning Machine Learning AI

Research Assistant

School of Mathematical and Computational Sciences, Massey University | Auckland, NZ

Nov 2022 - Mar 2023

Assisted with Course: Information Sciences Research Methods (158750).
Worked on funding applications such as Marsden and MURF.
Conducted research and data collection for Catalyst Funding Project.

Research Teaching Marking

Tutor

Pinnacle Global Academy (PGA) | Auckland, NZ

Jul 2021 - Feb 2022

Taught Mathematics and Python programming.
Prepared students for competitive exams.

Mathematics Programming Computer Skills

Assistant Professor

Faculty of Computational Sciences, GNA University | Punjab, INDIA

Jul 2016 - Feb 2018

Taught graduate and post-graduate courses.
Managed lecture scheduling and supervised examinations.
Supervised students' postgraduate research.

AI Databases Programming Languages OS Software Engineering Web Technologies

What I've Been Up To

Recent Activities

October 2025

Best Research Awards at Exihibition Day 2025

Part 4 Research students delivered outstanding work across a range of impactful topics and Secured best industry project in image and voice processing category 1 and 2.

September 2025

Transformer Models Workshop

Organized and led a one-day workshop on Transformer models with hands-on practical sessions at the University of Auckland.

April 2025

ICASSP 2025 Conference

Presented two papers on dysarthric speech recognition at the IEEE International Conference on Acoustics, Speech and Signal Processing in India.

December 2024

ICONIP 2024 Conference

Presented research on comprehensive performance evaluation of Whisper models in dysarthric speech recognition at ICONIP in New Zealand.

December 2024

Local Arrangement Chair

Served as Local Arrangement Chair for the ACM Multimedia Asia Conference held in Auckland, New Zealand.

November 2024

Best Research Awards at Exihibition Day 2024

Supervised students Steven Li and Adi Shenoy, who won the Best Project Award for developing a Neural Speaker Diarization System for Doctors.

Ongoing

PhD Student Supervision

Currently supervising multiple PhD and Honours students on projects related to dysarthric speech recognition, low-resource ASR, and AI accessibility tools.

Browse My

Recent Publications

2026 Under review

Low-Burden Data Augmentation for Dysarthric ASR via Zero-Shot Voice Cloning

Satwinder Singh*, Qianli Wang*, Zihan Zhong*, Clarion Mendes†, Mark Hasegawa-Johnson†, Waleed Abdulla*, Seyed Reza Shahamiri*

Interspeech

Audio Samples

2025 Published

Convolution-Augmented Transformers for Enhanced Speaker-Independent Dysarthric Speech Recognition

Zihan Zhong, Qianli Wang, Satwinder Singh, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri

IEEE Transactions on Neural Systems and Rehabilitation Engineering

Paper

2025 Published

Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition

Satwinder Singh, Qianli Wang, Zihan Zhong, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), India, 2025

Paper

2025 Published

Dysarthric Speech Conformer: Adaptation for Sequence-to-Sequence Dysarthric Speech Recognition

Qianli Wang, Zihan Zhong, Satwinder Singh, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), India, 2025

Paper

2024 Published

Mix-fine-tune: An Alternate Fine-tuning Strategy for Domain Adaptation and Generalization of Low-resource ASR

Lei, Chengxi; Singh, Satwinder; Hou, Feng; Wang, Ruili

Proceedings of the 6th ACM International Conference on Multimedia in Asia

Paper

2024 Published

A Comprehensive Performance Evaluation of Whisper Models in Dysarthric Speech Recognition

Satwinder Singh, Zihan Zhong, Qianli Wang, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri

International Conference on Neural Information Processing (ICONIP), New Zealand, 2024

Paper

2023 Published

PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition

Chengxi Lei, Satwinder Singh*, Xiaoyun Jia, Feng Hou, Ruili Wang

ACM MM Asia, 2023

Paper

2023 Published

Real and Synthetic Punjabi Speech Datasets for Speech Recognition

Satwinder Singh, Ruili Wang, Feng Hou

Data in Brief Journal, 2023

Paper Dataset

2023 Published

A Novel Self-training Approach for Low-resource Speech Recognition

Satwinder Singh, Ruili Wang, Feng Hou

Proc. Interspeech, Ireland, pp. 1588-1592, 2023

Paper

Explore My

Featured Projects

Active

Dysarthric Speech Recognition System

Deep Learning • PyTorch • Accessibility

An end-to-end automatic speech recognition system designed specifically for dysarthric speech using convolution-augmented transformers. Achieves state-of-the-art performance across multiple etiology groups.

2024

Neural Speaker Diarization for Medical Consultations

Supervised Project • Audio Processing • Healthcare AI

Supervised development of an AI system to automatically identify and separate speakers in doctor-patient conversations. Developed by students Steven Li and Adi Shenoy, winning Best Project Award in 2024.

Research

Te Reo Māori Text-to-Speech

Low-Resource ASR • Transfer Learning • Cultural AI

Developing robust TTS and ASR models for te reo Māori using self-training and meta-learning approaches to overcome data scarcity. Collaborating with Māori language experts to ensure cultural appropriateness.

Dataset

2023

Punjabi Speech Corpus

Dataset • Speech Synthesis • Open Source

Created comprehensive real and synthetic speech datasets for Punjabi language containing over 100 hours of annotated audio. Publicly available for research purposes on Mendeley Data.

Paper Dataset

2023

PhasePerturbation Augmentation

Data Augmentation • ASR Improvement • Signal Processing

Novel speech data augmentation technique using phase perturbation to improve ASR model robustness and generalization without requiring additional labeled data. Published at ACM MM Asia 2023.

Paper

Updates

News and Blog

2025-12-21

Scaling Accessibility: How Zero-Shot Voice Cloning Boosts Dysarthric Speech Recognition

Read