Dr. Satwinder Singh
About Experience Publications Activities Projects Blog Contact Admin
Hello, I'm

Dr. Satwinder Singh

Specializing in inclusive AI, speech recognition for atypical speech, and accessible language technologies. Passionate about bridging the gap between advanced machine learning and real-world clinical applications.

Dr. Satwinder Singh

Get To Know More

About Me

Profile picture
Experience icon

Experience

7+ years
Research & Teaching

Education icon

Education

PhD Computer Science
Massey University, NZ

I work on inclusive AI for speech and language technologies, with a focus on automatic speech recognition for atypical speech, low-resource settings, and accessibility-driven systems. I also supervise student projects and collaborate with clinicians and community partners to ensure research impact.

Explore My

Experience & Skills

Technical Skills

Python PyTorch TensorFlow C/C++ Linux/Bash SQL Git Docker Web Dev

Research Expertise

Speech Recognition (ASR) Deep Learning NLP AI Accessibility Low-resource Languages Atypical Speech Human-Computer Interaction

Relevant Experience

Postdoctoral Research Fellow

Dept. of Electrical, Computer and Software Engineering, University of Auckland | Auckland, NZ
Jan 2024 - Jan 2026
  • Working on Automatic Speech Recognition for Dysarthric Speech.
  • Supervising undergraduate and postgraduate research.
Research Supervision Deep Learning Machine Learning AI

Postdoctoral Research Fellow

School of Mathematical and Computational Sciences, Massey University | Auckland, NZ
Jul 2023 - Dec 2023
  • Worked on Natural language processing for Q&A in indigenous/vernacular languages.
  • Project funded by MBIE Catalyst: Strategic – New Zealand-Singapore Data Science Research Programme.
Research Teaching Deep Learning Machine Learning AI

Research Assistant

School of Mathematical and Computational Sciences, Massey University | Auckland, NZ
Nov 2022 - Mar 2023
  • Assisted with Course: Information Sciences Research Methods (158750).
  • Worked on funding applications such as Marsden and MURF.
  • Conducted research and data collection for Catalyst Funding Project.
Research Teaching Marking

Tutor

Pinnacle Global Academy (PGA) | Auckland, NZ
Jul 2021 - Feb 2022
  • Taught Mathematics and Python programming.
  • Prepared students for competitive exams.
Mathematics Programming Computer Skills

Assistant Professor

Faculty of Computational Sciences, GNA University | Punjab, INDIA
Jul 2016 - Feb 2018
  • Taught graduate and post-graduate courses.
  • Managed lecture scheduling and supervised examinations.
  • Supervised students' postgraduate research.
AI Databases Programming Languages OS Software Engineering Web Technologies

What I've Been Up To

Recent Activities

Part IV 2025 image 1 Part IV 2025 image 2 Part IV 2025 image 3 Part IV 2025 image 4
October 2025
Best Research Awards at Exihibition Day 2025
Part 4 Research students delivered outstanding work across a range of impactful topics and Secured best industry project in image and voice processing catrory 1 and 2.
Transformer workshop image 1 Transformer workshop image 2 Transformer workshop image 3
September 2025
Transformer Models Workshop
Organized and led a one-day workshop on Transformer models with hands-on practical sessions at the University of Auckland.
ICASSP 2025 image 1 ICASSP 2025 image 2 ICASSP 2025 image 3
April 2025
ICASSP 2025 Conference
Presented two papers on dysarthric speech recognition at the IEEE International Conference on Acoustics, Speech and Signal Processing in India.
ICONIP 2024 image 1 ICONIP 2024 image 2 ICONIP 2024 image 3
December 2024
ICONIP 2024 Conference
Presented research on comprehensive performance evaluation of Whisper models in dysarthric speech recognition at ICONIP in New Zealand.
ACM MM Asia image 1 ACM MM Asia image 2 ACM MM Asia image 3
December 2024
Local Arrangement Chair
Served as Local Arrangement Chair for the ACM Multimedia Asia Conference held in Auckland, New Zealand.
Best Project Award image 1 Best Project Award image 2 Best Project Award image 3
November 2024
Best Project Award 2024
Supervised students Steven Li and Adi Shenoy, who won the Best Project Award for developing a Neural Speaker Diarization System for Doctors.
Supervision image 1 Supervision image 2 Supervision image 3
Ongoing
PhD Student Supervision
Currently supervising multiple PhD and Honours students on projects related to dysarthric speech recognition, low-resource ASR, and AI accessibility tools.

Browse My

Recent Publications

2025 Under review
Low-Burden Data Augmentation for Dysarthric ASR via Zero-Shot Voice Cloning
Satwinder Singh*, Qianli Wang*, Zihan Zhong*, Clarion Mendes†, Mark Hasegawa-Johnson†, Waleed Abdulla*, Seyed Reza Shahamiri*
ICASSP
2025 Published
Convolution-Augmented Transformers for Enhanced Speaker-Independent Dysarthric Speech Recognition
Zihan Zhong, Qianli Wang, Satwinder Singh, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri
IEEE Transactions on Neural Systems and Rehabilitation Engineering
2025 Published
Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition
Satwinder Singh, Qianli Wang, Zihan Zhong, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), India, 2025
2025 Published
Dysarthric Speech Conformer: Adaptation for Sequence-to-Sequence Dysarthric Speech Recognition
Qianli Wang, Zihan Zhong, Satwinder Singh, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), India, 2025
2024 Published
Mix-fine-tune: An Alternate Fine-tuning Strategy for Domain Adaptation and Generalization of Low-resource ASR
Lei, Chengxi; Singh, Satwinder; Hou, Feng; Wang, Ruili
Proceedings of the 6th ACM International Conference on Multimedia in Asia
2024 Published
A Comprehensive Performance Evaluation of Whisper Models in Dysarthric Speech Recognition
Satwinder Singh, Zihan Zhong, Qianli Wang, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri
International Conference on Neural Information Processing (ICONIP), New Zealand, 2024
2023 Published
PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition
Chengxi Lei, Satwinder Singh*, Xiaoyun Jia, Feng Hou, Ruili Wang
ACM MM Asia, 2023
2023 Published
Real and Synthetic Punjabi Speech Datasets for Speech Recognition
Satwinder Singh, Ruili Wang, Feng Hou
Data in Brief Journal, 2023
2023 Published
A Novel Self-training Approach for Low-resource Speech Recognition
Satwinder Singh, Ruili Wang, Feng Hou
Proc. Interspeech, Ireland, pp. 1588-1592, 2023

Explore My

Featured Projects

Active
Dysarthric Speech Recognition System
Deep Learning • PyTorch • Accessibility
An end-to-end automatic speech recognition system designed specifically for dysarthric speech using convolution-augmented transformers. Achieves state-of-the-art performance across multiple etiology groups.
2024
Neural Speaker Diarization for Medical Consultations
Supervised Project • Audio Processing • Healthcare AI
Supervised development of an AI system to automatically identify and separate speakers in doctor-patient conversations. Developed by students Steven Li and Adi Shenoy, winning Best Project Award in 2024.
Research
Te Reo Māori Speech Recognition
Low-Resource ASR • Transfer Learning • Cultural AI
Developing robust ASR models for te reo Māori using self-training and meta-learning approaches to overcome data scarcity. Collaborating with Māori language experts to ensure cultural appropriateness.
2023
Punjabi Speech Corpus
Dataset • Speech Synthesis • Open Source
Created comprehensive real and synthetic speech datasets for Punjabi language containing over 100 hours of annotated audio. Publicly available for research purposes on Mendeley Data.
2023
PhasePerturbation Augmentation
Data Augmentation • ASR Improvement • Signal Processing
Novel speech data augmentation technique using phase perturbation to improve ASR model robustness and generalization without requiring additional labeled data. Published at ACM MM Asia 2023.

Updates

News and Blog

2025-12-21
Scaling Accessibility: How Zero-Shot Voice Cloning Boosts Dysarthric Speech Recognition