Publications

Selected works by Satwinder Singh

Showing —

2025

“Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition.”
Satwinder Singh, Qianli Wang, Zihan Zhong, Clarion Mendes, Mark Hasegawa‑Johnson, Waleed Abdulla, and Seyed Reza Shahamiri.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), India, 2025.
“Efficient Adaptation of Large‑Scale ASR for Robust Dysarthric Speech Recognition.” Submitted
Qianli Wang, Zihan Zhong, Satwinder Singh, Clarion Mendes, Mark Hasegawa‑Johnson, Waleed Abdulla, and Seyed Reza Shahamiri.
IEEE Signal Processing Letters, 2025.
“Empowering Māori Automatic Speech Recognition through EMD‑Based Augmentation.” Submitted
Chengxi Lei, Satwinder Singh, Feng Hou, Huia Jahnke, and Ruili Wang.
Pacific Rim International Conference on Artificial Intelligence, 2025.
“Convolution‑Augmented Transformers for Enhanced Speaker‑Independent Dysarthric Speech Recognition.” Submitted
Zihan Zhong, Qianli Wang, Satwinder Singh, Clarion Mendes, Mark Hasegawa‑Johnson, Waleed Abdulla, and Seyed Reza Shahamiri.
IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2025.
“Beyond Binary Detection: Multi‑Etiology Dysarthria Classification with Pre‑trained Speech Models.” Submitted
Zihan Zhong, Qianli Wang, Satwinder Singh, Clarion Mendes, Mark Hasegawa‑Johnson, Waleed Abdulla, and Seyed Reza Shahamiri.
Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025.
“Dysarthric Speech Conformer: Adaptation for Sequence‑to‑Sequence Dysarthric Speech Recognition.”
Qianli Wang, Zihan Zhong, Satwinder Singh, Clarion Mendes, Mark Hasegawa‑Johnson, Waleed Abdulla, and Seyed Reza Shahamiri.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), India, 2025.

2024

“A Comprehensive Performance Evaluation of Whisper Models in Dysarthric Speech Recognition.”
Satwinder Singh, Zihan Zhong, Qianli Wang, Clarion Mendes, Mark Hasegawa‑Johnson, Waleed Abdulla, and Seyed Reza Shahamiri.
Proceedings of the International Conference on Neural Information Processing (ICONIP), New Zealand, 2024.

2023

“PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition.”
Chengxi Lei, Satwinder Singh*, Xiaoyun Jia, Feng Hou, Ruili Wang.
ACM MM Asia, 2023.
“Punjabi Speech: A labeled Speech Corpus.” Dataset
Satwinder Singh, Ruili Wang, Feng Hou.
Mendeley Data, V1, 2023.
“Google‑synth: A Synthesized Punjabi Speech Dataset.” Dataset
Satwinder Singh, Ruili Wang, Feng Hou.
Figshare, V1, 2023.
“CMU‑synth: A Synthesized Punjabi Speech Dataset.” Dataset
Satwinder Singh, Ruili Wang, Feng Hou.
Figshare, V1, 2023.
“Real and Synthetic Punjabi Speech Datasets for Speech Recognition.”
Satwinder Singh, Ruili Wang, Feng Hou.
Data in Brief, 2023.
“A Novel Self‑training Approach for Low‑resource Speech Recognition.”
Satwinder Singh, Ruili Wang, Feng Hou.
Proc. Interspeech, Ireland, pp. 1588–1592, 2023.

2022

“Enhancing End‑to‑End Automatic Speech Recognition for Low‑Resource Punjabi Language Using Synthesized Datasets.”
Satwinder Singh, Ruili Wang, Feng Hou, Zhizhong Ma.
Available at SSRN 4181844, 2022.
“Improved Meta Learning for Low Resource Speech Recognition.”
Satwinder Singh, Ruili Wang, and Feng Hou.
ICASSP 2022 — IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4798–4802, Singapore, 2022.
“CyclicAugment: Speech Data Random Augmentation with Cosine Annealing Scheduler for Automatic Speech Recognition.”
Zhihan Wang, Feng Hou, Yuanhang Qiu, Zhizhong Ma, Satwinder Singh, and Ruili Wang.
Proc. Interspeech, Incheon, Korea, pp. 3859–3863, 2022.
“Automatic Speech‑based Smoking Status Identification.”
Zhizhong Ma, Feng Hou, Satwinder Singh, Yuanhang Qiu, Ruili Wang, Christopher Bullen, Joanna Ting Wai Chu.
Computing Conference, July 2022, London, United Kingdom.

2021

“DEEPF0: End‑To‑End Fundamental Frequency Estimation for Music and Speech Signals.”
Satwinder Singh, Ruili Wang, and Yuanhang Qiu.
ICASSP 2021 — IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 61–65, May 2021, Toronto, ON, Canada.
“Towards the Objective Speech Assessment of Smoking Status based on Voice Features: A Review of the Literature.”
Zhizhong Ma, Chris Bullen, Joanna Ting Wai Chu, Ruili Wang, Yingchun Wang, and Satwinder Singh.
Journal of Voice, January 2021.
“Self‑Supervised Learning Based Phone‑Fortified Speech Enhancement.”
Yuanhang Qiu, Ruili Wang, Satwinder Singh, Zhizhong Ma, and Feng Hou.
Proc. Interspeech, pp. 211–215, August 2021, Brno, Czechia.

2015

“Dual Layer Security of Data Using LSB Image Steganography Method and AES Encryption Algorithm.”
Satwinder Singh and Varinder Kaur Attri.
International Journal of Signal Processing, Image Processing and Pattern Recognition (IJSIP), Vol. 8, No. 5, pp. 259–266, May 2015.
“State‑of‑the‑art Review on Stenographic Techniques.”
Satwinder Singh and Varinder Kaur Attri.
International Journal of Signal Processing, Image Processing and Pattern Recognition (IJSIP), Vol. 8, No. 7, pp. 161–170, July 2015.

Tip: Use the search bar and tag chips (Submitted/Dataset) to filter your list.