My Projects and Publications

2025

Scalable Controllable Accented TTS - Aug 2025
Henry Li Xinyuan, Zexin Cai, Ashi Garg, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner
[ASRU 2025]
Demo Page
HLTCOE Submission to the VoicePrivacy Attacker Challenge - Apr 2025
Henry Li Xinyuan, Ashi Garg, Zexin Cai, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner
[ICASSP 2025]
Video Presentation
GenVC: Self-Supervised Zero-Shot Voice Conversion - Feb 2025
Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Matthew Wiesner, Nicholas Andrews
[ASRU 2025]
ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts - Feb 2025
Ashi Garg, Zexin Cai, Henry Li Xinyuan, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Matthew Wiesner, Nicholas Andrews
[ASRU 2025]

2024

HLTCOE JHU Submission to the Voice Privacy Challenge 2024 - Sep 2024
Henry Li Xinyuan, Zexin Cai, Ashi Garg, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner
[Voice Privacy Challenge 2024; Symposium on Security and Privacy in Speech Communication 2024]
Video Presentation
Clean Label Attacks against SLU Systems - Sep 2024
Henry Li Xinyuan, Sonal Joshi, Thomas Thebaud, Jesus Villalba, Najim Dehak, Sanjeev Khudanpur
[IEEE SLT 2024]
Video Presentation
Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization - Sep 2024
Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner
[IEEE SLT 2024]
JHU IWSLT 2024 Dialectal and Low-resource System Description - August 2024
Nathaniel Romney Robinson, Kaiser Sun, Cihan Xiao, Niyati Bafna, Weiting Tan, Haoran Xu, Henry Li Xinyuan, Ankur Kejriwal, Sanjeev Khudanpur, Kenton Murray, Paul McNamee
[IWSLT 2024]

2023

JHU IWSLT 2023 Multilingual Speech Translation System Description - July 2023
Henry Li Xinyuan, Neha Verma, Bismarck Bamfo Odoom, Ujvala Pradeep, Matthew Wiesner, Sanjeev Khudanpur
[IWSLT 2023]
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System - June 2023
Thomas Thebaud, Sonal Joshi, Henry Li, Martin Sustek, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak
[ASRU 2023]
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation - June 2023
Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur
[Interspeech 2023]
Learning a Formality-Aware Japanese Sentence Representation - Jan 2023
Henry Li Xinyuan, Ray Lee, Jerry Chen, Kelly Marchisio

2022 and Earlier

Minecraft Settlement Generator: Entry to GDMC 2022
Approximating the Multi Commodity Flow problem - May 2021
Studying consonant voicing in Shanghainese - May 2021
Voice and Ambient-Control Rave Goggles at Oxford Hack 2018 - Nov. 2018