Mohammed Safi Ur Rahman Khan
@safikhan2k
PhD @ai4bharat, @iitmadras, @WSAI_IITM
ID: 1548294082371686401
http://safikhanSoofiyani.github.io 16-07-2022 13:11:02
33 Tweet
154 Followers
876 Following
đ¨đ¨đ¨Excited to share our latest work: "Pralekha: An Indic Document Alignment Evaluation Benchmark", focusing on document-level alignment across 11 Indic languages. Paper: arxiv.org/abs/2411.19096 Github: github.com/AI4Bharat/Pral⌠Hugging Face đ¤: huggingface.co/datasets/ai4bh⌠đ§ľ1/N
Introducing Indic Parler-TTS: Open-Source Text-to-Speech for Over a Billion Indic Speakers! đ In collaboration with Hugging Face, we are excited to release Indic Parler-TTS, a state-of-the-art open-source text-to-speech system designed to bring accessible and high-quality
đ¨đ¨đ¨ Tutorial accepted to EMNLP 2025! Anoop Kunchukuttan, Rudra, Mohammed Safi Ur Rahman Khan, Thanmay and I will be doing a tutorial on "Data and Model Centric Approaches for Expansion of Large Language Models to New Languages" at EMNLP. You want to expand LLMs? We will share our knowledge on how!
I'll be presenting our poster for IndicVoices-R at NeurIPS Conference (Friday, Dec 13, 11 AM, West Ballroom A-D #5110) I'd love to chat about the work we do at AI4Bharat, from speech to LLMs! đIâm also seeking summer research internships, would appreciate any guidance or connections
Glad to share we have a tutorial accepted at EMNLP 2025 on "Data and Model Centric Approaches for Expansion of Large Language Models to New Languages" - with @prajdabre1 Mohammed Safi Ur Rahman Khan Thanmay Rudra Murthy You can see an early version of this here: anoopkunchukuttan.gitlab.io/publications/pâŚ
Had an amazing time working on this with Danish Pruthi and Mansi ! Catch janki nawale and me at our poster at ACL 2025 in Vienna - would love to chat!