About Me
I am an Assistant Professor in the NLP group at IT University of Copenhagen.
I completed my PhD at the University of Edinburgh, where my thesis on neural planning for long-document generation received the best dissertation in Scotland award from SICSA. During my PhD, I also interned with the Summarization team at Google Research, London.
My research interests include:
Planning and Long-Context Modeling: I work on improving models’ ability to plan and operate over long contexts, both in general NLP and scientific domains. This includes neural planning for text generation (TACL’21, TACL’22), long-context architectures for summarization and sequence modeling (ACL’23, ICLR’25 W), and genome modeling via task-specific self-pretraining (ICML-GenBio’25).
Multilinguality, Transfer Learning, and Interpretability: I explore methods to make LLMs effective for low-resource and non-Roman script languages through romanization (RomanSetu, ACL’24) and language-relatedness-based chunking (DecoMT, EMNLP’23). I also study how LLMs internally represent such multilingual data, including latent romanization (RomanLens, ACL’25).
Reasoning: I study mathematical reasoning in open-weights LLMs. Our work (VerityMath, ICML-AI4Math’24) identifies unit consistency as a key challenge and introduces Unit Consistency Programs (UCPs) as a solution.
News
- 12 Jun 2025: Paper on self-pretraining for genome modeling accepted to ICML 2025 Workshop on Generative AI for Biology - Paper
- 16 May 2025: RomanLens to appear in Findings of ACL 2025 - Paper
- 11 Feb 2025: RomanLens paper on latent romanization in multilingual LLMs - Paper
- 25 Sep 2024: Paper on vocabulary expansion and initialization strategies for LLMs accepted to CoNLL 2024 - Paper
- 13 Jun 2024: VerityMath paper accepted to AI4Math workshop at ICML 2024 - Paper
- 16 May 2024: Two papers accepted to ACL: RomanSetu and a paper on Indic MT Eval - RomanSetu Preprint - Indic MT Eval Preprint
- 25 Jan 2024: Introducing Airavata, Hindi Instruction-tuned LLM - Blog
- 24 Jan 2024: RomanSetu for unlocking multilingual capabilities of Large Language Models via Romanization - Preprint
- 21 Nov 2023: IndicTrans2 is accepted to Transactions of Machine Learning Research (TMLR) - Preprint
- 13 Nov 2023: VerityMath for applying unit consistency check for math problem solving - Preprint
- 9 Oct 2023: Two papers accepted to EMNLP. DecoMT is accepted to Main and CTQScorer to Findings - DecoMT Preprint - CTQScorer Preprint
Selected Publications
For my latest publications, please visit my Google Scholar profile.
- RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs Alan Saji, Jaavid Aktar Husain, Thanmay Jayakumar, Raj Dabre, Anoop Kunchukuttan, Ratish Puduppully. In Findings of the 63rd Annual Meeting of the Association for Computational Linguistics. 2025.
- Improving Genomic Models via Task-Specific Self-Pretraining Sohan Mupparapu, Parameswari Krishnamurthy, Ratish Puduppully. In Proceedings of the Workshop on Generative AI for Biology at the 42nd International Conference on Machine Learning. 2025.
- RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization Jaavid Aktar Husain, Raj Dabre, Aswanth Kumar, Ratish Puduppully, Anoop Kunchukuttan. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics. 2024.
- VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency Vernon Toh, Ratish Puduppully, Nancy F. Chen. In AI4Math Workshop at ICML. 2024.
- Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models Code
Ratish Puduppully, Anoop Kunchukuttan, Raj Dabre, Ai Ti Aw, Nancy F. Chen. In Proceedings of EMNLP. 2023. - IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages Code
AI4Bharat, Jay Gala, Pranjal A. Chitale, Raghavan AK, Sumanth Doddapaneni, Varun Gumma, Aswanth Kumar, Janki Nawale, Anupama Sujatha, Ratish Puduppully, Vivek Raghavan, Pratyush Kumar, Mitesh M. Khapra, Raj Dabre, Anoop Kunchukuttan. In Transactions of Machine Learning Research (TMLR). 2023. - Multi-Document Summarization with Centroid-Based Pretraining Code
Ratish Surendran Puduppully, Parag Jain, Nancy Chen, Mark Steedman. In Proceedings of ACL. 2023. - Data-to-text Generation with Variational Sequential Planning. Code
Ratish Puduppully and Yao Fu and Mirella Lapata. In Transactions of the Association for Computational Linguistics (TACL). 2022. - Data-to-text Generation with Macro Planning. Code
Ratish Puduppully and Mirella Lapata. In Transactions of the Association for Computational Linguistics (TACL). 2021. - Data-to-text Generation with Entity Modeling. Code
Ratish Puduppully, Li Dong, Mirella Lapata. In Proceedings of ACL. 2019. - Data-to-Text Generation with Content Selection and Planning Code
Ratish Puduppully, Li Dong, Mirella Lapata. In Proceedings of AAAI. 2019.