Manan Suri
PhD in Computer Science at University of Maryland, College Park

I am a PhD student in Computer Science at the University of Maryland, College Park, advised by Prof. Dinesh Manocha at the GAMMA Lab.
My research focuses on grounding language models through retrieval, attribution, agents, and reasoning, with an emphasis on semi-structured and multimodal data such as documents, tables, and charts. I am particularly interested in how context can be constructed and used effectively in generation, decision-making, and task-oriented agents.
Previously, I worked on greenwashing detection as a Data Science for Social Good Fellow at the University of Warwick, collaborating with the Algorithmic Transparency Institute. I also contributed to fact attribution and document retrieval systems at Scalenut.
news
May 15, 2025 | Our paper “ChartLens: Fine-grained Visual Attribution in Charts” was accepted at ACL 2025, main conference! |
---|---|
Jan 22, 2025 | Our paper “VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation” was accepted at NAACL 2025, main conference! |
Nov 20, 2024 | I presented our paper “DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding” at EMNLP 2024, in Miami! I was also a Volunteer Coordinator at the conference. |
Nov 1, 2024 | Served as a reviewer for ARR October 2024 Cycle (NAACL), and ICASSP 2025. |
Sep 20, 2024 | Our paper “DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding” got accepted at EMNLP 2024, main conference! |
selected publications
2025
- VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented GenerationIn Proceedings of the 2025 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Apr 2025
2024
- Doc2Command: Furthering Language Guided Document EditingIn The Second Tiny Papers Track at ICLR 2024, Apr 2024
- DocEdit-v2: Document Structure Editing Via Multimodal LLM GroundingIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
2023
- ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NERIn Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023
- CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic NetworkIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023
- I Don’t Feel so Good! Detecting Depressive Tendencies Using Transformer-Based Multimodal FrameworksIn Proceedings of the 2022 5th International Conference on Machine Learning and Natural Language Processing, Dec 2023
2022
- Boosting Pre-trained Language Models with Task Specific Metadata and Cost Sensitive LearningIn Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), Jul 2022