Manan Suri

PhD in Computer Science at University of Maryland, College Park

manansuri.jpeg
5108, 8125 Paint Branch Dr College Park, MD 20742

I am a PhD student in Computer Science at the University of Maryland, College Park, advised by Prof. Dinesh Manocha at the GAMMA Lab.

My research focuses on grounding language models through retrieval, attribution, agents, and reasoning, with an emphasis on semi-structured and multimodal data such as documents, tables, and charts. I am particularly interested in how context can be constructed and used effectively in generation, decision-making, and task-oriented agents.

Previously, I worked on greenwashing detection as a Data Science for Social Good Fellow at the University of Warwick, collaborating with the Algorithmic Transparency Institute. I also contributed to fact attribution and document retrieval systems at Scalenut.

news

May 15, 2025 Our paper “ChartLens: Fine-grained Visual Attribution in Charts” was accepted at ACL 2025, main conference!
Jan 22, 2025 Our paper “VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation” was accepted at NAACL 2025, main conference!
Nov 20, 2024 I presented our paper “DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding” at EMNLP 2024, in Miami! I was also a Volunteer Coordinator at the conference.
Nov 1, 2024 Served as a reviewer for ARR October 2024 Cycle (NAACL), and ICASSP 2025.
Sep 20, 2024 Our paper “DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding” got accepted at EMNLP 2024, main conference!

selected publications

2025

  1. VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation
    Manan Suri, Puneet Mathur, Franck Dernoncourt, Kanika Goswami, Ryan A. Rossi, and Dinesh Manocha
    In Proceedings of the 2025 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Apr 2025

2024

  1. Doc2Command: Furthering Language Guided Document Editing
    Manan Suri, Puneet Mathur, Ramit Sawhney, Preslav Nakov, and Dinesh Manocha
    In The Second Tiny Papers Track at ICLR 2024, Apr 2024
  2. DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
    Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I Morariu, Ramit Sawhney, Preslav Nakov, and Dinesh Manocha
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024

2023

  1. ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER
    Sreyan Ghosh, Utkarsh Tyagi, Manan Suri, Sonal Kumar, Ramaneswaran S, and Dinesh Manocha
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2023
  2. CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
    Sreyan Ghosh, Manan Suri, Purva Chiniya, Utkarsh Tyagi, Sonal Kumar, and Dinesh Manocha
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023
  3. I Don’t Feel so Good! Detecting Depressive Tendencies Using Transformer-Based Multimodal Frameworks
    Manan Suri, Nalin Semwal, Divya ChaudharyIan Gorton, and Bijendra Kumar
    In Proceedings of the 2022 5th International Conference on Machine Learning and Natural Language Processing, Dec 2023

2022

  1. Boosting Pre-trained Language Models with Task Specific Metadata and Cost Sensitive Learning
    Manan Suri
    In Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), Jul 2022