news

May 2026 Excited to join Meta SuperIntelligence Lab as a Research Scientist Intern, working on multimodal LLM post-training and reasoning!
Apr 2026 Two papers accepted at ACL 2026: a Findings paper on a hierarchical visual agent with compact visual/text context for chart reasoning, and a Main Conference survey on thinking with images.
Mar 2026 Our Amazon work, Visual Reasoning through Tool-supervised Reinforcement Learning, was accepted to CVPR 2026 Findings and is now available on arXiv. If you find it interesting, feel free to upvote it on Hugging Face Papers.
Mar 2026 The code and data for our ICLR 2026 paper Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks are released at ref-adv.github.io.
Feb 2026 Our new preprint Fine-T2I is released: Fine-T2I: An Open, Large-Scale, and Diverse Dataset for High-Quality T2I Fine-Tuning. The dataset is available on HuggingFace and was trending #1 in datasets!