Publications

(2024). Autonomous Evaluation and Refinement of Digital Agents. COLM 2024.

PDF Cite Code Twitter

(2024). OpenHands: An Open Platform for AI Software Developers as Generalist Agents. Preprint.

PDF Cite Code Twitter

(2024). Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning. NIPS 2024.

PDF Cite Code

(2024). DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning. Preprint.

PDF Cite Code Twitter

(2024). ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL. ICML 2024.

PDF Cite Code Project

(2023). Inversion-Free Image Editing with Natural Language. CVPR 2024.

PDF Cite Code Project

(2023). Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?. In EMNLP 2023.

PDF Cite Code Project Press

(2023). SEAGULL: An Embodied Agent for Instruction Following through Situated Dialog. In Alexa Prize SimBot Challenge Proceedings.

PDF Cite Press

(2022). Data-Efficient Learning of Natural Language to Linear Temporal Logic Translators for Robot Task Specification. International Conference on Robotics and Automation (ICRA) 2023.

PDF Cite Code Project

(2022). DANLI: Deliberative Agent for Following Natural Language Instructions. In EMNLP 2022.

PDF Cite Code