Yifan Du (都一凡)

I am a Ph.D. student at the Gaoling School of Artificial Intelligence, Renmin University of China, and I have the fortune of being advised by Prof. Wayne Xin Zhao. My primary research interests are centered around vision-language, with a particular focus on Multimodal Large Language Models (MLLMs). I welcome communication, please feel free to drop me an email. :)

Publication

  • What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning (Preprint)
    Yifan Du†, Hangyu Guo†, Kun Zhou†, Wayne Xin Zhao, Jinpeng Wang, Chuyuan Wang, Mingchen Cai, Ruihua Song, Ji-Rong Wen
    [pdf] [code]

  • Evaluating Object Hallucination in Large Vision-Language Models (EMNLP 2023)
    Yifan Li†, Yifan Du†, Kun Zhou†, Jinpeng Wang, Wayne Xin Zhao, Ji-Rong Wen
    [pdf] [code]

  • A Survey of Large Language Models
    Wayne Xin Zhao, Kun Zhou†, Junyi Li†, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie, Ji-Rong Wen
    [pdf] [code]

  • Zero-shot Visual Question Answering with Language Model Feedback (ACL 2023 Findings)
    Yifan Du, Junyi Li, Tianyi Tang, Wayne Xin Zhao, Ji-Rong Wen
    [pdf] [code]

  • Learning to Imagine: Visually-Augmented Natural Language Generation (ACL 2023)
    Tianyi Tang, Yushuo Chen, Yifan Du, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen
    [pdf] [code]

  • A Survey of Vision-Language Pre-Trained Models (IJCAI 2022)
    Yifan Du†, Zikang Liu†, Junyi Li, Wayne Xin Zhao
    [pdf] [code]