(* indicates equal contribution)
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Yiheng Xu*, Zekun Wang*, Junli Wang*, Dunjie Lu, Tianbao Xie, Amrita Saha, Doyen Sahoo, Tao Yu, Caiming Xiong
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
Yiheng Xu*, Dunjie Lu*, Zhennan Shen*, Junli Wang, Zekun Wang, Yuchen Mao, Caiming Xiong, Tao Yu
Lemur: Harmonizing Natural Language and Code for Language Agents
Yiheng Xu*, Hongjin Su*, Chen Xing*, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu
ICLR 2024 Spotlight (Top 5%)
[PDF]
[Code]
[Model]
[Blog]
DiT: Self-supervised Pre-training for Document Image Transformer
Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei
ACM Multimedia 2022
[PDF]
[Code]
[Demo]
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding
Junlong Li*, Yiheng Xu*, Lei Cui, Furu Wei
ACL 2022
[PDF]
[Code]
[Model]
[Blog]
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding
Yiheng Xu, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Furu Wei
ACL 2022 Findings
[PDF]
[Data]
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang, Yiheng Xu, Lei Cui, Jingbo Shang, Furu Wei
EMNLP 2021
[PDF]
[Code]
[Blog]
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding
Yang Xu*, Yiheng Xu*, Tengchao Lv*, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou
ACL 2021
[PDF]
[Code]
[Model]
[Blog]
DocBank: A Benchmark Dataset for Document Layout Analysis
Minghao Li*, Yiheng Xu*, Lei Cui, Shaohan Huang, Furu Wei, Zhoujun Li, Ming Zhou
COLING 2020
[PDF]
[Code]
[Blog]
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Yiheng Xu*, Minghao Li*, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou
KDD 2020
[PDF]
[Code]
[Model]
[Blog]
[PaperDigest Most Influential Papers]
Graph Convolutional Networks with Markov Random Field Reasoning for Social Spammer Detection
Yongji Wu, Defu Lian, Yiheng Xu, Le Wu, Enhong Chen
AAAI 2020
[PDF]
Reviewer: AAAI, ACL, CCL, COLING, EMNLP, NLPCC
Powered by Jekyll and Minimal Light theme.