
Haoran Liu*, Yuwei Zhang*, Xiyao Li, Bohan Lyu, Jingbo Shang (* equal contribution)
Preprint 2026
HERO turns next environment observations into hindsight reflection signals for multi-turn agent self-distillation, improving task success and reducing unnecessary tool-use turns under limited training budgets.
Haoran Liu*, Yuwei Zhang*, Xiyao Li, Bohan Lyu, Jingbo Shang (* equal contribution)
Preprint 2026
HERO turns next environment observations into hindsight reflection signals for multi-turn agent self-distillation, improving task success and reducing unnecessary tool-use turns under limited training budgets.

Yuwei Zhang, Chengyu Dong, Shuowei Jin, Changlong Yu, Hejie Cui, Hongye Jin, Xinyang Zhang, Hamed Bonab, Colin Lockard, Jianshu Chen, Zhenyu Shi, Jingbo Shang, Xian Li, Bing Yin
ICML 2026
CoMem decouples agent reasoning from memory summarization so long-horizon agents can preserve most long-context performance while reducing latency through asynchronous context management.
Yuwei Zhang, Chengyu Dong, Shuowei Jin, Changlong Yu, Hejie Cui, Hongye Jin, Xinyang Zhang, Hamed Bonab, Colin Lockard, Jianshu Chen, Zhenyu Shi, Jingbo Shang, Xian Li, Bing Yin
ICML 2026
CoMem decouples agent reasoning from memory summarization so long-horizon agents can preserve most long-context performance while reducing latency through asynchronous context management.

Zhongkai Yu, Yichen Lin, Chenyang Zhou, Yuwei Zhang, Kun Zhou, Junxia Cui, Haotian Ye, Zhengding Hu, Zaifeng Pan, Ruiyi Wang, Yujie Zhao, Hejia Zhang, Jingbo Shang, Jishen Zhao, Yufei Ding
Preprint 2026
ChipMATE trains Verilog and Python reference-model agents to mutually verify RTL generation without a golden oracle, improving deployable hardware-code generation for industrial settings.
Zhongkai Yu, Yichen Lin, Chenyang Zhou, Yuwei Zhang, Kun Zhou, Junxia Cui, Haotian Ye, Zhengding Hu, Zaifeng Pan, Ruiyi Wang, Yujie Zhao, Hejia Zhang, Jingbo Shang, Jishen Zhao, Yufei Ding
Preprint 2026
ChipMATE trains Verilog and Python reference-model agents to mutually verify RTL generation without a golden oracle, improving deployable hardware-code generation for industrial settings.

Yuwei Zhang, Sha Li, Changlong Yu, Qin Lu, Shuowei Jin, Chengyu Dong, Haoran Liu, Ilgee Hong, Xintong Li, Zhenyu Shi, Bing Yin, Jingbo Shang
Preprint 2026
RESD turns failed rollouts into reflection-based supervision and reusable playbook knowledge, letting models improve efficiently even when successful rollouts are rare.
Yuwei Zhang, Sha Li, Changlong Yu, Qin Lu, Shuowei Jin, Chengyu Dong, Haoran Liu, Ilgee Hong, Xintong Li, Zhenyu Shi, Bing Yin, Jingbo Shang
Preprint 2026
RESD turns failed rollouts into reflection-based supervision and reusable playbook knowledge, letting models improve efficiently even when successful rollouts are rare.

Yuwei Zhang, Wenhao Yu, Shangbin Feng, Yifan Zhu, Letian Peng, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang
ACL 2026 Oral
WikiDYK benchmarks real-world knowledge injection from fresh Wikipedia facts and shows bidirectional language models memorize injected knowledge more reliably than causal LMs.
Yuwei Zhang, Wenhao Yu, Shangbin Feng, Yifan Zhu, Letian Peng, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang
ACL 2026 Oral
WikiDYK benchmarks real-world knowledge injection from fresh Wikipedia facts and shows bidirectional language models memorize injected knowledge more reliably than causal LMs.
Guangchen Lan, Sipeng Zhang, Tianle Wang, Yuwei Zhang, Daoan Zhang, Xinpeng Wei, Xiaoman Pan, Hongming Zhang, Dong-Jun Han, Christopher G Brinton
Preprint 2025
MaPPO reframes preference optimization as a maximum-a-posteriori objective that injects prior reward knowledge to improve alignment without extra hyperparameters.
Guangchen Lan, Sipeng Zhang, Tianle Wang, Yuwei Zhang, Daoan Zhang, Xinpeng Wei, Xiaoman Pan, Hongming Zhang, Dong-Jun Han, Christopher G Brinton
Preprint 2025
MaPPO reframes preference optimization as a maximum-a-posteriori objective that injects prior reward knowledge to improve alignment without extra hyperparameters.
Yuwei Zhang, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang
Preprint 2025
Attrieval uses attention from chain-of-thought tokens as a training-free signal to retrieve implicit facts that long-context LLMs miss during multi-step reasoning.
Yuwei Zhang, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang
Preprint 2025
Attrieval uses attention from chain-of-thought tokens as a training-free signal to retrieve implicit facts that long-context LLMs miss during multi-step reasoning.
Xintong Li, Jalend Bantupalli, Ria Dharmani, Yuwei Zhang, Jingbo Shang
EMNLP 2025 Oral
ImplexConv and TaciTree target implicit personalization across many conversation sessions by structuring memory into a hierarchy that supports efficient, fine-grained reasoning.
Xintong Li, Jalend Bantupalli, Ria Dharmani, Yuwei Zhang, Jingbo Shang
EMNLP 2025 Oral
ImplexConv and TaciTree target implicit personalization across many conversation sessions by structuring memory into a hierarchy that supports efficient, fine-grained reasoning.
Zilong Wang, Zifeng Wang, Long Le, Steven Zheng, Swaroop Mishra, Vincent Perot, Yuwei Zhang, Anush Mattapalli, Ankur Taly, Jingbo Shang, Chen-Yu Lee, Tomas Pfister
ICLR 2025
Speculative RAG speeds up retrieval-augmented generation by letting a small specialist draft evidence-grounded answers in parallel while a larger model verifies them once.
Zilong Wang, Zifeng Wang, Long Le, Steven Zheng, Swaroop Mishra, Vincent Perot, Yuwei Zhang, Anush Mattapalli, Ankur Taly, Jingbo Shang, Chen-Yu Lee, Tomas Pfister
ICLR 2025
Speculative RAG speeds up retrieval-augmented generation by letting a small specialist draft evidence-grounded answers in parallel while a larger model verifies them once.
Di Wu, Hongwei Wang, Wenhao Yu, Yuwei Zhang, Kai-Wei Chang, Dong Yu
ICLR 2025
LongMemEval stress-tests chat assistants on long-term memory abilities and motivates retrieval and indexing optimizations for sustained multi-session conversations.
Di Wu, Hongwei Wang, Wenhao Yu, Yuwei Zhang, Kai-Wei Chang, Dong Yu
ICLR 2025
LongMemEval stress-tests chat assistants on long-term memory abilities and motivates retrieval and indexing optimizations for sustained multi-session conversations.
Letian Peng, Yuwei Zhang, Jingbo Shang
ACL Findings 2024
Letian Peng, Yuwei Zhang, Jingbo Shang
ACL Findings 2024
Letian Peng*, Yuwei Zhang*, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang (* equal contribution)
ACL 2024
Letian Peng*, Yuwei Zhang*, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang (* equal contribution)
ACL 2024
Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour
ACL 2024
Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour
ACL 2024
Yuwei Zhang, Zihan Wang, Jingbo Shang
EMNLP 2023
Yuwei Zhang, Zihan Wang, Jingbo Shang
EMNLP 2023
Yuwei Zhang*, Chih-Hui Ho*, Nuno Vasconcelos (* equal contribution)
ICCV 2023
Yuwei Zhang*, Chih-Hui Ho*, Nuno Vasconcelos (* equal contribution)
ICCV 2023
Haode Zhang, Haowen Liang, Yuwei Zhang, Liming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam
NAACL 2022 Oral
Haode Zhang, Haowen Liang, Yuwei Zhang, Liming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam
NAACL 2022 Oral
Yuwei Zhang, Haode Zhang, Li-Ming Zhan, Xiao-Ming Wu, Albert Y.S. Lam
ACL 2022
Yuwei Zhang, Haode Zhang, Li-Ming Zhan, Xiao-Ming Wu, Albert Y.S. Lam
ACL 2022
Jing Li*, Yuwei Zhang*, Qing Chen, Zhenhua Pan, Jun Chen, Meixiu Sun, Junfeng Wang, Yingxin Li, Qing Ye (* equal contribution)
Frontiers in Oncology 2022
Jing Li*, Yuwei Zhang*, Qing Chen, Zhenhua Pan, Jun Chen, Meixiu Sun, Junfeng Wang, Yingxin Li, Qing Ye (* equal contribution)
Frontiers in Oncology 2022
Yuwei Zhang*, Shuai Xu*, Ronghua Zhang, Zhichao Deng, Yin Liu, Jianguo Tian, Li Yu, Qiongzheng Hu, Qing Ye (* equal contribution)
Analytical Chemistry 2022
Yuwei Zhang*, Shuai Xu*, Ronghua Zhang, Zhichao Deng, Yin Liu, Jianguo Tian, Li Yu, Qiongzheng Hu, Qing Ye (* equal contribution)
Analytical Chemistry 2022
Chengjin Wang*, Yuwei Zhang*, Shuai Xu, Yuyan Liu, Lindan Xie, Changlong Wu, Qianhui Yang, Yanhua Chu, Qing Ye (* equal contribution)
Current Eye Research 2022
Chengjin Wang*, Yuwei Zhang*, Shuai Xu, Yuyan Liu, Lindan Xie, Changlong Wu, Qianhui Yang, Yanhua Chu, Qing Ye (* equal contribution)
Current Eye Research 2022
Haode Zhang*, Yuwei Zhang*, Li-Ming Zhan, Jiaxin Chen, Guangyuan Shi, Xiao-Ming Wu, Albert Y.S. Lam (* equal contribution)
EMNLP Findings 2021
Haode Zhang*, Yuwei Zhang*, Li-Ming Zhan, Jiaxin Chen, Guangyuan Shi, Xiao-Ming Wu, Albert Y.S. Lam (* equal contribution)
EMNLP Findings 2021
Matthew Ricci, Minju Jung, Yuwei Zhang, Mathieu Chalvidal, Aneri Soni, Thomas Serre
Preprint 2021
Matthew Ricci, Minju Jung, Yuwei Zhang, Mathieu Chalvidal, Aneri Soni, Thomas Serre
Preprint 2021