2026

HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation
HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation

Haoran Liu*, Yuwei Zhang*, Xiyao Li, Bohan Lyu, Jingbo Shang (* equal contribution)

Preprint 2026

HERO turns next environment observations into hindsight reflection signals for multi-turn agent self-distillation, improving task success and reducing unnecessary tool-use turns under limited training budgets.

HERO: Hindsight-Enhanced Reflection from Environment Observations for Agentic Self-Distillation

Haoran Liu*, Yuwei Zhang*, Xiyao Li, Bohan Lyu, Jingbo Shang (* equal contribution)

Preprint 2026

HERO turns next environment observations into hindsight reflection signals for multi-turn agent self-distillation, improving task success and reducing unnecessary tool-use turns under limited training budgets.

CoMem: Context Management with A Decoupled Long-Context Model
CoMem: Context Management with A Decoupled Long-Context Model

Yuwei Zhang, Chengyu Dong, Shuowei Jin, Changlong Yu, Hejie Cui, Hongye Jin, Xinyang Zhang, Hamed Bonab, Colin Lockard, Jianshu Chen, Zhenyu Shi, Jingbo Shang, Xian Li, Bing Yin

ICML 2026

CoMem decouples agent reasoning from memory summarization so long-horizon agents can preserve most long-context performance while reducing latency through asynchronous context management.

CoMem: Context Management with A Decoupled Long-Context Model

Yuwei Zhang, Chengyu Dong, Shuowei Jin, Changlong Yu, Hejie Cui, Hongye Jin, Xinyang Zhang, Hamed Bonab, Colin Lockard, Jianshu Chen, Zhenyu Shi, Jingbo Shang, Xian Li, Bing Yin

ICML 2026

CoMem decouples agent reasoning from memory summarization so long-horizon agents can preserve most long-context performance while reducing latency through asynchronous context management.

ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation
ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation

Zhongkai Yu, Yichen Lin, Chenyang Zhou, Yuwei Zhang, Kun Zhou, Junxia Cui, Haotian Ye, Zhengding Hu, Zaifeng Pan, Ruiyi Wang, Yujie Zhao, Hejia Zhang, Jingbo Shang, Jishen Zhao, Yufei Ding

Preprint 2026

ChipMATE trains Verilog and Python reference-model agents to mutually verify RTL generation without a golden oracle, improving deployable hardware-code generation for industrial settings.

ChipMATE: Multi-Agent Training via Reinforcement Learning for Enhanced RTL Generation

Zhongkai Yu, Yichen Lin, Chenyang Zhou, Yuwei Zhang, Kun Zhou, Junxia Cui, Haotian Ye, Zhengding Hu, Zaifeng Pan, Ruiyi Wang, Yujie Zhao, Hejia Zhang, Jingbo Shang, Jishen Zhao, Yufei Ding

Preprint 2026

ChipMATE trains Verilog and Python reference-model agents to mutually verify RTL generation without a golden oracle, improving deployable hardware-code generation for industrial settings.

Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation
Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation

Yuwei Zhang, Sha Li, Changlong Yu, Qin Lu, Shuowei Jin, Chengyu Dong, Haoran Liu, Ilgee Hong, Xintong Li, Zhenyu Shi, Bing Yin, Jingbo Shang

Preprint 2026

RESD turns failed rollouts into reflection-based supervision and reusable playbook knowledge, letting models improve efficiently even when successful rollouts are rare.

Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation

Yuwei Zhang, Sha Li, Changlong Yu, Qin Lu, Shuowei Jin, Chengyu Dong, Haoran Liu, Ilgee Hong, Xintong Li, Zhenyu Shi, Bing Yin, Jingbo Shang

Preprint 2026

RESD turns failed rollouts into reflection-based supervision and reusable playbook knowledge, letting models improve efficiently even when successful rollouts are rare.

Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection

Yuwei Zhang, Wenhao Yu, Shangbin Feng, Yifan Zhu, Letian Peng, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang

ACL 2026 Oral

WikiDYK benchmarks real-world knowledge injection from fresh Wikipedia facts and shows bidirectional language models memorize injected knowledge more reliably than causal LMs.

Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection

Yuwei Zhang, Wenhao Yu, Shangbin Feng, Yifan Zhu, Letian Peng, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang

ACL 2026 Oral

WikiDYK benchmarks real-world knowledge injection from fresh Wikipedia facts and shows bidirectional language models memorize injected knowledge more reliably than causal LMs.

2025

MaPPO: Maximum a posteriori preference optimization with prior knowledge

Guangchen Lan, Sipeng Zhang, Tianle Wang, Yuwei Zhang, Daoan Zhang, Xinpeng Wei, Xiaoman Pan, Hongming Zhang, Dong-Jun Han, Christopher G Brinton

Preprint 2025

MaPPO reframes preference optimization as a maximum-a-posteriori objective that injects prior reward knowledge to improve alignment without extra hyperparameters.

MaPPO: Maximum a posteriori preference optimization with prior knowledge

Guangchen Lan, Sipeng Zhang, Tianle Wang, Yuwei Zhang, Daoan Zhang, Xinpeng Wei, Xiaoman Pan, Hongming Zhang, Dong-Jun Han, Christopher G Brinton

Preprint 2025

MaPPO reframes preference optimization as a maximum-a-posteriori objective that injects prior reward knowledge to improve alignment without extra hyperparameters.

Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval

Yuwei Zhang, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang

Preprint 2025

Attrieval uses attention from chain-of-thought tokens as a training-free signal to retrieve implicit facts that long-context LLMs miss during multi-step reasoning.

Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval

Yuwei Zhang, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang

Preprint 2025

Attrieval uses attention from chain-of-thought tokens as a training-free signal to retrieve implicit facts that long-context LLMs miss during multi-step reasoning.

Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning

Xintong Li, Jalend Bantupalli, Ria Dharmani, Yuwei Zhang, Jingbo Shang

EMNLP 2025 Oral

ImplexConv and TaciTree target implicit personalization across many conversation sessions by structuring memory into a hierarchy that supports efficient, fine-grained reasoning.

Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning

Xintong Li, Jalend Bantupalli, Ria Dharmani, Yuwei Zhang, Jingbo Shang

EMNLP 2025 Oral

ImplexConv and TaciTree target implicit personalization across many conversation sessions by structuring memory into a hierarchy that supports efficient, fine-grained reasoning.

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

Zilong Wang, Zifeng Wang, Long Le, Steven Zheng, Swaroop Mishra, Vincent Perot, Yuwei Zhang, Anush Mattapalli, Ankur Taly, Jingbo Shang, Chen-Yu Lee, Tomas Pfister

ICLR 2025

Speculative RAG speeds up retrieval-augmented generation by letting a small specialist draft evidence-grounded answers in parallel while a larger model verifies them once.

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting

Zilong Wang, Zifeng Wang, Long Le, Steven Zheng, Swaroop Mishra, Vincent Perot, Yuwei Zhang, Anush Mattapalli, Ankur Taly, Jingbo Shang, Chen-Yu Lee, Tomas Pfister

ICLR 2025

Speculative RAG speeds up retrieval-augmented generation by letting a small specialist draft evidence-grounded answers in parallel while a larger model verifies them once.

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Di Wu, Hongwei Wang, Wenhao Yu, Yuwei Zhang, Kai-Wei Chang, Dong Yu

ICLR 2025

LongMemEval stress-tests chat assistants on long-term memory abilities and motivates retrieval and indexing optimizations for sustained multi-session conversations.

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Di Wu, Hongwei Wang, Wenhao Yu, Yuwei Zhang, Kai-Wei Chang, Dong Yu

ICLR 2025

LongMemEval stress-tests chat assistants on long-term memory abilities and motivates retrieval and indexing optimizations for sustained multi-session conversations.

2024

Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation

Letian Peng, Yuwei Zhang, Jingbo Shang

ACL Findings 2024

Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation

Letian Peng, Yuwei Zhang, Jingbo Shang

ACL Findings 2024

Answer is All You Need: Instruction-following Text Embedding via Answering the Question

Letian Peng*, Yuwei Zhang*, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang (* equal contribution)

ACL 2024

Answer is All You Need: Instruction-following Text Embedding via Answering the Question

Letian Peng*, Yuwei Zhang*, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang (* equal contribution)

ACL 2024

Can your model tell a negation from an implicature? Unravelling challenges with intent encoders

Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour

ACL 2024

Can your model tell a negation from an implicature? Unravelling challenges with intent encoders

Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour

ACL 2024

2023

ClusterLLM: Large Language Models as a Guide for Text Clustering

Yuwei Zhang, Zihan Wang, Jingbo Shang

EMNLP 2023

ClusterLLM: Large Language Models as a Guide for Text Clustering

Yuwei Zhang, Zihan Wang, Jingbo Shang

EMNLP 2023

Toward Unsupervised Realistic Visual Question Answering

Yuwei Zhang*, Chih-Hui Ho*, Nuno Vasconcelos (* equal contribution)

ICCV 2023

Toward Unsupervised Realistic Visual Question Answering

Yuwei Zhang*, Chih-Hui Ho*, Nuno Vasconcelos (* equal contribution)

ICCV 2023

2022

Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pretraining and Isotropization

Haode Zhang, Haowen Liang, Yuwei Zhang, Liming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam

NAACL 2022 Oral

Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pretraining and Isotropization

Haode Zhang, Haowen Liang, Yuwei Zhang, Liming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam

NAACL 2022 Oral

New Intent Discovery with Pre-training and Contrastive Learning

Yuwei Zhang, Haode Zhang, Li-Ming Zhan, Xiao-Ming Wu, Albert Y.S. Lam

ACL 2022

New Intent Discovery with Pre-training and Contrastive Learning

Yuwei Zhang, Haode Zhang, Li-Ming Zhan, Xiao-Ming Wu, Albert Y.S. Lam

ACL 2022

Development and validation of a screening model for lung cancer using machine learning: a large-scale, multi-center study of biomarkers in breath

Jing Li*, Yuwei Zhang*, Qing Chen, Zhenhua Pan, Jun Chen, Meixiu Sun, Junfeng Wang, Yingxin Li, Qing Ye (* equal contribution)

Frontiers in Oncology 2022

Development and validation of a screening model for lung cancer using machine learning: a large-scale, multi-center study of biomarkers in breath

Jing Li*, Yuwei Zhang*, Qing Chen, Zhenhua Pan, Jun Chen, Meixiu Sun, Junfeng Wang, Yingxin Li, Qing Ye (* equal contribution)

Frontiers in Oncology 2022

Automated Calculation of Liquid Crystal Sensing Images Based on Deep Learning

Yuwei Zhang*, Shuai Xu*, Ronghua Zhang, Zhichao Deng, Yin Liu, Jianguo Tian, Li Yu, Qiongzheng Hu, Qing Ye (* equal contribution)

Analytical Chemistry 2022

Automated Calculation of Liquid Crystal Sensing Images Based on Deep Learning

Yuwei Zhang*, Shuai Xu*, Ronghua Zhang, Zhichao Deng, Yin Liu, Jianguo Tian, Li Yu, Qiongzheng Hu, Qing Ye (* equal contribution)

Analytical Chemistry 2022

Research on assistant diagnosis of fundus optic neuropathy based on deep learning

Chengjin Wang*, Yuwei Zhang*, Shuai Xu, Yuyan Liu, Lindan Xie, Changlong Wu, Qianhui Yang, Yanhua Chu, Qing Ye (* equal contribution)

Current Eye Research 2022

Research on assistant diagnosis of fundus optic neuropathy based on deep learning

Chengjin Wang*, Yuwei Zhang*, Shuai Xu, Yuyan Liu, Lindan Xie, Changlong Wu, Qianhui Yang, Yanhua Chu, Qing Ye (* equal contribution)

Current Eye Research 2022

2021

Effectiveness of Pre-training for Few-shot Intent Classification

Haode Zhang*, Yuwei Zhang*, Li-Ming Zhan, Jiaxin Chen, Guangyuan Shi, Xiao-Ming Wu, Albert Y.S. Lam (* equal contribution)

EMNLP Findings 2021

Effectiveness of Pre-training for Few-shot Intent Classification

Haode Zhang*, Yuwei Zhang*, Li-Ming Zhan, Jiaxin Chen, Guangyuan Shi, Xiao-Ming Wu, Albert Y.S. Lam (* equal contribution)

EMNLP Findings 2021

KuraNet: Systems of Coupled Oscillators that Learn to Synchronize

Matthew Ricci, Minju Jung, Yuwei Zhang, Mathieu Chalvidal, Aneri Soni, Thomas Serre

Preprint 2021

KuraNet: Systems of Coupled Oscillators that Learn to Synchronize

Matthew Ricci, Minju Jung, Yuwei Zhang, Mathieu Chalvidal, Aneri Soni, Thomas Serre

Preprint 2021