publications | Zisu

2026

arXiv

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Yifan Yang^*, Ziyang Gong^*, Weiquan Huang^*, Qihao Yang^*, Ziwei Zhou^*, Zisu Huang^*, and 9 more authors

arXiv preprint, 2026

Paper Video Code Website

3k+ stars in week 1 1M+ views🤗 #1 Paper of the Day
arXiv

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Zisu Huang^*, Jingwen Xu^*, Yifan Yang, and 13 more authors

arXiv preprint, 2026

Paper Code Website
ACL 2026

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Zisu Huang^*, Muzhao Tian^*, Xiaohua Wang, and 8 more authors

Annual Meeting of the Association for Computational Linguistics (ACL), 2026

Main Conference
ICML 2026

TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios

Yuanzhe Shen^*, Zisu Huang^*, Zhengyuan Wang^*, and 14 more authors

International Conference on Machine Learning (ICML), 2026

Poster
ICLR 2026

RECAST: Strengthening LLMs’ Complex Instruction Following with Constraint-Verifiable Data

Wenhao Liu, Zhengkang Guo, Mingchen Xie, Jingwen Xu, Zisu Huang, and 9 more authors

International Conference on Learning Representations (ICLR), 2026

Poster
arXiv

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Changze Lv, Jie Zhou, Wentao Zhao, Jingwen Xu, Zisu Huang, and 7 more authors

Under Review, 2026
arXiv

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Qi Qian, Chengsong Huang, Jingwen Xu, Changze Lv, Muling Wu, Wenhao Liu, Xiaohua Wang, Zhenghua Wang, Zisu Huang, and 7 more authors

Under Review, 2026
arXiv

CSSG: Measuring Code Similarity with Semantic Graphs

Jingwen Xu^*, Yiyang Lu^*, Changze Lv, Zisu Huang, and 5 more authors

Under Review, 2026
Survey

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Xiaohua Wang^*, Muzhao Tian^*, Yuqi Zeng^*, Zisu Huang^*, and 19 more authors

arXiv preprint, 2026

Paper
arXiv

BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation

Jingwen Xu^*, Yiyang Lu^*, Zisu Huang, and 9 more authors

Under Review, 2026
arXiv

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

Xiaohua Wang, Jiakang Yuan, Zisu Huang, and 5 more authors

arXiv preprint, 2026

Paper

2025

EMNLP 2025

SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading

Yu Shen, Yihan Liu, Zisu Huang, and 3 more authors

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

Main Conference
EMNLP 2025

Enhancing Model Privacy in Federated Learning with Random Masking and Quantization

Zhibo Xu, Jianhao Zhu, Jingwen Xu, Changze Lv, Zisu Huang, and 5 more authors

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025

Findings
arXiv

Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning

Muling Wu, Qi Qian, Wenhao Liu, Xiaohua Wang, Zisu Huang, and 10 more authors

arXiv preprint, 2025
arXiv

IntentionReasoner: Facilitating Adaptive LLM Safeguards through Intent Reasoning and Selective Query Refinement

Yu Shen, Zisu Huang, Zhengkang Guo, and 5 more authors

arXiv preprint, 2025

2024

arXiv

Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement

Zisu Huang^*, Xiaohua Wang^*, Feiran Zhang, and 5 more authors

arXiv preprint, 2024