Zisu Huang
I am a first-year M.S. student in Computer Science at Fudan University, and a member of Fudan NLP Group, advised by A.P. Xiaoqing Zheng and Prof. Xuanjing Huang. I received my B.E. in Software Engineering from Fudan University in 2025.
I am currently a research intern in the Visual Computing Group at Microsoft Research Asia, working with Yifan Yang.
My research interests center on AI agents, with a particular focus on:
- Personalized agents: developing more effective personalized agents through both model-level adaptation and harness/system design.
- Agent evolution:
- Parametric evolution: improving agentic capabilities through parameter updates, e.g., agentic RL.
- Non-parametric evolution: expanding the capability boundary of agents in a training-free manner, e.g., through agent skills, orchestration, and harness design.
Feel free to reach out if youโd like to chat about research or collaborate.
News
| May 01, 2026 | Our paper TRIP-Bench has been accepted to ICML 2026. ๐ |
|---|---|
| Apr 16, 2026 | Check out our latest survey on reward hacking: Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges! ๐ฅ |
| Apr 07, 2026 | Our paper SteeM has been accepted to ACL 2026 (Main Conference). ๐ |
| Mar 03, 2026 | Started a research internship at the Visual Computing Group, Microsoft Research Asia (Shanghai), working with Yifan Yang. ๐ |
| Jan 26, 2026 | Our paper RECAST has been accepted to ICLR 2026. ๐ |
| Aug 21, 2025 | Two papers accepted to EMNLP 2025 โ SATER and FedQSN. ๐ |
Survey
- Survey
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, ChallengesarXiv preprint, 2026
Selected Publications
See the full publication list โ
- ACL 2026
Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent InteractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2026Main Conference - ICML 2026
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World ScenariosInternational Conference on Machine Learning (ICML), 2026Poster - arXiv
Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query RefinementarXiv preprint, 2024 - ICLR 2026
RECAST: Strengthening LLMsโ Complex Instruction Following with Constraint-Verifiable DataInternational Conference on Learning Representations (ICLR), 2026Poster
Miscellaneous
Recently, I have really enjoyed playing tennis ๐พ and have been working hard to improve my game. My favorite player is Novak Djokovic.