publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- ACL 2026
Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent InteractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2026Main Conference - ICML 2026
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World ScenariosInternational Conference on Machine Learning (ICML), 2026Poster - ICLR 2026
RECAST: Strengthening LLMs’ Complex Instruction Following with Constraint-Verifiable DataInternational Conference on Learning Representations (ICLR), 2026Poster - arXivLearning Query-Specific Rubrics from Human Preferences for DeepResearch Report GenerationUnder Review, 2026
- arXivBenchmark^2: Systematic Evaluation of LLM BenchmarksUnder Review, 2026
- arXivCSSG: Measuring Code Similarity with Semantic GraphsUnder Review, 2026
- Survey
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, ChallengesarXiv preprint, 2026 - arXivBatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-TranslationUnder Review, 2026
2025
- EMNLP 2025SATER: A Self-Aware and Token-Efficient Approach to Routing and CascadingConference on Empirical Methods in Natural Language Processing (EMNLP), 2025Main Conference
- EMNLP 2025Enhancing Model Privacy in Federated Learning with Random Masking and QuantizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2025Findings
- arXivProgressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical ReasoningarXiv preprint, 2025
- arXivIntentionReasoner: Facilitating Adaptive LLM Safeguards through Intent Reasoning and Selective Query RefinementarXiv preprint, 2025
2024
- arXiv
Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query RefinementarXiv preprint, 2024