About Me
I’m Keqing, currently working at Meituan LLM Team. My professional experience includes reasoning models(like o1), mixture of experts(MOE) and LLM alignment. Before that, I also participate in dialogue systems, including end2end dialogue system and dialogue pretrain.
My research interests focus on LLM, including:
- complex reasoning: Complex reasoning abilities are a key milestone in the development of LLMs, and the rise of reasoning models has rapidly advanced the field. My focus is on the evolution of foundational models and the optimization of Long-COT RL. For reasoning models, we need to build new technical pipelines—innovating from pre-training to post-training, from data to algorithms—to push the boundaries of what’s possible.
- reinforcement learning in real-world settings: While LLMs have made impressive strides in reasoning tasks like code and math, they’ve yet to translate into real-world productivity. LLM-driven end-to-end agent systems — such as DeepResearch, GUI Agent, and Embodied Agent — offer an exciting and imaginative path forward. My core interest lies in reinforcement learning in real-world settings, pushing the limits of intelligence through interaction with dynamic environments.
- LLM alignment: Alignment is an essential process when working with LLMs to ensure that models align with human values. My primary focus is on scalable alignment learning, including data evaluation and optimization, as well as preference learning algorithms. This work is crucial for shaping models that are not only powerful but also ethically sound and aligned with our goals.
News
- 2025.1: We have one papers accepted by ICLR2025
- 2024.6: We have two papers accepted by EMNLP2024
- 2024.2: We have two papers accepted by COLING2024, one paper accepted by ACL2024
- 2023.12: We have one paper accepted by ICLR2023
- 2023.10: We have four papers accepted by EMNLP2023
- 2023.5: We have four papers accepted by ACL2023
- 2022.12: We have four papers accepted by EMNLP2022
- 2022.8: We have three papers accepted by COLING2022
- 2022.8: We have one paper accepted by CIKM2022
- 2022.4: We have two papers accepted by NAACL2022
- 2022.3: We have one paper accepted by SIGIR2022
- 2022.2: We have one paper accepted by ACL2022
Experience
- Full employee in Meituan LLM Group, Mar 2023 - Now:
- Research area in reasoing models(like o1), mixture of experts(MOE) and LLM aligment.
- Full employee in Meituan NLP Group, Jun 2021 - Mar 2023:
- Research area in dialogue system and dialogue pretrain.
- Research Intern in Alibaba DAMO, Jun 2020 - Oct 2020:
- Research area in recommendation system.
- Research Intern in Tencent Wechat AI Lab, Mar 2020 - Jun 2020:
- Research area in zero-shot learning and slot filling.
- Research Intern in Meituan NLP Group, Oct 2019 - Mar 2020:
- Research area in GCN and dialogue system.
Education
2018-2021, Master in Artificial Intelligence, BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS
2014-2018, Bachelor in Communication Engineering, BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS
Publication
Please see the full paper list in Semantic Scholar
- SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild, Arxiv
- AgentRefine: Enhancing Agent Generalization through Refinement Tuning, ICLR2025
- DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning, ACL2024
- Yejie Wang*, Keqing He*, Mengdi Zhang, Jingang Wang, Xunliang Cai, Weiran Xu, etc
- paper
- How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data, EMNLP2024
- Yejie Wang*, Keqing He*, Jingang Wang, Mengdi Zhang, Xunliang Cai, Weiran Xu, etc
- paper
- Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models, EMNLP2024
- Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang
- paper
- What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning, ICLR2023
- Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT, EMNLP2023
- DemoNSF: A Multi-task Demonstration-based Generative Framework for Noisy Slot Filling Task, EMNLP2023 Findings
- Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition, EMNLP2023 Findings
- APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection, EMNLP2023 Findings
- Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery, ACL2023
- FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue, ACL2023
- Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation, ACL2023
- Weihao Zeng*, Lulu Zhao*, Keqing He, Ruotong Geng, Jingang Wang, Wei Wu, Weiran Xu
- paper
- Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting, ACL2023 Findings
- UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning, EMNLP2022
- Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery, EMNLP2022 oral
- Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems, EMNLP2022 SereTOD Workshop (Championship of Track II)
- Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning, EMNLP2022 SereTOD Workshop
- Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation, COLING2022
- PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling, COLING2022
- Guanting Dong*, Daichi Guo*, LiWen Wang*, Xuefeng Li*, Zechen Wang, Chen Zeng, Keqing He, Jinzheng Zhao, Hao Lei, Xinyue Cui, Yi Huang, Junlan Feng, Weiran Xu
- paper
- Unified Knowledge Prompt Pretraining for Customer Service Dialogues, CIKM2022
- Keqing He, Jingang Wang, Chaobo Sun, Wei Wu
- paper
- Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization, NAACL2022 oral
- Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold, NAACL2022
- ADPL: Adversarial Prompt-based Domain Adaptation for Dialogue Summarization with Knowledge Disentanglement, SIGIR2022
- Lulu Zhao*, Fujia Zheng*, Weihao Zeng, Keqing He, Ruotong Geng, Huixing Jiang, Wei Wu, Weiran Xu
- paper
- Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning, ACL2022
- Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling, EMNLP2021 oral
- A Finer-grain Universal Dialogue Semantic Structures based Model For Abstractive Dialogue Summarization, EMNLP2021 Findings
- Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System, ACL2021 oral
- Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning, ACL2021
- Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog System, ACL2021 Findings
- Sihong Liu, Jinchao Zhang, Keqing He, Weiran Xu and Jie Zhou
- paper
- Adversarial Self-Supervised Learning for Out-of-Domain Detection, NAACL2021 oral
- Dynamically Disentangling Social Bias from Task-Oriented Representations with Adversarial Attack, NAACL2021
- Contrastive Zero-Shot Learning for Cross-Domain Slot Filling with Adversarial Attack, COLING2020 oral
- Keqing He, Jinchao Zhang, Yuanmeng Yan, Weiran XU, Cheng Niu, Jie Zhou
- paper
- Syntactic Graph Convolution Network for Spoken Language Understanding, COLING2020
- Keqing He*, Shuyu Lei*, Jiangnan Xia, Yushu Yang, Huixing Jiang, Zhongyuan Wang
- paper
- A Deep Generative Distance-Based Classifier for Out-of-Domain Detection with Mahalanobis Space, COLING2020 oral
- Adversarial Semantic Decoupling for Recognizing Open-Vocabulary Slots, EMNLP2020 oral
- Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge, ACL2020
- Keqing He, Yuanmeng Yan, Hong Xu, Sihong Liu, Weiran Xu
- paper
Contact
- Address: Beijing, China
- Email: helicbupt@gmail.com
- Blog: https://helicqin.github.io