About Me

About Me

I am looking for talented and self-motivated research interns. Please contact me (kqin@bupt.cn) if you are interested in LLMs.

News

  • 2024.2: We have two papers accepted by COLING2024, including BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses; Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection
  • 2023.12: We have one paper accepted by ICLR2023, including What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
  • 2023.10: We have four papers accepted by EMNLP2023, including Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT; DemoNSF: A Multi-task Demonstration-based Generative Framework for Noisy Slot Filling Task; Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition; APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection
  • 2023.5: We have four papers accepted by ACL2023, including Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery; FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue; Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation; Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting
  • 2022.12: We have four papers accepted by EMNLP2022, including UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning; Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery; Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems; Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning
  • 2022.8: We have three papers accepted by COLING2022, including Generalized Intent Discovery: Learning from Open World Dialogue System; Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation; PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
  • 2022.8: We have one paper accepted by CIKM2022, including Unified Knowledge Prompt Pretraining for Customer Service Dialogues
  • 2022.4: We have two papers accepted by NAACL2022, including Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold; Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization.
  • 2022.3: We have one paper accepted by SIGIR2022, including ADPL: Adversarial Prompt-based Domain Adaptation for Dialogue Summarization with Knowledge Disentanglement.
  • 2022.2: We have one paper accepted by ACL2022, including Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning.

Research Area

Currently, I am working on Large Language Models(LLM), including pre-training, alignment and reasoning. I’m also interested in sparsity of LLMs, scaling of data, algorithm and infrastructure. Before that, I focused on neural conversational AI: natural language understanding, dialog policy learning, out-of-domain detection and dialogue summarization.

Education

  • 2021-Now, Working in Meituan Group, Beijing
  • 2018-2021, Master in Artificial Intelligence, BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS
  • 2014-2018, Bachelor in Communication Engineering, BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS

Experience

  1. Research Intern in Alibaba DAMO, Jun 2020 - Oct 2020

  2. Research Intern in Tencent Wechat AI Lab, Mar 2020 - Jun 2020

  3. Research Intern in Meituan NLP Group, Oct 2019 - Mar 2020

  4. Research and engineering Intern in GBSAA, IBM, SEP 2017 - FEB 2018

  5. Research Intern in BUPT PRIS LAB, MAR 2017 - SEP 2017

Publication

  1. What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning, ICLR2023

    • Wei Liu*, Weihao Zeng*, Keqing He, Yong Jiang, Junxian He
    • paper, code
  2. Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT, EMNLP2023

    • Xiaoshuai Song*, Keqing He*, Pei Wang, Guanting Dong, Yutao Mou, Jingang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu
    • paper, code
  3. DemoNSF: A Multi-task Demonstration-based Generative Framework for Noisy Slot Filling Task, EMNLP2023 Findings

    • Guanting Dong*, Tingfeng Hui*, Zhuoma GongQue, Jinxu Zhao, Daichi Guo, Gang Zhao, Keqing He, Weiran Xu
    • paper, code
  4. Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition, EMNLP2023 Findings

    • Xiaoshuai Song*, Yutao Mou*, Keqing He*, Yueyan Qiu, Jinxu Zhao, Pei Wang, Weiran Xu
    • paper, code
  5. APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection, EMNLP2023 Findings

    • Pei Wang*, Keqing He*, Yutao Mou*, Xiaoshuai Song, Yanan Wu, Jingang Wang, Yunsen Xian, Xunliang Cai, Weiran Xu
    • paper, code
  6. Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery, ACL2023

    • Yutao Mou*, Xiaoshuai Song*, Keqing He*, Chen Zeng, Pei Wang, Jingang Wang, Yunsen Xian, Weiran Xu
    • paper, code
  7. FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue, ACL2023

    • Weihao Zeng*, Keqing He*, Yejie Wang, Chen Zeng, Jingang Wang, Yunsen Xian, Weiran Xu
    • paper, code
  8. Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation, ACL2023

    • Weihao Zeng*, Lulu Zhao*, Keqing He, Ruotong Geng, Jingang Wang, Wei Wu, Weiran Xu
    • paper
  9. Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting, ACL2023 Findings

    • Xuefeng Li*, Liwen Wang*, Guanting Dong*, Keqing He, Jinzheng Zhao, Hao Lei, Jiachi Liu, Weiran Xu
    • paper, code
  10. UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning, EMNLP2022

    • Yutao Mou*, Pei Wang*, Keqing He*, Yanan Wu, Jingang Wang, Wei Wu, Weiran Xu
    • paper, code
  11. Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery, EMNLP2022 oral

    • Yutao Mou*, Keqing He*, Pei Wang, Yanan Wu, Jingang Wang, Wei Wu, Weiran Xu
    • paper, code
  12. Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems, EMNLP2022 SereTOD Workshop (Championship of Track II)

    • Weihao Zeng*, Keqing He*, Zechen Wang*, Dayuan Fu, Guanting Dong, Ruotong Geng, Pei Wang, Jingang Wang, Chaobo Sun, Wei Wu, Weiran Xu
    • paper, code
  13. Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning, EMNLP2022 SereTOD Workshop

    • Yanan Wu*, Zhiyuan Zeng*, Keqing He*, Yutao Mou, Pei Wang, Yuanmeng Yan, Weiran Xu
    • paper, code
  14. Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation, COLING2022

    • Yanan Wu*, Zhiyuan Zeng*, Keqing He*, Yutao Mou, Pei Wang, Weiran Xu
    • paper, code
  15. PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling, COLING2022

    • Guanting Dong*, Daichi Guo*, LiWen Wang*, Xuefeng Li*, Zechen Wang, Chen Zeng, Keqing He, Jinzheng Zhao, Hao Lei, Xinyue Cui, Yi Huang, Junlan Feng, Weiran Xu
    • paper
  16. Unified Knowledge Prompt Pretraining for Customer Service Dialogues, CIKM2022

    • Keqing He, Jingang Wang, Chaobo Sun, Wei Wu
    • paper
  17. Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization, NAACL2022 oral

    • Lulu Zhao*, Fujia Zheng*, Weihao Zeng, Keqing He, Weiran Xu, Huixing Jiang, Wei Wu, Yanan Wu
    • paper, code
  18. Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold, NAACL2022

    • Yanan Wu*, Keqing He*, Yuanmeng Yan, Qixiang Gao, Zhiyuan Zeng, Fujia Zheng, Lulu Zhao, Huixing Jiang, Wei Wu, Weiran Xu
    • paper, code
  19. ADPL: Adversarial Prompt-based Domain Adaptation for Dialogue Summarization with Knowledge Disentanglement, SIGIR2022

    • Lulu Zhao*, Fujia Zheng*, Weihao Zeng, Keqing He, Ruotong Geng, Huixing Jiang, Wei Wu, Weiran Xu
    • paper
  20. Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning, ACL2022

    • Yutao Mou*, Keqing He*, Yanan Wu*, Zhiyuan Zeng, Hong Xu, Huixing Jiang, Wei Wu, Weiran Xu
    • paper, code
  21. Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling, EMNLP2021 oral

    • Liwen Wang*, Xuefeng Li*, Jiachi Liu, Keqing He, Yuanmeng Yan, Weiran Xu
    • paper, code
  22. A Finer-grain Universal Dialogue Semantic Structures based Model For Abstractive Dialogue Summarization, EMNLP2021 Findings

    • Yuejie Lei*, Fujia Zheng*, Yuanmeng Yan, Keqing He, Weiran Xu
    • paper, code
  23. Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System, ACL2021 oral

    • Yanan Wu*, Zhiyuan Zeng*, Keqing He*, Hong Xu, Yuanmeng Yan, Huixing Jiang and Weiran Xu
    • paper, code
  24. Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning, ACL2021

    • Zhiyuan Zeng*, Keqing He*, Yuanmeng Yan, Zijun Liu, Yanan Wu, Hong Xu, Huixing Jiang and Weiran Xu
    • paper, code
  25. Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog System, ACL2021 Findings

    • Sihong Liu, Jinchao Zhang, Keqing He, Weiran Xu and Jie Zhou
    • paper
  26. Adversarial Self-Supervised Learning for Out-of-Domain Detection, NAACL2021 oral

    • Zhiyuan Zeng, Keqing He, Yuanmeng Yan, Hong Xu, Weiran Xu
    • paper, code
  27. Dynamically Disentangling Social Bias from Task-Oriented Representations with Adversarial Attack, NAACL2021

    • Liwen Wang*, Yuanmeng Yan*, Keqing He, Yanan Wu, Weiran Xu
    • paper, code
  28. Hierarchical Speaker-Aware Sequence-to-Sequence Model for Dialogue Summarization, ICASSP2021

    • Yuejie Lei, Yuanmeng Yan, Zhiyuan Zeng, Keqing He, Ximing Zhang, Weiran Xu
    • paper
  29. Adversarial Generative Distance-Based Classifier for Robust Out-of-Domain Detection, ICASSP2021

    • Zhiyuan Zeng*, Hong Xu*, Keqing He, Yuanmeng Yan, Sihong Liu, Zijun Liu, Weiran Xu
    • paper
  30. Contrastive Zero-Shot Learning for Cross-Domain Slot Filling with Adversarial Attack, COLING2020 oral

    • Keqing He, Jinchao Zhang, Yuanmeng Yan, Weiran XU, Cheng Niu, Jie Zhou
    • paper
  31. Syntactic Graph Convolution Network for Spoken Language Understanding, COLING2020

    • Keqing He*, Shuyu Lei*, Jiangnan Xia, Yushu Yang, Huixing Jiang, Zhongyuan Wang
    • paper
  32. A Deep Generative Distance-Based Classifier for Out-of-Domain Detection with Mahalanobis Space, COLING2020 oral

    • Hong Xu, Keqing He, Yuanmeng Yan, Sihong Liu, Zijun Liu, Weiran XU
    • paper, code
  33. Adversarial Semantic Decoupling for Recognizing Open-Vocabulary Slots, EMNLP2020 oral

    • Yuanmeng Yan*, Keqing He*, Hong Xu, Sihong Liu, Fanyu Meng, Min Hu, Weiran XU
    • paper, code
  34. Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge, ACL2020

    • Keqing He, Yuanmeng Yan, Hong Xu, Sihong Liu, Weiran Xu
    • paper
  35. Learning Label-Relational Output Structure for Adaptive Sequence Labeling, IJCNN2020

    • Keqing He, Yuanmeng Yan, Hong Xu, Weiran Xu
    • paper

Contact