I am looking for talented and self-motivated research interns. Please contact me (kqin@bupt.cn) if you are interested in LLMs.
News
- 2024.2: We have two papers accepted by COLING2024, including
BootTOD: Bootstrap Task-oriented Dialogue Representations by Aligning Diverse Responses
;Beyond the Known: Investigating LLMs Performance on Out-of-Domain Intent Detection
- 2023.12: We have one paper accepted by ICLR2023, including
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
- 2023.10: We have four papers accepted by EMNLP2023, including
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT
;DemoNSF: A Multi-task Demonstration-based Generative Framework for Noisy Slot Filling Task
;Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition
;APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection
- 2023.5: We have four papers accepted by ACL2023, including
Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery
;FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue
;Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation
;Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting
- 2022.12: We have four papers accepted by EMNLP2022, including
UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning
;Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery
;Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems
;Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning
- 2022.8: We have three papers accepted by COLING2022, including
Generalized Intent Discovery: Learning from Open World Dialogue System
;Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation
;PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling
- 2022.8: We have one paper accepted by CIKM2022, including
Unified Knowledge Prompt Pretraining for Customer Service Dialogues
- 2022.4: We have two papers accepted by NAACL2022, including
Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold
;Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization
. - 2022.3: We have one paper accepted by SIGIR2022, including
ADPL: Adversarial Prompt-based Domain Adaptation for Dialogue Summarization with Knowledge Disentanglement
. - 2022.2: We have one paper accepted by ACL2022, including
Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning
.
Research Area
Currently, I am working on Large Language Models(LLM), including pre-training, alignment and reasoning. I’m also interested in sparsity of LLMs, scaling of data, algorithm and infrastructure. Before that, I focused on neural conversational AI: natural language understanding, dialog policy learning, out-of-domain detection and dialogue summarization.
Education
- 2021-Now, Working in Meituan Group, Beijing
- 2018-2021, Master in Artificial Intelligence, BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS
- 2014-2018, Bachelor in Communication Engineering, BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS
Experience
Research Intern in Alibaba DAMO, Jun 2020 - Oct 2020
Research Intern in Tencent Wechat AI Lab, Mar 2020 - Jun 2020
Research Intern in Meituan NLP Group, Oct 2019 - Mar 2020
Research and engineering Intern in GBSAA, IBM, SEP 2017 - FEB 2018
Research Intern in BUPT PRIS LAB, MAR 2017 - SEP 2017
Publication
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning, ICLR2023
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT, EMNLP2023
DemoNSF: A Multi-task Demonstration-based Generative Framework for Noisy Slot Filling Task, EMNLP2023 Findings
Continual Generalized Intent Discovery: Marching Towards Dynamic and Open-world Intent Recognition, EMNLP2023 Findings
APP: Adaptive Prototypical Pseudo-Labeling for Few-shot OOD Detection, EMNLP2023 Findings
Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery, ACL2023
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue, ACL2023
Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation, ACL2023
- Weihao Zeng*, Lulu Zhao*, Keqing He, Ruotong Geng, Jingang Wang, Wei Wu, Weiran Xu
- paper
Generative Zero-Shot Prompt Learning for Cross-Domain Slot Filling with Inverse Prompting, ACL2023 Findings
UniNL: Aligning Representation Learning with Scoring Function for OOD Detection via Unified Neighborhood Learning, EMNLP2022
Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery, EMNLP2022 oral
Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems, EMNLP2022 SereTOD Workshop (Championship of Track II)
Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning, EMNLP2022 SereTOD Workshop
Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation, COLING2022
PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling, COLING2022
- Guanting Dong*, Daichi Guo*, LiWen Wang*, Xuefeng Li*, Zechen Wang, Chen Zeng, Keqing He, Jinzheng Zhao, Hao Lei, Xinyue Cui, Yi Huang, Junlan Feng, Weiran Xu
- paper
Unified Knowledge Prompt Pretraining for Customer Service Dialogues, CIKM2022
- Keqing He, Jingang Wang, Chaobo Sun, Wei Wu
- paper
Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization, NAACL2022 oral
Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold, NAACL2022
ADPL: Adversarial Prompt-based Domain Adaptation for Dialogue Summarization with Knowledge Disentanglement, SIGIR2022
- Lulu Zhao*, Fujia Zheng*, Weihao Zeng, Keqing He, Ruotong Geng, Huixing Jiang, Wei Wu, Weiran Xu
- paper
Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning, ACL2022
Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling, EMNLP2021 oral
A Finer-grain Universal Dialogue Semantic Structures based Model For Abstractive Dialogue Summarization, EMNLP2021 Findings
Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System, ACL2021 oral
Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning, ACL2021
Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog System, ACL2021 Findings
- Sihong Liu, Jinchao Zhang, Keqing He, Weiran Xu and Jie Zhou
- paper
Adversarial Self-Supervised Learning for Out-of-Domain Detection, NAACL2021 oral
Dynamically Disentangling Social Bias from Task-Oriented Representations with Adversarial Attack, NAACL2021
Hierarchical Speaker-Aware Sequence-to-Sequence Model for Dialogue Summarization, ICASSP2021
- Yuejie Lei, Yuanmeng Yan, Zhiyuan Zeng, Keqing He, Ximing Zhang, Weiran Xu
- paper
Adversarial Generative Distance-Based Classifier for Robust Out-of-Domain Detection, ICASSP2021
- Zhiyuan Zeng*, Hong Xu*, Keqing He, Yuanmeng Yan, Sihong Liu, Zijun Liu, Weiran Xu
- paper
Contrastive Zero-Shot Learning for Cross-Domain Slot Filling with Adversarial Attack, COLING2020 oral
- Keqing He, Jinchao Zhang, Yuanmeng Yan, Weiran XU, Cheng Niu, Jie Zhou
- paper
Syntactic Graph Convolution Network for Spoken Language Understanding, COLING2020
- Keqing He*, Shuyu Lei*, Jiangnan Xia, Yushu Yang, Huixing Jiang, Zhongyuan Wang
- paper
A Deep Generative Distance-Based Classifier for Out-of-Domain Detection with Mahalanobis Space, COLING2020 oral
Adversarial Semantic Decoupling for Recognizing Open-Vocabulary Slots, EMNLP2020 oral
Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge, ACL2020
- Keqing He, Yuanmeng Yan, Hong Xu, Sihong Liu, Weiran Xu
- paper
Learning Label-Relational Output Structure for Adaptive Sequence Labeling, IJCNN2020
- Keqing He, Yuanmeng Yan, Hong Xu, Weiran Xu
- paper
Contact
- Address: Beijing, China
- Email: kqin@bupt.cn
- Blog: https://helicqin.github.io