About Me
     Currently, I am a fourth-year PhD candidate at the College of Computing and Data Science (CCDS), Nanyang Technological University (NTU), Singapore, under the supervision of Prof. Chunyan Miao.
     Before that, I obtained my master degree in School of Computer Science, Fudan University (FDU) in 2021, where I worked with Prof. Xipeng Qiu and Prof. Xuanjing Huang. When I was in Fudan, I was a member of Fudan NLP Group and fastnlp develop team. I was one of the main contributors of fastNLP [GitHub] [Gitee].
     From 2014 to 2018, I completed my bachelor's at Taishan College, Shandong University (SDU), where I worked with Prof. Jun Ma.
Education Bio
- 2021 - present: PhD Student, College of Computing and Data Science (CCDS), Nanyang Technological University (NTU). Working with Prof. Chunyan Miao.
- 2018 - 2021: M.Sc. Computer Science from Fudan University, member of Fudan NLP Group and fastnlp develop team, worked with Prof. Xipeng Qiu and Prof. Xuanjing Huang.
- 2014 - 2018: B.Eng. Computer Science and Technology from Taishan College, Shandong University, worked with Prof. Jun Ma. Taishan College is an honor college (aka. elite class) of Shandong University. Our major selects less than 20 students from more than 300 undergraduates each year.
Research Interest
My research interests are centered on Machine Learning and Natural Language Processing (NLP), with a specific focus on Large Language Models (LLMs). I am dedicated to enhancing the efficient knowledge transfer of LLMs through the following key avenues:
- Efficient Optimization: I aim to develop advanced optimization techniques to improve the alignment and soft fine-tuning of LLMs. By refining these methods, I seek to enable LLMs to handle complex tasks with greater efficiency and accuracy.
- Efficient Inference: I focus on streamlining inference processes to enable LLMs to deliver faster predictions while maintaining comparable accuracy, which is crucial for real-time applications.
- Efficient Adaptation: I explore strategies for adapting LLMs to various downstream tasks with minimal supervision, employing few-shot learning techniques to address domain-specific challenges using limited annotated data.
Teaching
At NTU
- SC1007/CE1107/CZ1107 Data Structures and Algorithm (Semester 2, AY2022-2023). Teaching Assistant
- CZ3007 Compile Techniques (Semester 1, AY2022-2023). Teaching Assistant
At FDU
- DATA62004.01 Neural Network and Deep Learning (Spring 2020). Teaching Assistant
- COMP130137.01 Pattern Recognition & Machine Learning (Spring 2020). Teaching Assistant
- MANA130376.01 Big Data driven Business Analytics and Application (Spring 2019). Teaching Assistant
Awards
- Outstanding Students of Master's Degrees at Fudan University, 2020
- How to Fine-Tune BERT for Text Classification?, CCL 2019 Best Paper Award
Keynotes & Talks
- An Introduction to Prompting Methods, NTU Singapore, 04/05/2022.[Slides]
- Multi-perspective Optimization of Pre-trained Language Model, at NTU Student Lecture Series (SLS), Singapore, 24/03/2022. [Slides][Video]
- An Introduction of Transformer, NTU Singapore, 25/08/2021.[Slides]
Professional Services
Conference Reviewer / PC Members
- ACL Rolling Review (since January 2022)
- ACL (2021,2023,2024)
- EMNLP (2021-2024)
- NAACL (2021,2022,2024)
- EACL (2024)
- COLM (2024)
- NLPCC (2024)
Journal Reviewer
- Information Sciences
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
Publications
-
[New!]
RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference, (EMNLP), 2024.
[BibTeX]
[PDF]
[Code]
Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao. [Abstract] -
Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition, (Findings of EMNLP), 2023.
[BibTeX]
[PDF]
[Slides]
Yige Xu, Zhiwei Zeng, Zhiqi Shen. [Abstract] -
Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation, JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY (JCST), July 2023, Vol. 38(4), pp. 853-866, 2023.
[BibTeX]
[DOI]
[PDF]
Yige Xu, Xipeng Qiu, Ligao Zhou, Xuanjing Huang. [Abstract] -
MedChemLens: An Interactive Visual Tool to Support Direction Selection in Interdisciplinary Experimental Research of Medicinal Chemistry, IEEE Transactions on Visualization and Computer Graphics (In Proceedings of VIS 2022), 2023.
[BibTeX]
[PDF]
[Slides]
Chuhan Shi, Fei Nie, Yicheng Hu, Yige Xu, Lei Chen, Xiaojuan Ma, Qiong Luo. [Abstract] -
Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning, (Findings of EMNLP), 2021.
[BibTeX]
[PDF]
[Slides]
[Video]
[Code]
Yichao Luo*, Yige Xu*, Jiacheng Ye, Xipeng Qiu, Qi Zhang. [Abstract] -
Searching Effective Transformer for Seq2Seq Keyphrase Generation, CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC), 2021.
[BibTeX]
[DOI]
[PDF]
Yige Xu*, Yichao Luo*, Yicheng Zou, Zhengyan Li, Qi Zhang, Xipeng Qiu, Xuanjing Huang [Abstract] -
ONE2SET: Generating Diverse Keyphrases as a Set, (ACL), 2021.
[BibTeX]
[PDF]
[Code]
Jiacheng Ye, Tao Gui, Yichao Luo, Yige Xu, Qi Zhang. [Abstract] -
Pre-trained Models for Natural Language Processing: A Survey, SCIENCE CHINA Technological Sciences (Invited Paper, Most Influential Paper of SCTS in 2020), 2020.
[BibTeX]
[DOI]
[PDF]
Xipeng Qiu, Tianxiang Sun, Yige Xu, Yunfan Shao, Ning Dai, Xuanjing Huang. [Abstract] -
How to Fine-Tune BERT for Text Classification? China National Conference on Chinese Computational Linguistics (CCL, Best Paper Award), 2019.
[BibTeX]
[DOI]
[PDF]
[Code]
Chi Sun, Xipeng Qiu, Yige Xu, Xuanjing Huang. [Abstract]
(*: Equal contribution)
@inproceedings{xu-etal-2024-revmux, title = "RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference", author = "Xu, Yige and Guo, Xu and Zeng, Zhiwei and Miao, Chunyan", booktitle = "EMNLP 2024", year = "2024", publisher = "Association for Computational Linguistics", }
@inproceedings{xu-etal-2023-efficient, title = "Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition", author = "Xu, Yige and Zeng, Zhiwei and Shen, Zhiqi", booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2023", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.findings-emnlp.780", pages = "11654--11666" }
@article{xu2023jcst-self-distillation, title={Improving {BERT} Fine-Tuning via Self-Ensemble and Self-Distillation}, author={Xu, Yige and Qiu, Xipeng and Zhou, Ligao and Huang, Xuanjing}, journal={J. Comput. Sci. Technol.}, volume={38}, number={4}, pages={853--866}, year = {2023}, doi = {https://doi.org/10.1007/s11390-021-1119-0} }
@article{DBLP:journals/tvcg/ShiNHXCML23, author = {Chuhan Shi and Fei Nie and Yicheng Hu and Yige Xu and Lei Chen and Xiaojuan Ma and Qiong Luo}, title = {{MedChemLens}: An Interactive Visual Tool to Support Direction Selection in Interdisciplinary Experimental Research of Medicinal Chemistry}, journal = {{IEEE} Trans. Vis. Comput. Graph.}, volume = {29}, number = {1}, pages = {63--73}, year = {2023}, url = {https://doi.org/10.1109/TVCG.2022.3209434}, doi = {10.1109/TVCG.2022.3209434}, }
@inproceedings{luo2021keyphrase, title = "Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning", author = "Luo, Yichao and Xu, Yige and Ye, Jiacheng and Qiu, Xipeng and Zhang, Qi", booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021", month = nov, year = "2021", address = "Punta Cana, Dominican Republic", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2021.findings-emnlp.45", pages = "497--507", }
@inproceedings{xu2021nlpcc-searching-keyphrase, title={Searching Effective Transformer for Seq2Seq Keyphrase Generation}, author={Xu, Yige and Luo, Yichao and Zou, Yicheng and Li, Zhengyan and Zhang, Qi and Qiu, Xipeng and Huang, Xuanjing}, title={Searching Effective Transformer for Seq2Seq Keyphrase Generation}, booktitle={Natural Language Processing and Chinese Computing - 10th {CCF} International Conference, {NLPCC} 2021, Qingdao, China, October 13-17, 2021, Proceedings, Part {II}}, series={Lecture Notes in Computer Science}, volume={13029}, pages={86--97}, publisher={Springer}, year={2021}, url={https://doi.org/10.1007/978-3-030-88483-3\_7}, }
@inproceedings{ye2021one2set, title = "{O}ne2{S}et: {G}enerating Diverse Keyphrases as a Set", author = "Ye, Jiacheng and Gui, Tao and Luo, Yichao and Xu, Yige and Zhang, Qi", booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)", month = aug, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2021.acl-long.354", doi = "10.18653/v1/2021.acl-long.354", pages = "4598--4608", }
@article{qiu2020:scts-ptms, author = {Xipeng Qiu and TianXiang Sun and Yige Xu and Yunfan Shao and Ning Dai and Xuanjing Huang}, title = {Pre-trained Models for Natural Language Processing: A Survey}, journal = {SCIENCE CHINA Technological Sciences}, publisher = {Science China Press}, year = {2020}, volume = {63}, number = {10}, pages = {1872--1897}, doi = {https://doi.org/10.1007/s11431-020-1647-3} }
@inproceedings{sun2019fine, title={How to fine-tune {BERT} for text classification?}, author={Sun, Chi and Qiu, Xipeng and Xu, Yige and Huang, Xuanjing}, booktitle={China National Conference on Chinese Computational Linguistics}, pages={194--206}, year={2019}, organization={Springer} }