About Me
     Now I am NOT available on the job market, thanks for your interests.
     Currently, I am a fourth-year PhD candidate at the College of Computing and Data Science (CCDS), Nanyang Technological University (NTU), Singapore, under the supervision of Prof. Chunyan Miao.
    
Before that, I obtained my master degree in School of Computer Science, Fudan University (FDU) in 2021, where I worked with Prof. Xipeng Qiu and Prof. Xuanjing Huang.
When I was in Fudan, I was a member of Fudan NLP Group and fastnlp develop team.
I was one of the main contributors of fastNLP [GitHub] [Gitee].
     From 2014 to 2018, I completed my bachelor's at Taishan College, Shandong University (SDU), where I worked with Prof. Jun Ma.
Education Bio
- 2021 - present: PhD Student, College of Computing and Data Science (CCDS), Nanyang Technological University (NTU). Working with Prof. Chunyan Miao.
- 2018 - 2021: M.Sc. Computer Science from Fudan University, member of Fudan NLP Group and fastnlp develop team, worked with Prof. Xipeng Qiu and Prof. Xuanjing Huang.
- 2014 - 2018: B.Eng. Computer Science and Technology from Taishan College, Shandong University, worked with Prof. Jun Ma. I completed the China Top-Notch Undergraduate Training Program at Taishan College, the honors college of Shandong University.
Research Interest
My research interests are centred on Machine Learning and Natural Language Processing (NLP), with a specific focus on Large Language Models (LLMs). I am dedicated to enhancing the efficient knowledge transfer for LLMs through the following key avenues:
- Efficient Optimization: I aim to develop advanced optimization techniques to improve the alignment and soft fine-tuning of LLMs. By refining these methods, I seek to enable LLMs to handle complex tasks with greater efficiency and accuracy.
- Efficient Inference: I focus on streamlining inference processes to enable LLMs to deliver faster predictions while maintaining comparable accuracy, which is crucial for real-time applications.
- Efficient Adaptation: I explore strategies for adapting LLMs to various downstream tasks with minimal supervision, employing few-shot learning techniques to address domain-specific challenges using limited annotated data.
Teaching
At NTU
- SC1007/CE1107/CZ1107 Data Structures and Algorithm (Semester 2, AY2022-2023). Teaching Assistant
- CZ3007 Compile Techniques (Semester 1, AY2022-2023). Teaching Assistant
At FDU
- DATA62004.01 Neural Network and Deep Learning (Spring 2020). Teaching Assistant
- COMP130137.01 Pattern Recognition & Machine Learning (Spring 2020). Teaching Assistant
- MANA130376.01 Big Data driven Business Analytics and Application (Spring 2019). Teaching Assistant
Awards
- Outstanding Students of Master's Degrees at Fudan University, 2020
- How to Fine-Tune BERT for Text Classification?, CCL 2019 Best Paper Award
Keynotes & Talks
- An Introduction to Prompting Methods, NTU Singapore, 04/05/2022.[Slides]
- Multi-perspective Optimization of Pre-trained Language Model, at NTU Student Lecture Series (SLS), Singapore, 24/03/2022. [Slides][Video]
- An Introduction of Transformer, NTU Singapore, 25/08/2021.[Slides]
Professional Services
Conference Reviewer / PC Members
- ACL Rolling Review (since January 2022)
- ACL (2021, 2023-2025)
- EMNLP (2021-2025)
- NAACL (2021, 2022, 2024, 2025)
- EACL (2024)
- NeurIPS (2025)
- COLM (2024, 2025)
- NLPCC (2024, 2025)
Journal Reviewer
- Information Sciences
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
Publications
-
[New!]
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning, (arXiv preprint), 2025.
[BibTeX]
[PDF]
[Code]
Yige Xu*, Xu Guo*, Zhiwei Zeng, Chunyan Miao. [Abstract] -
[New!]
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs, (ACL), 2025.
[BibTeX]
[PDF]
[Code]
Yige Xu*, Xu Guo*, Zhiwei Zeng, Chunyan Miao. [Abstract] -
RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference, (EMNLP), 2024.
[BibTeX]
[PDF]
[Slides]
[Code]
Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao. [Abstract] -
Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition, (Findings of EMNLP), 2023.
[BibTeX]
[PDF]
[Slides]
Yige Xu, Zhiwei Zeng, Zhiqi Shen. [Abstract] -
Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation, JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, (JCST), July 2023, Vol. 38(4), pp. 853-866, 2023.
[BibTeX]
[DOI]
[PDF]
Yige Xu, Xipeng Qiu, Ligao Zhou, Xuanjing Huang. [Abstract] -
MedChemLens: An Interactive Visual Tool to Support Direction Selection in Interdisciplinary Experimental Research of Medicinal Chemistry, IEEE Transactions on Visualization and Computer Graphics, (In Proceedings of VIS 2022), 2023.
[BibTeX]
[PDF]
[Slides]
Chuhan Shi, Fei Nie, Yicheng Hu, Yige Xu, Lei Chen, Xiaojuan Ma, Qiong Luo. [Abstract] -
Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning, (Findings of EMNLP), 2021.
[BibTeX]
[PDF]
[Slides]
[Video]
[Code]
Yichao Luo*, Yige Xu*, Jiacheng Ye, Xipeng Qiu, Qi Zhang. [Abstract] -
Searching Effective Transformer for Seq2Seq Keyphrase Generation, CCF International Conference on Natural Language Processing and Chinese Computing, (NLPCC), 2021.
[BibTeX]
[DOI]
[PDF]
Yige Xu*, Yichao Luo*, Yicheng Zou, Zhengyan Li, Qi Zhang, Xipeng Qiu, Xuanjing Huang [Abstract] -
ONE2SET: Generating Diverse Keyphrases as a Set, (ACL), 2021.
[BibTeX]
[PDF]
[Code]
Jiacheng Ye, Tao Gui, Yichao Luo, Yige Xu, Qi Zhang. [Abstract] -
Pre-trained Models for Natural Language Processing: A Survey, SCIENCE CHINA Technological Sciences, (Most Influential Paper of SCTS in 2021), 2020.
[BibTeX]
[DOI]
[PDF]
Xipeng Qiu, Tianxiang Sun, Yige Xu, Yunfan Shao, Ning Dai, Xuanjing Huang. [Abstract] -
How to Fine-Tune BERT for Text Classification? China National Conference on Chinese Computational Linguistics, (CCL, Best Paper Award), 2019.
[BibTeX]
[DOI]
[PDF]
[Code]
Chi Sun, Xipeng Qiu, Yige Xu, Xuanjing Huang. [Abstract]
(*: Equal contribution)
@article{xu2025softcotpp, title={{SoftCoT++}: Test-Time Scaling with Soft Chain-of-Thought Reasoning}, author={Xu, Yige and Guo, Xu and Zeng, Zhiwei and Miao, Chunyan}, journal={arXiv preprint arXiv:2505.11484}, year={2025} }
@inproceedings{xu2025softcot, title={{SoftCoT}: Soft Chain-of-Thought for Efficient Reasoning with LLMs}, author={Xu, Yige and Guo, Xu and Zeng, Zhiwei and Miao, Chunyan}, booktitle={Proceedings of {ACL}}, year={2025} }
@inproceedings{xu-etal-2024-revmux, title = "{R}ev{MUX}: Data Multiplexing with Reversible Adapters for Efficient {LLM} Batch Inference", author = "Xu, Yige and Guo, Xu and Zeng, Zhiwei and Miao, Chunyan", booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing", month = nov, year = "2024", address = "Miami, Florida, USA", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2024.emnlp-main.1232", pages = "22072--22087", }
@inproceedings{xu-etal-2023-efficient, title = "Efficient Cross-Task Prompt Tuning for Few-Shot Conversational Emotion Recognition", author = "Xu, Yige and Zeng, Zhiwei and Shen, Zhiqi", booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2023", month = dec, year = "2023", address = "Singapore", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.findings-emnlp.780", pages = "11654--11666" }
@article{xu2023jcst-self-distillation, title={Improving {BERT} Fine-Tuning via Self-Ensemble and Self-Distillation}, author={Xu, Yige and Qiu, Xipeng and Zhou, Ligao and Huang, Xuanjing}, journal={J. Comput. Sci. Technol.}, volume={38}, number={4}, pages={853--866}, year = {2023}, doi = {https://doi.org/10.1007/s11390-021-1119-0} }
@article{DBLP:journals/tvcg/ShiNHXCML23, author = {Chuhan Shi and Fei Nie and Yicheng Hu and Yige Xu and Lei Chen and Xiaojuan Ma and Qiong Luo}, title = {{MedChemLens}: An Interactive Visual Tool to Support Direction Selection in Interdisciplinary Experimental Research of Medicinal Chemistry}, journal = {{IEEE} Trans. Vis. Comput. Graph.}, volume = {29}, number = {1}, pages = {63--73}, year = {2023}, url = {https://doi.org/10.1109/TVCG.2022.3209434}, doi = {10.1109/TVCG.2022.3209434}, }
@inproceedings{luo2021keyphrase, title = "Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning", author = "Luo, Yichao and Xu, Yige and Ye, Jiacheng and Qiu, Xipeng and Zhang, Qi", booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021", month = nov, year = "2021", address = "Punta Cana, Dominican Republic", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2021.findings-emnlp.45", pages = "497--507", }
@inproceedings{xu2021nlpcc-searching-keyphrase, title={Searching Effective Transformer for Seq2Seq Keyphrase Generation}, author={Xu, Yige and Luo, Yichao and Zou, Yicheng and Li, Zhengyan and Zhang, Qi and Qiu, Xipeng and Huang, Xuanjing}, title={Searching Effective Transformer for Seq2Seq Keyphrase Generation}, booktitle={Natural Language Processing and Chinese Computing - 10th {CCF} International Conference, {NLPCC} 2021, Qingdao, China, October 13-17, 2021, Proceedings, Part {II}}, series={Lecture Notes in Computer Science}, volume={13029}, pages={86--97}, publisher={Springer}, year={2021}, url={https://doi.org/10.1007/978-3-030-88483-3\_7}, }
@inproceedings{ye2021one2set, title = "{O}ne2{S}et: {G}enerating Diverse Keyphrases as a Set", author = "Ye, Jiacheng and Gui, Tao and Luo, Yichao and Xu, Yige and Zhang, Qi", booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)", month = aug, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2021.acl-long.354", doi = "10.18653/v1/2021.acl-long.354", pages = "4598--4608", }
@article{qiu2020:scts-ptms, author = {Xipeng Qiu and TianXiang Sun and Yige Xu and Yunfan Shao and Ning Dai and Xuanjing Huang}, title = {Pre-trained Models for Natural Language Processing: A Survey}, journal = {SCIENCE CHINA Technological Sciences}, publisher = {Science China Press}, year = {2020}, volume = {63}, number = {10}, pages = {1872--1897}, doi = {https://doi.org/10.1007/s11431-020-1647-3} }
@inproceedings{sun2019fine, title={How to fine-tune {BERT} for text classification?}, author={Sun, Chi and Qiu, Xipeng and Xu, Yige and Huang, Xuanjing}, booktitle={China National Conference on Chinese Computational Linguistics}, pages={194--206}, year={2019}, organization={Springer} }