Towards a Unified View of Parameter-Efficient Transfer Learning J He, C Zhou, X Ma, T Berg-Kirkpatrick, G Neubig International Conference on Learning Representations (ICLR) 2022, 2022 | 892 | 2022 |
On the Sentence Embeddings from Pre-trained Language Models B Li, H Zhou, J He, M Wang, Y Yang, L Li Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 720 | 2020 |
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models Y Huang, Y Bai, Z Zhu, J Zhang, J Zhang, T Su, J Liu, C Lv, Y Zhang, J Lei, ... Conference on Neural Information Processing Systems (NeurIPS) 2023, 2023 | 384* | 2023 |
Lagging Inference Networks and Posterior Collapse in Variational Autoencoders J He, D Spokoyny, G Neubig, T Berg-Kirkpatrick International Conference on Learning Representations (ICLR) 2019, 2019 | 347 | 2019 |
Revisiting Self-Training for Neural Sequence Generation J He, J Gu, J Shen, MA Ranzato International Conference on Learning Representations (ICLR) 2020, 2020 | 264 | 2020 |
Choosing Transfer Languages for Cross-Lingual Learning YH Lin, CY Chen, J Lee, Z Li, Y Zhang, M Xia, S Rijhwani, J He, Z Zhang, ... Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019 | 249 | 2019 |
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs M Xiong, Z Hu, X Lu, Y Li, J Fu, J He, B Hooi International Conference on Learning Representations (ICLR) 2024, 2024 | 209 | 2024 |
Mega: moving average equipped gated attention X Ma, C Zhou, X Kong, J He, L Gui, G Neubig, J May, L Zettlemoyer International Conference on Learning Representations (ICLR) 2023, 2023 | 164* | 2023 |
CTRLsum: Towards generic controllable text summarization J He, W Kryściński, B McCann, N Rajani, C Xiong Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2020 | 153 | 2020 |
A Probabilistic Formulation of Unsupervised Text Style Transfer J He, X Wang, G Neubig, T Berg-Kirkpatrick International Conference on Learning Representations (ICLR) 2020, 2020 | 144 | 2020 |
FacTool: Factuality Detection in Generative AI--A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios I Chern, S Chern, S Chen, W Yuan, K Feng, C Zhou, J He, G Neubig, ... arXiv preprint arXiv:2307.13528, 2023 | 135 | 2023 |
StructVAE: Tree-structured latent variable models for semi-supervised semantic parsing P Yin, C Zhou, J He, G Neubig Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018 | 124 | 2018 |
What makes good data for alignment? a comprehensive study of automatic data selection in instruction tuning W Liu, W Zeng, K He, Y Jiang, J He International Conference on Learning Representations (ICLR) 2024, 2024 | 109 | 2024 |
Self-Evaluation Guided Beam Search for Reasoning Y Xie, K Kawaguchi, Y Zhao, X Zhao, MY Kan, J He, Q Xie Conference on Neural Information Processing Systems (NeurIPS) 2023, 2023 | 107* | 2023 |
Efficient Nearest Neighbor Language Models J He, G Neubig, T Berg-Kirkpatrick Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 99 | 2021 |
Composing parameter-efficient modules with arithmetic operations J Zhang, S Chen, J Liu, J He Conference on Neural Information Processing Systems (NeurIPS) 2023, 2023 | 95 | 2023 |
A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text B Li, J He, G Neubig, T Berg-Kirkpatrick, Y Yang Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 85 | 2019 |
K2: A foundation language model for geoscience knowledge understanding and utilization C Deng, T Zhang, Z He, Q Chen, Y Shi, Y Xu, L Fu, W Zhang, X Wang, ... Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024 | 71 | 2024 |
Prompt consistency for zero-shot task generalization C Zhou, J He, X Ma, T Berg-Kirkpatrick, G Neubig Findings of EMNLP 2022, 2022 | 70 | 2022 |
Texar: A modularized, versatile, and extensible toolkit for text generation Z Hu, H Shi, B Tan, W Wang, Z Yang, T Zhao, J He, L Qin, D Wang, X Ma, ... arXiv preprint arXiv:1809.00794, 2018 | 67 | 2018 |