The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 1824 | 2024 |
Code llama: Open foundation models for code B Roziere, J Gehring, F Gloeckle, S Sootla, I Gat, XE Tan, Y Adi, J Liu, ... arXiv preprint arXiv:2308.12950, 2023 | 1535 | 2023 |
Deeppath: A reinforcement learning method for knowledge graph reasoning W Xiong, T Hoang, WY Wang EMNLP 2017, 2017 | 947 | 2017 |
One-shot relational learning for knowledge graphs W Xiong, M Yu, S Chang, X Guo, WY Wang EMNLP 2018, 2018 | 312 | 2018 |
Hybridqa: A dataset of multi-hop question answering over tabular and textual data W Chen, H Zha, Z Chen, W Xiong, H Wang, W Wang EMNLP 2020 Findings, 2020 | 297 | 2020 |
Pretrained encyclopedia: Weakly supervised knowledge-pretrained language model W Xiong, J Du, WY Wang, V Stoyanov ICLR 2020, 2019 | 253* | 2019 |
Look before you leap: Bridging model-free and model-based reinforcement learning for planned-ahead vision-and-language navigation X Wang*, W Xiong*, H Wang, WY Wang ECCV 2018, 37-53, 2018 | 249 | 2018 |
Answering complex open-domain questions with multi-hop dense retrieval W Xiong, XL Li, S Iyer, J Du, P Lewis, WY Wang, Y Mehdad, W Yih, ... ICLR 2021, 2020 | 161* | 2020 |
Improving question answering over incomplete kbs with knowledge-aware reader W Xiong, M Yu, S Chang, X Guo, WY Wang ACL 2019, 2019 | 161 | 2019 |
Effective long-context scaling of foundation models W Xiong, J Liu, I Molybog, H Zhang, P Bhargava, R Hou, L Martin, ... arXiv preprint arXiv:2309.16039, 2023 | 159 | 2023 |
Variational knowledge graph reasoning W Chen, W Xiong, X Yan, W Wang arXiv preprint arXiv:1803.06581, 2018 | 151 | 2018 |
xformers: A modular and hackable transformer modelling library B Lefaudeux, F Massa, D Liskovich, W Xiong, V Caggiano, S Naren, M Xu, ... | 145 | 2022 |
Sentence embedding alignment for lifelong relation extraction H Wang, W Xiong, M Yu, X Guo, S Chang, WY Wang arXiv preprint arXiv:1903.02588, 2019 | 134 | 2019 |
3dgen: Triplane latent diffusion for textured mesh generation A Gupta, W Xiong, Y Nie, I Jones, B Oğuz arXiv preprint arXiv:2303.05371, 2023 | 133 | 2023 |
Scrolls: Standardized comparison over long language sequences U Shaham, E Segal, M Ivgi, A Efrat, O Yoran, A Haviv, A Gupta, W Xiong, ... EMNLP 2022, 2022 | 129 | 2022 |
Prompting large language models with speech recognition abilities Y Fathullah, C Wu, E Lakomkin, J Jia, Y Shangguan, K Li, J Guo, W Xiong, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 105 | 2024 |
Lm-infinite: Simple on-the-fly length generalization for large language models C Han, Q Wang, W Xiong, Y Chen, H Ji, S Wang arXiv preprint arXiv:2308.16137, 2023 | 100 | 2023 |
Learning to learn and predict: A meta-learning approach for multi-label classification J Wu, W Xiong, WY Wang arXiv preprint arXiv:1909.04176, 2019 | 89 | 2019 |
TWEETQA: A social media focused question answering dataset W Xiong, J Wu, H Wang, V Kulkarni, M Yu, S Chang, X Guo, WY Wang ACL 2019, 2019 | 84 | 2019 |
Self-supervised learning for contextualized extractive summarization H Wang, X Wang, W Xiong, M Yu, X Guo, S Chang, WY Wang arXiv preprint arXiv:1906.04466, 2019 | 76 | 2019 |