The curious case of neural text degeneration A Holtzman, J Buys, L Du, M Forbes, Y Choi arXiv preprint arXiv:1904.09751, 2019 | 3005 | 2019 |
Qlora: Efficient finetuning of quantized llms T Dettmers, A Pagnoni, A Holtzman, L Zettlemoyer Advances in Neural Information Processing Systems 36, 2024 | 1765 | 2024 |
Hellaswag: Can a machine really finish your sentence? R Zellers, A Holtzman, Y Bisk, A Farhadi, Y Choi arXiv preprint arXiv:1905.07830, 2019 | 1543 | 2019 |
Rethinking the role of demonstrations: What makes in-context learning work? S Min, X Lyu, A Holtzman, M Artetxe, M Lewis, H Hajishirzi, L Zettlemoyer arXiv preprint arXiv:2202.12837, 2022 | 1146 | 2022 |
Defending against neural fake news R Zellers, A Holtzman, H Rashkin, Y Bisk, A Farhadi, F Roesner, Y Choi Advances in neural information processing systems 32, 2019 | 1114 | 2019 |
Clipscore: A reference-free evaluation metric for image captioning J Hessel, A Holtzman, M Forbes, RL Bras, Y Choi arXiv preprint arXiv:2104.08718, 2021 | 1031 | 2021 |
Abductive commonsense reasoning C Bhagavatula, RL Bras, C Malaviya, K Sakaguchi, A Holtzman, ... arXiv preprint arXiv:1908.05739, 2019 | 423 | 2019 |
Experience grounds language Y Bisk, A Holtzman, J Thomason, J Andreas, Y Bengio, J Chai, M Lapata, ... arXiv preprint arXiv:2004.10151, 2020 | 403 | 2020 |
Learning to write with cooperative discriminators A Holtzman, J Buys, M Forbes, A Bosselut, D Golub, Y Choi arXiv preprint arXiv:1805.06087, 2018 | 284 | 2018 |
Contrastive decoding: Open-ended text generation as optimization XL Li, A Holtzman, D Fried, P Liang, J Eisner, T Hashimoto, L Zettlemoyer, ... arXiv preprint arXiv:2210.15097, 2022 | 217 | 2022 |
Surface form competition: Why the highest probability answer isn't always right A Holtzman, P West, V Shwartz, Y Choi, L Zettlemoyer arXiv preprint arXiv:2104.08315, 2021 | 209 | 2021 |
Tactical rewind: Self-correction via backtracking in vision-and-language navigation L Ke, X Li, Y Bisk, A Holtzman, Z Gan, J Liu, J Gao, Y Choi, S Srinivasa Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 183 | 2019 |
Counterfactual story reasoning and generation L Qin, A Bosselut, A Holtzman, C Bhagavatula, E Clark, Y Choi arXiv preprint arXiv:1909.04076, 2019 | 148 | 2019 |
Simulating action dynamics with neural process networks A Bosselut, O Levy, A Holtzman, C Ennis, D Fox, Y Choi arXiv preprint arXiv:1711.05313, 2017 | 137 | 2017 |
Connotation frames of power and agency in modern films M Sap, MC Prasettio, A Holtzman, H Rashkin, Y Choi Proceedings of the 2017 conference on empirical methods in natural language …, 2017 | 129 | 2017 |
Do neural language representations learn physical commonsense? M Forbes, A Holtzman, Y Choi arXiv preprint arXiv:1908.02899, 2019 | 120 | 2019 |
Demix layers: Disentangling domains for modular language modeling S Gururangan, M Lewis, A Holtzman, NA Smith, L Zettlemoyer arXiv preprint arXiv:2108.05036, 2021 | 104 | 2021 |
Sounding board: A user-centric and content-driven social chatbot H Fang, H Cheng, M Sap, E Clark, A Holtzman, Y Choi, NA Smith, ... arXiv preprint arXiv:1804.10202, 2018 | 79 | 2018 |
QLoRA: efficient finetuning of quantized LLMs (2023) T Dettmers, A Pagnoni, A Holtzman, L Zettlemoyer arXiv preprint arXiv:2305.14314 52, 3982-3992, 2023 | 74 | 2023 |
PIGLeT: Language grounding through neuro-symbolic interaction in a 3D world R Zellers, A Holtzman, M Peters, R Mottaghi, A Kembhavi, A Farhadi, ... arXiv preprint arXiv:2106.00188, 2021 | 72 | 2021 |