MultiModalQA: Complex Question Answering over Text, Tables and Images A Talmor, O Yoran, A Catav, D Lahav, Y Wang, A Asai, G Ilharco, ... arXiv preprint arXiv:2104.06039, 2021 | 147 | 2021 |
SCROLLS: Standardized CompaRison Over Long Language Sequences U Shaham, E Segal, M Ivgi, A Efrat, O Yoran, A Haviv, A Gupta, W Xiong, ... arXiv preprint arXiv:2201.03533, 2022 | 129 | 2022 |
Making Retrieval-Augmented Language Models Robust to Irrelevant Context O Yoran, T Wolfson, O Ram, J Berant arXiv preprint arXiv:2310.01558, 2023 | 113 | 2023 |
Commonsenseqa 2.0: Exposing the limits of ai through gamification A Talmor, O Yoran, RL Bras, C Bhagavatula, Y Goldberg, Y Choi, J Berant arXiv preprint arXiv:2201.05320, 2022 | 106 | 2022 |
Evaluating the ripple effects of knowledge editing in language models R Cohen, E Biran, O Yoran, A Globerson, M Geva Transactions of the Association for Computational Linguistics 12, 283-298, 2024 | 102 | 2024 |
Answering Questions by Meta-Reasoning over Multiple Chains of Thought O Yoran, T Wolfson, B Bogin, U Katz, D Deutch, J Berant arXiv preprint arXiv:2304.13007, 2023 | 72 | 2023 |
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills O Yoran, A Talmor, J Berant arXiv preprint arXiv:2107.07261, 2021 | 52 | 2021 |
QAMPARI: A Benchmark for Open-domain Questions with Many Answers S Amouyal, T Wolfson, O Rubin, O Yoran, J Herzig, J Berant The 61st Annual Meeting Of The Association For Computational Linguistics, 2023 | 46* | 2023 |
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? O Yoran, SJ Amouyal, C Malaviya, B Bogin, O Press, J Berant arXiv preprint arXiv:2407.15711, 2024 | 6* | 2024 |
From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty M Ivgi, O Yoran, J Berant, M Geva arXiv preprint arXiv:2407.06071, 2024 | 4 | 2024 |
The BrowserGym Ecosystem for Web Agent Research D Chezelles, T Le Sellier, M Gasse, A Lacoste, A Drouin, M Caccia, ... arXiv preprint arXiv:2412.05467, 2024 | | 2024 |