Follow
Max Bain
Max Bain
Reka / VGG, University of Oxford
Verified email at reka.ai - Homepage
Title
Cited by
Cited by
Year
Frozen in time: A joint video and image encoder for end-to-end retrieval
M Bain, A Nagrani, G Varol, A Zisserman
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021
10902021
WhisperX: Time-accurate speech transcription of long-form audio
M Bain, J Huh, T Han, A Zisserman
Interspeech 2023, 2023
1952023
Condensed Movies: Story Based Retrieval with Contextual Embeddings
M Bain, A Nagrani, A Brown, A Zisserman
Asian Conference on Computer Vision (ACCV), 2020, 2020
1102020
A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
H Berg, SM Hall, Y Bhalgat, W Yang, HR Kirk, A Shtedritski, M Bain
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the …, 2022
1052022
The CLIP-Hitchhiker's Guide to Long Video Retrieval
M Bain, A Nagrani, G Varol, A Zisserman
arXiv preprint arXiv:2205.08508, 2022
692022
Automated audiovisual behavior recognition in wild primates
M Bain, A Nagrani, D Schofield, S Berdugo, J Bessa, J Owen, ...
Science Advances 7 (46), eabi4883, 2021
562021
AutoAD: Movie Description in Context
T Han*, M Bain*, A Nagrani, G Varol, W Xie, A Zisserman
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023, 2023
552023
AutoAD II: The sequel-who, when, and what in movie audio description
T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
382023
Balancing the picture: Debiasing vision-language datasets with synthetic contrast sets
B Smith, M Farinha, SM Hall, HR Kirk, A Shtedritski, M Bain
arXiv preprint arXiv:2305.15407, 2023
202023
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
P Padlewski*, M Bain*, M Henderson, Z Zhu, N Relan, H Pham, D Ong, ...
arXiv preprint arXiv:2405.02287, 2024
192024
Reka core, flash, and edge: A series of powerful multimodal language models
R Team, A Ormazabal, C Zheng, CM d'Autume, D Yogatama, D Fu, D Ong, ...
arXiv preprint arXiv:2404.12387, 2024
19*2024
AutoAD III: The Prequel-Back to the Pixels
T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
142024
Count, crop and recognise: Fine-grained recognition in the wild
M Bain, A Nagrani, D Schofield, A Zisserman
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
142019
Autoad-zero: A training-free framework for zero-shot audio description
J Xie, T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the Asian Conference on Computer Vision, 2265-2281, 2024
42024
Understanding video through the lens of language
M Bain
University of Oxford, 2023
12023
Culture in communication: inter-community variation in buttress drumming by wild chimpanzees
J Bessa, M Bain, A Nagrani, A Zisserman, J Di, KH Giovanni, D Biro
Chimpanzee Culture in Cantanhez National Park, Guinea-Bissau, 98, 0
The system can't perform the operation now. Try again later.
Articles 1–16