Alaaeldin El-Nouby
Alaaeldin El-Nouby
Research Scientist, Apple
Dirección de correo verificada de - Página principal
Citado por
Citado por
DINOv2: Learning Robust Visual Features without Supervision
M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ...
arXiv preprint arXiv:2304.07193, 2023
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
B Graham, A El-Nouby, H Touvron, P Stock, A Joulin, H Jégou, M Douze
International Conference on Computer Vision 2021, 2021
Resmlp: Feedforward networks for image classification with data-efficient training
H Touvron, P Bojanowski, M Caron, M Cord, A El-Nouby, E Grave, ...
IEEE transactions on pattern analysis and machine intelligence 45 (4), 5314-5321, 2022
Imagebind: One embedding space to bind them all
R Girdhar, A El-Nouby, Z Liu, M Singh, KV Alwala, A Joulin, I Misra
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
XCiT: Cross-Covariance Image Transformers
A El-Nouby, H Touvron, M Caron, P Bojanowski, M Douze, A Joulin, ...
35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021
Training vision transformers for image retrieval
A El-Nouby, N Neverova, I Laptev, H Jégou
arXiv preprint arXiv:2102.05644, 2021
Tell, draw, and repeat: Generating and modifying images based on continual linguistic instruction
A El-Nouby, S Sharma, H Schulz, D Hjelm, LE Asri, SE Kahou, Y Bengio, ...
Proceedings of the IEEE International Conference on Computer Vision, 10304-10312, 2019
Are large-scale datasets necessary for self-supervised pre-training?
A El-Nouby, G Izacard, H Touvron, I Laptev, H Jegou, E Grave
arXiv preprint arXiv:2112.10740, 2021
Three things everyone should know about vision transformers
H Touvron, M Cord, A El-Nouby, J Verbeek, H Jégou
European Conference on Computer Vision, 497-515, 2022
Omnimae: Single model masked pretraining on images and videos
R Girdhar, A El-Nouby, M Singh, KV Alwala, A Joulin, I Misra
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
Augmenting convolutional networks with attention-based aggregation
H Touvron, M Cord, A El-Nouby, P Bojanowski, A Joulin, G Synnaeve, ...
arXiv preprint arXiv:2112.13692, 2021
Real-Time End-to-End Action Detection with Two-Stream Networks
A Ali, GW Taylor
2018 15th Conference on Computer and Robot Vision (CRV), 31-38, 2018
Image compression with product quantized masked image modeling
A El-Nouby, MJ Muckley, K Ullrich, I Laptev, J Verbeek, H Jégou
arXiv preprint arXiv:2212.07372, 2022
Scalable Pre-training of Large Autoregressive Image Models
A El-Nouby, M Klein, S Zhai, MA Bautista, A Toshev, V Shankar, ...
International Conference on Machine Learning, 2024, 2024
Skip-Clip: Self-Supervised Spatiotemporal Representation Learning by Future Clip Order Ranking
A El-Nouby, S Zhai, GW Taylor, JM Susskind
Holistic Video Understanding Workshop ICCV2019, 2019
Improving statistical fidelity for neural image compression with implicit local likelihood models
MJ Muckley, A El-Nouby, K Ullrich, H Jégou, J Verbeek
International Conference on Machine Learning, 25426-25443, 2023
Variable rate allocation for vector-quantized autoencoders
F Baldassarre, A El-Nouby, H Jégou
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
DataComp-LM: In search of the next generation of training sets for language models
J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ...
arXiv preprint arXiv:2406.11794, 2024
Contributions to the Design and Training of Transformers in Computer Vision
INRIA Paris; ENS Paris-Ecole Normale Supérieure de Paris; PSL University, 2023
Are Visual Recognition Models Robust to Image Compression?
JM Janeiro, S Frolov, A El-Nouby, J Verbeek
arXiv preprint arXiv:2304.04518, 2023
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20