フォロー
Alaaeldin El-Nouby
Alaaeldin El-Nouby
Research Scientist, Apple
確認したメール アドレス: apple.com - ホームページ
タイトル
引用先
引用先
DINOv2: Learning Robust Visual Features without Supervision
M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ...
arXiv preprint arXiv:2304.07193, 2023
725*2023
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
B Graham, A El-Nouby, H Touvron, P Stock, A Joulin, H Jégou, M Douze
International Conference on Computer Vision 2021, 2021
622*2021
Resmlp: Feedforward networks for image classification with data-efficient training
H Touvron, P Bojanowski, M Caron, M Cord, A El-Nouby, E Grave, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 5314-5321, 2022
585*2022
XCiT: Cross-Covariance Image Transformers
A El-Nouby, H Touvron, M Caron, P Bojanowski, M Douze, A Joulin, ...
35th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021
393*2021
Imagebind: One embedding space to bind them all
R Girdhar, A El-Nouby, Z Liu, M Singh, KV Alwala, A Joulin, I Misra
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
319*2023
Training vision transformers for image retrieval
A El-Nouby, N Neverova, I Laptev, H Jégou
arXiv preprint arXiv:2102.05644, 2021
1562021
Tell, draw, and repeat: Generating and modifying images based on continual linguistic instruction
A El-Nouby, S Sharma, H Schulz, D Hjelm, LE Asri, SE Kahou, Y Bengio, ...
Proceedings of the IEEE International Conference on Computer Vision, 10304-10312, 2019
142*2019
Are large-scale datasets necessary for self-supervised pre-training?
A El-Nouby, G Izacard, H Touvron, I Laptev, H Jegou, E Grave
arXiv preprint arXiv:2112.10740, 2021
1202021
Three things everyone should know about vision transformers
H Touvron, M Cord, A El-Nouby, J Verbeek, H Jégou
European Conference on Computer Vision, 497-515, 2022
772022
Omnimae: Single model masked pretraining on images and videos
R Girdhar, A El-Nouby, M Singh, KV Alwala, A Joulin, I Misra
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
63*2023
Augmenting convolutional networks with attention-based aggregation
H Touvron, M Cord, A El-Nouby, P Bojanowski, A Joulin, G Synnaeve, ...
arXiv preprint arXiv:2112.13692, 2021
48*2021
Real-Time End-to-End Action Detection with Two-Stream Networks
A Ali, GW Taylor
2018 15th Conference on Computer and Robot Vision (CRV), 31-38, 2018
32*2018
Image compression with product quantized masked image modeling
A El-Nouby, MJ Muckley, K Ullrich, I Laptev, J Verbeek, H Jégou
arXiv preprint arXiv:2212.07372, 2022
182022
Skip-Clip: Self-Supervised Spatiotemporal Representation Learning by Future Clip Order Ranking
A El-Nouby, S Zhai, GW Taylor, JM Susskind
Holistic Video Understanding Workshop ICCV2019, 2019
162019
Improving statistical fidelity for neural image compression with implicit local likelihood models
MJ Muckley, A El-Nouby, K Ullrich, H Jégou, J Verbeek
International Conference on Machine Learning, 25426-25443, 2023
8*2023
Scalable Pre-training of Large Autoregressive Image Models
A El-Nouby, M Klein, S Zhai, MA Bautista, A Toshev, V Shankar, ...
arXiv preprint arXiv:2401.08541, 2024
62024
Variable Rate Allocation for Vector-Quantized Autoencoders
F Baldassarre, A El-Nouby, H Jégou
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Are Visual Recognition Models Robust to Image Compression?
JM Janeiro, S Frolov, A El-Nouby, J Verbeek
arXiv preprint arXiv:2304.04518, 2023
2023
Spatiotemporal Representation Learning For Human Action Recognition And Localization
A Ali
University of Guelph, 2019
2019
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–19