Action and event recognition with fisher vectors on a compact feature set D Oneata, J Verbeek, C Schmid Proceedings of the IEEE international conference on computer vision, 1817-1824, 2013 | 512 | 2013 |
A robust and efficient video representation for action recognition H Wang, D Oneata, J Verbeek, C Schmid International journal of computer vision 119, 219-238, 2016 | 370 | 2016 |
Spatio-temporal object detection proposals D Oneata, J Revaud, J Verbeek, C Schmid Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland …, 2014 | 235 | 2014 |
The LEAR submission at Thumos 2014 D Oneata, J Verbeek, C Schmid | 149 | 2014 |
Efficient action localization with approximately normalized fisher vectors D Oneata, J Verbeek, C Schmid Proceedings of the IEEE conference on computer vision and pattern …, 2014 | 92 | 2014 |
The AXES submissions at TrecVid 2013 R Aly, R Arandjelovic, K Chatfield, M Douze, B Fernando, Z Harchaoui, ... | 41 | 2013 |
AXES at TRECVid 2012: KIS, INS, and MED D Oneata, M Douze, J Revaud, S Jochen, D Potapov, H Wang, ... TRECVID workshop, 2012 | 39* | 2012 |
An evaluation of word-level confidence estimation for end-to-end automatic speech recognition D Oneaţă, A Caranica, A Stan, H Cucu 2021 IEEE Spoken Language Technology Workshop (SLT), 258-265, 2021 | 25 | 2021 |
Improving multimodal speech recognition by data augmentation and speech representations D Oneață, H Cucu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 19 | 2022 |
Weakly-supervised deepfake localization in diffusion-generated images DC Țânțaru, E Oneață, D Oneață Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 14 | 2024 |
Kite: Automatic speech recognition for unmanned aerial vehicles D Oneata, H Cucu arXiv preprint arXiv:1907.01195, 2019 | 14 | 2019 |
Speaker disentanglement in video-to-speech conversion D Oneaţă, A Stan, H Cucu 2021 29th European Signal Processing Conference (EUSIPCO), 46-50, 2021 | 12 | 2021 |
Keyword localisation in untranscribed speech using visually grounded speech models K Olaleye, D Oneaţă, H Kamper IEEE Journal of Selected Topics in Signal Processing 16 (6), 1454-1466, 2022 | 10 | 2022 |
Data-filtering methods for self-training of automatic speech recognition systems AL Georgescu, C Manolache, D Oneaţă, H Cucu, C Burileanu 2021 IEEE Spoken Language Technology Workshop (SLT), 1-7, 2021 | 9 | 2021 |
Multilingual multimodal learning with machine translated text C Qiu, D Oneata, E Bugliarello, S Frank, D Elliott arXiv preprint arXiv:2210.13134, 2022 | 8 | 2022 |
Multimodal speech recognition for unmanned aerial vehicles D Oneață, H Cucu Computers & Electrical Engineering 90, 106943, 2021 | 8 | 2021 |
Towards generalisable and calibrated audio deepfake detection with self-supervised representations O Pascu, A Stan, D Oneata, E Oneata, H Cucu Proc. Interspeech 2024, 4828-4832, 2024 | 7* | 2024 |
YFACC: A Yorùbá speech-image dataset for cross-lingual keyword localisation through visual grounding K Olaleye, D Oneata, H Kamper arXiv preprint arXiv:2210.04600, 2022 | 7 | 2022 |
The INRIA-LIM-VocR and AXES submissions to TRECVID 2014 multimedia event detection M Douze, D Oneata, M Paulin, C Leray, N Chesneau, D Potapov, ... | 6 | 2014 |
Revisiting SincNet: An Evaluation of Feature and Network Hyperparameters for Speaker Recognition D Oneata, L Georgescu, H Cucu, D Burileanu, C Burileanu 28th European Signal Processing Conference, 361–365, 2020 | 5* | 2020 |