Masked reconstruction based self-supervision for human activity recognition H Haresamudram, A Beedu, V Agrawal, PL Grady, I Essa, J Hoffman, ... Proceedings of the 2020 ACM International Symposium on Wearable Computers, 45-49, 2020 | 129 | 2020 |
Video based Object 6D Pose Estimation using Transformers A Beedu, H Alamri, I Essa NeurIPS 2022 workshop on Vision Transformers: Theory and Applications, 2022 | 9 | 2022 |
Multi-stage based feature fusion of multi-modal data for human activity recognition H Choi, A Beedu, H Haresamudram, I Essa arXiv preprint arXiv:2211.04331, 2022 | 6 | 2022 |
Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition H Choi, A Beedu, I Essa ICCV 2023 workshop on PerDream: PERception, Decision making and REAsoning …, 2023 | 5 | 2023 |
End-to-End Multimodal Representation Learning for Video Dialog H Alamri, A Bilic, M Hu, A Beedu, I Essa NeurIPS 2022 workshop on Vision Transformers: Theory and Applications, 2022 | 4 | 2022 |
VideoPose: Estimating 6D object pose from videos A Beedu, Z Ren, V Agrawal, I Essa arXiv preprint arXiv:2111.10677, 2021 | 2 | 2021 |
Location based payload imaging J Apoorva, B Mohan, A Beedu, MM Nayak, D Rao, VK Agrawal 2015 IEEE International Conference on Electronics, Computing and …, 2015 | 2 | 2015 |
On the Efficacy of Text-Based Input Modalities for Action Anticipation A Beedu, H Haresamudram, K Samel, I Essa arXiv preprint arXiv:2401.12972 (Under review), 2024 | 1 | 2024 |
Exploring Efficient Foundational Multi-modal Models for Video Summarization K Samel, A Beedu, N Sontakke, I Essa arXiv preprint arXiv:2410.07405, 2024 | | 2024 |
Mamba Fusion: Learning Actions Through Questioning Z Dong, A Beedu, J Sheinkopf, I Essa arXiv preprint arXiv:2409.11513, 2024 | | 2024 |
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition--And Ways to Overcome Them H Haresamudram, A Beedu, M Rabbi, S Saha, I Essa, T Ploetz arXiv preprint arXiv:2408.12023, 2024 | | 2024 |