Search
Publication Authors

Prof. Dr. Didier Stricker

Dr. Alain Pagani

Dr. Gerd Reis

Eric Thil

Keonna Cunningham

Monika Miersch

Dr. Oliver Wasenmüller

Dr. Muhammad Zeshan Afzal

Dr. Gabriele Bleser

Dr. Muhammad Jameel Nawaz Malik

Dr. Bruno Mirbach

Dr. Jason Raphael Rambach

Dr. Nadia Robertini

Dr. René Schuster

Dr. Bertram Taetz

Ahmed Aboukhadra

Dr. Sk Aziz Ali

Mhd Rashed Al Koutayni

Yuriy Anisimov
Anmol Prasad

Muhammad Asad Ali

Jilliam Maria Diaz Barros

Ramy Battrawy
Katharina Bendig
Hammad Butt

Mahdi Chamseddine
Chun-Peng Chang
Steve Dias da Cruz
Fangwen Shu

Torben Fetzer

Ahmet Firintepe

Sophie Folawiyo

David Michael Fürst
Anshu Garg

Christiano Couto Gava
Suresh Guttikonda

Tewodros Amberbir Habtegebrial

Simon Häring

Khurram Azeem Hashmi

Dr. Anna Katharina Hebborn

Hamoun Heidarshenas
Henri Hoyez

Pragati Jaiswal

Alireza Javanmardi

Sai Srinivas Jeevanandam

Jigyasa Singh Katrolia

Matin Keshmiri

Andreas Kölsch
Ganesh Shrinivas Koparde
Onorina Kovalenko

Stephan Krauß
Bastian Krayer
Paul Lesur

Michael Lorenz

Dr. Markus Miezal

Mina Ameli

Nareg Minaskan Karabid

Mohammad Minouei

Shashank Mishra

Pramod Murthy

Mathias Musahl
Peter Neigel

Manthan Pancholi

Mariia Podguzova

Praveen Nathan
Qinzhuan Qian
Rishav

Marcel Rogge
María Alejandra Sánchez Marín
Dr. Kripasindhu Sarkar

Alexander Schäfer

Pascal Schneider

Dr. Mohamed Selim

Tahira Shehzadi
Lukas Stefan Staecker

Yongzhi Su

Xiaoying Tan
Pelle Thielmann
Dr. Mohit Vaishnav

Shaoxiang Wang
Christian Witte

Yaxu Xie

Vemburaj Yadav

Yu Zhou

Dr. Vladislav Golyanik

Dr. Aditya Tewari

André Luiz Brandão
Publication Archive
New title
- @VISOR
- @VISOR-HH
- 4DUS
- ActivityPlus
- AlphaView
- AlterEgo
- Anna C-trus
- AR-Handbook
- ARinfuse
- ARVIDA
- AuRoRas
- AVES/DyMoSiHR
- AVILUS+
- Be-greifen
- BIONIC
- Body Analyzer
- CAPTURE
- Co2Team
- COGNITO
- CONTACT
- DAKARA
- DENSITY
- DYNAMICS
- EASY-IMP
- EMERGENT
- ENNOS
- EPOS
- Eyes Of Things
- GreifbAR
- HyperCOG
- HYSOCIATEA
- iACT
- IMCVO
- iMP
- Infinity
- IVHM
- IVMT
- KI-Absicherung
- LARA
- LiSA
- Marmorbild
- Micro-Dress
- Moveon
- NetVis
- Odysseus Studio
- On Eye
- OrcaM
- PAMAP
- ProWiLAN
- ServiceFactory
- SIMILAR
- SINNODIUM
- STREET3D
- SUDPLAN
- SwarmTrack
- SYNERGIE
- TuBUs-Pro
- VES
- VIDETE
- VIDP
- Virtual Try-On
- VisIMon
- VISTRA
- VIZTA
- You in 3D
Attention-Guided Disentangled Feature Aggregation for Video Object Detection
Attention-Guided Disentangled Feature Aggregation for Video Object Detection
Shishir Muralidhara, Khurram Azeem Hashmi, Alain Pagani, Marcus Liwicki, Didier Stricker, Muhammad Zeshan Afzal
Sensors - Open Access Journal (Sensors) 22 21 Seiten 1-17 MDPI Switzerland 11/2022 .
- Abstract:
- Object detection is a computer vision task that involves the localisation and classification of objects in an image. Video data implicitly introduces several challenges, such as blur, occlusion and defocus, making video object detection more challenging in comparison to still image object detection, which is performed on individual and independent images. This paper tackles these challenges by proposing an attention-heavy framework for video object detection that aggregates the disentangled features extracted from individual frames. The proposed framework is a two-stage object detector based on the Faster R-CNN architecture. The disentanglement head integrates scale, spatial and task-aware attention and applies it to the features extracted by the backbone network across all the frames. Subsequently, the aggregation head incorporates temporal attention and improves detection in the target frame by aggregating the features of the support frames. These include the features extracted from the disentanglement network along with the temporal features. We evaluate the proposed framework using the ImageNet VID dataset and achieve a mean Average Precision (mAP) of 49.8 and 52.5 using the backbones of ResNet-50 and ResNet-101, respectively. The improvement in performance over the individual baseline methods validates the efficacy of the proposed approach.