Presentation + Paper
12 June 2023 Frequency-based aerial video recognition
Divya Kothandaraman, Xijun Wang, Tianrui Guan, Sean Hu, Ming Lin, Dinesh Manocha
Author Affiliations +
Abstract
Aerial video recognition is challenging due to various factors. Prior work on action recognition imposes constraints in terms of unavailability of object detection bounding box ground-truth inhibiting the application of localization models and computational constraints preventing the usage of expensive space-time self-attention. Optical flow and pretrained models for detecting human actor performing action do not work too well due to domain gap issues. Our contributions1, 2 are as follows: 1. We present a frequency-domain space-time attention method that encapsulates long-range space-time dependencies by emulating the weighted outer product in the frequency domain. 2. We present a frequency-based object background disentanglement method to inherently separate out the moving human actor from the background. 3. We present a mathematical model for static salient regions and an identity loss function to learn disentangled features in a differentiable manner.
Conference Presentation
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Divya Kothandaraman, Xijun Wang, Tianrui Guan, Sean Hu, Ming Lin, and Dinesh Manocha "Frequency-based aerial video recognition", Proc. SPIE 12544, Open Architecture/Open Business Model Net-Centric Systems and Defense Transformation 2023, 125440J (12 June 2023); https://doi.org/10.1117/12.2663491
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Action recognition

Object detection

Unmanned aerial vehicles

Army

Cameras

Mathematical modeling

RELATED CONTENT


Back to Top