PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
Aerial video recognition is challenging due to various factors. Prior work on action recognition imposes constraints in terms of unavailability of object detection bounding box ground-truth inhibiting the application of localization models and computational constraints preventing the usage of expensive space-time self-attention. Optical flow and pretrained models for detecting human actor performing action do not work too well due to domain gap issues. Our contributions1, 2 are as follows: 1. We present a frequency-domain space-time attention method that encapsulates long-range space-time dependencies by emulating the weighted outer product in the frequency domain. 2. We present a frequency-based object background disentanglement method to inherently separate out the moving human actor from the background. 3. We present a mathematical model for static salient regions and an identity loss function to learn disentangled features in a differentiable manner.
Divya Kothandaraman,Xijun Wang,Tianrui Guan,Sean Hu,Ming Lin, andDinesh Manocha
"Frequency-based aerial video recognition", Proc. SPIE 12544, Open Architecture/Open Business Model Net-Centric Systems and Defense Transformation 2023, 125440J (12 June 2023); https://doi.org/10.1117/12.2663491
ACCESS THE FULL ARTICLE
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
The alert did not successfully save. Please try again later.
Divya Kothandaraman, Xijun Wang, Tianrui Guan, Sean Hu, Ming Lin, Dinesh Manocha, "Frequency-based aerial video recognition," Proc. SPIE 12544, Open Architecture/Open Business Model Net-Centric Systems and Defense Transformation 2023, 125440J (12 June 2023); https://doi.org/10.1117/12.2663491