Abstract
Early action prediction deals with inferring the ongoing action from partially-observed videos, typically at the outset of the video. We propose a bottleneck-based attention model that captures the evolution of the action, through progressive sampling over fine-to-coarse scales. Our proposed Temporal Progressive (TemPr) model is composed of multiple attention towers, one for each scale. The predicted action label is based on the collective agreement considering confidences of these towers. Extensive experiments over four video datasets showcase state-of-the-art performance on the task of Early Action Prediction across a range of encoder architectures. We demonstrate the effectiveness and consistency of TemPr through detailed ablations. † † Code is available at: https://tinyurl.com/temprog
Original language | English |
---|---|
Title of host publication | 2023 IEEE/CVF Conference On Computer Vision And Pattern Recognition |
Subtitle of host publication | CVPR 2023 |
Publisher | IEEE |
Pages | 14709-14719 |
Number of pages | 11 |
ISBN (Print) | 979-8-3503-0129-8 |
DOIs | |
Publication status | Published - 2023 |
Externally published | Yes |
Event | IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Vancouver, Canada Duration: 18 Jun 2023 → 22 Jun 2023 https://cvpr2023.thecvf.com/ |
Publication series
Name | Proceedings |
---|---|
Publisher | IEEE |
ISSN (Print) | 2575-7075 |
Conference
Conference | IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 |
---|---|
Abbreviated title | CVPR 2023 |
Country/Territory | Canada |
City | Vancouver |
Period | 18/06/23 → 22/06/23 |
Internet address |
Keywords
- n/a OA procedure