notesum.ai
Published at November 22About Time: Advances, Challenges, and Outlooks of Action Understanding
cs.CV
cs.AI
Released Date: November 22, 2024
Authors: Alexandros Stergiou1, Ronald Poppe1
Aff.: 1Not provided

| Abbreviations used | ||||
| NVIR: Non-Verbal Interaction Recognition | VAD: Video Anomaly Detection | VC: Video Captioning | VR: Video Retrieval | TAL: Temporal Action Localization |
| APP: Action Progress Prediction | EAP: Early Action Prediction | VA: Video Alignment | AA: Action Anticipation | STAD: SpetaoTemporal Action Detection |
| VFP: Video Frame Prediction | Z/F: Zero- and Few-shot | AOD: Active Object Detection | TSG: Temporal Sentence Grounding | EBD: Event Boundary Detection |
| VRC: Video Repetition Counting | OSCD: Object State Change Detection | VAR: Video Abductive Reasoning | T2V: Text to Video Generation | I2V: Image to Video Generation |