Media Summary: Authors: Myeongjun Kim; Federica Spinola; Philipp Benz; Tae-hoon Kim Description: Deep learning has become a popular tool ... ST-GCN is the first GCN-based method for the task of skeleton-based Authors: Joanna Materzynska, Tete Xiao, Roei Herzig, Huijuan Xu, Xiaolong Wang, Trevor Darrell Description: Human

A Atrous Spatial Temporal Action - Detailed Analysis & Overview

Authors: Myeongjun Kim; Federica Spinola; Philipp Benz; Tae-hoon Kim Description: Deep learning has become a popular tool ... ST-GCN is the first GCN-based method for the task of skeleton-based Authors: Joanna Materzynska, Tete Xiao, Roei Herzig, Huijuan Xu, Xiaolong Wang, Trevor Darrell Description: Human Spatio-Temporal Action Detection with Multi-Object Interaction Authors: Gurkirt Singh (ETH Zurich)*; Vasileios Choutas (ETH Zurich); Suman Saha (ETH Zurich); Fisher Yu (ETH Zurich); Luc Van ... Group 106: Spatial Temporal Convolutional Networks for Action Recognition

ACM ICMR 2026 A Unified Object Centric Spatio Temporal Graph Reasoning Framework for AVQA Referring Video Object Segmentation (RVOS) aims to segment target objects in videos based on natural language descriptions. To develop an Artificial Intelligence (AI) system that can understand the world around us, it needs to be able to interpret and ...

Photo Gallery

A*: Atrous Spatial Temporal Action Recognition for Real Time Applications
ST-GCN: Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
Spatio-Temporal Action Detection with Occlusion
Something-Else: Compositional Action Recognition With Spatial-Temporal Interaction Networks
Spatio-Temporal Action Detection with Multi-Object Interaction
Spatio-Temporal Action Detection Under Large Motion
EScALation: A Framework for Efficient and Scalable Spatio-temporal Action Localization
GATSBI: Generative Agent-centric Spatio-temporal Object Interaction
Group 106: Spatial Temporal Convolutional Networks for Action Recognition
ACM ICMR 2026 A Unified Object Centric Spatio Temporal Graph Reasoning Framework for AVQA
VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation
Towards Grounded Spatio-Temporal Reasoning
View Detailed Profile
A*: Atrous Spatial Temporal Action Recognition for Real Time Applications

A*: Atrous Spatial Temporal Action Recognition for Real Time Applications

Authors: Myeongjun Kim; Federica Spinola; Philipp Benz; Tae-hoon Kim Description: Deep learning has become a popular tool ...

ST-GCN: Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

ST-GCN: Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

ST-GCN is the first GCN-based method for the task of skeleton-based

Spatio-Temporal Action Detection with Occlusion

Spatio-Temporal Action Detection with Occlusion

Action

Something-Else: Compositional Action Recognition With Spatial-Temporal Interaction Networks

Something-Else: Compositional Action Recognition With Spatial-Temporal Interaction Networks

Authors: Joanna Materzynska, Tete Xiao, Roei Herzig, Huijuan Xu, Xiaolong Wang, Trevor Darrell Description: Human

Spatio-Temporal Action Detection with Multi-Object Interaction

Spatio-Temporal Action Detection with Multi-Object Interaction

Spatio-Temporal Action Detection with Multi-Object Interaction

Spatio-Temporal Action Detection Under Large Motion

Spatio-Temporal Action Detection Under Large Motion

Authors: Gurkirt Singh (ETH Zurich)*; Vasileios Choutas (ETH Zurich); Suman Saha (ETH Zurich); Fisher Yu (ETH Zurich); Luc Van ...

EScALation: A Framework for Efficient and Scalable Spatio-temporal Action Localization

EScALation: A Framework for Efficient and Scalable Spatio-temporal Action Localization

ACM MMSys 2021 talks.

GATSBI: Generative Agent-centric Spatio-temporal Object Interaction

GATSBI: Generative Agent-centric Spatio-temporal Object Interaction

GATSBI: Generative Agent-centric

Group 106: Spatial Temporal Convolutional Networks for Action Recognition

Group 106: Spatial Temporal Convolutional Networks for Action Recognition

Group 106: Spatial Temporal Convolutional Networks for Action Recognition

ACM ICMR 2026 A Unified Object Centric Spatio Temporal Graph Reasoning Framework for AVQA

ACM ICMR 2026 A Unified Object Centric Spatio Temporal Graph Reasoning Framework for AVQA

ACM ICMR 2026 A Unified Object Centric Spatio Temporal Graph Reasoning Framework for AVQA

VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation

VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation

Referring Video Object Segmentation (RVOS) aims to segment target objects in videos based on natural language descriptions.

Towards Grounded Spatio-Temporal Reasoning

Towards Grounded Spatio-Temporal Reasoning

To develop an Artificial Intelligence (AI) system that can understand the world around us, it needs to be able to interpret and ...

Spatial-Temporal Graph Convolution Networks for skeleton-based Action Recognition

Spatial-Temporal Graph Convolution Networks for skeleton-based Action Recognition

Human