Skip to content

firework8/Awesome-Skeleton-based-Action-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation

Awesome Skeleton-based Action Recognition

Awesome PRs Welcome

We collect existing papers on skeleton-based action recognition published in prominent conferences and journals.

This paper list will be continuously updated at the end of each month.

Table of Contents

Survey

  • Human Action Recognition from Various Data Modalities: A Review (TPAMI 2022) [paper]
  • Human action recognition and prediction: A survey (IJCV 2022) [paper]
  • Transformer for Skeleton-based action recognition: A review of recent advances (Neurocomputing 2023) [paper]
  • Action recognition based on RGB and skeleton data sets: A survey (Neurocomputing 2022) [paper]
  • A Comparative Review of Recent Kinect-based Action Recognition Algorithms (TIP 2019) [paper]
  • Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond (2024 arXiv paper) [paper]
  • A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities (2024 arXiv paper) [paper]
  • ANUBIS: Review and Benchmark Skeleton-Based Action Recognition Methods with a New Dataset (2022 arXiv paper) [paper]
  • A Survey on 3D Skeleton-Based Action Recognition Using Learning Method (2020 arXiv paper) [paper]

Papers

Statistics: 🔥 relatively highly cited | ⭐ code is available and star > 100

2024

CVPR

  • BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition [paper] [code]
  • Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition [paper] [code]
  • Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning [paper] [code]
  • LLMs are Good Action Recognizers [paper]
  • MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning [paper]

ECCV

  • SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition [paper] [code]
  • MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion [paper] [code]
  • Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition [paper] [code]
  • SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders [paper] [code]
  • CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner [paper] [code]
  • On the Utility of 3D Hand Poses for Action Recognition [paper] [code]
  • Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph [paper] [code]
  • VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG [paper] [code]
  • Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation [paper] [code]
  • S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition [paper]
  • Towards Physical World Backdoor Attacks against Skeleton Action Recognition [paper]

NeurIPS

  • CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition [paper] [code]
  • Recovering Complete Actions for Cross-dataset Skeleton Action Recognition [paper] [code]

AAAI

  • Dynamic Semantic-Based Spatial Graph Convolution Network for Skeleton-Based Human Action Recognition [paper] [code]
  • SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Action Recognition [paper] [code]
  • Navigating Open Set Scenarios for Skeleton-based Action Recognition [paper] [code]
  • Behavioral Recognition of Skeletal Data Based on Targeted Dual Fusion Strategy [paper]
  • Spatio-Temporal Fusion for Human Action Recognition via Joint Trajectory Graph [paper]

IJCAI

  • Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition [paper] [code]

ACM MM

  • Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition [paper] [code]
  • Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition [paper] [code]
  • Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer [paper]

CVPRW

  • Efficient Skeleton-Based Action Recognition for Real-Time Embedded Systems [paper]

ICPR

  • Mask and Compress: Efficient Skeleton-based Action Recognition in Continual Learning [paper] [code]

ICIP

  • Hierarchical Vertex-Wise Intensification Graph Convolution for Skeleton-Based Activity Recognition [paper]
  • Cross-Action Cross-Subject Skeleton Action Recognition Via Simultaneous Action-Subject Learning With Two-Step Feature Removal [paper]

ICASSP

  • Elevating Skeleton-Based Action Recognition with Efficient Multi-Modality Self-Supervision [paper] [code]
  • Wavelet-Decoupling Contrastive Enhancement Network for Fine-Grained Skeleton-Based Action Recognition [paper]
  • A Novel Contrastive Diffusion Graph Convolutional Network for Few-Shot Skeleton-Based Action Recognition [paper]

IROS

  • Skeleton-Based Human Action Recognition with Noisy Labels [paper] [code]

ICMEW

  • HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition [paper] [code]

TPAMI

  • InfoGCN++: Learning Representation by Predicting the Future for Online Skeleton-based Action Recognition [paper] [code]
  • One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching [paper]

IJCV

  • View-invariant Skeleton Action Representation Learning via Motion Retargeting [paper] [code]

TIP

  • DeGCN: Deformable Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
  • SelfGCN: Graph Convolution Network With Self-Attention for Skeleton-Based Action Recognition [paper] [code]
  • Dynamic Semantic-based Spatial-Temporal Graph Convolution Network for Skeleton-based Human Action Recognition [paper] [code]
  • Mutual Information Driven Equivariant Contrastive Learning for 3D Action Representation Learning [paper]
  • Multi-View Time-Series Hypergraph Neural Network for Action Recognition [paper]

TMM

  • Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning [paper] [code]
  • Localized Linear Temporal Dynamics for Self-supervised Skeleton Action Recognition [paper]
  • Hierarchical Aggregated Graph Neural Network for Skeleton-based Action Recognition [paper]

TCSVT

  • SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition [paper] [code]
  • Asynchronous Joint-based Temporal Pooling for Skeleton-based Action Recognition [paper] [code]
  • Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-based Action Recognition [paper]
  • Multi-scale Structural Graph Convolutional Network for Skeleton-based Action Recognition [paper]
  • Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-based Human Action Recognition [paper]
  • Global and Local Contrastive Learning for Self-supervised Skeleton-Based Action Recognition [paper]
  • Motion-Aware Mask Feature Reconstruction for Skeleton-Based Action Recognition [paper]
  • Asynchronous Joint-based Temporal Pooling for Skeleton-based Action Recognition [paper]
  • Enhancing Skeleton-Based Action Recognition with Language Descriptions from Pre-trained Large Multimodal Models [paper]
  • DSDC-GCN: Decoupled Static-Dynamic Co-occurrence Graph Convolutional Networks for Skeleton-Based Action Recognition [paper]

TNNLS

  • Language-Guided 3-D Action Feature Learning Without Ground-Truth Sample Class Label [paper] [code]
  • GRA: Graph Representation Alignment for Semi-Supervised Action Recognition [paper]
  • Multi-Dimensional Refinement Graph Convolutional Network with Robust Decouple Loss for Fine-Grained Skeleton-Based Action Recognition [paper]

PR

  • Improving self-supervised action recognition from extremely augmented skeleton sequences [paper] [code]
  • Spatiotemporal Progressive Inward-Outward Aggregation Network for skeleton-based action recognition [paper]

Neurocomputing

  • A motion-aware and temporal-enhanced Spatial–Temporal Graph Convolutional Network for skeleton-based human action segmentation [paper] [code]
  • Independent Dual Graph Attention Convolutional Network for skeleton-based action recognition [paper]
  • Representation modeling learning with multi-domain decoupling for unsupervised skeleton-based action recognition [paper]
  • Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition [paper]
  • Modeling the skeleton-language uncertainty for 3D action recognition [paper]
  • Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition [paper]
  • Language-guided temporal primitive modeling for skeleton-based action recognition [paper]

arXiv papers

  • Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition [paper] [code]
  • Topological Symmetry Enhanced Graph Convolution for Skeleton-Based Action Recognition [paper] [code]
  • Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation [paper] [code]
  • Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition [paper] [code]
  • Graph in Graph Neural Network [paper] [code]
  • EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition [paper] [code]
  • Language Supervised Human Action Recognition with Salient Fusion: Construction Worker Action Recognition as a Use Case [paper] [code]
  • AutoGCN - Towards Generic Human Activity Recognition with Neural Architecture Search [paper] [code]
  • GCN-DevLSTM: Path Development for Skeleton-Based Action Recognition [paper] [code]
  • Active Generation Network of Human Skeleton for Action Recognition [paper] [code]
  • STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences [paper] [code]
  • Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition [paper] [code]
  • TDSM:Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition [paper] [code]
  • Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition [paper] [code]
  • A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition [paper]
  • Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction Recognition [paper]
  • Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence [paper]
  • Unsupervised Spatial-Temporal Feature Enrichment and Fidelity Preservation Network for Skeleton based Action Recognition [paper]
  • Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition [paper]
  • Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos [paper]
  • MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition [paper]
  • An Improved Graph Pooling Network for Skeleton-Based Action Recognition [paper]
  • Action-OOD: An End-to-End Skeleton-Based Model for Robust Out-of-Distribution Human Action Detection [paper]
  • An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition [paper]
  • LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition [paper]
  • Skeleton-Based Action Recognition with Spatial-Structural Graph Convolution [paper]
  • Signal-SGN: A Spiking Graph Convolutional Network for Skeletal Action Recognition via Learning Temporal-Frequency Dynamics [paper]
  • TASAR: Transferable Attack on Skeletal Action Recognition [paper]
  • Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment [paper]
  • Language-Assisted Human Part Motion Learning for Skeleton-Based Temporal Action Segmentation [paper]
  • Explaining Human Activity Recognition with SHAP: Validating Insights with Perturbation and Quantitative Measures [paper]
  • Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections [paper]
  • SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders [paper]

2023

CVPR

  • Learning Discriminative Representations for Skeleton Based Action Recognition [paper] [code]
  • Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition [paper] [code]
  • Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition [paper] [code]
  • HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions [paper] [code]
  • 3Mformer: Multi-order Multi-mode Transformer for Skeletal Action Recognition [paper]
  • Unified Pose Sequence Modeling [paper]
  • Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling [paper]
  • Prompt-Guided Zero-Shot Anomaly Action Recognition using Pretrained Deep Skeleton Features [paper]

ICCV

  • Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
  • Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition [paper] [code]
  • Generative Action Description Prompts for Skeleton-based Action Recognition [paper] [code]
  • Masked Motion Predictors are Strong 3D Action Representation Learners [paper] [code]
  • SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training [paper] [code]
  • MotionBERT: A Unified Perspective on Learning Human Motion Representations [paper] [code]
  • Parallel Attention Interaction Network for Few-Shot Skeleton-based Action Recognition [paper] [code]
  • Modeling the Relative Visual Tempo for Self-supervised Skeleton-based Action Recognition [paper] [code]
  • FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation [paper] [code]
  • Hard No-Box Adversarial Attack on Skeleton-Based Human Action Recognition with Skeleton-Motion-Informed Gradient [paper] [code]
  • LAC - Latent Action Composition for Skeleton-based Action Segmentation [paper] [code]
  • SkeleTR: Towards Skeleton-based Action Recognition in the Wild [paper]
  • Cross-Modal Learning with 3D Deformable Attention for Action Recognition [paper]

ICML

  • Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition [paper] [code]

ICLR

  • Graph Contrastive Learning for Skeleton-based Action Recognition [paper] [code]
  • Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action Representations [paper] [code]

AAAI

  • Hierarchical Consistent Contrastive Learning for Skeleton-Based Action Recognition with Growing Augmentations [paper] [code]
  • Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences [paper] [code]
  • Frame-Level Label Refinement for Skeleton-Based Weakly-Supervised Action Recognition [paper] [code]
  • Hierarchical Contrast for Unsupervised Skeleton-based Action Representation Learning [paper] [code]
  • Anonymization for Skeleton Action Recognition [paper] [code]
  • Defending Black-box Skeleton-based Human Activity Classifiers [paper] [code]
  • Novel Motion Patterns Matter for Practical Skeleton-based Action Recognition [paper]
  • Self-Supervised Learning for Multilevel Skeleton-Based Forgery Detection via Temporal-Causal Consistency of Actions [paper]

ACM MM

  • Skeleton MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition [paper] [code]
  • Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation Learning [paper] [code]
  • Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding [paper] [code]
  • Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization [paper] [code]
  • Self-Relational Graph Convolution Network for Skeleton-Based Action Recognition [paper]
  • Skeletal Spatial-Temporal Semantics Guided Homogeneous-Heterogeneous Multimodal Network for Action Recognition [paper]
  • Occluded Skeleton-Based Human Action Recognition with Dual Inhibition Training [paper]

IJCAI

  • Part Aware Contrastive Learning for Self-Supervised Action Recognition [paper] [code]
  • Action Recognition with Multi-stream Motion Modeling and Mutual Information Maximization [paper]

ICCVW

  • A Lightweight Skeleton-Based 3D-CNN for Real-Time Fall Detection and Action Recognition [paper]

BMVC

  • STEP CATFormer: Spatial-Temporal Effective Body-Part Cross Attention Transformer for Skeleton-based Action Recognition [paper] [code]

WACV

  • Adaptive Local-Component-aware Graph Convolutional Network for One-shot Skeleton-based Action Recognition [paper]
  • STAR-Transformer: A Spatio-Temporal Cross Attention Transformer for Human Action Recognition [paper]

ICIP

  • Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition [paper] [code]
  • Part Aware Graph Convolution Network with Temporal Enhancement for Skeleton-Based Action Recognition [paper]
  • Skeleton Action Recognition Based on Spatio-Temporal Features [paper]

ICME

  • DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition [paper] [code]
  • Dynamic Spatial-temporal Hypergraph Convolutional Network for Skeleton-based Action Recognition [paper]

WACVW

  • Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art [paper]

ICMEW

  • SkeletonMAE: Spatial-Temporal Masked Autoencoders for Self-supervised Skeleton Action Recognition [paper]

ICASSP

  • Body Prior Guided Graph Convolutional Neural Network for Skeleton-Based Action Recognition [paper] [code]

IROS

  • Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition [paper] [code]

TPAMI

  • Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization [paper]

TIP

  • DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition [paper]

TMM

  • Delving Deep into One-Shot Skeleton-based Action Recognition with Diverse Occlusions [paper] [code]
  • Temporal Decoupling Graph Convolutional Network for Skeleton-based Gesture Recognition [paper] [code]
  • Skeleton-based Action Recognition through Contrasting Two-Stream Spatial-Temporal Networks [paper]
  • Learning Representations by Contrastive Spatio-temporal Clustering for Skeleton-based Action Recognition [paper]
  • Skeleton-Based Gesture Recognition With Learnable Paths and Signature Features [paper]
  • Skeleton-Based Action Recognition with Select-Assemble-Normalize Graph Convolutional Networks [paper]
  • Joints-Centered Spatial-Temporal Features Fused Skeleton Convolution Network for Action Recognition [paper]

TCSVT

  • Motion Complement and Temporal Multifocusing for Skeleton-based Action Recognition [paper] [code]
  • TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition [paper] [code]

TNNLS

  • Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semi-Supervised Skeleton-based Action Recognition [paper]
  • Learning Heterogeneous Spatial–Temporal Context for Skeleton-Based Action Recognition [paper]
  • Self-Adaptive Graph With Nonlocal Attention Network for Skeleton-Based Action Recognition [paper]

PR

  • Continual spatio-temporal graph convolutional networks [paper] [code]
  • Relation-mining self-attention network for skeleton-based human action recognition [paper] [code]
  • SpatioTemporal Focus for Skeleton-based Action Recognition [paper]
  • Multi-grained clip focus for skeleton-based action recognition [paper]

Neurocomputing

  • SPAR: An efficient self-attention network using Switching Partition Strategy for skeleton-based action recognition [paper] [code]
  • Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition [paper]
  • Spatio-temporal segments attention for skeleton-based action recognition [paper]
  • STDM-transformer: Space-time dual multi-scale transformer network for skeleton-based action recognition [paper]

arXiv papers

  • Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition [paper] [code]
  • TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potential [paper] [code]
  • High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
  • Spatial-Temporal Decoupling Contrastive Learning for Skeleton-based Human Action Recognition [paper] [code]
  • Hulk: A Universal Knowledge Translator for Human-Centric Tasks [paper] [code]
  • Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition [paper] [code]
  • Joint Adversarial and Collaborative Learning for Self-Supervised Action Recognition [paper] [code]
  • Unveiling the Hidden Realm: Self-supervised Skeleton-based Action Recognition in Occluded Environments [paper] [code]
  • Pyramid Self-attention Polymerization Learning for Semi-supervised Skeleton-based Action Recognition [paper] [code]
  • Skeleton-based Human Action Recognition via Convolutional Neural Networks (CNN) [paper]
  • Cross-view Action Recognition via Contrastive View-invariant Representation [paper]
  • Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning [paper]
  • I2MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation [paper]
  • Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition [paper]
  • Modiff: Action-Conditioned 3D Motion Generation with Denoising Diffusion Probabilistic Models [paper]
  • Skeleton-based action analysis for ADHD diagnosis [paper]
  • Fine-grained Action Analysis: A Multi-modality and Multi-task Dataset of Figure Skating [paper]

2022

CVPR

  • InfoGCN: Representation Learning for Human Skeleton-based Action Recognition [paper] [code] [🔥]
  • Revisiting Skeleton-based Action Recognition [paper] [code] [🔥] [⭐]

ECCV

  • CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation [paper] [code]
  • Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning [paper] [code]
  • Collaborating Domain-shared and Target-specific Feature Clustering for Cross-domain 3D Action Recognition [paper] [code]
  • Global-local Motion Transformer for Unsupervised Skeleton-based Action Learning [paper] [code]
  • IGFormer: Interaction Graph Transformer for Skeleton-Based Human Interaction Recognition [paper]
  • Contrastive Positive Mining for Unsupervised 3D Action Representation Learning [paper]
  • Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition [paper]
  • Uncertainty-DTW for Time Series and Sequences [paper]

AAAI

  • Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition [paper] [code] [🔥]
  • Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition [paper] [code] [🔥] [⭐]
  • Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition [paper]

ACM MM

  • PYSKL: Towards Good Practices for Skeleton Action Recognition [paper] [code] [⭐]
  • Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton based Action Recognition [paper] [code]
  • Skeleton-based Action Recognition via Adaptive Cross-Form Learning [paper] [code]
  • Global-Local Cross-View Fisher Discrimination for View-Invariant Action Recognition [paper]

CVPRW

  • Bootstrapped Representation Learning for Skeleton-Based Action Recognition [paper]

ECCVW

  • Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks [paper] [code]
  • PSUMNet: Unified Modality Part Streams are All You Need for Efficient Pose-based Action Recognition [paper] [code]
  • Strengthening Skeletal Action Recognizers via Leveraging Temporal Patterns [paper]

ACCV

  • Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition [paper]
  • Temporal-Viewpoint Transportation Plan for Skeletal Few-shot Action Recognition [paper]

WACV

  • Skeleton-DML: Deep Metric Learning for Skeleton-Based One-Shot Action Recognition [paper] [code]
  • Generative Adversarial Graph Convolutional Networks for Human Action Synthesis [paper] [code]

ICPR

  • Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network [paper]

TPAMI

  • Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition [paper] [code] [🔥]
  • Motif-GCNs With Local and Non-Local Temporal Blocks for Skeleton-Based Action Recognition [paper] [code]
  • Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition [paper] [code]

IJCV

  • Action2video: Generating Videos of Human 3D Actions [paper]

TIP

  • Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action Recognition [paper] [code]
  • Multilevel Spatial–Temporal Excited Graph Network for Skeleton-Based Action Recognition [paper] [code]
  • SMAM: Self and Mutual Adaptive Matching for Skeleton-Based Few-Shot Action Recognition [paper]
  • X-Invariant Contrastive Augmentation and Representation Learning for Semi-Supervised Skeleton-Based Action Recognition [paper]

TMM

  • Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition [paper]
  • Joint-bone Fusion Graph Convolutional Network for Semi-supervised Skeleton Action Recognition [paper]

TCSVT

  • Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition [paper] [code]
  • Zoom Transformer for Skeleton-Based Group Activity Recognition [paper] [code]
  • Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition [paper]
  • Motion-Driven Spatial and Temporal Adaptive High-Resolution Graph Convolutional Networks for Skeleton-Based Action Recognition [paper]
  • View-Normalized and Subject-Independent Skeleton Generation for Action Recognition [paper]

TNNLS

  • Fusing Higher-Order Features in Graph Neural Networks for Skeleton-Based Action Recognition [paper] [code]

Neurocomputing

  • Forward-reverse adaptive graph convolutional networks for skeleton-based action recognition [paper] [code]
  • AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement [paper]
  • Hierarchical graph attention network with pseudo-metapath for skeleton-based action recognition [paper]
  • Skeleton-based similar action recognition through integrating the salient image feature into a center-connected graph convolutional network [paper]
  • PB-GCN: Progressive binary graph convolutional networks for skeleton-based action recognition [paper]

arXiv papers

  • Hypergraph Transformer for Skeleton-based Action Recognition [paper] [code]
  • DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition [paper] [code]
  • Spatio-Temporal Tuples Transformer for Skeleton-Based Action Recognition [paper] [code]
  • Contrastive Learning from Spatio-Temporal Mixed Skeleton Sequences for Self-Supervised Skeleton-Based Action Recognition [paper] [code]
  • HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment [paper] [code]
  • Skeleton-based Action Recognition Via Temporal-Channel Aggregation [paper]
  • A New Spatial Adjacency Matrix of Skeleton Data Based on Self-loop and Adaptive Weights [paper]
  • View-Invariant Skeleton-based Action Recognition via Global-Local Contrastive Learning [paper]

2021

CVPR

  • 3D Human Action Representation Learning via Cross-View Consistency Pursuit [paper] [code] [🔥]
  • BASAR:Black-box Attack on Skeletal Action Recognition [paper] [code]
  • Understanding the Robustness of Skeleton-based Action Recognition under Adversarial Attack [paper] [code]

ICCV

  • Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition [paper] [code] [🔥] [⭐]
  • AdaSGN: Adapting Joint Number and Model Size for Efficient Skeleton-Based Action Recognition [paper] [code]
  • Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning [paper]
  • Self-supervised 3D Skeleton Action Representation Learning with Motion Consistency and Continuity [paper]

NeurIPS

  • Unsupervised Motion Representation Learning with Capsule Autoencoders [paper] [code]

AAAI

  • Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition [paper] [code] [🔥]
  • Spatio-Temporal Difference Descriptor for Skeleton-Based Action Recognition [paper]

ACM MM

  • Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition [paper] [code]
  • STST: Spatial-Temporal Specialized Transformer for Skeleton-based Action Recognition [paper] [code]
  • Skeleton-Contrastive 3D Action Representation Learning [paper] [code]
  • Modeling the Uncertainty for Self-supervised 3D Skeleton Action Representation Learning [paper]

CVPRW

  • One-shot action recognition in challenging therapy scenarios [paper] [code]

BMVC

  • UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition [paper] [code]
  • Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance [paper] [code]
  • LSTA-Net: Long short-term Spatio-Temporal Aggregation Network for Skeleton-based Action Recognition [paper]

WACV

  • JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition [paper]

ICPR

  • Learning Connectivity with Graph Convolutional Networks for Skeleton-based Action Recognition [paper]

ICPRW

  • Spatial Temporal Transformer Network for Skeleton-Based Action Recognition [paper] [code] [🔥] [⭐]

ICIP

  • Syntactically Guided Generative Embeddings for Zero-Shot Skeleton Action Recognition [paper] [code]

ICME

  • Graph Convolutional Hourglass Networks for Skeleton-Based Action Recognition [paper]

ICRA

  • Pose Refinement Graph Convolutional Network for Skeleton-basedAction Recognition [paper] [code]

TPAMI

  • Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction [paper] [🔥]
  • Tensor Representations for Action Recognition [paper]

IJCV

  • Quo Vadis, Skeleton Action Recognition? [paper] [code]

TIP

  • Extremely Lightweight Skeleton-Based Action Recognition with ShiftGCN++ [paper] [code]
  • Structural Knowledge Distillation for Efficient Skeleton-Based Action Recognition [paper] [code]
  • Feedback Graph Convolutional Network for Skeleton-Based Action Recognition [paper]
  • Hypergraph Neural Network for Skeleton-Based Action Recognition [paper]

TIFS

  • REGINA - Reasoning Graph Convolutional Networks in Human Action Recognition [paper] [code]

TMM

  • Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition [paper] [code]
  • Interaction Relational Network for Mutual Action Recognition [paper] [code]
  • LAGA-Net: Local-and-Global Attention Network for Skeleton Based Action Recognition [paper]
  • A Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition [paper]
  • Multi-Localized Sensitive Autoencoder-Attention-LSTM For Skeleton-based Action Recognition [paper]
  • Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition [paper]
  • Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition [paper]
  • GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition [paper]

TCSVT

  • Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition [paper] [code]
  • A Central Difference Graph Convolutional Operator for Skeleton-Based Action Recognition [paper] [code]
  • Multi-Stream Interaction Networks for Human Action Recognition [paper]
  • A Cross View Learning Approach for Skeleton-Based Action Recognition [paper]
  • Symmetrical Enhanced Fusion Network for Skeleton-Based Action Recognition [paper]
  • Graph2Net: Perceptually-enriched graph learning for skeleton-based action recognition [paper]

TNNLS

  • Memory Attention Networks for Skeleton-Based Action Recognition [paper] [code]

PR

  • Arbitrary-view human action recognition via novel-view action generation [paper] [code]
  • Tripool: Graph triplet pooling for 3D skeleton-based action recognition [paper]
  • Action recognition using kinematics posture feature on 3D skeleton joint locations [paper]
  • Scene image and human skeleton-based dual-stream human action recognition [paper]
  • Dyadic relational graph convolutional networks for skeleton-based human interaction recognition [paper]

Neurocomputing

  • Rethinking the ST-GCNs for 3D skeleton-based human action recognition [paper]
  • Attention adjacency matrix based graph convolutional networks for skeleton-based action recognition [paper]
  • Skeleton-based action recognition using sparse spatio-temporal GCN with edge effective resistance [paper]
  • Integrating vertex and edge features with Graph Convolutional Networks for skeleton-based action recognition [paper]
  • Adaptive multi-view graph convolutional networks for skeleton-based action recognition [paper]
  • Knowledge embedded GCN for skeleton-based two-person interaction recognition [paper]
  • Normal graph: Spatial temporal graph convolutional networks based prediction network for skeleton based video anomaly detection [paper]

arXiv papers

  • STAR: Sparse Transformer-based Action Recognition [paper] [code]
  • Self-attention based anchor proposal for skeleton-based action recognition [paper] [code]
  • Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition [paper]
  • 3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Na¨ıve [paper]

2020

CVPR

  • Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition [paper] [code] [🔥] [⭐]
  • Skeleton-Based Action Recognition with Shift Graph Convolutional Network [paper] [code] [🔥] [⭐]
  • Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition [paper] [code] [🔥] [⭐]
  • PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition [paper] [code] [⭐]
  • Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction [paper] [code] [🔥] [⭐]
  • Context Aware Graph Convolution for Skeleton-Based Action Recognition [paper]

ECCV

  • Decoupling GCN with DropGraph Module for Skeleton-Based Action Recognition [paper] [code] [🔥]
  • Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement [paper] [code]
  • DDGCN: A Dynamic Directed Graph Convolutional Network for Action Recognition [paper]
  • Adversarial Self-supervised Learning for Semi-supervised 3D Action Recognition [paper]

AAAI

  • Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching [paper] [code] [🔥] [⭐]
  • Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition [paper]
  • Learning Diverse Stochastic Human-Action Generators by Learning Smooth Latent Transitions [paper]

ACM MM

  • Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition [paper] [code] [🔥]
  • Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition [paper] [code] [⭐]
  • Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
  • MS2L: Multi-Task Self-Supervised Learning for Skeleton Based Action Recognition [paper] [code]
  • Action2Motion: Conditioned Generation of 3D Human Motions [paper] [code] [⭐]
  • Group-Skeleton-Based Human Action Recognition in Complex Events [paper]
  • Mix Dimension in Poincaré Geometry for 3D Skeleton-based Action Recognition [paper]

NIPSW

  • Contrastive Self-Supervised Learning for Skeleton Action Recognition [paper]

ACCV

  • Decoupled Spatial-Temporal Attention Network for Skeleton-Based Action-Gesture Recognition [paper]

TPAMI

  • Learning Multi-View Interactional Skeleton Graph for Action Recognition [paper] [code]
  • Multi-Task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition [paper] [code] [⭐]

TIP

  • Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks [paper] [code] [🔥] [⭐]

TMM

  • Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition [paper]
  • Deep Manifold-to-Manifold Transforming Network for Skeleton-Based Action Recognition [paper]

TCSVT

  • Richly Activated Graph Convolutional Network for Robust Skeleton-based Action Recognition [paper] [code]

TNNLS

  • Adversarial Attack on Skeleton-Based Human Action Recognition [paper]

TOMM

  • A Benchmark Dataset and Comparison Study for Multi-modal Human Action Analytics [paper]

PR

  • Skeleton-based action recognition with hierarchical spatial reasoning and temporal stack learning network [paper]

Neurocomputing

  • Exploring a rich spatial–temporal dependent relational model for skeleton-based action recognition by bidirectional LSTM-CNN [paper]
  • HDS-SP: A novel descriptor for skeleton-based human action recognition [paper]

2019

CVPR

  • Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code] [🔥] [⭐]
  • Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition [paper] [code] [🔥] [⭐]
  • Skeleton-Based Action Recognition with Directed Graph Neural Networks [paper] [code] [🔥] [⭐]
  • Bayesian Hierarchical Dynamic Model for Human Action Recognition [paper] [code]
  • An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition [paper] [🔥]

ICCV

  • Bayesian Graph Convolution LSTM for Skeleton Based Action Recognition [paper]
  • Making the Invisible Visible: Action Recognition Through Walls and Occlusions [paper]

AAAI

  • Graph CNNs with Motif and Variable Temporal Block for Skeleton-Based Action Recognition [paper] [code]
  • Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition [paper]

CVPRW

  • Three-Stream Convolutional Neural Network With Multi-Task and Ensemble Learning for 3D Action Recognition [paper]

ICCVW

  • Spatial Residual Layer and Dense Connection Block Enhanced Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition [paper]

WACV

  • Unsupervised Feature Learning of Human Actions As Trajectories in Pose Embedding Manifold [paper]

ICIP

  • Richly Activated Graph Convolutional Network for Action Recognition with Incomplete Skeletons [paper] [code]

ICME

  • Skeleton-Based Action Recognition with Synchronous Local and Non-local Spatio-temporal Learning and Frequency Attention [paper]
  • Relational Network for Skeleton-Based Action Recognition [paper]

TPAMI

  • NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding [paper] [code] [🔥]
  • View Adaptive Neural Networks for High Performance Skeleton-based Human Action Recognition [paper] [code] [🔥] [⭐]

TIP

  • Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-Based Human Action Recognition [paper] [code]
  • View-Invariant Human Action Recognition Based on a 3D Bio-Constrained Skeleton Model [paper] [code]
  • EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks [paper]
  • Learning Latent Global Network for Skeleton-Based Action Prediction [paper]

TMM

  • 2-D Skeleton-Based Action Recognition via Two-Branch Stacked LSTM-RNNs [paper]
  • A Cuboid CNN Model With an Attention Mechanism for Skeleton-Based Action Recognition [paper]
  • Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition [paper]

TCSVT

  • Action Recognition Scheme Based on Skeleton Representation With DS-LSTM Network [paper]

TNNLS

  • Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition [paper]

Neurocomputing

  • Convolutional relation network for skeleton-based action recognition [paper]

2018

CVPR

  • Recognizing Human Actions as the Evolution of Pose Estimation Maps [paper] [code]
  • Independently Recurrent Neural Network (IndRNN): Building a Longer and Deeper RNN [paper] [code] [🔥] [⭐]
  • 2D/3D Pose Estimation and Action Recognition Using Multitask Deep Learning [paper] [code] [🔥] [⭐]
  • Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition [paper] [🔥]

ECCV

  • Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack [paper] [🔥]
  • Adding Attentiveness to the Neurons in Recurrent Neural Networks [paper]

AAAI

  • Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition [paper [code] [🔥] [⭐]
  • Unsupervised Representation Learning With Long-Term Dynamics for Skeleton Based Action Recognition [paper] [code]
  • Spatio-Temporal Graph Convolution for Skeleton Based Action Recognition [paper]

ACM MM

  • Optimized Skeleton-based Action Recognition via Sparsified Graph Regression [paper]
  • A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition [paper]

IJCAI

  • Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation [paper] [code] [🔥] [⭐]
  • Memory Attention Networks for Skeleton-based Action Recognition [paper] [code]

BMVC

  • Part-based Graph Convolutional Network for Action Recognition [paper] [code]
  • A Fine-to-Coarse Convolutional Neural Network for 3D Human Action Recognition [paper]

ICIP

  • Joints Relation Inference Network for Skeleton-Based Action Recognition [paper]

ICME

  • Skeleton-Based Human Action Recognition Using Spatial Temporal 3D Convolutional Neural Networks [paper]

TIP

  • Beyond Joints: Learning Representations From Primitive Geometries for Skeleton-Based Action Recognition and Detection [paper] [code]
  • Learning Clip Representations for Skeleton-Based 3D Action Recognition [paper]

TMM

  • Attention-Based Multiview Re-Observation Fusion Network for Skeletal Action Recognition [paper]
  • Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks [paper]

TCSVT

  • Skeleton-Based Action Recognition With Gated Convolutional Neural Networks [paper]
  • Action Recognition With Spatio–Temporal Visual Attention on Skeleton Image Sequences [paper]

PR

  • Learning content and style: Joint action recognition and person identification from human skeletons [paper]

2017

CVPR

  • Deep Learning on Lie Groups for Skeleton-based Action Recognition [paper] [code]
  • Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks [paper]
  • Global Context-Aware Attention LSTM Networks for 3D Action Recognition [paper] [🔥]
  • A New Representation of Skeleton Sequences for 3D Action Recognition [paper] [🔥]

ICCV

  • View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data [paper] [code] [🔥] [⭐]
  • Ensemble Deep Learning for Skeleton-Based Action Recognition Using Temporal Sliding LSTM Networks [paper] [code]
  • Learning Action Recognition Model From Depth and Skeleton Videos [paper]

AAAI

  • An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data [paper] [🔥]

CVPRW

  • Interpretable 3D Human Action Analysis with Temporal Convolutional Networks [paper] [code] [🔥] [⭐]

ICMEW

  • Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep CNN [paper]
  • Investigation of different skeleton features for CNN-based 3D action recognition [paper]
  • Skeleton-based action recognition using LSTM and CNN [paper]

TPAMI

  • Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates [paper] [code]

TIP

  • Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks [paper] [🔥]

PR

  • Learning discriminative trajectorylet detector sets for accurate skeleton-based action recognition [paper]
  • Enhanced skeleton visualization for view invariant human action recognition [paper] [🔥]

2016

CVPR

  • NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis [paper] [code] [🔥]
  • Rolling Rotations for Recognizing Human Actions from 3D Skeletal Data [paper]

ECCV

  • Temporal segment networks: Towards good practices for deep action recognition [paper] [code] [🔥] [⭐]
  • Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition [paper] [🔥]

AAAI

  • Co-occurrence Feature Learning for Skeleton based Action Recognition using Regularized Deep LSTM Networks [paper] [🔥]

ACM MM

  • Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks [paper]

TIP

  • Representation Learning of Temporal Dynamics for Skeleton-Based Action Recognition [paper]

TMM

  • Discriminative Multi-instance Multitask Learning for 3D Action Recognition [paper]

TCSVT

  • Skeleton Optical Spectra-Based Action Recognition Using Convolutional Neural Networks [paper]

2015

CVPR

  • Hierarchical Recurrent Neural Network for Skeleton Based Action Recognition [paper] [🔥]
  • Jointly learning heterogeneous features for RGB-D activity recognition [paper] [🔥]

ICCV

  • Learning Spatiotemporal Features with 3D Convolutional Networks [paper] [code] [🔥]

TPAMI

  • Multimodal Multipart Learning for Action Recognition in Depth Videos [paper]

TMM

  • Effective Active Skeleton Representation for Low Latency Human Action Recognition [paper]

Neurocomputing

  • Skeleton-based action recognition with extreme learning machines [paper]

2014

CVPR

  • Cross-view Action Modeling, Learning and Recognition [paper] [🔥]
  • Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group [paper] [🔥]

NeurIPS

  • Two-Stream Convolutional Networks for Action Recognition in Videos [paper] [🔥]

Other Resources

With all the resources available on the github website, this paper list is comprehensive and recently updated.

Last update: Dec 2, 2024

Feel free to contact me if you find any interesting paper is missing.