We collect existing papers on skeleton-based action recognition published in prominent conferences and journals.
This paper list will be continuously updated at the end of each month.
- Human Action Recognition from Various Data Modalities: A Review (TPAMI 2022) [paper]
- Human action recognition and prediction: A survey (IJCV 2022) [paper]
- Transformer for Skeleton-based action recognition: A review of recent advances (Neurocomputing 2023) [paper]
- Action recognition based on RGB and skeleton data sets: A survey (Neurocomputing 2022) [paper]
- A Comparative Review of Recent Kinect-based Action Recognition Algorithms (TIP 2019) [paper]
- Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond (2024 arXiv paper) [paper]
- A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities (2024 arXiv paper) [paper]
- ANUBIS: Review and Benchmark Skeleton-Based Action Recognition Methods with a New Dataset (2022 arXiv paper) [paper]
- A Survey on 3D Skeleton-Based Action Recognition Using Learning Method (2020 arXiv paper) [paper]
Statistics: 🔥 relatively highly cited | ⭐ code is available and star > 100
CVPR
- BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition [paper] [code]
- Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition [paper] [code]
- Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning [paper] [code]
- LLMs are Good Action Recognizers [paper]
- MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning [paper]
ECCV
- SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition [paper] [code]
- MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion [paper] [code]
- Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition [paper] [code]
- SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders [paper] [code]
- CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner [paper] [code]
- On the Utility of 3D Hand Poses for Action Recognition [paper] [code]
- Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph [paper] [code]
- VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG [paper] [code]
- Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation [paper] [code]
- S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition [paper]
- Towards Physical World Backdoor Attacks against Skeleton Action Recognition [paper]
NeurIPS
- CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition [paper] [code]
- Recovering Complete Actions for Cross-dataset Skeleton Action Recognition [paper] [code]
AAAI
- Dynamic Semantic-Based Spatial Graph Convolution Network for Skeleton-Based Human Action Recognition [paper] [code]
- SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Action Recognition [paper] [code]
- Navigating Open Set Scenarios for Skeleton-based Action Recognition [paper] [code]
- Behavioral Recognition of Skeletal Data Based on Targeted Dual Fusion Strategy [paper]
- Spatio-Temporal Fusion for Human Action Recognition via Joint Trajectory Graph [paper]
IJCAI
- Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action Recognition [paper] [code]
ACM MM
- Multi-Modality Co-Learning for Efficient Skeleton-based Action Recognition [paper] [code]
- Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition [paper] [code]
- Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer [paper]
CVPRW
- Efficient Skeleton-Based Action Recognition for Real-Time Embedded Systems [paper]
ICPR
ICIP
- Hierarchical Vertex-Wise Intensification Graph Convolution for Skeleton-Based Activity Recognition [paper]
- Cross-Action Cross-Subject Skeleton Action Recognition Via Simultaneous Action-Subject Learning With Two-Step Feature Removal [paper]
ICASSP
- Elevating Skeleton-Based Action Recognition with Efficient Multi-Modality Self-Supervision [paper] [code]
- Wavelet-Decoupling Contrastive Enhancement Network for Fine-Grained Skeleton-Based Action Recognition [paper]
- A Novel Contrastive Diffusion Graph Convolutional Network for Few-Shot Skeleton-Based Action Recognition [paper]
IROS
ICMEW
- HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition [paper] [code]
TPAMI
- InfoGCN++: Learning Representation by Predicting the Future for Online Skeleton-based Action Recognition [paper] [code]
- One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching [paper]
IJCV
TIP
- DeGCN: Deformable Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
- SelfGCN: Graph Convolution Network With Self-Attention for Skeleton-Based Action Recognition [paper] [code]
- Dynamic Semantic-based Spatial-Temporal Graph Convolution Network for Skeleton-based Human Action Recognition [paper] [code]
- Mutual Information Driven Equivariant Contrastive Learning for 3D Action Representation Learning [paper]
- Multi-View Time-Series Hypergraph Neural Network for Action Recognition [paper]
TMM
- Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning [paper] [code]
- Localized Linear Temporal Dynamics for Self-supervised Skeleton Action Recognition [paper]
- Hierarchical Aggregated Graph Neural Network for Skeleton-based Action Recognition [paper]
TCSVT
- SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition [paper] [code]
- Asynchronous Joint-based Temporal Pooling for Skeleton-based Action Recognition [paper] [code]
- Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-based Action Recognition [paper]
- Multi-scale Structural Graph Convolutional Network for Skeleton-based Action Recognition [paper]
- Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-based Human Action Recognition [paper]
- Global and Local Contrastive Learning for Self-supervised Skeleton-Based Action Recognition [paper]
- Motion-Aware Mask Feature Reconstruction for Skeleton-Based Action Recognition [paper]
- Asynchronous Joint-based Temporal Pooling for Skeleton-based Action Recognition [paper]
- Enhancing Skeleton-Based Action Recognition with Language Descriptions from Pre-trained Large Multimodal Models [paper]
- DSDC-GCN: Decoupled Static-Dynamic Co-occurrence Graph Convolutional Networks for Skeleton-Based Action Recognition [paper]
TNNLS
- Language-Guided 3-D Action Feature Learning Without Ground-Truth Sample Class Label [paper] [code]
- GRA: Graph Representation Alignment for Semi-Supervised Action Recognition [paper]
- Multi-Dimensional Refinement Graph Convolutional Network with Robust Decouple Loss for Fine-Grained Skeleton-Based Action Recognition [paper]
PR
- Improving self-supervised action recognition from extremely augmented skeleton sequences [paper] [code]
- Spatiotemporal Progressive Inward-Outward Aggregation Network for skeleton-based action recognition [paper]
Neurocomputing
- A motion-aware and temporal-enhanced Spatial–Temporal Graph Convolutional Network for skeleton-based human action segmentation [paper] [code]
- Independent Dual Graph Attention Convolutional Network for skeleton-based action recognition [paper]
- Representation modeling learning with multi-domain decoupling for unsupervised skeleton-based action recognition [paper]
- Multi-scale sampling attention graph convolutional networks for skeleton-based action recognition [paper]
- Modeling the skeleton-language uncertainty for 3D action recognition [paper]
- Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition [paper]
- Language-guided temporal primitive modeling for skeleton-based action recognition [paper]
arXiv papers
- Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition [paper] [code]
- Topological Symmetry Enhanced Graph Convolution for Skeleton-Based Action Recognition [paper] [code]
- Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation [paper] [code]
- Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition [paper] [code]
- Graph in Graph Neural Network [paper] [code]
- EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition [paper] [code]
- Language Supervised Human Action Recognition with Salient Fusion: Construction Worker Action Recognition as a Use Case [paper] [code]
- AutoGCN - Towards Generic Human Activity Recognition with Neural Architecture Search [paper] [code]
- GCN-DevLSTM: Path Development for Skeleton-Based Action Recognition [paper] [code]
- Active Generation Network of Human Skeleton for Action Recognition [paper] [code]
- STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences [paper] [code]
- Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition [paper] [code]
- TDSM:Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition [paper] [code]
- Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition [paper] [code]
- A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition [paper]
- Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction Recognition [paper]
- Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence [paper]
- Unsupervised Spatial-Temporal Feature Enrichment and Fidelity Preservation Network for Skeleton based Action Recognition [paper]
- Multi-Scale Spatial-Temporal Self-Attention Graph Convolutional Networks for Skeleton-based Action Recognition [paper]
- Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos [paper]
- MK-SGN: A Spiking Graph Convolutional Network with Multimodal Fusion and Knowledge Distillation for Skeleton-based Action Recognition [paper]
- An Improved Graph Pooling Network for Skeleton-Based Action Recognition [paper]
- Action-OOD: An End-to-End Skeleton-Based Model for Robust Out-of-Distribution Human Action Detection [paper]
- An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition [paper]
- LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition [paper]
- Skeleton-Based Action Recognition with Spatial-Structural Graph Convolution [paper]
- Signal-SGN: A Spiking Graph Convolutional Network for Skeletal Action Recognition via Learning Temporal-Frequency Dynamics [paper]
- TASAR: Transferable Attack on Skeletal Action Recognition [paper]
- Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment [paper]
- Language-Assisted Human Part Motion Learning for Skeleton-Based Temporal Action Segmentation [paper]
- Explaining Human Activity Recognition with SHAP: Validating Insights with Perturbation and Quantitative Measures [paper]
- Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections [paper]
- SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders [paper]
CVPR
- Learning Discriminative Representations for Skeleton Based Action Recognition [paper] [code]
- Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition [paper] [code]
- Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition [paper] [code]
- HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions [paper] [code]
- 3Mformer: Multi-order Multi-mode Transformer for Skeletal Action Recognition [paper]
- Unified Pose Sequence Modeling [paper]
- Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling [paper]
- Prompt-Guided Zero-Shot Anomaly Action Recognition using Pretrained Deep Skeleton Features [paper]
ICCV
- Hierarchically Decomposed Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
- Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition [paper] [code]
- Generative Action Description Prompts for Skeleton-based Action Recognition [paper] [code]
- Masked Motion Predictors are Strong 3D Action Representation Learners [paper] [code]
- SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training [paper] [code]
- MotionBERT: A Unified Perspective on Learning Human Motion Representations [paper] [code]
- Parallel Attention Interaction Network for Few-Shot Skeleton-based Action Recognition [paper] [code]
- Modeling the Relative Visual Tempo for Self-supervised Skeleton-based Action Recognition [paper] [code]
- FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation [paper] [code]
- Hard No-Box Adversarial Attack on Skeleton-Based Human Action Recognition with Skeleton-Motion-Informed Gradient [paper] [code]
- LAC - Latent Action Composition for Skeleton-based Action Segmentation [paper] [code]
- SkeleTR: Towards Skeleton-based Action Recognition in the Wild [paper]
- Cross-Modal Learning with 3D Deformable Attention for Action Recognition [paper]
ICML
ICLR
- Graph Contrastive Learning for Skeleton-based Action Recognition [paper] [code]
- Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action Representations [paper] [code]
AAAI
- Hierarchical Consistent Contrastive Learning for Skeleton-Based Action Recognition with Growing Augmentations [paper] [code]
- Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences [paper] [code]
- Frame-Level Label Refinement for Skeleton-Based Weakly-Supervised Action Recognition [paper] [code]
- Hierarchical Contrast for Unsupervised Skeleton-based Action Representation Learning [paper] [code]
- Anonymization for Skeleton Action Recognition [paper] [code]
- Defending Black-box Skeleton-based Human Activity Classifiers [paper] [code]
- Novel Motion Patterns Matter for Practical Skeleton-based Action Recognition [paper]
- Self-Supervised Learning for Multilevel Skeleton-Based Forgery Detection via Temporal-Causal Consistency of Actions [paper]
ACM MM
- Skeleton MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition [paper] [code]
- Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation Learning [paper] [code]
- Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding [paper] [code]
- Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization [paper] [code]
- Self-Relational Graph Convolution Network for Skeleton-Based Action Recognition [paper]
- Skeletal Spatial-Temporal Semantics Guided Homogeneous-Heterogeneous Multimodal Network for Action Recognition [paper]
- Occluded Skeleton-Based Human Action Recognition with Dual Inhibition Training [paper]
IJCAI
- Part Aware Contrastive Learning for Self-Supervised Action Recognition [paper] [code]
- Action Recognition with Multi-stream Motion Modeling and Mutual Information Maximization [paper]
ICCVW
- A Lightweight Skeleton-Based 3D-CNN for Real-Time Fall Detection and Action Recognition [paper]
BMVC
- STEP CATFormer: Spatial-Temporal Effective Body-Part Cross Attention Transformer for Skeleton-based Action Recognition [paper] [code]
WACV
- Adaptive Local-Component-aware Graph Convolutional Network for One-shot Skeleton-based Action Recognition [paper]
- STAR-Transformer: A Spatio-Temporal Cross Attention Transformer for Human Action Recognition [paper]
ICIP
- Temporal-Channel Topology Enhanced Network for Skeleton-Based Action Recognition [paper] [code]
- Part Aware Graph Convolution Network with Temporal Enhancement for Skeleton-Based Action Recognition [paper]
- Skeleton Action Recognition Based on Spatio-Temporal Features [paper]
ICME
- DD-GCN: Directed Diffusion Graph Convolutional Network for Skeleton-based Human Action Recognition [paper] [code]
- Dynamic Spatial-temporal Hypergraph Convolutional Network for Skeleton-based Action Recognition [paper]
WACVW
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art [paper]
ICMEW
- SkeletonMAE: Spatial-Temporal Masked Autoencoders for Self-supervised Skeleton Action Recognition [paper]
ICASSP
- Body Prior Guided Graph Convolutional Neural Network for Skeleton-Based Action Recognition [paper] [code]
IROS
- Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition [paper] [code]
TPAMI
- Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization [paper]
TIP
- DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition [paper]
TMM
- Delving Deep into One-Shot Skeleton-based Action Recognition with Diverse Occlusions [paper] [code]
- Temporal Decoupling Graph Convolutional Network for Skeleton-based Gesture Recognition [paper] [code]
- Skeleton-based Action Recognition through Contrasting Two-Stream Spatial-Temporal Networks [paper]
- Learning Representations by Contrastive Spatio-temporal Clustering for Skeleton-based Action Recognition [paper]
- Skeleton-Based Gesture Recognition With Learnable Paths and Signature Features [paper]
- Skeleton-Based Action Recognition with Select-Assemble-Normalize Graph Convolutional Networks [paper]
- Joints-Centered Spatial-Temporal Features Fused Skeleton Convolution Network for Action Recognition [paper]
TCSVT
- Motion Complement and Temporal Multifocusing for Skeleton-based Action Recognition [paper] [code]
- TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition [paper] [code]
TNNLS
- Spatiotemporal Decouple-and-Squeeze Contrastive Learning for Semi-Supervised Skeleton-based Action Recognition [paper]
- Learning Heterogeneous Spatial–Temporal Context for Skeleton-Based Action Recognition [paper]
- Self-Adaptive Graph With Nonlocal Attention Network for Skeleton-Based Action Recognition [paper]
PR
- Continual spatio-temporal graph convolutional networks [paper] [code]
- Relation-mining self-attention network for skeleton-based human action recognition [paper] [code]
- SpatioTemporal Focus for Skeleton-based Action Recognition [paper]
- Multi-grained clip focus for skeleton-based action recognition [paper]
Neurocomputing
- SPAR: An efficient self-attention network using Switching Partition Strategy for skeleton-based action recognition [paper] [code]
- Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition [paper]
- Spatio-temporal segments attention for skeleton-based action recognition [paper]
- STDM-transformer: Space-time dual multi-scale transformer network for skeleton-based action recognition [paper]
arXiv papers
- Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition [paper] [code]
- TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potential [paper] [code]
- High-Performance Inference Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
- Spatial-Temporal Decoupling Contrastive Learning for Skeleton-based Human Action Recognition [paper] [code]
- Hulk: A Universal Knowledge Translator for Human-Centric Tasks [paper] [code]
- Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition [paper] [code]
- Joint Adversarial and Collaborative Learning for Self-Supervised Action Recognition [paper] [code]
- Unveiling the Hidden Realm: Self-supervised Skeleton-based Action Recognition in Occluded Environments [paper] [code]
- Pyramid Self-attention Polymerization Learning for Semi-supervised Skeleton-based Action Recognition [paper] [code]
- Skeleton-based Human Action Recognition via Convolutional Neural Networks (CNN) [paper]
- Cross-view Action Recognition via Contrastive View-invariant Representation [paper]
- Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning [paper]
- I2MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation [paper]
- Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition [paper]
- Modiff: Action-Conditioned 3D Motion Generation with Denoising Diffusion Probabilistic Models [paper]
- Skeleton-based action analysis for ADHD diagnosis [paper]
- Fine-grained Action Analysis: A Multi-modality and Multi-task Dataset of Figure Skating [paper]
CVPR
- InfoGCN: Representation Learning for Human Skeleton-based Action Recognition [paper] [code] [🔥]
- Revisiting Skeleton-based Action Recognition [paper] [code] [🔥] [⭐]
ECCV
- CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation [paper] [code]
- Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning [paper] [code]
- Collaborating Domain-shared and Target-specific Feature Clustering for Cross-domain 3D Action Recognition [paper] [code]
- Global-local Motion Transformer for Unsupervised Skeleton-based Action Learning [paper] [code]
- IGFormer: Interaction Graph Transformer for Skeleton-Based Human Interaction Recognition [paper]
- Contrastive Positive Mining for Unsupervised 3D Action Representation Learning [paper]
- Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition [paper]
- Uncertainty-DTW for Time Series and Sequences [paper]
AAAI
- Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition [paper] [code] [🔥]
- Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition [paper] [code] [🔥] [⭐]
- Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition [paper]
ACM MM
- PYSKL: Towards Good Practices for Skeleton Action Recognition [paper] [code] [⭐]
- Shifting Perspective to See Difference: A Novel Multi-View Method for Skeleton based Action Recognition [paper] [code]
- Skeleton-based Action Recognition via Adaptive Cross-Form Learning [paper] [code]
- Global-Local Cross-View Fisher Discrimination for View-Invariant Action Recognition [paper]
CVPRW
- Bootstrapped Representation Learning for Skeleton-Based Action Recognition [paper]
ECCVW
- Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks [paper] [code]
- PSUMNet: Unified Modality Part Streams are All You Need for Efficient Pose-based Action Recognition [paper] [code]
- Strengthening Skeletal Action Recognizers via Leveraging Temporal Patterns [paper]
ACCV
- Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition [paper]
- Temporal-Viewpoint Transportation Plan for Skeletal Few-shot Action Recognition [paper]
WACV
- Skeleton-DML: Deep Metric Learning for Skeleton-Based One-Shot Action Recognition [paper] [code]
- Generative Adversarial Graph Convolutional Networks for Human Action Synthesis [paper] [code]
ICPR
- Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network [paper]
TPAMI
- Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition [paper] [code] [🔥]
- Motif-GCNs With Local and Non-Local Temporal Blocks for Skeleton-Based Action Recognition [paper] [code]
- Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition [paper] [code]
IJCV
- Action2video: Generating Videos of Human 3D Actions [paper]
TIP
- Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action Recognition [paper] [code]
- Multilevel Spatial–Temporal Excited Graph Network for Skeleton-Based Action Recognition [paper] [code]
- SMAM: Self and Mutual Adaptive Matching for Skeleton-Based Few-Shot Action Recognition [paper]
- X-Invariant Contrastive Augmentation and Representation Learning for Semi-Supervised Skeleton-Based Action Recognition [paper]
TMM
- Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition [paper]
- Joint-bone Fusion Graph Convolutional Network for Semi-supervised Skeleton Action Recognition [paper]
TCSVT
- Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition [paper] [code]
- Zoom Transformer for Skeleton-Based Group Activity Recognition [paper] [code]
- Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition [paper]
- Motion-Driven Spatial and Temporal Adaptive High-Resolution Graph Convolutional Networks for Skeleton-Based Action Recognition [paper]
- View-Normalized and Subject-Independent Skeleton Generation for Action Recognition [paper]
TNNLS
- Fusing Higher-Order Features in Graph Neural Networks for Skeleton-Based Action Recognition [paper] [code]
Neurocomputing
- Forward-reverse adaptive graph convolutional networks for skeleton-based action recognition [paper] [code]
- AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement [paper]
- Hierarchical graph attention network with pseudo-metapath for skeleton-based action recognition [paper]
- Skeleton-based similar action recognition through integrating the salient image feature into a center-connected graph convolutional network [paper]
- PB-GCN: Progressive binary graph convolutional networks for skeleton-based action recognition [paper]
arXiv papers
- Hypergraph Transformer for Skeleton-based Action Recognition [paper] [code]
- DG-STGCN: Dynamic Spatial-Temporal Modeling for Skeleton-based Action Recognition [paper] [code]
- Spatio-Temporal Tuples Transformer for Skeleton-Based Action Recognition [paper] [code]
- Contrastive Learning from Spatio-Temporal Mixed Skeleton Sequences for Self-Supervised Skeleton-Based Action Recognition [paper] [code]
- HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment [paper] [code]
- Skeleton-based Action Recognition Via Temporal-Channel Aggregation [paper]
- A New Spatial Adjacency Matrix of Skeleton Data Based on Self-loop and Adaptive Weights [paper]
- View-Invariant Skeleton-based Action Recognition via Global-Local Contrastive Learning [paper]
CVPR
- 3D Human Action Representation Learning via Cross-View Consistency Pursuit [paper] [code] [🔥]
- BASAR:Black-box Attack on Skeletal Action Recognition [paper] [code]
- Understanding the Robustness of Skeleton-based Action Recognition under Adversarial Attack [paper] [code]
ICCV
- Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition [paper] [code] [🔥] [⭐]
- AdaSGN: Adapting Joint Number and Model Size for Efficient Skeleton-Based Action Recognition [paper] [code]
- Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning [paper]
- Self-supervised 3D Skeleton Action Representation Learning with Motion Consistency and Continuity [paper]
NeurIPS
AAAI
- Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition [paper] [code] [🔥]
- Spatio-Temporal Difference Descriptor for Skeleton-Based Action Recognition [paper]
ACM MM
- Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition [paper] [code]
- STST: Spatial-Temporal Specialized Transformer for Skeleton-based Action Recognition [paper] [code]
- Skeleton-Contrastive 3D Action Representation Learning [paper] [code]
- Modeling the Uncertainty for Self-supervised 3D Skeleton Action Representation Learning [paper]
CVPRW
BMVC
- UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition [paper] [code]
- Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance [paper] [code]
- LSTA-Net: Long short-term Spatio-Temporal Aggregation Network for Skeleton-based Action Recognition [paper]
WACV
- JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition [paper]
ICPR
- Learning Connectivity with Graph Convolutional Networks for Skeleton-based Action Recognition [paper]
ICPRW
ICIP
ICME
- Graph Convolutional Hourglass Networks for Skeleton-Based Action Recognition [paper]
ICRA
TPAMI
- Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction [paper] [🔥]
- Tensor Representations for Action Recognition [paper]
IJCV
TIP
- Extremely Lightweight Skeleton-Based Action Recognition with ShiftGCN++ [paper] [code]
- Structural Knowledge Distillation for Efficient Skeleton-Based Action Recognition [paper] [code]
- Feedback Graph Convolutional Network for Skeleton-Based Action Recognition [paper]
- Hypergraph Neural Network for Skeleton-Based Action Recognition [paper]
TIFS
TMM
- Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition [paper] [code]
- Interaction Relational Network for Mutual Action Recognition [paper] [code]
- LAGA-Net: Local-and-Global Attention Network for Skeleton Based Action Recognition [paper]
- A Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition [paper]
- Multi-Localized Sensitive Autoencoder-Attention-LSTM For Skeleton-based Action Recognition [paper]
- Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition [paper]
- Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition [paper]
- GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition [paper]
TCSVT
- Fuzzy Integral-Based CNN Classifier Fusion for 3D Skeleton Action Recognition [paper] [code]
- A Central Difference Graph Convolutional Operator for Skeleton-Based Action Recognition [paper] [code]
- Multi-Stream Interaction Networks for Human Action Recognition [paper]
- A Cross View Learning Approach for Skeleton-Based Action Recognition [paper]
- Symmetrical Enhanced Fusion Network for Skeleton-Based Action Recognition [paper]
- Graph2Net: Perceptually-enriched graph learning for skeleton-based action recognition [paper]
TNNLS
PR
- Arbitrary-view human action recognition via novel-view action generation [paper] [code]
- Tripool: Graph triplet pooling for 3D skeleton-based action recognition [paper]
- Action recognition using kinematics posture feature on 3D skeleton joint locations [paper]
- Scene image and human skeleton-based dual-stream human action recognition [paper]
- Dyadic relational graph convolutional networks for skeleton-based human interaction recognition [paper]
Neurocomputing
- Rethinking the ST-GCNs for 3D skeleton-based human action recognition [paper]
- Attention adjacency matrix based graph convolutional networks for skeleton-based action recognition [paper]
- Skeleton-based action recognition using sparse spatio-temporal GCN with edge effective resistance [paper]
- Integrating vertex and edge features with Graph Convolutional Networks for skeleton-based action recognition [paper]
- Adaptive multi-view graph convolutional networks for skeleton-based action recognition [paper]
- Knowledge embedded GCN for skeleton-based two-person interaction recognition [paper]
- Normal graph: Spatial temporal graph convolutional networks based prediction network for skeleton based video anomaly detection [paper]
arXiv papers
- STAR: Sparse Transformer-based Action Recognition [paper] [code]
- Self-attention based anchor proposal for skeleton-based action recognition [paper] [code]
- Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition [paper]
- 3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Na¨ıve [paper]
CVPR
- Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition [paper] [code] [🔥] [⭐]
- Skeleton-Based Action Recognition with Shift Graph Convolutional Network [paper] [code] [🔥] [⭐]
- Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition [paper] [code] [🔥] [⭐]
- PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition [paper] [code] [⭐]
- Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction [paper] [code] [🔥] [⭐]
- Context Aware Graph Convolution for Skeleton-Based Action Recognition [paper]
ECCV
- Decoupling GCN with DropGraph Module for Skeleton-Based Action Recognition [paper] [code] [🔥]
- Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement [paper] [code]
- DDGCN: A Dynamic Directed Graph Convolutional Network for Action Recognition [paper]
- Adversarial Self-supervised Learning for Semi-supervised 3D Action Recognition [paper]
AAAI
- Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching [paper] [code] [🔥] [⭐]
- Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition [paper]
- Learning Diverse Stochastic Human-Action Generators by Learning Smooth Latent Transitions [paper]
ACM MM
- Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition [paper] [code] [🔥]
- Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition [paper] [code] [⭐]
- Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code]
- MS2L: Multi-Task Self-Supervised Learning for Skeleton Based Action Recognition [paper] [code]
- Action2Motion: Conditioned Generation of 3D Human Motions [paper] [code] [⭐]
- Group-Skeleton-Based Human Action Recognition in Complex Events [paper]
- Mix Dimension in Poincaré Geometry for 3D Skeleton-based Action Recognition [paper]
NIPSW
- Contrastive Self-Supervised Learning for Skeleton Action Recognition [paper]
ACCV
- Decoupled Spatial-Temporal Attention Network for Skeleton-Based Action-Gesture Recognition [paper]
TPAMI
- Learning Multi-View Interactional Skeleton Graph for Action Recognition [paper] [code]
- Multi-Task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition [paper] [code] [⭐]
TIP
- Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks [paper] [code] [🔥] [⭐]
TMM
- Hierarchical Soft Quantization for Skeleton-Based Human Action Recognition [paper]
- Deep Manifold-to-Manifold Transforming Network for Skeleton-Based Action Recognition [paper]
TCSVT
- Richly Activated Graph Convolutional Network for Robust Skeleton-based Action Recognition [paper] [code]
TNNLS
- Adversarial Attack on Skeleton-Based Human Action Recognition [paper]
TOMM
- A Benchmark Dataset and Comparison Study for Multi-modal Human Action Analytics [paper]
PR
- Skeleton-based action recognition with hierarchical spatial reasoning and temporal stack learning network [paper]
Neurocomputing
- Exploring a rich spatial–temporal dependent relational model for skeleton-based action recognition by bidirectional LSTM-CNN [paper]
- HDS-SP: A novel descriptor for skeleton-based human action recognition [paper]
CVPR
- Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition [paper] [code] [🔥] [⭐]
- Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition [paper] [code] [🔥] [⭐]
- Skeleton-Based Action Recognition with Directed Graph Neural Networks [paper] [code] [🔥] [⭐]
- Bayesian Hierarchical Dynamic Model for Human Action Recognition [paper] [code]
- An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition [paper] [🔥]
ICCV
- Bayesian Graph Convolution LSTM for Skeleton Based Action Recognition [paper]
- Making the Invisible Visible: Action Recognition Through Walls and Occlusions [paper]
AAAI
- Graph CNNs with Motif and Variable Temporal Block for Skeleton-Based Action Recognition [paper] [code]
- Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition [paper]
CVPRW
- Three-Stream Convolutional Neural Network With Multi-Task and Ensemble Learning for 3D Action Recognition [paper]
ICCVW
- Spatial Residual Layer and Dense Connection Block Enhanced Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition [paper]
WACV
- Unsupervised Feature Learning of Human Actions As Trajectories in Pose Embedding Manifold [paper]
ICIP
- Richly Activated Graph Convolutional Network for Action Recognition with Incomplete Skeletons [paper] [code]
ICME
- Skeleton-Based Action Recognition with Synchronous Local and Non-local Spatio-temporal Learning and Frequency Attention [paper]
- Relational Network for Skeleton-Based Action Recognition [paper]
TPAMI
- NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding [paper] [code] [🔥]
- View Adaptive Neural Networks for High Performance Skeleton-based Human Action Recognition [paper] [code] [🔥] [⭐]
TIP
- Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-Based Human Action Recognition [paper] [code]
- View-Invariant Human Action Recognition Based on a 3D Bio-Constrained Skeleton Model [paper] [code]
- EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks [paper]
- Learning Latent Global Network for Skeleton-Based Action Prediction [paper]
TMM
- 2-D Skeleton-Based Action Recognition via Two-Branch Stacked LSTM-RNNs [paper]
- A Cuboid CNN Model With an Attention Mechanism for Skeleton-Based Action Recognition [paper]
- Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition [paper]
TCSVT
- Action Recognition Scheme Based on Skeleton Representation With DS-LSTM Network [paper]
TNNLS
- Graph Edge Convolutional Neural Networks for Skeleton-Based Action Recognition [paper]
Neurocomputing
- Convolutional relation network for skeleton-based action recognition [paper]
CVPR
- Recognizing Human Actions as the Evolution of Pose Estimation Maps [paper] [code]
- Independently Recurrent Neural Network (IndRNN): Building a Longer and Deeper RNN [paper] [code] [🔥] [⭐]
- 2D/3D Pose Estimation and Action Recognition Using Multitask Deep Learning [paper] [code] [🔥] [⭐]
- Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition [paper] [🔥]
ECCV
- Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack [paper] [🔥]
- Adding Attentiveness to the Neurons in Recurrent Neural Networks [paper]
AAAI
- Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition [paper [code] [🔥] [⭐]
- Unsupervised Representation Learning With Long-Term Dynamics for Skeleton Based Action Recognition [paper] [code]
- Spatio-Temporal Graph Convolution for Skeleton Based Action Recognition [paper]
ACM MM
- Optimized Skeleton-based Action Recognition via Sparsified Graph Regression [paper]
- A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition [paper]
IJCAI
- Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation [paper] [code] [🔥] [⭐]
- Memory Attention Networks for Skeleton-based Action Recognition [paper] [code]
BMVC
- Part-based Graph Convolutional Network for Action Recognition [paper] [code]
- A Fine-to-Coarse Convolutional Neural Network for 3D Human Action Recognition [paper]
ICIP
- Joints Relation Inference Network for Skeleton-Based Action Recognition [paper]
ICME
- Skeleton-Based Human Action Recognition Using Spatial Temporal 3D Convolutional Neural Networks [paper]
TIP
- Beyond Joints: Learning Representations From Primitive Geometries for Skeleton-Based Action Recognition and Detection [paper] [code]
- Learning Clip Representations for Skeleton-Based 3D Action Recognition [paper]
TMM
- Attention-Based Multiview Re-Observation Fusion Network for Skeletal Action Recognition [paper]
- Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks [paper]
TCSVT
- Skeleton-Based Action Recognition With Gated Convolutional Neural Networks [paper]
- Action Recognition With Spatio–Temporal Visual Attention on Skeleton Image Sequences [paper]
PR
- Learning content and style: Joint action recognition and person identification from human skeletons [paper]
CVPR
- Deep Learning on Lie Groups for Skeleton-based Action Recognition [paper] [code]
- Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks [paper]
- Global Context-Aware Attention LSTM Networks for 3D Action Recognition [paper] [🔥]
- A New Representation of Skeleton Sequences for 3D Action Recognition [paper] [🔥]
ICCV
- View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data [paper] [code] [🔥] [⭐]
- Ensemble Deep Learning for Skeleton-Based Action Recognition Using Temporal Sliding LSTM Networks [paper] [code]
- Learning Action Recognition Model From Depth and Skeleton Videos [paper]
AAAI
- An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data [paper] [🔥]
CVPRW
ICMEW
- Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep CNN [paper]
- Investigation of different skeleton features for CNN-based 3D action recognition [paper]
- Skeleton-based action recognition using LSTM and CNN [paper]
TPAMI
- Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates [paper] [code]
TIP
- Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks [paper] [🔥]
PR
- Learning discriminative trajectorylet detector sets for accurate skeleton-based action recognition [paper]
- Enhanced skeleton visualization for view invariant human action recognition [paper] [🔥]
CVPR
- NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis [paper] [code] [🔥]
- Rolling Rotations for Recognizing Human Actions from 3D Skeletal Data [paper]
ECCV
- Temporal segment networks: Towards good practices for deep action recognition [paper] [code] [🔥] [⭐]
- Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition [paper] [🔥]
AAAI
- Co-occurrence Feature Learning for Skeleton based Action Recognition using Regularized Deep LSTM Networks [paper] [🔥]
ACM MM
- Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks [paper]
TIP
- Representation Learning of Temporal Dynamics for Skeleton-Based Action Recognition [paper]
TMM
- Discriminative Multi-instance Multitask Learning for 3D Action Recognition [paper]
TCSVT
- Skeleton Optical Spectra-Based Action Recognition Using Convolutional Neural Networks [paper]
CVPR
- Hierarchical Recurrent Neural Network for Skeleton Based Action Recognition [paper] [🔥]
- Jointly learning heterogeneous features for RGB-D activity recognition [paper] [🔥]
ICCV
TPAMI
- Multimodal Multipart Learning for Action Recognition in Depth Videos [paper]
TMM
- Effective Active Skeleton Representation for Low Latency Human Action Recognition [paper]
Neurocomputing
- Skeleton-based action recognition with extreme learning machines [paper]
CVPR
- Cross-view Action Modeling, Learning and Recognition [paper] [🔥]
- Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group [paper] [🔥]
NeurIPS
- Two-Stream Convolutional Networks for Action Recognition in Videos [paper] [🔥]
With all the resources available on the github website, this paper list is comprehensive and recently updated.
- niais/Awesome-Skeleton-based-Action-Recognition
- Kali-Hac/Awesome-Skeleton-Based-Models
- qbxlvnf11/skeleton-based-action-recognition-methods
- cagbal/Skeleton-Based-Action-Recognition-Papers-and-Notes
- XiaoCode-er/Skeleton-Based-Action-Recognition-Papers
- leviethung2103/awesome-skeleton-based-action-recognition
- fdu-wuyuan/Siren
- manjunath5496/Skeleton-based-Action-Recognition-Papers
- liaomingg/action_recognition_and_skeleton_detection_summary
- caglarmert/MOT-Research/wiki/Awesome-Action-Recognition
- shuangshuangguo/skeleton-based-action-recognition-review