Skip to content

Latest commit

 

History

History
2054 lines (685 loc) · 181 KB

File metadata and controls

2054 lines (685 loc) · 181 KB

ACMMM2021

  • Wen Gao. "Video Coding for Machine" | [Home Page] | [PDF]

  • H. V. Jagadish. "Semantic Media Conversion: Possibilities and Limits" | [Home Page] | [PDF]

  • Rong Zhang,Wei Li,Yiqun Zhang,Hong Zhang,Jinhui Yu,Ruigang Yang,Weiwei Xu. "Image Re-composition via Regional Content-Style Decoupling" | [Home Page] | [PDF]

  • Hao Huang,Shinjae Yoo,Chenxiao Xu. "Deep Clustering based on Bi-Space Association Learning" | [Home Page] | [PDF]

  • Seogkyu Jeon,Kibeom Hong,Pilhyeon Lee,Jewook Lee,Hyeran Byun. "Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization" | [Home Page] | [PDF]

  • Qi Zhang,Xuesong Zhang,Baoping Li,Yuzhong Chen,Anlong Ming. "HDA-Net: Horizontal Deformable Attention Network for Stereo Matching" | [Home Page] | [PDF]

  • Zhaoyang Jia,Han Fang,Weiming Zhang. "MBRS: Enhancing Robustness of DNN-based Watermarking by Mini-Batch of Real and Simulated JPEG Compression" | [Home Page] | [PDF]

  • Ye Liu,Lei Zhu,Shunda Pei,Huazhu Fu,Jing Qin,Qing Zhang,Liang Wan,Wei Feng. "From Synthetic to Real: Image Dehazing Collaborating with Unlabeled Real Data" | [Home Page] | [PDF]

  • Jiangtong Li,Wentao Wang,Junjie Chen,Li Niu,Jianlou Si,Chen Qian,Liqing Zhang. "Video Semantic Segmentation via Sparse Temporal Transformer" | [Home Page] | [PDF]

  • Yingchen Yu,Fangneng Zhan,Rongliang WU,Jianxiong Pan,Kaiwen Cui,Shijian Lu,Feiying Ma,Xuansong Xie,Chunyan Miao. "Diverse Image Inpainting with Bidirectional and Autoregressive Transformers" | [Home Page] | [PDF]

  • Hanbang Liang,Xianxu Hou,Linlin Shen. "SSFlow: Style-guided Neural Spline Flows for Face Image Manipulation" | [Home Page] | [PDF]

  • Kotaro Kikuchi,Edgar Simo-Serra,Mayu Otani,Kota Yamaguchi. "Constrained Graphic Layout Generation via Latent Optimization" | [Home Page] | [PDF]

  • Xiaoya Zhang,Ling Zhou,Yong Li,Zhen Cui,Jin Xie,Jian Yang. "Transfer Vision Patterns for Multi-Task Pixel Learning" | [Home Page] | [PDF]

  • Yike Wu,Bo Zhang,Gang Yu,Weixi Zhang,Bin Wang,Tao Chen,Jiayuan Fan. "Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification" | [Home Page] | [PDF]

  • Yunan Zhu,Haichuan Ma,Jialun Peng,Dong Liu,Zhiwei Xiong. "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN" | [Home Page] | [PDF]

  • Liangchen Song,Sheng Liu,Celong Liu,Zhong Li,Yuqi Ding,Yi Xu,Junsong Yuan. "Learning Kinematic Formulas from Multiple View Videos" | [Home Page] | [PDF]

  • Pingyue Zhang,Mengyue Wu,Heinrich Dinkel,Kai Yu. "DEPA: Self-Supervised Audio Embedding for Depression Detection" | [Home Page] | [PDF]

  • Zhaodong Kang,Jianing Li,Lin Zhu,Yonghong Tian. "Retinomorphic Sensing: A Novel Paradigm for Future Multimedia Computing" | [Home Page] | [PDF]

  • Haihan Duan,Jiaye Li,Sizheng Fan,Zhonghao Lin,Xiao Wu,Wei Cai. "Metaverse for Social Good: A University Campus Prototype" | [Home Page] | [PDF]

  • Yueqi Xie,Ka Leong Cheng,Qifeng Chen. "Enhanced Invertible Encoding for Learned Image Compression" | [Home Page] | [PDF]

  • Shihao Zhou,Mengxi Jiang,Shanshan Cai,Yunqi Lei. "DC-GNet: Deep Mesh Relation Capturing Graph Convolution Network for 3D Human Shape Reconstruction" | [Home Page] | [PDF]

  • Xun Cai,Jiajing Chai,Yanbo Gao,Shuai Li,Bo Zhu. "Deep Marginal Fisher Analysis based CNN for Image Representation and Classification" | [Home Page] | [PDF]

  • Yuanzhouhan Cao,Yidong Li,Haokui Zhang,Chao Ren,Yifan Liu. "Learning Structure Affinity for Video Depth Estimation" | [Home Page] | [PDF]

  • Jingjing Jiang,Ziyi Liu,Yifan Liu,Zhixiong Nan,Nanning Zheng. "X-GGM: Graph Generative Modeling for Out-of-distribution Generalization in Visual Question Answering" | [Home Page] | [PDF]

  • Aichun Zhu,Zijie Wang,Yifeng Li,Xili Wan,Jing Jin,Tian Wang,Fangqiang Hu,Gang Hua. "DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval" | [Home Page] | [PDF]

  • David D. Nguyen,Surya Nepal,Salil S. Kanhere. "Diverse Multimedia Layout Generation with Multi Choice Learning" | [Home Page] | [PDF]

  • Liangchen Liu,Xi Yang,Nannan Wang,Xinbo Gao. "Viewing from Frequency Domain: A DCT-based Information Enhancement Network for Video Person Re-Identification" | [Home Page] | [PDF]

  • Yingqing He,Yazhou Xing,Tianjia Zhang,Qifeng Chen. "Unsupervised Portrait Shadow Removal via Generative Priors" | [Home Page] | [PDF]

  • Yi Huang,Xiaoshan Yang,Changsheng Xu. "Multimodal Global Relation Knowledge Distillation for Egocentric Action Anticipation" | [Home Page] | [PDF]

  • Liuan Wang,Li Sun,Mingjie Zhang,Huigang Zhang,Wang Ping,Rong Zhou,Jun Sun. "Exploring Pathologist Knowledge for Automatic Assessment of Breast Cancer Metastases in Whole-slide Image" | [Home Page] | [PDF]

  • Duan Mingxing,Kenli Li,Lingxi Xie,Qi Tian,Bin Xiao. "Towards Multiple Black-boxes Attack via Adversarial Example Generation Network" | [Home Page] | [PDF]

  • Hao Feng,Yuechen Wang,Wengang Zhou,Jiajun Deng,Houqiang Li. "DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction" | [Home Page] | [PDF]

  • Yiyang Gan,Ruize Han,Liqiang Yin,Wei Feng,Song Wang. "Self-supervised Multi-view Multi-Human Association and Tracking" | [Home Page] | [PDF]

  • Hongwei Xue,Bei Liu,Huan Yang,Jianlong Fu,Houqiang Li,Jiebo Luo. "Learning Fine-Grained Motion Embedding for Landscape Animation" | [Home Page] | [PDF]

  • Ying Li,Hongwei Zhou,Yeyu Yin,Jiaquan Gao. "Multi-label Pattern Image Retrieval via Attention Mechanism Driven Graph Convolutional Network" | [Home Page] | [PDF]

  • Na Zheng,Xuemeng Song,Qingying Niu,Xue Dong,Yibing Zhan,Liqiang Nie. "Collocation and Try-on Network: Whether an Outfit is Compatible" | [Home Page] | [PDF]

  • Rishabh Baghel,Abhishek Trivedi,Tejas Ravichandran,Ravi Kiran Sarvadevabhatla. "MeronymNet: A Hierarchical Model for Unified and Controllable Multi-Category Object Generation" | [Home Page] | [PDF]

  • Akash Gupta,Padmaja Jonnalagedda,Bir Bhanu,Amit K. Roy-Chowdhury. "Ada-VSR: Adaptive Video Super-Resolution with Meta-Learning" | [Home Page] | [PDF]

  • Minha Kim,Shahroz Tariq,Simon S. Woo. "CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation" | [Home Page] | [PDF]

  • Xiaowen Ying,Xin Li,Mooi Choo Chuah. "SRNet: Spatial Relation Network for Efficient Single-stage Instance Segmentation in Videos" | [Home Page] | [PDF]

  • Zilong Shao,Siyang Song,Shashank Jaiswal,Linlin Shen,Michel Valstar,Hatice Gunes. "Personality Recognition by Modelling Person-specific Cognitive Processes using Graph Representation" | [Home Page] | [PDF]

  • Xiaopeng Guo,Zhijie Huang,Jie Gao,Mingyu Shang,Maojing Shu,Jun Sun. "Enhancing Knowledge Tracing via Adversarial Training" | [Home Page] | [PDF]

  • Gangyan Zeng,Yuan Zhang,Yu Zhou,Xiaomeng Yang. "Beyond OCR + VQA: Involving OCR into the Flow for Robust and Accurate TextVQA" | [Home Page] | [PDF]

  • Qing Guo,Xiaoguang Li,Felix Juefei-Xu,Hongkai Yu,Yang Liu,Song Wang. "JPGNet: Joint Predictive Filtering and Generative Network for Image Inpainting" | [Home Page] | [PDF]

  • Yihao Huang,Qing Guo,Felix Juefei-Xu,Lei Ma,Weikai Miao,Yang Liu,Geguang Pu. "AdvFilter: Predictive Perturbation-aware Filtering against Adversarial Attack via Multi-domain Learning" | [Home Page] | [PDF]

  • Zizheng Yan,Xianggang Yu,Yipeng Qin,Yushuang Wu,Xiaoguang Han,Shuguang Cui. "Pixel-level Intra-domain Adaptation for Semantic Segmentation" | [Home Page] | [PDF]

  • Xugong Qin,Yu Zhou,Youhui Guo,Dayan Wu,Zhihong Tian,Ning Jiang,Hongbin Wang,Weiping Wang. "Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection" | [Home Page] | [PDF]

  • Chuanjun Zheng,Daming Shi,Yukun Liu. "Windowing Decomposition Convolutional Neural Network for Image Enhancement" | [Home Page] | [PDF]

  • Weiming Zhuang,Yonggang Wen,Shuai Zhang. "Joint Optimization in Edge-Cloud Continuum for Federated Unsupervised Person Re-identification" | [Home Page] | [PDF]

  • Zehai Niu,Ke Lu,Jian Xue,Haifeng Ma,Runchen Wei. "Multi-view 3D Smooth Human Pose Estimation based on Heatmap Filtering and Spatio-temporal Information" | [Home Page] | [PDF]

  • Yu-Ke Li,Pin Wang,Mang Ye,Ching-Yao Chan. "Imitative Learning for Multi-Person Action Forecasting" | [Home Page] | [PDF]

  • Ruikang Xu,Zeyu Xiao,Mingde Yao,Yueyi Zhang,Zhiwei Xiong. "Stereo Video Super-Resolution via Exploiting View-Temporal Correlations" | [Home Page] | [PDF]

  • Jiawei Zhao,Yifan Zhao,Jia Li. "M3TR: Multi-modal Multi-label Recognition with Transformer" | [Home Page] | [PDF]

  • Luchuan Song,Bin Liu,Guojun Yin,Xiaoyi Dong,Yufei Zhang,Jia-Xuan Bai. "TACR-Net: Editing on Deep Video and Voice Portraits" | [Home Page] | [PDF]

  • Yixiong Zou,Shanghang Zhang,Guangyao Chen,Yonghong Tian,Kurt Keutzer,José M. F. Moura. "Annotation-Efficient Untrimmed Video Action Recognition" | [Home Page] | [PDF]

  • Hsiao-Han Lu,Shao-En Weng,Ya-Fan Yen,Hong-Han Shuai,Wen-Huang Cheng. "Face-based Voice Conversion: Learning the Voice behind a Face" | [Home Page] | [PDF]

  • Xiongwei Wu,Xin Fu,Ying Liu,Ee-Peng Lim,Steven C.H. Hoi,Qianru Sun. "A Large-Scale Benchmark for Food Image Segmentation" | [Home Page] | [PDF]

  • Guowen Zhang,Pingping Zhang,Jinqing Qi,Huchuan Lu. "HAT: Hierarchical Aggregation Transformers for Person Re-identification" | [Home Page] | [PDF]

  • Qinglin Liu,Haozhe Xie,Shengping Zhang,Bineng Zhong,Rongrong Ji. "Long-Range Feature Propagating for Natural Image Matting" | [Home Page] | [PDF]

  • Ansheng You,Chenglin Zhou,Qixuan Zhang,Lan Xu. "Towards Controllable and Photorealistic Region-wise Image Manipulation" | [Home Page] | [PDF]

  • Zhuangzi Li,Ge Li,Thomas Li,Shan Liu,Wei Gao. "Information-Growth Attention Network for Image Super-Resolution" | [Home Page] | [PDF]

  • Jiale Li,Hang Dai,Ling Shao,Yong Ding. "Anchor-free 3D Single Stage Detector with Mask-Guided Attention for Point Cloud" | [Home Page] | [PDF]

  • Xin Gao,Zhenjiang Liu,Zunlei Feng,Chengji Shen,Kairi Ou,Haihong Tang,Mingli Song. "Shape Controllable Virtual Try-on for Underwear Models" | [Home Page] | [PDF]

  • Zhiwei Chen,Liujuan Cao,Yunhang Shen,Feihong Lian,Yongjian Wu,Rongrong Ji. "E2Net: Excitative-Expansile Learning for Weakly Supervised Object Localization" | [Home Page] | [PDF]

  • Jiahao Wang,Yunhong Wang,Sheng Liu,Annan Li. "Few-shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-Learning" | [Home Page] | [PDF]

  • Yi Tan,Yanbin Hao,Xiangnan He,Yinwei Wei,Xun Yang. "Selective Dependency Aggregation for Action Classification" | [Home Page] | [PDF]

  • Wenbo Hu,Changgong Zhang,Fangneng Zhan,Lei Zhang,Tien-Tsin Wong. "Conditional Directed Graph Convolution for 3D Human Pose Estimation" | [Home Page] | [PDF]

  • Gangming Zhao. "Cross Chest Graph for Disease Diagnosis with Structural Relational Reasoning" | [Home Page] | [PDF]

  • Qi Wen,Shuang Li,Bingfeng Han,Yi Yuan. "ZiGAN: Fine-grained Chinese Calligraphy Font Generation via a Few-shot Style Transfer Approach" | [Home Page] | [PDF]

  • Hao Wang,Guosheng Lin,Steven C. H. Hoi,Chunyan Miao. "Cycle-Consistent Inverse GAN for Text-to-Image Synthesis" | [Home Page] | [PDF]

  • Hu Wang,Peng Chen,Bohan Zhuang,Chunhua Shen. "Fully Quantized Image Super-Resolution Networks" | [Home Page] | [PDF]

  • Haonan Zhang,Longjun Liu,Hengyi Zhou,Wenxuan Hou,Hongbin Sun,Nanning Zheng. "AKECP: Adaptive Knowledge Extraction from Feature Maps for Fast and Efficient Channel Pruning" | [Home Page] | [PDF]

  • Qiangqiang Wu,Jia Wan,Antoni B. Chan. "Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting" | [Home Page] | [PDF]

  • Miao Zhang,Tingwei Liu,Yongri Piao,Shunyu Yao,Huchuan Lu. "Auto-MSFNet: Search Multi-scale Fusion Network for Salient Object Detection" | [Home Page] | [PDF]

  • Shengqi Huang,Wanqi Yang,Lei Wang,Luping Zhou,Ming Yang. "Few-shot Unsupervised Domain Adaptation with Image-to-Class Sparse Similarity Encoding" | [Home Page] | [PDF]

  • Xuanhan Wang,Lianli Gao,Yan Dai,Yixuan Zhou,Jingkuan Song. "Semantic-aware Transfer with Instance-adaptive Parsing for Crowded Scenes Pose Estimation" | [Home Page] | [PDF]

  • Haoyu Zhang,Meng Liu,Zan Gao,Xiaoqiang Lei,Yinglong Wang,Liqiang Nie. "Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding" | [Home Page] | [PDF]

  • Jingwei Liao,Yanli Liu,Guanyu Xing,Housheng Wei,Jueyu Chen,Songhua Xu. "Shadow Detection via Predicting the Confidence Maps of Shadow Detection Methods" | [Home Page] | [PDF]

  • Pengxiang Su,Zhenguang Liu,Shuang Wu,Lei Zhu,Yifang Yin,Xuanjing Shen. "Motion Prediction via Joint Dependency Modeling in Phase Space" | [Home Page] | [PDF]

  • Hao Su,Jianwei Niu,Xuefeng Liu,Qingfeng Li,Ji Wan,Mingliang Xu. "Q-Art Code: Generating Scanning-robust Art-style QR Codes by Deformable Convolution" | [Home Page] | [PDF]

  • Wenbo Zhang,Ge-Peng Ji,Zhuo Wang,Keren Fu,Qijun Zhao. "Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection" | [Home Page] | [PDF]

  • Yixiong Zou,Shanghang Zhang,Jianpeng Yu,Yonghong Tian,José M. F. Moura. "Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition" | [Home Page] | [PDF]

  • Yuqi Sun,Ri Cheng,Bo Yan,Shili Zhou. "Space-Angle Super-Resolution for Multi-View Images" | [Home Page] | [PDF]

  • Wei Wang,Junyu Gao,Changsheng Xu. "Weakly-Supervised Video Object Grounding via Stable Context Learning" | [Home Page] | [PDF]

  • Yukun Su,Guosheng Lin,Ruizhou Sun,Yun Hao,Qingyao Wu. "Modeling the Uncertainty for Self-supervised 3D Skeleton Action Representation Learning" | [Home Page] | [PDF]

  • Rongyun Mo,Yan Yan,Jing-Hao Xue,Si Chen,Hanzi Wang. "D³Net: Dual-Branch Disturbance Disentangling Network for Facial Expression Recognition" | [Home Page] | [PDF]

  • Yukang Zhang,Yan Yan,Yang Lu,Hanzi Wang. "Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification" | [Home Page] | [PDF]

  • Yuhao Cui,Zhou Yu,Chunqi Wang,Zhongzhou Zhao,Ji Zhang,Meng Wang,Jun Yu. "ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration" | [Home Page] | [PDF]

  • Xuanxiang Lin,Ke Chen,Kui Jia. "Object Point Cloud Classification via Poly-Convolutional Architecture Search" | [Home Page] | [PDF]

  • Xiao Wang,Weirong Ye,Zhongang Qi,Xun Zhao,Guangge Wang,Ying Shan,Hanzi Wang. "Semantic-Guided Relation Propagation Network for Few-shot Action Recognition" | [Home Page] | [PDF]

  • Yunjie Ge,Qian Wang,Baolin Zheng,Xinlu Zhuang,Qi Li,Chao Shen,Cong Wang. "Anti-Distillation Backdoor Attacks: Backdoors Can Really Survive in Knowledge Distillation" | [Home Page] | [PDF]

  • Yinglu Liu,Mingcan Xiang,Hailin Shi,Tao Mei. "One-stage Context and Identity Hallucination Network" | [Home Page] | [PDF]

  • Zhi Chen,Yadan Luo,Sen Wang,Ruihong Qiu,Jingjing Li,Zi Huang. "Mitigating Generation Shifts for Generalized Zero-Shot Learning" | [Home Page] | [PDF]

  • Yuan Ji,Xu Jia,Huchuan Lu,Xiang Ruan. "Weakly-Supervised Temporal Action Localization via Cross-Stream Collaborative Learning" | [Home Page] | [PDF]

  • Cheng Chen,Jiayin Cai,Yao Hu,Xu Tang,Xinggang Wang,Chun Yuan,Xiang Bai,Song Bai. "Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter" | [Home Page] | [PDF]

  • Chenchen Liu,Yadong Mu. "Searching Motion Graphs for Human Motion Synthesis" | [Home Page] | [PDF]

  • Hanbin Zhao,Xin Qin,Shihao Su,Yongjian Fu,Zibo Lin,Xi Li. "When Video Classification Meets Incremental Classes" | [Home Page] | [PDF]

  • Yulin He,Wei Chen,Zhengfa Liang,Dan Chen,Yusong Tan,Xin Luo,Chen Li,Yulan Guo. "Fast and Accurate Lane Detection via Frequency Domain Learning" | [Home Page] | [PDF]

  • Yifang Yin,Ying Zhang,Zhenguang Liu,Yuxuan Liang,Sheng Wang,Rajiv Ratn Shah,Roger Zimmermann. "Learning Multi-context Aware Location Representations from Large-scale Geotagged Images" | [Home Page] | [PDF]

  • Xiaojing Zhong,Zhonghua Wu,Taizhe Tan,Guosheng Lin,Qingyao Wu. "MV-TON: Memory-based Video Virtual Try-on network" | [Home Page] | [PDF]

  • Hao Zhang,Yanbin Hao,Chong-Wah Ngo. "Token Shift Transformer for Video Classification" | [Home Page] | [PDF]

  • Rui Wang,Jian Chen,Gang Yu,Li Sun,Changqian Yu,Changxin Gao,Nong Sang. "Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation" | [Home Page] | [PDF]

  • Zhihao Peng,Hui Liu,Yuheng Jia,Junhui Hou. "Attention-driven Graph Clustering Network" | [Home Page] | [PDF]

  • Tianhao Fu,Yingying Li,Xiaoqing Ye,Xiao Tan,Hao Sun,Fumin Shen,Errui Ding. "Lifting the Veil of Frequency in Joint Segmentation and Depth Estimation" | [Home Page] | [PDF]

  • Joao Magalhaes,Tat-Seng Chua,Tao Mei,Alan Smeaton. "The Next Generation Multimodal Conversational Search and Recommendation" | [Home Page] | [PDF]

  • Guanze Liu,Yu Rong,Lu Sheng. "VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds" | [Home Page] | [PDF]

  • Shao-Kui Zhang,Yi-Xiao Li,Yu He,Yong-Liang Yang,Song-Hai Zhang. "MageAdd: Real-Time Interaction Simulation for Scene Synthesis" | [Home Page] | [PDF]

  • Gaowen Liu,Hao Tang,Hugo M. Latapie,Jason J. Corso,Yan Yan. "Cross-View Exocentric to Egocentric Video Synthesis" | [Home Page] | [PDF]

  • Sachin Mehta,Amit Kumar,Fitsum Reda,Varun Nasery,Vikram Mulukutla,Rakesh Ranjan,Vikas Chandra. "EVRNet: Efficient Video Restoration on Edge Devices" | [Home Page] | [PDF]

  • Jingru Gan,Jinchang Luo,Haiwei Wang,Shuhui Wang,Wei He,Qingming Huang. "Multimodal Entity Linking: A New Dataset and A Baseline" | [Home Page] | [PDF]

  • Xichu Ma,Ye Wang,Min-Yen Kan,Wee Sun Lee. "AI-Lyricist: Generating Music and Vocabulary Constrained Lyrics" | [Home Page] | [PDF]

  • Yingjie Chen,Diqi Chen,Yizhou Wang,Tao Wang,Yun Liang. "CaFGraph: Context-aware Facial Multi-graph Representation for Facial Action Unit Recognition" | [Home Page] | [PDF]

  • Jingwei Yan,Jingjing Wang,Qiang Li,Chunmao Wang,Shiliang Pu. "Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition" | [Home Page] | [PDF]

  • Ziyu Jia,Youfang Lin,Jing Wang,Zhiyang Feng,Xiangheng Xie,Caijie Chen. "HetEmotionNet: Two-Stream Heterogeneous Graph Recurrent Neural Network for Multi-modal Emotion Recognition" | [Home Page] | [PDF]

  • Xu Yan,Li-Ming Zhao,Bao-Liang Lu. "Simplifying Multimodal Emotion Recognition with Single Eye Movement Modality" | [Home Page] | [PDF]

  • Feiyu Chen,Zhengxiao Sun,Deqiang Ouyang,Xueliang Liu,Jie Shao. "Learning What and When to Drop: Adaptive Multimodal and Contextual Dynamics for Emotion Recognition in Conversation" | [Home Page] | [PDF]

  • Fan Qi,Xiaoshan Yang,Changsheng Xu. "Zero-shot Video Emotion Recognition via Multimodal Protagonist-aware Transformer Network" | [Home Page] | [PDF]

  • Hao Liu,Xin Li,Bing Liu,Deqiang Jiang,Yinsong Liu,Bo Ren,Rongrong Ji. "Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator" | [Home Page] | [PDF]

  • Di Jin,Zhongang Qi,Yingmin Luo,Ying Shan. "TransFusion: Multi-Modal Fusion for Video Tag Inference via Translation-based Knowledge Embedding" | [Home Page] | [PDF]

  • Yiqing Hu,Yan Zheng,Xinghua Jiang,Hao Liu,Deqiang Jiang,Yinsong Liu,Bo Ren,Rongrong Ji. "RecycleNet: An Overlapped Text Instance Recovery Approach" | [Home Page] | [PDF]

  • Shan An,Guangfu Che,Jinghao Guo,Haogang Zhu,Junjie Ye,Fangru Zhou,Zhaoqi Zhu,Dong Wei,Aishan Liu,Wei Zhang. "ARShoe: Real-Time Augmented Reality Shoe Try-on System on Smartphones" | [Home Page] | [PDF]

  • Yongshun Gong,Jinfeng Yi,Dong-Dong Chen,Jian Zhang,Jiayu Zhou,Zhihua Zhou. "Inferring the Importance of Product Appearance with Semi-supervised Multi-modal Enhancement: A Step Towards the Screenless Retailing" | [Home Page] | [PDF]

  • Cheng Da,Yanhao Zhang,Yun Zheng,Pan Pan,Yinghui Xu,Chunhong Pan. "AsyNCE: Disentangling False-Positives for Weakly-Supervised Video Grounding" | [Home Page] | [PDF]

  • Yupan Huang,Hongwei Xue,Bei Liu,Yutong Lu. "Unifying Multimodal Transformer for Bi-directional Image and Text Generation" | [Home Page] | [PDF]

  • Lianghua Huang,Yu Liu,Xiangzeng Zhou,Ansheng You,Ming Li,Bin Wang,Yingya Zhang,Pan Pan,Xu Yinghui. "Once and for All: Self-supervised Multi-modal Co-training on One-billion Videos at Alibaba" | [Home Page] | [PDF]

  • Yuanfeng Song,Di Jiang,Xuefang Zhao,Qian Xu,Raymond Chi-Wing Wong,Lixin Fan,Qiang Yang. "L2RS: A Learning-to-Rescore Mechanism for Hybrid Speech Recognition" | [Home Page] | [PDF]

  • Avijit Shah,Topojoy Biswas,Sathish Ramadoss,Deven Santosh Shah. "Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos Understanding" | [Home Page] | [PDF]

  • Xin Jin,Zhonglan Li,Ke Liu,Dongqing Zou,Xiaodong Li,Xingfan Zhu,Ziyin Zhou,Qilong Sun,Qingyu Liu. "Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies" | [Home Page] | [PDF]

  • Haotian Zhang,Allan D. Jepson,Iqbal Mohomed,Konstantinos G. Derpanis,Ran Zhang,Afsaneh Fazly. "Personalized Multi-modal Video Retrieval on Mobile Devices" | [Home Page] | [PDF]

  • Wei Zhang,Lingxiao He,Peng Chen,Xingyu Liao,Wu Liu,Qi Li,Zhenan Sun. "Boosting End-to-end Multi-Object Tracking and Person Search via Knowledge Distillation" | [Home Page] | [PDF]

  • Li Hu,Bang Zhang,Peng Zhang,Jinwei Qi,Jian Cao,Daiheng Gao,Haiming Zhao,Xiaoduan Feng,Qi Wang,Lian Zhuo,Pan Pan,Yinghui Xu. "A Virtual Character Generation and Animation System for E-Commerce Live Streaming" | [Home Page] | [PDF]

  • Peng Qi,Juan Cao,Xirong Li,Huan Liu,Qiang Sheng,Xiaoyue Mi,Qin He,Yongbiao Lv,Chenyang Guo,Yingchao Yu. "Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues" | [Home Page] | [PDF]

  • Federico Vaccaro,Marco Bertini,Tiberio Uricchio,Alberto Del Bimbo. "Fast Video Visual Quality and Resolution Improvement using SR-UNet" | [Home Page] | [PDF]

  • Yujie Zhang,Qi Yang,Yiling Xu. "MS-GraphSIM: Inferring Point Cloud Quality via Multiscale Graph Similarity" | [Home Page] | [PDF]

  • Jia-Xuan Bai,Bin Liu,Luchuan Song. "I Know Your Keyboard Input: A Robust Keystroke Eavesdropper Based-on Acoustic Signals" | [Home Page] | [PDF]

  • Jiahua Xu,Jing Li,Xingguang Zhou,Wei Zhou,Baichao Wang,Zhibo Chen. "Perceptual Quality Assessment of Internet Videos" | [Home Page] | [PDF]

  • Jonathan Carlton,Andy Brown,Caroline Jay,John Keane. "Using Interaction Data to Predict Engagement with Interactive Media" | [Home Page] | [PDF]

  • Sun-Kyung Lee,Jong-Hwan Kim. "Air-Text: Air-Writing and Recognition System" | [Home Page] | [PDF]

  • Daxin Gu,Jia Li,Yu Zhang,Yonghong Tian. "How to Learn a Domain-Adaptive Event Simulator?" | [Home Page] | [PDF]

  • Jinming Mu,Shuiping Gou,Shasha Mao,Shankui Zheng. "A Stepwise Matching Method for Multi-modal Image based on Cascaded Network" | [Home Page] | [PDF]

  • Naili Xing,Sai Ho Yeung,Cheng-Hao Cai,Teck Khim Ng,Wei Wang,Kaiyuan Yang,Nan Yang,Meihui Zhang,Gang Chen,Beng Chin Ooi. "SINGA-Easy: An Easy-to-Use Framework for MultiModal Analysis" | [Home Page] | [PDF]

  • Wanxia Deng,Yawen Cui,Zhen Liu,Gangyao Kuang,Dewen Hu,Matti Pietikäinen,Li Liu. "Informative Class-Conditioned Feature Alignment for Unsupervised Domain Adaptation" | [Home Page] | [PDF]

  • Zhaoquan Yuan,Xiao Peng,Xiao Wu,Changsheng Xu. "Hierarchical Multi-Task Learning for Diagram Question Answering with Multi-Modal Transformer" | [Home Page] | [PDF]

  • Jianming Lv,Kaijie Liu,Shengfeng He. "Differentiated Learning for Multi-Modal Domain Adaptation" | [Home Page] | [PDF]

  • Yang Jiao,Zequn Jie,Weixin Luo,Jingjing Chen,Yu-Gang Jiang,Xiaolin Wei,Lin Ma. "Two-stage Visual Cues Enhancement Network for Referring Image Segmentation" | [Home Page] | [PDF]

  • Yongyong Chen,Shuqin Wang,Chong Peng,Guangming Lu,Yicong Zhou. "Partial Tubal Nuclear Norm Regularized Multi-view Learning" | [Home Page] | [PDF]

  • Yuxing Wang,Yawen Lu,Zhihua Xie,Guoyu Lu. "Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment" | [Home Page] | [PDF]

  • Zhijie Lin,Zhou Zhao,Haoyuan Li,Jinglin Liu,Meng Zhang,Xingshan Zeng,Xiaofei He. "SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory" | [Home Page] | [PDF]

  • Xiaoni Li,Yu Zhou,Yifei Zhang,Aoting Zhang,Wei Wang,Ning Jiang,Haiying Wu,Weiping Wang. "Dense Semantic Contrast for Self-Supervised Visual Representation Learning" | [Home Page] | [PDF]

  • Xingyu Wan,Sanping Zhou,Jinjun Wang,Rongye Meng. "Multiple Object Tracking by Trajectory Map Regression with Temporal Priors Embedding" | [Home Page] | [PDF]

  • Omar Mossad,Khaled Diab,Ihab Amer,Mohamed Hefeeda. "DeepGame: Efficient Video Encoding for Cloud Gaming" | [Home Page] | [PDF]

  • Takumi Kimura,Takashi Matsubara,Kuniaki Uehara. "ChartPointFlow for Topology-Aware 3D Point Cloud Generation" | [Home Page] | [PDF]

  • Cheng Tan,Jun Xia,Lirong Wu,Stan Z. Li. "Co-learning: Learning from Noisy Labels with Self-supervision" | [Home Page] | [PDF]

  • Xu Lu,Lei Zhu,Li Liu,Liqiang Nie,Huaxiang Zhang. "Graph Convolutional Multi-modal Hashing for Flexible Multimedia Retrieval" | [Home Page] | [PDF]

  • Jianming Ye,Shiliang Zhang,Jingdong Wang. "Hybrid Network Compression via Meta-Learning" | [Home Page] | [PDF]

  • Hui Cui,Lei Zhu,Jingjing Li,Zhiyong Cheng,Zheng Zhang. "Two-pronged Strategy: Lightweight Augmented Graph Network Hashing for Scalable Image Retrieval" | [Home Page] | [PDF]

  • Wenli Jiang,Chong Cao. "Reconstruction: A Motion Driven Interactive Artwork Inspired by Chinese Shadow Puppet" | [Home Page] | [PDF]

  • Predrag K. Nikolic,Ruiyang Liu,Shengcheng Luo. "Syntropic Counterpoints: Metaphysics of The Machines" | [Home Page] | [PDF]

  • Castillo Clarence Fitzgerald Gumtang,Sourav S. Bhowmick. "Kandinsky Mobile: Abstract Art-Inspired Interactive Visualization of Social Discussions on Mobile Devices" | [Home Page] | [PDF]

  • Lyn Chao-ling Chen. "Sand Scope: An Interactive Installation for Revealing the Connection Between Mental Space and Life Space in a Microcosm of the World" | [Home Page] | [PDF]

  • Lin Wang,Zhonghao Lin,Wei Cai. "Heraclitus's Forest: An Interactive Artwork for Oral History" | [Home Page] | [PDF]

  • Aiden Kang,Liang Wang,Ziyu Zhou,Zhe Huang,Robert J.K. Jacob. "Affective Color Fields: Reimagining Rothkoesque Artwork as an Interactive Companion for Artistic Self-Expression" | [Home Page] | [PDF]

  • You-Yang Hu,Chiao-Chi Chou,Chia-Wei Li. "Apercevoir: Bio Internet of Things Interactive System" | [Home Page] | [PDF]

  • Zheng Wang,Jingjing Chen,Yu-Gang Jiang. "Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval" | [Home Page] | [PDF]

  • ShuBao Liu,Ke-Yue Zhang,Taiping Yao,Mingwei Bi,Shouhong Ding,Jilin Li,Feiyue Huang,Lizhuang Ma. "Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing" | [Home Page] | [PDF]

  • Haozhe Wu,Jia Jia,Haoyu Wang,Yishun Dou,Chao Duan,Qingshan Deng. "Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis" | [Home Page] | [PDF]

  • Zhongxing Ma,Yifan Zhao,Jia Li. "Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification" | [Home Page] | [PDF]

  • Jiong Wang,Zhou Zhao,Weike Jin,Xinyu Duan,Zhen Lei,Baoxing Huai,Yiling Wu,Xiaofei He. "VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation" | [Home Page] | [PDF]

  • Lu He,Qianyu Zhou,Xiangtai Li,Li Niu,Guangliang Cheng,Xiao Li,Wenxuan Liu,Yunhai Tong,Lizhuang Ma,Liqing Zhang. "End-to-End Video Object Detection with Spatial-Temporal Transformers" | [Home Page] | [PDF]

  • Peng-Fei Zhang,Jiasheng Duan,Zi Huang,Hongzhi Yin. "Joint-teaching: Learning to Refine Knowledge for Resource-constrained Unsupervised Cross-modal Retrieval" | [Home Page] | [PDF]

  • Zhi Chen,Xiaoqing Ye,Liang Du,Wei Yang,Liusheng Huang,Xiao Tan,Zhenbo Shi,Fumin Shen,Errui Ding. "AggNet for Self-supervised Monocular Depth Estimation: Go An Aggressive Step Furthe" | [Home Page] | [PDF]

  • Xiaotong Luo,Qiuyuan Liang,Ding Liu,Yanyun Qu. "Boosting Lightweight Single Image Super-resolution via Joint-distillation" | [Home Page] | [PDF]

  • Shaohao Lu,Yuqiao Xian,Ke Yan,Yi Hu,Xing Sun,Xiaowei Guo,Feiyue Huang,Wei-Shi Zheng. "Discriminator-free Generative Adversarial Attack" | [Home Page] | [PDF]

  • Zengqun Zhao,Qingshan Liu. "Former-DFER: Dynamic Facial Expression Recognition Transformer" | [Home Page] | [PDF]

  • Guanyue Li,Yi Liu,Xiwen Wei,Yang Zhang,Si Wu,Yong Xu,Hau-San Wong. "Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations" | [Home Page] | [PDF]

  • Yiming Wu,Xintian Wu,Xi Li,Jian Tian. "MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification" | [Home Page] | [PDF]

  • Meng-Jiun Chiou,Henghui Ding,Hanshu Yan,Changhu Wang,Roger Zimmermann,Jiashi Feng. "Recovering the Unbiased Scene Graphs from the Biased Ones" | [Home Page] | [PDF]

  • Fa-Ting Hong,Jia-Chang Feng,Dan Xu,Ying Shan,Wei-Shi Zheng. "Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization" | [Home Page] | [PDF]

  • Risheng Liu,Zhu Liu,Jinyuan Liu,Xin Fan. "Searching a Hierarchically Aggregated Fusion Architecture for Fast Multi-Modality Image Fusion" | [Home Page] | [PDF]

  • Yu Yin,Joseph P. Robinson,Songyao Jiang,Yue Bai,Can Qin,Yun Fu. "SuperFront: From Low-resolution to High-resolution Frontal Face Synthesis" | [Home Page] | [PDF]

  • Chen Jiang,Kaiming Huang,Sifeng He,Xudong Yang,Wei Zhang,Xiaobo Zhang,Yuan Cheng,Lei Yang,Qing Wang,Furong Xu,Tan Pan,Wei Chu. "Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval" | [Home Page] | [PDF]

  • Tianshu Xie,Xuan Cheng,Xiaomin Wang,Minghui Liu,Jiali Deng,Tao Zhou,Ming Liu. "Cut-Thumbnail: A Novel Data Augmentation for Convolutional Neural Network" | [Home Page] | [PDF]

  • Sheng Li,Xun Zhu,Guorui Feng,Xinpeng Zhang,Zhenxing Qian. "Diffusing the Liveness Cues for Face Anti-spoofing" | [Home Page] | [PDF]

  • Da-Wei Zhou,Han-Jia Ye,De-Chuan Zhan. "Co-Transport for Class-Incremental Learning" | [Home Page] | [PDF]

  • Fida Mohammad Thoker,Hazel Doughty,Cees G. M. Snoek. "Skeleton-Contrastive 3D Action Representation Learning" | [Home Page] | [PDF]

  • Albin Vogel,Erik Kronberg,Niklas Carlsson. "Fast-forwarding, Rewinding, and Path Exploration in Interactive Branched Video Streaming" | [Home Page] | [PDF]

  • Yunzhong Hou,Liang Zheng. "Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)" | [Home Page] | [PDF]

  • Chang Liu,Lichen Wang,Kai Li,Yun Fu. "Domain Generalization via Feature Variation Decorrelation" | [Home Page] | [PDF]

  • Dong Jing,Shuo Zhang,Runmin Cong,Youfang Lin. "Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection" | [Home Page] | [PDF]

  • Jiabo Ye,Xin Lin,Liang He,Dingbang Li,Qin Chen. "One-Stage Visual Grounding via Semantic-Aware Feature Filter" | [Home Page] | [PDF]

  • Chenyou Fan,Junjie Hu,Jianwei Huang. "Few-Shot Multi-Agent Perception" | [Home Page] | [PDF]

  • Bo Seok Shim,Yoo Seung Shin,Seong Wook Park,Jong-Uk Hou. "SI3DP: Source Identification Challenges and Benchmark for Consumer-Level 3D Printer Forensics" | [Home Page] | [PDF]

  • Wen Wang,Yang Cao,Jing Zhang,Fengxiang He,Zheng-Jun Zha,Yonggang Wen,Dacheng Tao. "Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers" | [Home Page] | [PDF]

  • Tianyi Xie,Liucheng Liao,Cheng Bi,Benlai Tang,Xiang Yin,Jianfei Yang,Mingjie Wang,Jiali Yao,Yang Zhang,Zejun Ma. "Towards Realistic Visual Dubbing with Heterogeneous Sources" | [Home Page] | [PDF]

  • Qianqian Wang,Wei Xia,Zhiqiang Tao,Quanxue Gao,Xiaochun Cao. "Deep Self-Supervised t-SNE for Multi-modal Subspace Clustering" | [Home Page] | [PDF]

  • Xindi Shang,Zehuan Yuan,Anran Wang,Changhu Wang. "Multimodal Video Summarization via Time-Aware Transformers" | [Home Page] | [PDF]

  • Taichi Nishimura,Atsushi Hashimoto,Yoshitaka Ushiku,Hirotaka Kameko,Shinsuke Mori. "State-aware Video Procedural Captioning" | [Home Page] | [PDF]

  • Woosung Choi,Minseok Kim,Marco A. Martínez Ramírez,Jaehwa Chung,Soonyoung Jung. "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" | [Home Page] | [PDF]

  • Sitong Su,Lianli Gao,Junchen Zhu,Jie Shao,Jingkuan Song. "Fully Functional Image Manipulation Using Scene Graphs in A Bounding-Box Free Way" | [Home Page] | [PDF]

  • Xi Zhang,Feifei Zhang,Changsheng Xu. "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning" | [Home Page] | [PDF]

  • Zhiwei Hao,Yong Luo,Han Hu,Jianping An,Yonggang Wen. "Data-Free Ensemble Knowledge Distillation for Privacy-conscious Multimedia Model Compression" | [Home Page] | [PDF]

  • Haocong Rao,Xiping Hu,Jun Cheng,Bin Hu. "SM-SGE: A Self-Supervised Multi-Scale Skeleton Graph Encoding Framework for Person Re-Identification" | [Home Page] | [PDF]

  • Sohail Ahmed Khan,Hang Dai. "Video Transformer for Deepfake Detection with Incremental Learning" | [Home Page] | [PDF]

  • Jiahao Wang,Gang Pan,Di Sun,Jiawan Zhang. "Chinese Character Inpainting with Contextual Semantic Constraints" | [Home Page] | [PDF]

  • Ji Zhang,Jingkuan Song,Yazhou Yao,Lianli Gao. "Curriculum-Based Meta-learning" | [Home Page] | [PDF]

  • Haonan Qiu,Pan He,Shuchun Liu,Weiyuan Shao,Feiyun Zhang,Jiajun Wang,Liang He,Feng Wang. "Ego-Deliver: A Large-Scale Dataset For Egocentric Video Analysis" | [Home Page] | [PDF]

  • Ping-Han Chiang,Chi-Shen Chan,Shan-Hung Wu. "Adversarial Pixel Masking: A Defense against Physical Attacks for Pre-trained Object Detectors" | [Home Page] | [PDF]

  • Li Wang,Baoyu Fan,Zhenhua Guo,Yaqian Zhao,Runze Zhang,Rengang Li,Weifeng Gong,Endong Wang. "Knowledge-Supervised Learning: Knowledge Consensus Constraints for Person Re-Identification" | [Home Page] | [PDF]

  • Qingzhe Pan,Zhifu Zhao,Xuemei Xie,Jianan Li,Yuhan Cao,Guangming Shi. "View-normalized Skeleton Generation for Action Recognition" | [Home Page] | [PDF]

  • Zheyun Qin,Xiankai Lu,Xiushan Nie,Xiantong Zhen,Yilong Yin. "Learning Hierarchical Embedding for Video Instance Segmentation" | [Home Page] | [PDF]

  • Tianhao Zhang,Hung-Yu Tseng,Lu Jiang,Weilong Yang,Honglak Lee,Irfan Essa. "Text as Neural Operator:Image Manipulation by Text Instruction" | [Home Page] | [PDF]

  • Wenhao Wu,Yuxiang Zhao,Yanwu Xu,Xiao Tan,Dongliang He,Zhikang Zou,Jin Ye,Yingying Li,Mingde Yao,Zichao Dong,Yifeng Shi. "DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning" | [Home Page] | [PDF]

  • Yulin Li,Yuxi Qian,Yuechen Yu,Xiameng Qin,Chengquan Zhang,Yan Liu,Kun Yao,Junyu Han,Jingtuo Liu,Errui Ding. "StrucTexT: Structured Text Understanding with Multi-Modal Transformers" | [Home Page] | [PDF]

  • Yudong Chen,Sen Wang,Jianglin Lu,Zhi Chen,Zheng Zhang,Zi Huang. "Local Graph Convolutional Networks for Cross-Modal Hashing" | [Home Page] | [PDF]

  • Shenhao Cao,Qin Zou,Xiuqing Mao,Dengpan Ye,Zhongyuan Wang. "Metric Learning for Anti-Compression Facial Forgery Detection" | [Home Page] | [PDF]

  • Yaqi Xia,Yan Xia,Wei Li,Rui Song,Kailang Cao,Uwe Stilla. "ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point Completion" | [Home Page] | [PDF]

  • Ding Ma,Xiangqian Wu. "Capsule-based Object Tracking with Natural Language Specification" | [Home Page] | [PDF]

  • Bicheng Dai,Kaisheng Wu,Tong Wu,Kai Li,Yanyun Qu,Yuan Xie,Yun Fu. "Faster-PPN: Towards Real-Time Semantic Segmentation with Dual Mutual Learning for Ultra-High Resolution Images" | [Home Page] | [PDF]

  • Nenglun Chen,Xingjia Pan,Runnan Chen,Lei Yang,Zhiwen Lin,Yuqiang Ren,Haolei Yuan,Xiaowei Guo,Feiyue Huang,Wenping Wang. "Distributed Attention for Grounded Image Captioning" | [Home Page] | [PDF]

  • Zhiwei Liu,Xiangyu Zhu,Lu Yang,Xiang Yan,Ming Tang,Zhen Lei,Guibo Zhu,Xuetao Feng,Yan Wang,Jinqiao Wang. "Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation" | [Home Page] | [PDF]

  • Qinyan Dai,Juncheng Li,Qiaosi Yi,Faming Fang,Guixu Zhang. "Feedback Network for Mutually Boosted Stereo Image Super-Resolution and Disparity Estimation" | [Home Page] | [PDF]

  • Qijun Wang,Guodong Zheng. "Merging Multiple Template Matching Predictions in Intra Coding with Attentive Convolutional Neural Network" | [Home Page] | [PDF]

  • Hao Ni,Jingkuan Song,Xiaosu Zhu,Feng Zheng,Lianli Gao. "Camera-Agnostic Person Re-Identification via Adversarial Disentangling Learning" | [Home Page] | [PDF]

  • Uttaran Bhattacharya,Elizabeth Childs,Nicholas Rewkowski,Dinesh Manocha. "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning" | [Home Page] | [PDF]

  • Shangzhe Di,Zeren Jiang,Si Liu,Zhaokai Wang,Leyan Zhu,Zexin He,Hongming Liu,Shuicheng Yan. "Video Background Music Generation with Controllable Music Transformer" | [Home Page] | [PDF]

  • Zhi Qiao,Yu Zhou,Jin Wei,Wei Wang,Yuan Zhang,Ning Jiang,Hongbin Wang,Weiping Wang. "PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition" | [Home Page] | [PDF]

  • Abhishek Kumar,Tristan Braud,Lik Hang Lee,Pan Hui. "Theophany: Multimodal Speech Augmentation in Instantaneous Privacy Channels" | [Home Page] | [PDF]

  • You-Yang Hu,Yao-Fu Jan,Kuan-Wei Tseng,You-Shin Tsai,Hung-Ming Sung,Jin-Yao Lin,Yi-Ping Hung. "aBio: Active Bi-Olfactory Display Using Subwoofers for Virtual Reality" | [Home Page] | [PDF]

  • Yunfei Guo,Wei Feng,Fei Yin,Tao Xue,Shuqi Mei,Cheng-Lin Liu. "Learning to Understand Traffic Signs" | [Home Page] | [PDF]

  • Yanyuan Qiao,Qi Chen,Chaorui Deng,Ning Ding,Yuankai Qi,Mingkui Tan,Xincheng Ren,Qi Wu. "R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks" | [Home Page] | [PDF]

  • Chen Zhang,Runmin Cong,Qinwei Lin,Lin Ma,Feng Li,Yao Zhao,Sam Kwong. "Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection" | [Home Page] | [PDF]

  • Junda Wu,Tong Yu,Shuai Li. "Deconfounded and Explainable Interactive Vision-Language Retrieval of Complex Scenes" | [Home Page] | [PDF]

  • Junyong You. "Long Short-term Convolutional Transformer for No-Reference Video Quality Assessment" | [Home Page] | [PDF]

  • Baopu Li,Yanwen Fan,Zhihong Pan,Yuchen Bian,Gang Zhang. "Automatic Channel Pruning with Hyper-parameter Search and Dynamic Masking" | [Home Page] | [PDF]

  • Yue Zhao,Weizhi Nie,An-An Liu,Zan Gao,Yuting Su. "SVHAN: Sequential View Based Hierarchical Attention Network for 3D Shape Recognition" | [Home Page] | [PDF]

  • Jian Li,Bin Zhang,Yabiao Wang,Ying Tai,Zhenyu Zhang,Chengjie Wang,Jilin Li,Xiaoming Huang,Yili Xia. "ASFD: Automatic and Scalable Face Detector" | [Home Page] | [PDF]

  • Qi Tang,Runmin Cong,Ronghui Sheng,Lingzhi He,Dan Zhang,Yao Zhao,Sam Kwong. "BridgeNet: A Joint Learning Network of Depth Map Super-Resolution and Monocular Depth Estimation" | [Home Page] | [PDF]

  • Yuxi Li,Boshen Zhang,Jian Li,Yabiao Wang,Weiyao Lin,Chengjie Wang,Jilin Li,Feiyue Huang. "LSTC: Boosting Atomic Action Detection with Long-Short-Term Context" | [Home Page] | [PDF]

  • Taehun Kim,Hyemin Lee,Daijin Kim. "UACANet: Uncertainty Augmented Context Attention for Polyp Segmentation" | [Home Page] | [PDF]

  • Zhenquan Lin,Kailing Guo,Xiaofen Xing,Xiangmin Xu. "Weight Evolution: Improving Deep Neural Networks Training through Evolving Inferior Weight Values" | [Home Page] | [PDF]

  • Zhikang Zou,Xiaoye Qu,Pan Zhou,Shuangjie Xu,Xiaoqing Ye,Wenhao Wu,Jin Ye. "Coarse to Fine: Domain Adaptive Crowd Counting via Adversarial Scoring Network" | [Home Page] | [PDF]

  • Qiming Wu,Zhikang Zou,Pan Zhou,Xiaoqing Ye,Binghui Wang,Ang Li. "Towards Adversarial Patch Analysis and Certified Defense against Crowd Counting" | [Home Page] | [PDF]

  • Pengpeng Zeng,Lianli Gao,Xinyu Lyu,Shuaiqi Jing,Jingkuan Song. "Conceptual and Syntactical Cross-modal Alignment with Cross-level Consistency for Image-Text Matching" | [Home Page] | [PDF]

  • Yifan Zhao,Le Hui,Jin Xie. "SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering" | [Home Page] | [PDF]

  • Anupam Sobti,Vaibhav Mavi,M Balakrishnan,Chetan Arora. "VmAP: A Fair Metric for Video Object Detection" | [Home Page] | [PDF]

  • Mucong Ye,Jing Zhang,Jinpeng Ouyang,Ding Yuan. "Source Data-free Unsupervised Domain Adaptation for Semantic Segmentation" | [Home Page] | [PDF]

  • Wang Yin,Peng Lu,Zhaoran Zhao,Xujun Peng. "Yes, "Attention Is All You Need", for Exemplar based Colorization" | [Home Page] | [PDF]

  • Jiehua Zhang,Liang Li,Chenggang Yan,Yaoqi Sun,Tao Shen,Jiyong Zhang,Zhan Wang. "Heuristic Depth Estimation with Progressive Depth Reconstruction and Confidence-Aware Loss" | [Home Page] | [PDF]

  • Jingxian Sun,Lichao Zhang,Yufei Zha,Abel Gonzalez-Garcia,Peng Zhang,Wei Huang,Yanning Zhang. "Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking" | [Home Page] | [PDF]

  • Kaiqi Dong,Wei Yang,Zhenbo Xu,Liusheng Huang,Zhidong Yu. "ABPNet: Adaptive Background Modeling for Generalized Few Shot Segmentation" | [Home Page] | [PDF]

  • Qingqing Wang,Liqiang Xiao,Yue Lu,Yaohui Jin,Hao He. "Towards Reasoning Ability in Scene Text Visual Question Answering" | [Home Page] | [PDF]

  • Jianxin Sun,Qi Li,Weining Wang,Jian Zhao,Zhenan Sun. "Multi-caption Text-to-Face Synthesis: Dataset and Algorithm" | [Home Page] | [PDF]

  • Weili Guan,Haokun Wen,Xuemeng Song,Chung-Hsing Yeh,Xiaojun Chang,Liqiang Nie. "Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations" | [Home Page] | [PDF]

  • Shudong Huang,Ivor W. Tsang,Zenglin Xu,Jiancheng Lv,Quanhui Liu. "CDD: Multi-view Subspace Clustering via Cross-view Diversity Detection" | [Home Page] | [PDF]

  • Yiqi Lin,Jinpeng Wang,Manlin Zhang,Andy J. Ma. "Learning Spatio-temporal Representation by Channel Aliasing Video Perception" | [Home Page] | [PDF]

  • Huanqian Yan,Xingxing Wei. "Efficient Sparse Attacks on Videos using Reinforcement Learning" | [Home Page] | [PDF]

  • Shengshan Hu,Yechao Zhang,Xiaogeng Liu,Leo Yu Zhang,Minghui Li,Hai Jin. "AdvHash: Set-to-set Targeted Attack on Deep Hashing with One Single Adversarial Patch" | [Home Page] | [PDF]

  • Dailan He,Yusheng Zhao,Junyu Luo,Tianrui Hui,Shaofei Huang,Aixi Zhang,Si Liu. "TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding" | [Home Page] | [PDF]

  • Qian He,Desen Zhou,Bo Wan,Xuming He. "Single Image 3D Object Estimation with Primitive Graph Networks" | [Home Page] | [PDF]

  • Yun Li,Chen Zhang,Shihao Han,Li Lyna Zhang,Baoqun Yin,Yunxin Liu,Mengwei Xu. "Boosting Mobile CNN Inference through Semantic Memory" | [Home Page] | [PDF]

  • Gil Shapira,Noga Levy,Ishay Goldin,Roy J. Jevnisek. "Knowing When to Quit: Selective Cascaded Regression with Patch Attention for Real-Time Face Alignment" | [Home Page] | [PDF]

  • Jianjun Chen,Shancheng Fang,Hongtao Xie,Zheng-Jun Zha,Yue Hu,Jianlong Tan. "End-to-end Boundary Exploration for Weakly-supervised Semantic Segmentation" | [Home Page] | [PDF]

  • Xiangwen Deng,Junlin Zhu,Shangming Yang. "SFE-Net: EEG-based Emotion Recognition with Symmetrical Spatial Feature Extraction" | [Home Page] | [PDF]

  • Dian Jin,Long Ma,Risheng Liu,Xin Fan. "Bridging the Gap between Low-Light Scenes: Bilevel Learning for Fast Adaptation" | [Home Page] | [PDF]

  • Liangchen Song,Jialian Wu,Ming Yang,Qian Zhang,Yuan Li,Junsong Yuan. "Handling Difficult Labels for Multi-label Image Classification via Uncertainty Distillation" | [Home Page] | [PDF]

  • Chenxi Ma,Bo Yan,Weimin Tan,Xuhao Jiang. "Perception-Oriented Stereo Image Super-Resolution" | [Home Page] | [PDF]

  • Rongkai Zhang,Lanqing Guo,Siyu Huang,Bihan Wen. "ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image Enhancement" | [Home Page] | [PDF]

  • Lingbo Yang,Zhanning Gao,Siwei Ma,Wen Gao. "Intrinsic Temporal Regularization for High-resolution Human Video Synthesis" | [Home Page] | [PDF]

  • Kit Yung Lam,Lik Hang Lee,Pan Hui. "A2W: Context-Aware Recommendation System for Mobile Augmented Reality Web Browser" | [Home Page] | [PDF]

  • Changchong Sheng,Matti Pietikäinen,Qi Tian,Li Liu. "Cross-modal Self-Supervised Learning for Lip Reading: When Contrastive Learning meets Adversarial Training" | [Home Page] | [PDF]

  • Shentong Mo,Xin Miao. "OsGG-Net: One-step Graph Generation Network for Unbiased Head Pose Estimation" | [Home Page] | [PDF]

  • Xirong Li,Yang Zhou,Jie Wang,Hailan Lin,Jianchun Zhao,Dayong Ding,Weihong Yu,Youxin Chen. "Multi-Modal Multi-Instance Learning for Retinal Disease Recognition" | [Home Page] | [PDF]

  • Keyan Ding,Yi Liu,Xueyi Zou,Shiqi Wang,Kede Ma. "Locally Adaptive Structure and Texture Similarity for Image Quality Assessment" | [Home Page] | [PDF]

  • Yiyang Huang,Xuefeng Liang,Chaowei Fang. "CALLip: Lipreading using Contrastive and Attribute Learning" | [Home Page] | [PDF]

  • Yu Sugiyama,Keiji Yanai. "Cross-Modal Recipe Embeddings by Disentangling Recipe Contents and Dish Styles" | [Home Page] | [PDF]

  • Yu Zhou,Hongtao Xie,Shancheng Fang,Jing Wang,Zhengjun Zha,Yongdong Zhang. "TDI TextSpotter: Taking Data Imbalance into Account in Scene Text Spotting" | [Home Page] | [PDF]

  • Xuanyu Zhang,Qing Yang. "Position-Augmented Transformers with Entity-Aligned Mesh for TextVQA" | [Home Page] | [PDF]

  • Ye Deng,Siqi Hui,Sanping Zhou,Deyu Meng,Jinjun Wang. "Learning Contextual Transformer Network for Image Inpainting" | [Home Page] | [PDF]

  • Lei Ma,Jian Shi,Yanyun Chen. "Milliseconds Color Stippling" | [Home Page] | [PDF]

  • Longyao Liu,Bo Ma,Yulin Zhang,Xin Yi,Haozhi Li. "AFD-Net: Adaptive Fully-Dual Network for Few-Shot Object Detection" | [Home Page] | [PDF]

  • Meng Shen,Huaizheng Zhang,Yixin Cao,Fan Yang,Yonggang Wen. "Missing Data Imputation for Solar Yield Prediction using Temporal Multi-Modal Variational Auto-Encoder" | [Home Page] | [PDF]

  • Chenyi Lei,Shixian Luo,Yong Liu,Wanggui He,Jiamang Wang,Guoxin Wang,Haihong Tang,Chunyan Miao,Houqiang Li. "Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training" | [Home Page] | [PDF]

  • Hongyu Li,Jia Li,Dong Zhao,Long Xu. "DehazeFlow: Multi-scale Conditional Flow Network for Single Image Dehazing" | [Home Page] | [PDF]

  • Huan Zheng,Zhao Zhang,Yang Wang,Zheng Zhang,Mingliang Xu,Yi Yang,Meng Wang. "GCM-Net: Towards Effective Global Context Modeling for Image Inpainting" | [Home Page] | [PDF]

  • Yufei Wang,Haoliang Li,Lap-pui Chau,Alex C. Kot. "Embracing the Dark Knowledge: Domain Generalization Using Regularized Knowledge Distillation" | [Home Page] | [PDF]

  • Bingyu Hu,Zheng-Jun Zha,Jiawei Liu,Xierong Zhu,Hongtao Xie. "Cluster and Scatter: A Multi-grained Active Semi-supervised Learning Framework for Scalable Person Re-identification" | [Home Page] | [PDF]

  • Xinzhi Dong,Chengjiang Long,Wenju Xu,Chunxia Xiao. "Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning" | [Home Page] | [PDF]

  • Qilin Deng,Kai Wang,Minghao Zhao,Runze Wu,Yu Ding,Zhene Zou,Yue Shang,Jianrong Tao,Changjie Fan. "Build Your Own Bundle - A Neural Combinatorial Optimization Method" | [Home Page] | [PDF]

  • Changfeng Yu,Yi Chang,Yi Li,Xile Zhao,Luxin Yan. "Unsupervised Image Deraining: Optimization Model Driven Deep CNN" | [Home Page] | [PDF]

  • Cordelia Schmid. "Do you see what I see?: Large-scale Learning from Multimodal Videos" | [Home Page] | [PDF]

  • Jingren Zhou. "Large-scale Multi-Modality Pretrained Models: Applications and Experiences" | [Home Page] | [PDF]

  • Xiaoqi Zhao,Youwei Pang,Jiaxing Yang,Lihe Zhang,Huchuan Lu. "Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation" | [Home Page] | [PDF]

  • Changshu Liu,Liangjian Wen,Zhao Kang,Guangchun Luo,Ling Tian. "Self-supervised Consensus Representation Learning for Attributed Graph" | [Home Page] | [PDF]

  • Shuhui Qu,Yan Kang,Janghwan Lee. "Efficient Multi-Modal Fusion with Diversity Analysis" | [Home Page] | [PDF]

  • Yongming Wen,Yiquan Fang,Junhao Cai,Kimwa Tung,Hui Cheng. "GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation" | [Home Page] | [PDF]

  • Paul Pu Liang,Peter Wu,Liu Ziyin,Louis-Philippe Morency,Ruslan Salakhutdinov. "Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment" | [Home Page] | [PDF]

  • Yikai Wang,Wenbing Huang,Bin Fang,Fuchun Sun,Chang Li. "Elastic Tactile Simulation Towards Tactile-Visual Perception" | [Home Page] | [PDF]

  • Zan Gao,Yuxiang Shao,Weili Guan,Meng Liu,Zhiyong Cheng,Shengyong Chen. "A Novel Patch Convolutional Neural Network for View-based 3D Model Retrieval" | [Home Page] | [PDF]

  • Xu Yan,Zhengcong Fei,Zekang Li,Shuhui Wang,Qingming Huang,Qi Tian. "Semi-Autoregressive Image Captioning" | [Home Page] | [PDF]

  • Yi Zhang,Xinwang Liu,Siwei Wang,Jiyuan Liu,Sisi Dai,En Zhu. "One-Stage Incomplete Multi-view Clustering via Late Fusion" | [Home Page] | [PDF]

  • Jiyuan Liu,Xinwang Liu,Yi Zhang,Pei Zhang,Wenxuan Tu,Siwei Wang,Sihang Zhou,Weixuan Liang,Siqi Wang,Yuexiang Yang. "Self-Representation Subspace Clustering for Incomplete Multi-view Data" | [Home Page] | [PDF]

  • Meng Wang,Sen Wang,Han Yang,Zheng Zhang,Xi Chen,Guilin Qi. "Is Visual Context Really Helpful for Knowledge Graph? A Representation Learning Perspective" | [Home Page] | [PDF]

  • Yushan Zhu,Huaixiao Zhao,Wen Zhang,Ganqiang Ye,Hui Chen,Ningyu Zhang,Huajun Chen. "Knowledge Perceived Multi-modal Pretraining in E-commerce" | [Home Page] | [PDF]

  • Yipeng Yu,Zirui Tu,Longyu Lu,Xiao Chen,Hui Zhan,Zixun Sun. "Text2Video: Automatic Video Generation Based on Text Scripts" | [Home Page] | [PDF]

  • Sen Yang,Qike Zhao,Lanxin Miao,Min Chen,Lianli Gao,Jingkuan Song,Weidong Le. "A System for Interactive and Intelligent AD Auxiliary Screening" | [Home Page] | [PDF]

  • Borun Xu,Biao Wang,Jiale Tao,Tiezheng Ge,Yuning Jiang,Wen Li,Lixin Duan. "Move As You Like: Image Animation in E-Commerce Scenario" | [Home Page] | [PDF]

  • Rinita Roy,Ruben Mayer,Hans-Arno Jacobsen. "MDMS: Music Data Matching System for Query Variant Retrieval" | [Home Page] | [PDF]

  • Mu Mu,Murtada Dohan. "Community Generated VR Painting using Eye Gaze" | [Home Page] | [PDF]

  • Yuki Tajima,Toshiharu Horiuchi,Gen Hattori. "Sync Glass: Virtual Pouring and Toasting Experience with Multimodal Presentation" | [Home Page] | [PDF]

  • Yanhao Zhang,Qiang Wang,Yun Zheng,Pan Pan,Yinghui Xu. "VideoDiscovery: An Automatic Short-Video Generation System for E-commerce Live-streaming" | [Home Page] | [PDF]

  • Yuanfeng Song,Xuefang Zhao,Di Jiang,Xiaoling Huang,Weiwei Zhao,Qian Xu,Raymond Chi-Wing Wong,Qiang Yang. "SmartSales: An AI-Powered Telemarketing Coaching System in FinTech" | [Home Page] | [PDF]

  • Yuanfeng Song,Di Jiang,Xuefang Zhao,Xiaoling Huang,Qian Xu,Raymond Chi-Wing Wong,Qiang Yang. "SmartMeeting: Automatic Meeting Transcription and Summarization for In-Person Conversations" | [Home Page] | [PDF]

  • Hao Lou,Heng Huang,Chaoen Xiao,Xin Jin. "Aesthetic Evaluation and Guidance for Mobile Photography" | [Home Page] | [PDF]

  • Wenyuan Xue,Siqi Cai,Wen Wang,Qingyong Li,Baosheng Yu,Yibing Zhan,Dacheng Tao. "A Question Answering System for Unstructured Table Images" | [Home Page] | [PDF]

  • Xujian Zhao,Chongwei Wang,Peiquan Jin,Hui Zhang,Chunming Yang,Bo Li. "Post2Story: Automatically Generating Storylines from Microblogging Platforms" | [Home Page] | [PDF]

  • Tong Shen,Jiawei Zuo,Fan Shi,Jin Zhang,Liqin Jiang,Meng Chen,Zhengchen Zhang,Wei Zhang,Xiaodong He,Tao Mei. "ViDA-MAN: Visual Dialog with Digital Humans" | [Home Page] | [PDF]

  • Yupan Huang,Bei Liu,Jianlong Fu,Yutong Lu. "A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation" | [Home Page] | [PDF]

  • Maxime Grandidier,Fabien Boucaud,Indira Thouvenin,Catherine Pelachaud. "Softly: Simulated Empathic Touch between an Agent and a Human" | [Home Page] | [PDF]

  • Akihisa Ishino,Yoko Yamakata,Hiroaki Karasawa,Kiyoharu Aizawa. "RecipeLog: Recipe Authoring App for Accurate Food Recording" | [Home Page] | [PDF]

  • Matthias Springstein,Stefanie Schneider,Javad Rahnama,Eyke Hüllermeier,Hubertus Kohle,Ralph Ewerth. "iART: A Search Engine for Art-Historical Images to Support Research in the Humanities" | [Home Page] | [PDF]

  • Jardenna Mohazzab,Abe Vos,Jonathan van Westendorp,Lucas Lageweg,Dylan Prins,Aritra Bhowmik. "ArtiVisual: A Platform to Generate and Compare Art" | [Home Page] | [PDF]

  • Ivona Najdenkoska,Jeroen den Boef,Thomas Schneider,Justo van der Werf,Reinier de Ridder,Fajar Fathurrahman,Marcel Worring. "GCNIllustrator: Illustrating the Effect of Hyperparameters on Graph Convolutional Networks" | [Home Page] | [PDF]

  • Noboru Yoshida,Jianquan Liu. "On-demand Action Detection System using Pose Information" | [Home Page] | [PDF]

  • Xian Zhao,Jiaming Zhang,Xiaowen Huang. "APF: An Adversarial Privacy-preserving Filter to Protect Portrait Information" | [Home Page] | [PDF]

  • Li Hu,Jinwei Qi,Bang Zhang,Pan Pan,Yinghui Xu. "Text-driven 3D Avatar Animation with Emotional and Expressive Behaviors" | [Home Page] | [PDF]

  • Xinyan Yang,Fei Hu,Long Ye. "Text to Scene: A System of Configurable 3D Indoor Scene Synthesis" | [Home Page] | [PDF]

  • Ruiqi Wang,Long Ye,Qin Zhang. "MovieREP: A New Movie Reproduction Framework for Film Soundtrack" | [Home Page] | [PDF]

  • Li Gao,Jing Zhang,Lefei Zhang,Dacheng Tao. "DSP: Dual Soft-Paste for Unsupervised Domain Adaptive Semantic Segmentation" | [Home Page] | [PDF]

  • Yu Lin,Jinghui Guo,Yang Gao,Yi-fan Li,Zhuoyi Wang,Latifur Khan. "Generating Point Cloud from Single Image in The Few Shot Scenario" | [Home Page] | [PDF]

  • Yuqing Song,Shizhe Chen,Qin Jin,Wei Luo,Jun Xie,Fei Huang. "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training" | [Home Page] | [PDF]

  • Yong Liu,Susen Yang,Chenyi Lei,Guoxin Wang,Haihong Tang,Juyong Zhang,Aixin Sun,Chunyan Miao. "Pre-training Graph Transformer with Multimodal Side Information for Recommendation" | [Home Page] | [PDF]

  • Minyoung Kim,Ricardo Guerrero,Vladimir Pavlovic. "Learning Disentangled Factors from Paired Data in Cross-Modal Retrieval: An Implicit Identifiable VAE Approach" | [Home Page] | [PDF]

  • Liang Peng,Shuangji Yang,Yi Bin,Guoqing Wang. "Progressive Graph Attention Network for Video Question Answering" | [Home Page] | [PDF]

  • Tao Dai,Yalei Lv,Bin Chen,Zhi Wang,Zexuan Zhu,Shu-Tao Xia. "Mix-order Attention Networks for Image Restoration" | [Home Page] | [PDF]

  • Ji Zhang,Jian-Jun Qiao,Xiao Wu,Wei Li. "Vehicle Counting Network with Attention-based Mask Refinement and Spatial-awareness Block Loss" | [Home Page] | [PDF]

  • Zhiyang Chen,Yousong Zhu,Chaoyang Zhao,Guosheng Hu,Wei Zeng,Jinqiao Wang,Ming Tang. "DPT: Deformable Patch-based Transformer for Visual Recognition" | [Home Page] | [PDF]

  • Cairong Zhao,Shuyang Feng,Brian Nlong Zhao,Zhijun Ding,Jun Wu,Fumin Shen,Heng Tao Shen. "Scene Text Image Super-Resolution via Parallelly Contextual Attention Network" | [Home Page] | [PDF]

  • Mengyuan Ding,Shanshan Zhang,Jian Yang. "Improving Pedestrian Detection from a Long-tailed Domain Perspective" | [Home Page] | [PDF]

  • Xianyong Fang,Xiaohao He,Linbo Wang,Jianbing Shen. "Robust Shadow Detection by Exploring Effective Shadow Contexts" | [Home Page] | [PDF]

  • Babak Taraghi. "End-to-end Quality of Experience Evaluation for HTTP Adaptive Streaming" | [Home Page] | [PDF]

  • Yutong Zhou. "Generative Adversarial Network for Text-to-Face Synthesis and Manipulation" | [Home Page] | [PDF]

  • Zhihang Ren. "GAN-aided Serial Dependence Study in Medical Image Perception" | [Home Page] | [PDF]

  • Ru Li. "Image Style Transfer with Generative Adversarial Networks" | [Home Page] | [PDF]

  • Yuhang Lu. "Annotation-Efficient Semantic Segmentation with Shape Prior Knowledge" | [Home Page] | [PDF]

  • Peng Dai. "Neural-based Rendering and Application" | [Home Page] | [PDF]

  • Shaoxiang Chen. "Towards Bridging Video and Language by Caption Generation and Sentence Localization" | [Home Page] | [PDF]

  • Pratibha Kumari. "Situational Anomaly Detection in Multimedia Data under Concept Drift" | [Home Page] | [PDF]

  • Guangzhi Wang. "Dynamic Knowledge Distillation with Cross-Modality Knowledge Transfer" | [Home Page] | [PDF]

  • Peidong Liu,Zibin He,Xiyu Yan,Yong Jiang,Shu-Tao Xia,Feng Zheng,Hu Maowei. "WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations" | [Home Page] | [PDF]

  • Jinhai Yang,Hua Yang,Lin Chen. "Towards Cross-Granularity Few-Shot Learning: Coarse-to-Fine Pseudo-Labeling with Visual-Semantic Meta-Embedding" | [Home Page] | [PDF]

  • Guoqing Wang,Changming Sun,Xing Xu,Jingjing Li,Zheng Wang,Zeyu Ma. "Disentangled Representation Learning and Enhancement Network for Single Image De-Raining" | [Home Page] | [PDF]

  • Lei Zhu,Zhaojing Luo,Wei Wang,Meihui Zhang,Gang Chen,Kaiping Zheng. "Towards Robust Cross-domain Image Understanding with Unsupervised Noise Removal" | [Home Page] | [PDF]

  • Zaid Khan,Yun Fu. "Exploiting BERT for Multimodal Target Sentiment Classification through Input Space Translation" | [Home Page] | [PDF]

  • Jingran Zhang,Xing Xu,Fumin Shen,Yazhou Yao,Jie Shao,Xiaofeng Zhu. "Video Representation Learning with Graph Contrastive Augmentation" | [Home Page] | [PDF]

  • Shipeng Yan,Jiale Zhou,Jiangwei Xie,Songyang Zhang,Xuming He. "An EM Framework for Online Incremental Learning of Semantic Segmentation" | [Home Page] | [PDF]

  • Shuang Li,Bingfeng Han,Zhenjie Yu,Chi Harold Liu,Kai Chen,Shuigen Wang. "I2V-GAN: Unpaired Infrared-to-Visible Video Translation" | [Home Page] | [PDF]

  • Zitai Wang,Qianqian Xu,Zhiyong Yang,Xiaochun Cao,Qingming Huang. "Implicit Feedbacks are Not Always Favorable: Iterative Relabeled One-Class Collaborative Filtering against Noisy Interactions" | [Home Page] | [PDF]

  • Dahu Shi,Xing Wei,Xiaodong Yu,Wenming Tan,Ye Ren,Shiliang Pu. "InsPose: Instance-Aware Networks for Single-Stage Multi-Person Pose Estimation" | [Home Page] | [PDF]

  • Lufan Ma,Tiancai Wang,Bin Dong,Jiangpeng Yan,Xiu Li,Xiangyu Zhang. "Implicit Feature Refinement for Instance Segmentation" | [Home Page] | [PDF]

  • Anwen Hu,Shizhe Chen,Qin Jin. "Question-controlled Text-aware Image Captioning" | [Home Page] | [PDF]

  • Yiwei Zhang,Toshihiko Yamasaki. "Style-Aware Image Recommendation for Social Media Marketing" | [Home Page] | [PDF]

  • He Li,Mang Ye,Bo Du. "WePerson: Learning a Generalized Re-identification Model from All-weather Virtual Data" | [Home Page] | [PDF]

  • Shuai Liu,Lu Zhang,Shuai Hao,Huchuan Lu,You He. "Polar Ray: A Single-stage Angle-free Detector for Oriented Object Detection in Aerial Images" | [Home Page] | [PDF]

  • Bi'an Du,Xiang Gao,Wei Hu,Xin Li. "Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning" | [Home Page] | [PDF]

  • Yi Zhang,Sheng Huang,Fengtao Zhou. "Generally Boosting Few-Shot Learning with HandCrafted Features" | [Home Page] | [PDF]

  • Tianjun Zhang,Nlong Zhao,Ying Shen,Xuan Shao,Lin Zhang,Yicong Zhou. "ROECS: A Robust Semi-direct Pipeline Towards Online Extrinsics Correction of the Surround-view System" | [Home Page] | [PDF]

  • Wen Qian,Zhiqun He,Silong Peng,Chen Chen,Wei Wu. "Pseudo Graph Convolutional Network for Vehicle ReID" | [Home Page] | [PDF]

  • Wencan Huang,Wenwen Pan,Zhou Zhao,Qi Tian. "Towards Fast and High-Quality Sign Language Production" | [Home Page] | [PDF]

  • Zhenzhong Kuang,Huigui Liu,Jun Yu,Aikui Tian,Lei Wang,Jianping Fan,Noboru Babaguchi. "Effective De-identification Generative Adversarial Network for Face Anonymization" | [Home Page] | [PDF]

  • Ricardo Guerrero,Hai X. Pham,Vladimir Pavlovic. "Cross-modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Subspace Learning" | [Home Page] | [PDF]

  • Jie Xiao,Dandan Zhan,Haoran Qi,Zhi Jin. "When Face Completion Meets Irregular Holes: An Attributes Guided Deep Inpainting Network" | [Home Page] | [PDF]

  • Zongmo Huang,Yazhou Ren,Xiaorong Pu,Lifang He. "Non-Linear Fusion for Self-Paced Multi-View Clustering" | [Home Page] | [PDF]

  • Pengzhan Sun,Bo Wu,Xunsong Li,Wen Li,Lixin Duan,Chuang Gan. "Counterfactual Debiasing Inference for Compositional Action Recognition" | [Home Page] | [PDF]

  • Yuhan Zhang,Bo Wu,Wen Li,Lixin Duan,Chuang Gan. "STST: Spatial-Temporal Specialized Transformer for Skeleton-based Action Recognition" | [Home Page] | [PDF]

  • Xinyu Liu,Baopu Li,Zhen Chen,Yixuan Yuan. "Exploring Gradient Flow Based Saliency for DNN Model Compression" | [Home Page] | [PDF]

  • Shengjie Chen,Zhenhua Guo,Bo Yuan. "An Adaptive Iterative Inpainting Method with More Information Exploration" | [Home Page] | [PDF]

  • Gonçalo Marcelino,David Semedo,André Mourão,Saverio Blasi,João Magalhães,Marta Mrak. "Assisting News Media Editors with Cohesive Visual Storylines" | [Home Page] | [PDF]

  • Yiqiang Zhao,Yiyao Zhou,Rui Chen,Bin Hu,Xiding Ai. "MM-Flow: Multi-modal Flow Network for Point Cloud Completion" | [Home Page] | [PDF]

  • Zhiliang Peng,Wei Huang,Zonghao Guo,Xiaosong Zhang,Jianbin Jiao,Qixiang Ye. "Long-tailed Distribution Adaptation" | [Home Page] | [PDF]

  • Kecheng Chen,Kun Long,Yazhou Ren,Jiayu Sun,Xiaorong Pu. "Lesion-Inspired Denoising Network: Connecting Medical Image Denoising and Lesion Detection" | [Home Page] | [PDF]

  • Fuming You,Jingjing Li,Lei Zhu,Zhi Chen,Zi Huang. "Domain Adaptive Semantic Segmentation without Source Data" | [Home Page] | [PDF]

  • Yuchen Yang,Min Wang,Wengang Zhou,Houqiang Li. "Cross-modal Joint Prediction and Alignment for Composed Query Image Retrieval" | [Home Page] | [PDF]

  • Yingjian Li,Yingnan Gao,Bingzhi Chen,Zheng Zhang,Lei Zhu,Guangming Lu. "JDMAN: Joint Discriminative and Mutual Adaptation Networks for Cross-Domain Facial Expression Recognition" | [Home Page] | [PDF]

  • Feifei Shao,Yawei Luo,Li Zhang,Lu Ye,Siliang Tang,Yi Yang,Jun Xiao. "Improving Weakly Supervised Object Localization via Causal Intervention" | [Home Page] | [PDF]

  • Xinhao Li,Jingjing Li,Lei Zhu,Guoqing Wang,Zi Huang. "Imbalanced Source-free Domain Adaptation" | [Home Page] | [PDF]

  • Zhekai Du,Jingjing Li,Ke Lu,Lei Zhu,Zi Huang. "Learning Transferrable and Interpretable Representations for Domain Generalization" | [Home Page] | [PDF]

  • Zhenyu Xie,Xujie Zhang,Fuwei Zhao,Haoye Dong,Michael C. Kampffmeyer,Haonan Yan,Xiaodan Liang. "WAS-VTON: Warping Architecture Search for Virtual Try-on Network" | [Home Page] | [PDF]

  • Yan-Jie Zhou,Shi-Qi Liu,Xiao-Liang Xie,Zeng-Guang Hou. "DFR-Net: A Novel Multi-Task Learning Network for Real-Time Multi-Instrument Segmentation" | [Home Page] | [PDF]

  • Mingrui Lao,Yanming Guo,Yu Liu,Wei Chen,Nan Pu,Michael S. Lew. "From Superficial to Deep: Language Bias driven Curriculum Learning for Visual Question Answering" | [Home Page] | [PDF]

  • Xun Gao,Yin Zhao,Jie Zhang,Longjun Cai. "Pairwise Emotional Relationship Recognition in Drama Videos: Dataset and Benchmark" | [Home Page] | [PDF]

  • Yingying Cheng,Fan Zhang,Gang Hu,Yiwen Wang,Hanhui Yang,Gong Zhang,Zhuo Cheng. "Block Popularity Prediction for Multimedia Storage Systems Using Spatial-Temporal-Sequential Neural Networks" | [Home Page] | [PDF]

  • Yang Chen,Yingwei Pan,Yu Wang,Ting Yao,Xinmei Tian,Tao Mei. "Transferrable Contrastive Learning for Visual Domain Adaptation" | [Home Page] | [PDF]

  • Rong-Cheng Tu,Xian-Ling Mao,Cihang Kong,Zihang Shao,Ze-Lin Li,Wei Wei,Heyan Huang. "Weighted Gaussian Loss based Hamming Hashing" | [Home Page] | [PDF]

  • Peng Lu,Gao Huang,Hangyu Lin,Wenming Yang,Guodong Guo,Yanwei Fu. "Domain-Aware SE Network for Sketch-based Image Retrieval with Multiplicative Euclidean Margin Softmax" | [Home Page] | [PDF]

  • Deyu Wang,Dongchao Wen,Wei Tao,Lingxiao Yin,Tse-Wei Chen,Tadayuki Ito,Kinya Osa,Masami Kato. "FTAFace: Context-enhanced Face Detector with Fine-grained Task Attention" | [Home Page] | [PDF]

  • Jingcheng Ni,Jie Qin,Di Huang. "Identity-aware Graph Memory Network for Action Detection" | [Home Page] | [PDF]

  • Wenkang Shan,Haopeng Lu,Shanshe Wang,Xinfeng Zhang,Wen Gao. "Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation" | [Home Page] | [PDF]

  • Nan Zhong,Zhenxing Qian,Xinpeng Zhang. "Deep Neural Network Retrieval" | [Home Page] | [PDF]

  • Xingcai Wu,Yucheng Xie,Jiaqi Zeng,Zhenguo Yang,Yi Yu,Qing Li,Wenyin Liu. "Adversarial Learning with Mask Reconstruction for Text-Guided Image Inpainting" | [Home Page] | [PDF]

  • Zhihao Gu,Yang Chen,Taiping Yao,Shouhong Ding,Jilin Li,Feiyue Huang,Lizhuang Ma. "Spatiotemporal Inconsistency Learning for DeepFake Video Detection" | [Home Page] | [PDF]

  • Gian-Luca Savino,Jessé Moraes Braga,Johannes Schöning. "VeloCity: Using Voice Assistants for Cyclists to Provide Traffic Reports" | [Home Page] | [PDF]

  • Qiyu Dai,Shuai Yang,Wenjing Wang,Wei Xiang,Jiaying Liu. "Edit Like A Designer: Modeling Design Workflows for Unaligned Fashion Editing" | [Home Page] | [PDF]

  • Jizhizi Li,Sihan Ma,Jing Zhang,Dacheng Tao. "Privacy-Preserving Portrait Matting" | [Home Page] | [PDF]

  • Jiaxiang You,Yuanman Li,Jiantao Zhou,Zhongyun Hua,Weiwei Sun,Xia Li. "A Transformer based Approach for Image Manipulation Chain Detection" | [Home Page] | [PDF]

  • Peng Wu,Xiangteng He,Mingqian Tang,Yiliang Lv,Jing Liu. "HANet: Hierarchical Alignment Networks for Video-Text Retrieval" | [Home Page] | [PDF]

  • Mengjing Sun,Pei Zhang,Siwei Wang,Sihang Zhou,Wenxuan Tu,Xinwang Liu,En Zhu,Changjian Wang. "Scalable Multi-view Subspace Clustering with Unified Anchors" | [Home Page] | [PDF]

  • Tao Xiang,Ying Yang,Shangwei Guo,Hangcheng Liu,Hantao Liu. "PRNet: A Progressive Recovery Network for Revealing Perceptually Encrypted Images" | [Home Page] | [PDF]

  • Run Wang,Felix Juefei-Xu,Meng Luo,Yang Liu,Lina Wang. "FakeTagger: Robust Safeguards against DeepFake Dissemination via Provenance Tracking" | [Home Page] | [PDF]

  • Yang Bai,Junyan Wang,Yang Long,Bingzhang Hu,Yang Song,Maurice Pagnucco,Yu Guan. "Discriminative Latent Semantic Graph for Video Captioning" | [Home Page] | [PDF]

  • Qichao Ying,Zhenxing Qian,Hang Zhou,Haisheng Xu,Xinpeng Zhang,Siyi Li. "From Image to Imuge: Immunized Image Generation" | [Home Page] | [PDF]

  • Sravya Vardhani Shivapuja,Mansi Pradeep Khamkar,Divij Bajaj,Ganesh Ramakrishnan,Ravi Kiran Sarvadevabhatla. "Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting" | [Home Page] | [PDF]

  • Insoo Lee,Jinsung Lee,Kyunghan Lee,Dirk Grunwald,Sangtae Ha. "Demystifying Commercial Video Conferencing Applications" | [Home Page] | [PDF]

  • Han Hu,Sheng Cheng,Xinggong Zhang,Zongming Guo. "LightFEC: Network Adaptive FEC with a Lightweight Deep-Learning Approach" | [Home Page] | [PDF]

  • Yueming Lyu,Jing Dong,Bo Peng,Wei Wang,Tieniu Tan. "SOGAN: 3D-Aware Shadow and Occlusion Robust GAN for Makeup Transfer" | [Home Page] | [PDF]

  • Yuqing Liao,Xinke Li,Zekun Tong,Yabang Zhao,Andrew Lim,Zhenzhong Kuang,Cise Midoglu. "Reproducibility Companion Paper: Campus3D: A Photogrammetry Point Cloud Benchmark for Outdoor Scene Hierarchical Understanding" | [Home Page] | [PDF]

  • Dingquan Li,Tingting Jiang,Ming Jiang,Vajira Lasantha Thambawita,Haoliang Wang. "Reproducibility Companion Paper: Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment" | [Home Page] | [PDF]

  • Serhan Gül,Sebastian Bosse,Dimitri Podborski,Thomas Schierl,Cornelius Hellge,Marc A. Kastner,Jan Zahálka. "Reproducibility Companion Paper: Kalman Filter-Based Head Motion Prediction for Cloud-Based Mixed Reality" | [Home Page] | [PDF]

  • Jari Korhonen,Yicheng Su,Junyong You,Steven Hicks,Cise Midoglu. "Reproducibility Companion Paper: Blind Natural Video Quality Prediction via Statistical Temporal Features and Deep Spatial Features" | [Home Page] | [PDF]

  • Jakub Nawala,Lucjan Janowski,Bogdan Cmiel,Krzysztof Rusek,Marc A. Kastner,Jan Zahálka. "Reproducibility Companion Paper: Describing Subjective Experiment Consistency by p-Value P-P Plot" | [Home Page] | [PDF]

  • Li Tao,Xueting Wang,Toshihiko Yamasaki,Jingjing Chen,Steven Hicks. "Reproducibility Companion Paper: Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework" | [Home Page] | [PDF]

  • Fan Yu,Haonan Wang,Tongwei Ren,Jinhui Tang,Gangshan Wu,Jingjing Chen,Zhenzhong Kuang. "Reproducibility Companion Paper: Visual Relation of Interest Detection" | [Home Page] | [PDF]

  • Lijian Gao,Qirong Mao,Jingjing Chen,Ming Dong,Ratna Chinnam,Lucile Sassatelli,Miguel Romero Rondon,Ujjwal Sharma. "Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection" | [Home Page] | [PDF]

  • James Lester. "AI and the Future of Education" | [Home Page] | [PDF]

  • Zhengyou Zhang. "Digital Human in an Integrated Physical-Digital World (IPhD)" | [Home Page] | [PDF]

  • Wenhang Ge,Chunyan Pan,Ancong Wu,Hongwei Zheng,Wei-Shi Zheng. "Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes" | [Home Page] | [PDF]

  • Xindi Shang,Yicong Li,Junbin Xiao,Wei Ji,Tat-Seng Chua. "Video Visual Relation Detection via Iterative Inference" | [Home Page] | [PDF]

  • Jiahui Li,Kun Kuang,Lin Li,Long Chen,Songyang Zhang,Jian Shao,Jun Xiao. "Instance-wise or Class-wise? A Tale of Neighbor Shapley for Concept-based Explanation" | [Home Page] | [PDF]

  • Zheyu Zhang,Yurui Zhu,Xueyang Fu,Zhiwei Xiong,Zheng-Jun Zha,Feng Wu. "Multifocal Attention-Based Cross-Scale Network for Image De-raining" | [Home Page] | [PDF]

  • Dongyang Zhang,Changyu Li,Ning Xie,Guoqing Wang,Jie Shao. "PFFN: Progressive Feature Fusion Network for Lightweight Image Super-Resolution" | [Home Page] | [PDF]

  • Mengzhu Wang,Wei Wang,Baopu Li,Xiang Zhang,Long Lan,Huibin Tan,Tianyi Liang,Wei Yu,Zhigang Luo. "InterBN: Channel Fusion for Adversarial Unsupervised Domain Adaptation" | [Home Page] | [PDF]

  • Shaozu Yuan,Ruixue Liu,Meng Chen,Baoyang Chen,Zhijie Qiu,Xiaodong He. "Learning to Compose Stylistic Calligraphy Artwork with Emotions" | [Home Page] | [PDF]

  • Athanasios Efthymiou,Stevan Rudinac,Monika Kackovic,Marcel Worring,Nachoem Wijnberg. "Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings" | [Home Page] | [PDF]

  • Mark-David Hosale,Robert Allison,Jim Madsen,Marcus Gordon. "ArtScience and the ICECUBE LED Display [ILDm^3]" | [Home Page] | [PDF]

  • Guo Li,Baoliang Chen,Lingyu Zhu,Qinwen He,Hongfei Fan,Shiqi Wang. "PUGCQ: A Large Scale Dataset for Quality Assessment of Professional User-Generated Content" | [Home Page] | [PDF]

  • Yurui Ren,Yubo Wu,Thomas H. Li,Shan Liu,Ge Li. "Combining Attention with Flow for Person Image Synthesis" | [Home Page] | [PDF]

  • Shuang Wu,Zhenguang Liu,Shijian Lu,Li Cheng. "Dual Learning Music Composition and Dance Choreography" | [Home Page] | [PDF]

  • Xin Liu,Jiancheng Li,Jiaqi Wang,Ziwei Liu. "MMFashion: An Open-Source Toolbox for Visual Fashion Analysis" | [Home Page] | [PDF]

  • Zihan Ding,Tianyang Yu,Hongming Zhang,Yanhua Huang,Guo Li,Quancheng Guo,Luo Mai,Hao Dong. "Efficient Reinforcement Learning Development with RLzoo" | [Home Page] | [PDF]

  • Yixiao Guo,Jiawei Liu,Guo Li,Luo Mai,Hao Dong. "Fast and Flexible Human Pose Estimation with HyperPose" | [Home Page] | [PDF]

  • Xuezhi Wang,Guanyu Gao. "SmartEye: An Open Source Framework for Real-Time Video Analytics with Edge-Cloud Collaboration" | [Home Page] | [PDF]

  • Tom Bartindale,Peter Chen,Harrison Marshall,Stanislav Pozdniakov,Dan Richardson. "ZoomSense: A Scalable Infrastructure for Augmenting Zoom" | [Home Page] | [PDF]

  • Jun Hu,Shengsheng Qian,Quan Fang,Youze Wang,Quan Zhao,Huaiwen Zhang,Changsheng Xu. "Efficient Graph Deep Learning in TensorFlow with tf_geometric" | [Home Page] | [PDF]

  • Jun Wang,Yinglu Liu,Yibo Hu,Hailin Shi,Tao Mei. "FaceX-Zoo: A PyTorch Toolbox for Face Recognition" | [Home Page] | [PDF]

  • Haoqi Fan,Tullie Murrell,Heng Wang,Kalyan Vasudev Alwala,Yanghao Li,Yilei Li,Bo Xiong,Nikhila Ravi,Meng Li,Haichuan Yang,Jitendra Malik,Ross Girshick,Matt Feiszli,Aaron Adcock,Wan-Yen Lo,Christoph Feichtenhofer. "PyTorchVideo: A Deep Learning Library for Video Understanding" | [Home Page] | [PDF]

  • Haocong Ying,Tie Liu,Mingxin Ai,Jiali Ding,Yuanyuan Shang. "AICoacher: A System Framework for Online Realtime Workout Coach" | [Home Page] | [PDF]

  • Zhanghui Kuang,Hongbin Sun,Zhizhong Li,Xiaoyu Yue,Tsui Hin Lin,Jianyong Chen,Huaqiang Wei,Yiqin Zhu,Tong Gao,Wenwei Zhang,Kai Chen,Wayne Zhang,Dahua Lin. "MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding" | [Home Page] | [PDF]

  • Adam Wieckowski,Christian Lehmann,Benjamin Bross,Detlev Marpe,Thibaud Biatek,Mikael Raulet,Jean Le Feuvre. "A Complete End to End Open Source Toolchain for the Versatile Video Coding (VVC) Standard" | [Home Page] | [PDF]

  • Yehao Li,Yingwei Pan,Jingwen Chen,Ting Yao,Tao Mei. "X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics" | [Home Page] | [PDF]

  • Luka Murn,Alan F. Smeaton,Marta Mrak. "Interpreting Super-Resolution CNNs for Sub-Pixel Motion Compensation in Video Coding" | [Home Page] | [PDF]

  • Yi-Geng Hong,Hui-Chu Xiao,Wan-Lei Zhao. "Towards Accurate Localization by Instance Search" | [Home Page] | [PDF]

  • Rintaro Yanagi,Ren Togo,Takahiro Ogawa,Miki Haseyama. "Database-adaptive Re-ranking for Enhancing Cross-modal Image Retrieval" | [Home Page] | [PDF]

  • Ning Han,Jingjing Chen,Guangyi Xiao,Hao Zhang,Yawen Zeng,Hao Chen. "Fine-grained Cross-modal Alignment Network for Text-Video Retrieval" | [Home Page] | [PDF]

  • Jiwei Wei,Xing Xu,Zheng Wang,Guoqing Wang. "Meta Self-Paced Learning for Cross-Modal Matching" | [Home Page] | [PDF]

  • Ruihong Qiu,Sen Wang,Zhi Chen,Hongzhi Yin,Zi Huang. "CausalRec: Causal Inference for Visual Debiasing in Visually-Aware Recommendation" | [Home Page] | [PDF]

  • Haifeng Xia,Taotao Jing,Chen Chen,Zhengming Ding. "Semi-supervised Domain Adaptive Retrieval via Discriminative Hashing Learning" | [Home Page] | [PDF]

  • Zhizhong Han,Xiyang Wang,Yu-Shen Liu,Matthias Zwicker. "Hierarchical View Predictor: Unsupervised 3D Global Feature Learning through Hierarchical Prediction among Unordered Views" | [Home Page] | [PDF]

  • Jinghao Zhang,Yanqiao Zhu,Qiang Liu,Shu Wu,Shuhui Wang,Liang Wang. "Mining Latent Structures for Multimedia Recommendation" | [Home Page] | [PDF]

  • Jiahao Xun,Shengyu Zhang,Zhou Zhao,Jieming Zhu,Qi Zhang,Jingjie Li,Xiuqiang He,Xiaofei He,Tat-Seng Chua,Fei Wu. "Why Do We Click: Visual Impression-aware News Recommendation" | [Home Page] | [PDF]

  • Jingzhi Li,Lutong Han,Ruoyu Chen,Hua Zhang,Bing Han,Lili Wang,Xiaochun Cao. "Identity-Preserving Face Anonymization via Adaptively Facial Attributes Obfuscation" | [Home Page] | [PDF]

  • Zhijian Hou,Chong-Wah Ngo,W. K. Chan. "CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval" | [Home Page] | [PDF]

  • Qianxiu Hao,Qianqian Xu,Zhiyong Yang,Qingming Huang. "Learning Unified Embeddings for Recommendation via Meta-path Semantics" | [Home Page] | [PDF]

  • Kin Wai Cheuk,Dorien Herremans,Li Su. "ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data" | [Home Page] | [PDF]

  • Ruijie Tao,Zexu Pan,Rohan Kumar Das,Xinyuan Qian,Mike Zheng Shou,Haizhou Li. "Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection" | [Home Page] | [PDF]

  • Wei-Tsung Lu,Meng-Hsuan Wu,Yuh-Ming Chiu,Li Su. "Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience" | [Home Page] | [PDF]

  • Rongjie Huang,Feiyang Chen,Yi Ren,Jinglin Liu,Chenye Cui,Zhou Zhao. "Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus" | [Home Page] | [PDF]

  • Hongyuan Zhu,Ye Niu,Di Fu,Hao Wang. "MusicBERT: A Self-supervised Learning of Music Representation" | [Home Page] | [PDF]

  • Yuanhang Zhang,Susan Liang,Shuang Yang,Xiao Liu,Zhongqin Wu,Shiguang Shan,Xilin Chen. "UniCon: Unified Context Network for Robust Active Speaker Detection" | [Home Page] | [PDF]

  • Yakun Huang,Yuanwei Zhu,Xiuquan Qiao,Zhijie Tan,Boyuan Bai. "AITransfer: Progressive AI-powered Transmission for Real-Time Point Cloud Video Streaming" | [Home Page] | [PDF]

  • Tiesong Zhao,Jielian Lin,Yanjie Song,Xu Wang,Yuzhen Niu. "Game Theory-driven Rate Control for 360-Degree Video Coding" | [Home Page] | [PDF]

  • Lei Zhang,Yanyan Suo,Ximing Wu,Feng Wang,Yuchi Chen,Laizhong Cui,Jiangchuan Liu,Zhong Ming. "TBRA: Tiling and Bitrate Adaptation for Mobile 360-Degree Video Streaming" | [Home Page] | [PDF]

  • Wanxin Shi,Qing Li,Ruishan Zhang,Gengbiao Shen,Yong Jiang,Zhenhui Yuan,Gabriel-Miro Muntean. "QoE Ready to Respond: A QoE-aware MEC Selection Scheme for DASH-based Adaptive Video Streaming to Mobile Users" | [Home Page] | [PDF]

  • Pengfei Xiong,Yu Chen. "Hierarchical Fusion for Practical Ghost-free High Dynamic Range Imaging" | [Home Page] | [PDF]

  • Xindong Zhang,Hui Zeng,Lei Zhang. "Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices" | [Home Page] | [PDF]

  • Hanyue Tu,Li Li,Wengang Zhou,Houqiang Li. "Semantic Scalable Image Compression with Cross-Layer Priors" | [Home Page] | [PDF]

  • Weidong Chen,Guorong Li,Xinfeng Zhang,Hongyang Yu,Shuhui Wang,Qingming Huang. "Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence" | [Home Page] | [PDF]

  • Chuanyi Zhang,Yazhou Yao,Xing Xu,Jie Shao,Jingkuan Song,Zechao Li,Zhenmin Tang. "Extracting Useful Knowledge from Noisy Web Images via Data Purification for Fine-Grained Recognition" | [Home Page] | [PDF]

  • Tianyu Su,Xuemeng Song,Na Zheng,Weili Guan,Yan Li,Liqiang Nie. "Complementary Factorization towards Outfit Compatibility Modeling" | [Home Page] | [PDF]

  • Xin Dong,Hao Liu,Weiwei Cai,Pengyuan Lv,Zekuan Yu. "Open Set Face Anti-Spoofing in Unseen Attacks" | [Home Page] | [PDF]

  • Yicong Li,Xun Yang,Xindi Shang,Tat-Seng Chua. "Interventional Video Relation Detection" | [Home Page] | [PDF]

  • Yuxi Xie,Danqing Huang,Jinpeng Wang,Chin-Yew Lin. "CanvasEmb: Learning Layout Representation with Large-scale Pre-training for Graphic Design" | [Home Page] | [PDF]

  • Yizhen Lao,Jie Yang,Xinying Wang,Jianxin Lin,Yu Cao,Shien Song. "Augmenting TV Shows via Uncalibrated Camera Small Motion Tracking in Dynamic Scene" | [Home Page] | [PDF]

  • Aoxiong Yin,Zhou Zhao,Jinglin Liu,Weike Jin,Meng Zhang,Xingshan Zeng,Xiaofei He. "SimulSLT: End-to-End Simultaneous Sign Language Translation" | [Home Page] | [PDF]

  • Hongshuo Tian,Ning Xu,An-An Liu,Chenggang Yan,Zhendong Mao,Quan Zhang,Yongdong Zhang. "Mask and Predict: Multi-step Reasoning for Scene Graph Generation" | [Home Page] | [PDF]

  • Shanmin Yang,Xiao Yang,Yi Lin,Peng Cheng,Yi Zhang,Jianwei Zhang. "Heterogeneous Face Recognition with Attention-guided Feature Disentangling" | [Home Page] | [PDF]

  • Yiqi Jiang,Weihua Chen,Xiuyu Sun,Xiaoyu Shi,Fan Wang,Hao Li. "Exploring the Quality of GAN Generated Images for Person Re-Identification" | [Home Page] | [PDF]

  • Chen Zhang,Siwei Wang,Jiyuan Liu,Sihang Zhou,Pei Zhang,Xinwang Liu,En Zhu,Changwang Zhang. "Multi-view Clustering via Deep Matrix Factorization and Partition Alignment" | [Home Page] | [PDF]

  • Zhen Han,Xiangteng He,Mingqian Tang,Yiliang Lv. "Video Similarity and Alignment Learning on Partial Video Copy Detection" | [Home Page] | [PDF]

  • Jinjian Wu,Yongxu Liu,Leida Li,Weisheng Dong,Guangming Shi. "No-Reference Video Quality Assessment with Heterogeneous Knowledge Ensemble" | [Home Page] | [PDF]

  • Carlos Bermejo Fernandez,Petteri Nurmi,Pan Hui. "Seeing is Believing?: Effects of Visualization on Smart Device Privacy Perceptions" | [Home Page] | [PDF]

  • Shuai Shao,Lei Xing,Yan Wang,Rui Xu,Chunyan Zhao,Yanjiang Wang,Baodi Liu. "MHFC: Multi-Head Feature Collaboration for Few-Shot Learning" | [Home Page] | [PDF]

  • Ma Shuo,Yanli Ji,Xing Xu,Xiaofeng Zhu. "Vision-guided Music Source Separation via a Fine-grained Cycle-Separation Network" | [Home Page] | [PDF]

  • Yuchen Yang,Ye Xiang,Shuaicheng Liu,Lifang Wu,Boxuan Zhao,Bing Zeng. "GLM-Net: Global and Local Motion Estimation via Task-Oriented Encoder-Decoder Structure" | [Home Page] | [PDF]

  • Katsuyuki Nakamura,Hiroki Ohashi,Mitsuhiro Okada. "Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention" | [Home Page] | [PDF]

  • Jiguo Li,Chuanmin Jia,Xinfeng Zhang,Siwei Ma,Wen Gao. "Cross Modal Compression: Towards Human-comprehensible Semantic Compression" | [Home Page] | [PDF]

  • Yunqing Hu,Xuan Jin,Yin Zhang,Haiwen Hong,Jingfeng Zhang,Yuan He,Hui Xue. "RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition" | [Home Page] | [PDF]

  • Jiechong Song,Bin Chen,Jian Zhang. "Memory-Augmented Deep Unfolding Network for Compressive Sensing" | [Home Page] | [PDF]

  • Lihao Jiang,Yi Wang,Qi Jia,Shengwei Xu,Yu Liu,Xin Fan,Haojie Li,Risheng Liu,Xinwei Xue,Ruili Wang. "Underwater Species Detection using Channel Sharpening Attention" | [Home Page] | [PDF]

  • Junyin Zhang,Yongxin Ge,Xinqian Gu,Boyu Hua,Tao Xiang. "Self-Supervised Pre-training on the Target Domain for Cross-Domain Person Re-identification" | [Home Page] | [PDF]

  • Lei Zhang,Leiting Chen,Chuan Zhou,Fan Yang,Xin Li. "Exploring Graph-Structured Semantics for Cross-Modal Retrieval" | [Home Page] | [PDF]

  • Lei Shen,Haolan Zhan,Xin Shen,Yonghao Song,Xiaofang Zhao. "Text is NOT Enough: Integrating Visual Impressions into Open-domain Dialogue Generation" | [Home Page] | [PDF]

  • Yang Li,Shiqi Wang,Xinfeng Zhang,Shanshe Wang,Siwei Ma,Yue Wang. "Quality Assessment of End-to-End Learned Image Compression: The Benchmark and Objective Measure" | [Home Page] | [PDF]

  • Xiao Luo,Daqing Wu,Zeyu Ma,Chong Chen,Minghua Deng,Jianqiang Huang,Xian-Sheng Hua. "A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing" | [Home Page] | [PDF]

  • Zi-Rong Jin,Liang-Jian Deng,Tian-Jing Zhang,Xiao-Xu Jin. "BAM: Bilateral Activation Mechanism for Image Fusion" | [Home Page] | [PDF]

  • Lei Wang,Piotr Koniusz. "Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors" | [Home Page] | [PDF]

  • Tailin Chen,Desen Zhou,Jian Wang,Shidong Wang,Yu Guan,Xuming He,Errui Ding. "Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition" | [Home Page] | [PDF]

  • Weijie Li,Xinhang Song,Yubing Bai,Sixian Zhang,Shuqiang Jiang. "ION: Instance-level Object Navigation" | [Home Page] | [PDF]

  • Shiwei Gan,Yafeng Yin,Zhiwei Jiang,Lei Xie,Sanglu Lu. "Skeleton-Aware Neural Sign Language Translation" | [Home Page] | [PDF]

  • Srinivas Kruthiventi S S,George Jose,Nitya Tandon,Rajesh Biswal,Aashish Kumar. "Fingerspelling Recognition in the Wild with Fixed-Query based Visual Attention" | [Home Page] | [PDF]

  • Qiongjie Cui,Huaijiang Sun,Yue Kong,Xiaoning Sun. "Deep Human Dynamics Prior" | [Home Page] | [PDF]

  • Jiangming Shi,Zixian Gao,Hao Liu,Zekuan Yu,Fengjun Li. "Exploiting Invariance of Mining Facial Landmarks" | [Home Page] | [PDF]

  • Jiaxiang Tang,Xiaokang Chen,Gang Zeng. "Joint Implicit Image Function for Guided Depth Super-Resolution" | [Home Page] | [PDF]

  • Ziqi Yuan,Wei Li,Hua Xu,Wenmeng Yu. "Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis" | [Home Page] | [PDF]

  • Jun Xiao,Qian Ye,Rui Zhao,Kin-Man Lam,Kao Wan. "Self-feature Learning: An Efficient Deep Lightweight Network for Image Super-resolution" | [Home Page] | [PDF]

  • Sebastian Szyller,Buse Gul Atli,Samuel Marchal,N. Asokan. "DAWN: Dynamic Adversarial Watermarking of Neural Networks" | [Home Page] | [PDF]

  • Jing Liang,Li Niu,Fengjun Guo,Teng Long,Liqing Zhang. "Visible Watermark Removal via Self-calibrated Localization and Background Refinement" | [Home Page] | [PDF]

  • Ruoxi Deng,Shengjun Liu,Jinxin Wang,Huibing Wang,Hanli Zhao,Xiaoqin Zhang. "Learning to Decode Contextual Information for Efficient Contour Detection" | [Home Page] | [PDF]

  • Yiguo Qiao,Licheng Jiao,Wenbin Li,Christian Richardt,Darren Cosker. "Fast, High-Quality Hierarchical Depth-Map Super-Resolution" | [Home Page] | [PDF]

  • Nan Xiang,Xiaosong Yang,Jian J Zhang. "TsFPS: An Accurate and Flexible 6DoF Tracking System with Fiducial Platonic Solids" | [Home Page] | [PDF]

  • Xiao Wang,Zheng Wang,Wu Liu,Xin Xu,Jing Chen,Chia-Wen Lin. "Consistency-Constancy Bi-Knowledge Learning for Pedestrian Detection in Night Surveillance" | [Home Page] | [PDF]

  • Yudong Wang,Liang-Jian Deng,Tian-Jing Zhang,Xiao Wu. "SSconv: Explicit Spectral-to-Spatial Convolution for Pansharpening" | [Home Page] | [PDF]

  • Zhengyi Liu,Yuan Wang,Zhengzheng Tu,Yun Xiao,Bin Tang. "TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network" | [Home Page] | [PDF]

  • Pu Li,Xiaobai Liu,Xiaohui Xie. "Learning Sample-Specific Policies for Sequential Image Augmentation" | [Home Page] | [PDF]

  • Wen Yang,Jinjian Wu,Leida Li,Weisheng Dong,Guangming Shi. "Image Quality Caption with Attentive and Recurrent Semantic Attractor Network" | [Home Page] | [PDF]

  • Weizhi Nie,Jiesi Li,Ning Xu,An-An Liu,Xuanya Li,Yongdong Zhang. "Triangle-Reward Reinforcement Learning: A Visual-Linguistic Semantic Alignment for Image Captioning" | [Home Page] | [PDF]

  • Huiyuan Fu,Changhao Tian,Xin Wang,Huadong Ma. "Stacked Semantically-Guided Learning for Image De-distortion" | [Home Page] | [PDF]

  • Yudong Han,Yangyang Guo,Jianhua Yin,Meng Liu,Yupeng Hu,Liqiang Nie. "Focal and Composed Vision-semantic Modeling for Visual Question Answering" | [Home Page] | [PDF]

  • Kecheng Zheng,Cuiling Lan,Wenjun Zeng,Jiawei Liu,Zhizheng Zhang,Zheng-Jun Zha. "Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification" | [Home Page] | [PDF]

  • Jiayuan Xie,Yi Cai,Qingbao Huang,Tao Wang. "Multiple Objects-Aware Visual Question Generation" | [Home Page] | [PDF]

  • Chamara Madarasingha,Kanchana Thilakarathna. "VASTile: Viewport Adaptive Scalable 360-Degree Video Frame Tiling" | [Home Page] | [PDF]

  • Li Ding,Yongwei Wang,Xin Ding,Kaiwen Yuan,Ping Wang,Hua Huang,Z. Jane Wang. "Delving into Deep Image Prior for Adversarial Defense: A Novel Reconstruction-based Defense Framework" | [Home Page] | [PDF]

  • Yongrui Li,Shilian Wu,Jun Yu,Zengfu Wang. "Fine-Grained Language Identification in Scene Text Images" | [Home Page] | [PDF]

  • Dongjie Tang,Cathy Bao,Yong Yao,Chao Xie,Qiming Shi,Marc Mao,Randy Xu,Linsheng Li,Mohammad R. Haghighat,Zhengwei Qi,Haibing Guan. "CARE: Cloudified Android OSes on the Cloud Rendering" | [Home Page] | [PDF]

  • Shuangping Huang,Yu Luo,Zhenzhou Zhuang,Jin-Gang Yu,Mengchao He,Yongpan Wang. "Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model" | [Home Page] | [PDF]

  • Chunbin Gu,Jiajun Bu,Zhen Zhang,Zhi Yu,Dongfang Ma,Wei Wang. "Image Search with Text Feedback by Deep Hierarchical Attention Mutual Information Maximization" | [Home Page] | [PDF]

  • Hayley Hung,Cathal Gurrin,Martha Larson,Hatice Gunes,Fabien Ringeval,Elisabeth Andre,Louis-Philippe Morency. "Social Signals and Multimedia: Past, Present, Future" | [Home Page] | [PDF]

  • Xianqiang Lyu,Zhiyu Zhu,Mantang Guo,Jing Jin,Junhui Hou,Huanqiang Zeng. "Learning Spatial-angular Fusion for Compressive Light Field Imaging in a Cycle-consistent Framework" | [Home Page] | [PDF]

  • Jiale Li,Hang Dai,Ling Shao,Yong Ding. "From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder" | [Home Page] | [PDF]

  • Jisheng Li,Yuze He,Jinghui Jiao,Yubin Hu,Yuxing Han,Jiangtao Wen. "Extending 6-DoF VR Experience Via Multi-Sphere Images Interpolation" | [Home Page] | [PDF]

  • Liao Wang,Ziyu Wang,Pei Lin,Yuheng Jiang,Xin Suo,Minye Wu,Lan Xu,Jingyi Yu. "iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering" | [Home Page] | [PDF]

  • Guoxing Sun,Xin Chen,Yizhang Chen,Anqi Pang,Pei Lin,Yuheng Jiang,Lan Xu,Jingyi Yu,Jingya Wang. "Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions" | [Home Page] | [PDF]

  • Hongkuan Shi,Zhiwei Wang,Jinxin Lv,Yilang Wang,Peng Zhang,Fei Zhu,Qiang Li. "Semi-supervised Learning via Improved Teacher-Student Network for Robust 3D Reconstruction of Stereo Endoscopic Image" | [Home Page] | [PDF]

  • Qiang Hou,Weiqing Min,Jing Wang,Sujuan Hou,Yuanjie Zheng,Shuqiang Jiang. "FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network" | [Home Page] | [PDF]

  • Jing Wang,Yuanjie Zheng,Jingqi Song,Sujuan Hou. "Cross-View Representation Learning for Multi-View Logo Classification with Information Bottleneck" | [Home Page] | [PDF]

  • Xiangjun Tang,WenXin Sun,Yong-Liang Yang,Xiaogang Jin. "Parametric Reshaping of Portraits in Videos" | [Home Page] | [PDF]

  • Anshu Singh,Shaojing Fan,Mohan Kankanhalli. "Human Attributes Prediction under Privacy-preserving Conditions" | [Home Page] | [PDF]

  • Bin Liang,Chenwei Lou,Xiang Li,Lin Gui,Min Yang,Ruifeng Xu. "Multi-Modal Sarcasm Detection with Interactive In-Modal and Cross-Modal Graphs" | [Home Page] | [PDF]

  • Shiwei Wu,Joya Chen,Tong Xu,Liyi Chen,Lingfei Wu,Yao Hu,Enhong Chen. "Linking the Characters: Video-oriented Social Graph Generation via Hierarchical-cumulative GCN" | [Home Page] | [PDF]

  • Zhenzhi Wang,Zhimin Li,Liyu Wu,Jiangfeng Xiong,Qinglin Lu. "Overview of Tencent Multi-modal Ads Video Understanding" | [Home Page] | [PDF]

  • Haoxin Zhang,Zhimin Li,Qinglin Lu. "Better Learning Shot Boundary Detection via Multi-task" | [Home Page] | [PDF]

  • Xinqi Fan,Ali Raza Shahid,Hong Yan. "Facial Micro-Expression Generation based on Deep Motion Retargeting and Transfer Learning" | [Home Page] | [PDF]

  • Chao Zhou,Wenjun Wu,Dan Yang,Tianchi Huang,Liang Guo,Bing Yu. "Deadline and Priority-aware Congestion Control for Delay-sensitive Multimedia Streaming" | [Home Page] | [PDF]

  • Wang-Wang Yu,Jingwen Jiang,Yong-Jie Li. "LSSNet: A Two-stream Convolutional Neural Network for Spotting Macro- and Micro-expression in Long Videos" | [Home Page] | [PDF]

  • Chengbo Dong,Xinru Chen,Aozhu Chen,Fan Hu,Zihan Wang,Xirong Li. "Multi-Level Visual Representation with Semantic-Reinforced Learning for Video Captioning" | [Home Page] | [PDF]

  • Yi Zhang,Youjun Zhao,Yuhang Wen,Zixuan Tang,Xinhua Xu,Mengyuan Liu. "Facial Prior Based First Order Motion Model for Micro-expression Generation" | [Home Page] | [PDF]

  • Xuansheng Wu,Feichi Yang,Tong Zhou,Xinyue Lin. "Rethinking the Impacts of Overfitting and Feature Quality on Small-scale Video Classification" | [Home Page] | [PDF]

  • Fuxing Leng. "A Gradient Balancing Approach for Robust Logo Detection" | [Home Page] | [PDF]

  • Daya Guo,Zhaoyang Zeng. "Multi-modal Representation Learning for Video Advertisement Content Structuring" | [Home Page] | [PDF]

  • Haozhe Li. "Phoenix: Combining Highest-Profit First Scheduling and Responsive Congestion Control for Delay-sensitive Multimedia Transmission" | [Home Page] | [PDF]

  • Wei Ji,Yicong Li,Meng Wei,Xindi Shang,Junbin Xiao,Tongwei Ren,Tat-Seng Chua. "VidVRD 2021: The Third Grand Challenge on Video Relation Detection" | [Home Page] | [PDF]

  • Weipeng Xu,Ye Liu,Daquan Lin. "A Simple and Effective Baseline for Robust Logo Detection" | [Home Page] | [PDF]

  • Hang Chen,Xiao Li,Zefan Wang,Xiaolin Hu. "Robust Logo Detection in E-Commerce Images by Data Augmentation" | [Home Page] | [PDF]

  • Bo Yang,Jianming Wu,Zhiguang Zhou,Megumi Komiya,Koki Kishimoto,Jianfeng Xu,Keisuke Nonaka,Toshiharu Horiuchi,Satoshi Komorita,Gen Hattori,Sei Naito,Yasuhiro Takishima. "Facial Action Unit-based Deep Learning Framework for Spotting Macro- and Micro-expressions in Long Video Sequences" | [Home Page] | [PDF]

  • Liwei Jin,Haoyue Cheng,Su Xu,Wayne Wu,Limin Wang. "NJU MCG - Sensetime Team Submission to Pre-training for Video Understanding Challenge Track II" | [Home Page] | [PDF]

  • He Yuhong. "Research on Micro-Expression Spotting Method Based on Optical Flow Features" | [Home Page] | [PDF]

  • Hao Wu,Jiajie Wang,Yuanzhe Gu,Peisen Zhao,Zhonglin Zu. "A Solution to Multi-modal Ads Video Tagging Challenge" | [Home Page] | [PDF]

  • Yifan Xu,Sirui Zhao,Huaying Tang,Xinglong Mao,Tong Xu,Enhong Chen. "FAMGAN: Fine-grained AUs Modulation based Generative Adversarial Network for Micro-Expression Generation" | [Home Page] | [PDF]

  • Yiqing Huang,Hongwei Xue,Jiansheng Chen,Huimin Ma,Hongbing Ma. "Semantic Tag Augmented XlanV Model for Video Captioning" | [Home Page] | [PDF]

  • Qin Lin,Nuo Pang,Zhiying Hong. "Automated Multi-Modal Video Editing for Ads Video" | [Home Page] | [PDF]

  • Dongyuan Su,Laizhong Cui,Lei Zhang,Yanyan Suo,Yan Qiu. "Rate Adaptation and Block Scheduling for Delay-sensitive Multimedia Applications" | [Home Page] | [PDF]

  • Kaifeng Gao,Long Chen,Yifeng Huang,Jun Xiao. "Video Relation Detection via Tracklet based Visual Transformer" | [Home Page] | [PDF]

  • Chris Birmingham,Kalin Stefanov,Maja J. Mataric. "Group-Level Focus of Visual Attention for Improved Next Speaker Prediction" | [Home Page] | [PDF]

  • Zejia Weng,Lingchen Meng,Rui Wang,Zuxuan Wu,Yu-Gang Jiang. "A Multimodal Framework for Video Ads Understanding" | [Home Page] | [PDF]

  • Beibei Zhang,Fan Yu,Yanxin Gao,Tongwei Ren,Gangshan Wu. "Joint Learning for Relationship and Interaction Analysis in Video with Multimodal Feature Fusion" | [Home Page] | [PDF]

  • Sihan Chen,Xinxin Zhu,Dongze Hao,Wei Liu,Jiawei Liu,Zijia Zhao,Longteng Guo,Jing Liu. "MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques" | [Home Page] | [PDF]

  • Mingkang Tang,Zhanyu Wang,Zhenhua LIU,Fengyun Rao,Dian Li,Xiu Li. "CLIP4Caption: CLIP for Video Caption" | [Home Page] | [PDF]

  • Jie Zhang,Junjie Deng,Mowei Wang,Yong Cui,Wei Tsang Ooi,Jiangchuan Liu,Xinyu Zhang,Kai Zheng,Yi Li. "The ACM Multimedia 2021 Meet Deadline Requirements Grand Challenge" | [Home Page] | [PDF]

  • Vishal Anand,Raksha Ramesh,Boshen Jin,Ziyin Wang,Xiaoxiao Lei,Ching-Yung Lin. "MultiModal Language Modelling on Knowledge Graphs for Deep Video Understanding" | [Home Page] | [PDF]

  • Eugene Yujun Fu,Michael W. Ngai. "Using Motion Histories for Eye Contact Detection in Multiperson Group Conversations" | [Home Page] | [PDF]

  • Philipp Müller,Michael Dietz,Dominik Schiller,Dominike Thomas,Guanhua Zhang,Patrick Gebhard,Elisabeth André,Andreas Bulling. "MultiMediate: Multi-modal Group Behaviour Analysis for Artificial Mediation" | [Home Page] | [PDF]

  • Vinit Veerendraveer Singh,Shivanand Venkanna Sheshappanavar,Chandra Kambhamettu. "MeshNet++: A Network with a Face" | [Home Page] | [PDF]

  • Mengshi Qi,Jie Qin,Di Huang,Zhiqiang Shen,Yi Yang,Jiebo Luo. "Latent Memory-augmented Graph Transformer for Visual Storytelling" | [Home Page] | [PDF]

  • Shunli Wang,Dingkang Yang,Peng Zhai,Chixiao Chen,Lihua Zhang. "TSA-Net: Tube Self-Attention Network for Action Quality Assessment" | [Home Page] | [PDF]

  • Xiangpeng Li,Lianli Gao,Lei Zhao,Jingkuan Song. "Exploring Contextual-Aware Representation and Linguistic-Diverse Expression for Visual Dialog" | [Home Page] | [PDF]

  • Injung Lee,Hyunchul Kim,Byungjoo Lee. "Automated Playtesting with a Cognitive Model of Sensorimotor Coordination" | [Home Page] | [PDF]

  • Yifan Ren,Xing Xu,Fumin Shen,Yazhou Yao,Huimin Lu. "CAA: Candidate-Aware Aggregation for Temporal Action Detection" | [Home Page] | [PDF]

  • Zehui Chen,Chenhongyi Yang,Qiaofei Li,Feng Zhao,Zheng-Jun Zha,Feng Wu. "Disentangle Your Dense Object Detector" | [Home Page] | [PDF]

  • Lei Hu,Shaoli Huang,Shilei Wang,Wei Liu,Jifeng Ning. "Do We Really Need Frame-by-Frame Annotation Datasets for Object Tracking?" | [Home Page] | [PDF]

  • Xu Chen,Chenqiang Gao,Feng Yang,Xiaohan Wang,Yi Yang,Yahong Han. "Video-to-Image Casting: A Flatting Method for Video Analysis" | [Home Page] | [PDF]

  • Zhirui Zhao,Changqun Xia,Chenxi Xie,Jia Li. "Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection" | [Home Page] | [PDF]

  • Kedi Lyu,Zhenguang Liu,Shuang Wu,Haipeng Chen,Xuhong Zhang,Yuyu Yin. "Learning Human Motion Prediction via Stochastic Differential Equations" | [Home Page] | [PDF]

  • Ning Wang,Guangming Zhu,Liang Zhang,Peiyi Shen,Hongsheng Li,Cong Hua. "Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition" | [Home Page] | [PDF]

  • Xiang Guan,Guoqing Wang,Xing Xu,Yi Bin. "Learning Hierarchal Channel Attention for Fine-grained Visual Classification" | [Home Page] | [PDF]

  • Jiuniu Wang,Wenjia Xu,Qingzhong Wang,Antoni B. Chan. "Group-based Distinctive Image Captioning with Memory Attention" | [Home Page] | [PDF]

  • Lei Li,Chun Yuan. "VQMG: Hierarchical Vector Quantised and Multi-hops Graph Reasoning for Explicit Representation Learning" | [Home Page] | [PDF]

  • Minli Li,Peilin Zhao,Yifan Zhang,Shuaicheng Niu,Qingyao Wu,Mingkui Tan. "Structure-aware Mathematical Expression Recognition with Sequence-Level Modeling" | [Home Page] | [PDF]

  • Ying Cheng,Ruize Wang,Jiashuo Yu,Rui-Wei Zhao,Yuejie Zhang,Rui Feng. "Exploring Logical Reasoning for Referring Expression Comprehension" | [Home Page] | [PDF]

  • Zeliang Song,Xiaofei Zhou,Linhua Dong,Jianlong Tan,Li Guo. "Direction Relation Transformer for Image Captioning" | [Home Page] | [PDF]

  • Tao Jin,Zhou Zhao. "Contrastive Disentangled Meta-Learning for Signer-Independent Sign Language Translation" | [Home Page] | [PDF]

  • Zeming Liao,Qingbao Huang,Yu Liang,Mingyi Fu,Yi Cai,Qing Li. "Scene Graph with 3D Information for Change Captioning" | [Home Page] | [PDF]

  • Hongying Liu,Ruyi Luo,Fanhua Shang,Mantang Niu,Yuanyuan Liu. "Progressive Semantic Matching for Video-Text Retrieval" | [Home Page] | [PDF]

  • Qing Lin,Bo Yan,Weimin Tan. "Multimodal Asymmetric Dual Learning for Unsupervised Eyeglasses Removal" | [Home Page] | [PDF]

  • Dong An,Yuankai Qi,Yan Huang,Qi Wu,Liang Wang,Tieniu Tan. "Neighbor-view Enhanced Model for Vision and Language Navigation" | [Home Page] | [PDF]

  • Yi Bin,Xindi Shang,Bo Peng,Yujuan Ding,Tat-Seng Chua. "Multi-Perspective Video Captioning" | [Home Page] | [PDF]

  • Hui Wang,Dan Guo,Xian-Sheng Hua,Meng Wang. "Pairwise VLAD Interaction Network for Video Question Answering" | [Home Page] | [PDF]

  • Yunke Zhang,Chi Wang,Miaomiao Cui,Peiran Ren,Xuansong Xie,Xian-Sheng Hua,Hujun Bao,Qixing Huang,Weiwei Xu. "Attention-guided Temporally Coherent Video Object Matting" | [Home Page] | [PDF]

  • Roy Ka-Wei Lee,Rui Cao,Ziqing Fan,Jing Jiang,Wen-Haw Chong. "Disentangling Hate in Online Memes" | [Home Page] | [PDF]

  • Jiutao Yue,Haofeng Li,Pengxu Wei,Guanbin Li,Liang Lin. "Robust Real-World Image Super-Resolution against Adversarial Attacks" | [Home Page] | [PDF]

  • Chaoning Zhang,Adil Karjauv,Philipp Benz,In So Kweon. "Towards Robust Deep Hiding Under Non-Differentiable Distortions for Practical Blind Watermarking" | [Home Page] | [PDF]

  • Liuwu Li,Yuqi Bu,Yi Cai. "Bottom-Up and Bidirectional Alignment for Referring Expression Comprehension" | [Home Page] | [PDF]

  • Lingyun Zhang,Xiuxiu Bai,Yao Gao. "SalS-GAN: Spatially-Adaptive Latent Space in StyleGAN for Real Image Embedding" | [Home Page] | [PDF]

  • Xuri Ge,Fuhai Chen,Joemon M. Jose,Zhilong Ji,Zhongqin Wu,Xiao Liu. "Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval" | [Home Page] | [PDF]

  • Clinton Mo,Kun Hu,Shaohui Mei,Zebin Chen,Zhiyong Wang. "Keyframe Extraction from Motion Capture Sequences with Graph based Deep Reinforcement Learning" | [Home Page] | [PDF]

  • Lei Shi,Kai Shuang,Shijie Geng,Peng Gao,Zuohui Fu,Gerard de Melo,Yunpeng Chen,Sen Su. "Dense Contrastive Visual-Linguistic Pretraining" | [Home Page] | [PDF]

  • Weijiang Yu,Jian Liang,Lei Ji,Lu Li,Yuejian Fang,Nong Xiao,Nan Duan. "Hybrid Reasoning Network for Video-based Commonsense Captioning" | [Home Page] | [PDF]

  • Guibao Shen,Yingkui Zhang,Jialu Li,Mingqiang Wei,Qiong Wang,Guangyong Chen,Pheng-Ann Heng. "Learning Regularizer for Monocular Depth Estimation with Adversarial Guidance" | [Home Page] | [PDF]

  • Wenyu Zhang,Qing Ding,Jian Hu,Yi Ma,Mingzhe Lu. "Pixel-wise Graph Attention Networks for Person Re-identification" | [Home Page] | [PDF]

  • Xiaomeng Chu,Jiajun Deng,Yao Li,Zhenxun Yuan,Yanyong Zhang,Jianmin Ji,Yu Zhang. "Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting" | [Home Page] | [PDF]

  • Rui Ma,Hanxiao Luo,Qingbo Wu,King Ngi Ngan,Hongliang Li,Fanman Meng,Linfeng Xu. "Remember and Reuse: Cross-Task Blind Image Quality Assessment via Relevance-aware Incremental Learning" | [Home Page] | [PDF]

  • Yajun Gao,Tengfei Liang,Yi Jin,Xiaoyan Gu,Wu Liu,Yidong Li,Congyan Lang. "MSO: Multi-Feature Space Joint Optimization Network for RGB-Infrared Person Re-Identification" | [Home Page] | [PDF]

  • Wen-xu Tao,Gang-yi Jiang,Zhi-di Jiang,Mei Yu. "Point Cloud Projection and Multi-Scale Feature Fusion Network Based Blind Quality Assessment for Colored Point Clouds" | [Home Page] | [PDF]

  • Guangjun Li,Yongxiong Wang,Fengting Zhu. "Multi-branch Channel-wise Enhancement Network for Fine-grained Visual Recognition" | [Home Page] | [PDF]

  • Bowei Zhu,Yong Liu. "General Approximate Cross Validation for Model Selection: Supervised, Semi-supervised and Pairwise Learning" | [Home Page] | [PDF]

  • Qian Ye,Jun Xiao,Kin-man Lam,Takayuki Okatani. "Progressive and Selective Fusion Network for High Dynamic Range Imaging" | [Home Page] | [PDF]

  • Changmeng Zheng,Junhao Feng,Ze Fu,Yi Cai,Qing Li,Tao Wang. "Multimodal Relation Extraction with Efficient Graph Alignment" | [Home Page] | [PDF]

  • Jia Tan,Nan Ji,Haidong Xie,Xueshuang Xiang. "Legitimate Adversarial Patches: Evading Human Eyes and Detection Models in the Physical World" | [Home Page] | [PDF]

  • Xian Zhong,Shilei Zhao,Xiao Wang,Kui Jiang,Wenxuan Liu,Wenxin Huang,Zheng Wang. "Unsupervised Vehicle Search in the Wild: A New Benchmark" | [Home Page] | [PDF]

  • Yuqian Fu,Yanwei Fu,Yu-Gang Jiang. "Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data" | [Home Page] | [PDF]

  • Jiliang Yan,Deming Zhai,Junjun Jiang,Xianming Liu. "Target-guided Adaptive Base Class Reweighting for Few-Shot Learning" | [Home Page] | [PDF]

  • Yunzhi Zhuge,Chunhua Shen. "Deep Reasoning Network for Few-shot Semantic Segmentation" | [Home Page] | [PDF]

  • Gangjian Zhang,Shikui Wei,Huaxin Pang,Yao Zhao. "Heterogeneous Feature Fusion and Cross-modal Alignment for Composed Image Retrieval" | [Home Page] | [PDF]

  • Guodun Li,Yuchen Zhai,Zehao Lin,Yin Zhang. "Similar Scenes Arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning" | [Home Page] | [PDF]

  • Danni Xu,Ruimin Hu,Zixiang Xiong,Zheng Wang,Linbo Luo,Dengshi Li. "Trajectory is not Enough: Hidden Following Detection" | [Home Page] | [PDF]

  • Yinwei Wei,Xiang Wang,Qi Li,Liqiang Nie,Yan Li,Xuanping Li,Tat-Seng Chua. "Contrastive Learning for Cold-Start Recommendation" | [Home Page] | [PDF]

  • Jixin Liu,Rui Chen,Shipeng An,Heng Zhang. "CG-GAN: Class-Attribute Guided Generative Adversarial Network for Old Photo Restoration" | [Home Page] | [PDF]

  • Zekun Zheng,Xiaodong Wang,Xinye Lin,Shaohe Lv. "Get The Best of the Three Worlds: Real-Time Neural Image Compression in a Non-GPU Environment" | [Home Page] | [PDF]

  • Ye Zheng,Xi Huang,Li Cui. "Visual Language Based Succinct Zero-Shot Object Detection" | [Home Page] | [PDF]

  • Bo Jiang,Pengfei Sun,Ziyan Zhang,Jin Tang,Bin Luo. "GAMnet: Robust Feature Matching via Graph Adversarial-Matching Network" | [Home Page] | [PDF]

  • Zhixiong Zeng,Ying Sun,Wenji Mao. "MCCN: Multimodal Coordinated Clustering Network for Large-Scale Cross-modal Retrieval" | [Home Page] | [PDF]

  • Yi Ma,Yongqi Zhai,Jiayu Yang,Chunhui Yang,Ronggang Wang. "AFEC: Adaptive Feature Extraction Modules for Learned Image Compression" | [Home Page] | [PDF]

  • Chengcheng Zhou,Zongqing Lu,Linge Li,Qiangyu Yan,Jing-Hao Xue. "How Video Super-Resolution and Frame Interpolation Mutually Benefit" | [Home Page] | [PDF]

  • Lingdong Wang,Mohammad Hajiesmaili,Ramesh K. Sitaraman. "FOCAS: Practical Video Super Resolution using Foveated Rendering" | [Home Page] | [PDF]

  • Xiangrong Zhang,Zelin Peng,Peng Zhu,Tianyang Zhang,Chen Li,Huiyu Zhou,Licheng Jiao. "Adaptive Affinity Loss and Erroneous Pseudo-Label Refinement for Weakly Supervised Semantic Segmentation" | [Home Page] | [PDF]

  • Jialin Tian,Xing Xu,Zheng Wang,Fumin Shen,Xin Liu. "Relationship-Preserving Knowledge Distillation for Zero-Shot Sketch Based Image Retrieval" | [Home Page] | [PDF]

  • Francesco Bongini,Lorenzo Berlincioni,Marco Bertini,Alberto Del Bimbo. "Partially Fake it Till you Make It: Mixing Real and Fake Thermal Images for Improved Object Detection" | [Home Page] | [PDF]

  • Tianshuo Xu,Yuhang Wu,Xiawu Zheng,Teng Xi,Gang Zhang,Errui Ding,Fei Chao,Rongrong Ji. "CDP: Towards Optimal Filter Pruning via Class-wise Discriminative Power" | [Home Page] | [PDF]

  • Tao Lu,Yuanzhi Wang,Yanduo Zhang,Yu Wang,Liu Wei,Zhongyuan Wang,Junjun Jiang. "Face Hallucination via Split-Attention in Split-Attention Network" | [Home Page] | [PDF]

  • Xianglong Feng,Yi Xie,Mengmei Ye,Zhongze Tang,Bo Yuan,Sheng Wei. "Fake Gradient: A Security and Privacy Protection Framework for DNN-based Image Classification" | [Home Page] | [PDF]

  • Zhihua Li,Xiang Deng,Xiaotian Li,Lijun Yin. "Integrating Semantic and Temporal Relationships in Facial Action Unit Detection" | [Home Page] | [PDF]

  • Md Fahim Faysal Khan,Nelson Daniel Troncoso Aldas,Abhishek Kumar,Siddharth Advani,Vijaykrishnan Narayanan. "Sparse to Dense Depth Completion using a Generative Adversarial Network with Intelligent Sampling Strategies" | [Home Page] | [PDF]

  • Siyan Xue,Shaobing Gao,Minjie Tan,Zhen He,Liangtian He. "How does Color Constancy Affect Target Recognition and Instance Segmentation?" | [Home Page] | [PDF]

  • Xinyang Feng,Dongjin Song,Yuncong Chen,Zhengzhang Chen,Jingchao Ni,Haifeng Chen. "Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection" | [Home Page] | [PDF]

  • Yuan Chang,Yisong Chen,Guoping Wang. "Salient Error Detection based Refinement for Wide-baseline Image Interpolation" | [Home Page] | [PDF]

  • Rui Li,Yiting Wang,Bao-Liang Lu. "A Multi-Domain Adaptive Graph Convolutional Network for EEG-based Emotion Recognition" | [Home Page] | [PDF]

  • Zhenhong Sun,Zhiyu Tan,Xiuyu Sun,Fangyi Zhang,Yichen Qian,Dongyang Li,Hao Li. "Interpolation Variable Rate Image Compression" | [Home Page] | [PDF]

  • Songhe Wang,Zheng Bao,Jingtong E. "Armor: A Benchmark for Meta-evaluation of Artificial Music" | [Home Page] | [PDF]

  • Haiwen Hong,Xuan Jin,Yin Zhang,Yunqing Hu,Jingfeng Zhang,Yuan He,Hui Xue. "DRDF: Determining the Importance of Different Multimodal Information with Dual-Router Dynamic Framework" | [Home Page] | [PDF]

  • Jianjie Luo,Yehao Li,Yingwei Pan,Ting Yao,Hongyang Chao,Tao Mei. "CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising" | [Home Page] | [PDF]

  • Jiaqing Xu,Haifeng Sun,Qi Qi,Jingyu Wang,Ce Ge,Lejian Zhang,Jianxin Liao. "DLA-Net for FG-SBIR: Dynamic Local Aligned Network for Fine-Grained Sketch-Based Image Retrieval" | [Home Page] | [PDF]

  • Qianxiu Hao,Qianqian Xu,Zhiyong Yang,Qingming Huang. "Pareto Optimality for Fairness-constrained Collaborative Filtering" | [Home Page] | [PDF]

  • Yan Gao,Qimeng Wang,Xu Tang,Haochen Wang,Fei Ding,Jing Li,Yao Hu. "Decoupled IoU Regression for Object Detection" | [Home Page] | [PDF]

  • Zhuofan Zong,Qianggang Cao,Biao Leng. "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" | [Home Page] | [PDF]

  • Minyi Zhao,Yi Xu,Shuigeng Zhou. "Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction" | [Home Page] | [PDF]

  • Jan Zdenek,Hideki Nakayama. "JokerGAN: Memory-Efficient Model for Handwritten Text Generation with Text Line Awareness" | [Home Page] | [PDF]

  • Kede Ma,Yuming Fang. "Image Quality Assessment in the Modern Age" | [Home Page] | [PDF]

  • Xiaowen Huang,Jiaming Zhang,Yi Zhang,Xian Zhao,Jitao Sang. "Trustworthy Multimedia Analysis" | [Home Page] | [PDF]

  • Manjunath Iyer. "Multimedia Classifiers: Behind the Scenes" | [Home Page] | [PDF]

  • Jie Chen,Qixiang Ye,Xiaoshan Yang,S. Kevin Zhou,Xiaopeng Hong,Li Zhang. "Few-shot Learning for Multi-Modality Tasks" | [Home Page] | [PDF]

  • Antonio M. G. Pinheiro. "Plenoptic Quality Assessment: The JPEG Pleno Experience" | [Home Page] | [PDF]

  • Xu Tan,Xiaobing Li. "A Tutorial on AI Music Composition" | [Home Page] | [PDF]

  • Xin Wang,Peng Cui,Wenwu Zhu. "Out-of-distribution Generalization and Its Applications for Multimedia" | [Home Page] | [PDF]

  • Guo Lu,Ren Yang,Shenlong Wang,Shan Liu,Radu Timofte. "Deep Learning for Visual Data Compression" | [Home Page] | [PDF]

  • Aishan Liu,Xinyun Chen,Yingwei Li,Chaowei Xiao,Xun Yang,Xianglong Liu,Dawn Song,Dacheng Tao,Alan Yuille,Anima Anandkumar. "ADVM'21: 1st International Workshop on Adversarial Learning for Multimedia" | [Home Page] | [PDF]

  • Ricardo Guerrero,Michael Spranger,Shuqiang Jiang,Chong-Wah Ngo. "AIxFood'21: 3rd Workshop on AIxFood" | [Home Page] | [PDF]

  • Wu Liu,Xinchen Liu,Jingkuan Song,Dingwen Zhang,Wenbing Huang,Junbo Guo,John Smith. "HUMA'21: 2nd International Workshop on Human-centric Multimedia Analysis" | [Home Page] | [PDF]

  • Rainer Lienhart,Thomas B. Moeslund,Hideo Saito. "MMSports'21: 4th International Workshop on Multimedia Content Analysis in Sports" | [Home Page] | [PDF]

  • Valérie Gouet-Brunet,Margarita Khokhlova,Ronak Kosti,Li Weng. "SUMAC'21: 3rd Workshop on Structuring and Understanding of Multimedia heritAge Contents" | [Home Page] | [PDF]

  • Stevan Rudinac,Alessandro Bozzon,Tat-Seng Chua,Suzanne Little,Daniel Gatica-Perez,Kiyoharu Aizawa. "UrbanMM'21: 1st International Workshop on Multimedia Computing for Urban Data" | [Home Page] | [PDF]

  • Stefan Winkler,Weiling Chen,Abhinav Dhall,Pavel Korshunov. "ADGD'21: 1st Workshop on Synthetic Multimedia - Audiovisual Deepfake Generation and Detection" | [Home Page] | [PDF]

  • Jingting Li,Moi Hoon Yap,Wen-Huang Cheng,John See,Xiaopeng Hong,Xiaobai Li,Su-Jing Wang. "FME'21: 1st Workshop on Facial Micro-Expression: Advanced Techniques for Facial Expressions Generation and Spotting" | [Home Page] | [PDF]

  • Joao Magalhaes,Alexander G. Hauptmann,Ricardo G. Sousa,Carlos Santiago. "MuCAI'21: 2nd ACM Multimedia Workshop on Multimodal Conversational AI" | [Home Page] | [PDF]

  • Xiu-Shen Wei,Jufeng Yang,Han-Jia Ye,Jian Yang. "MULL'21: First International Workshop on Multimedia Understanding with Less Labeling" | [Home Page] | [PDF]

  • Lukas Stappen,Eva-Maria Meßner,Erik Cambria,Guoying Zhao,Björn W. Schuller. "MuSe 2021 Challenge: Multimodal Emotion, Sentiment, Physiological-Emotion, and Stress Detection" | [Home Page] | [PDF]

  • Teddy Furon,Jingen Liu,Yogesh Rawat,Wei Zhang,Qi Zhao. "Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing" | [Home Page] | [PDF]

  • Yueting Zhuang,Xing Tang,Guilin Wu,Yahong Han,Haihong Tang,Xiaobo Li,Xiaohan Wang,Baoming Yan,Bo Gao,Yi Yang. "WAB'21: 1st Workshop on Multimodal Product Identification in Livestreaming and WAB Challenge" | [Home Page] | [PDF]