2018-11-05~2018-11-09
这篇文章介绍43篇论文,涉及CNN、图像分类、数据增广、Face、图像分割、OCR、GAN、风格迁移、目标跟踪、数据集和姿态估计等方向。
《The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale》
IJCV
arXiv:https://arxiv.org/abs/1811.00982
数据集官网:https://storage.googleapis.com/openimages/web/index.html
注:920w 幅图像
《Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning》
CVPR 2018
arXiv:https://arxiv.org/abs/1811.02307
datasets:https://usa.honda-ri.com/hdd
《Invertible Residual Networks》
arXiv:https://arxiv.org/abs/1811.00995
《You Only Search Once: Single Shot Neural Architecture Search via Direct Sparse Optimization》
ICLR2019 Submission
arXiv:https://arxiv.org/abs/1811.01567
注:图森中实习生的work,超越NAS
《Bi-Real Net: Binarizing Deep Network Towards Real-Network Performance》
Submitted to IJCV 2018
arXiv:https://arxiv.org/abs/1811.01335
《Activation Functions: Comparison of trends in Practice and Research for Deep Learning》
arXiv:https://arxiv.org/abs/1811.03378
《Microscopic Nuclei Classification, Segmentation and Detection with improved Deep Convolutional Neural Network (DCNN) Approaches》
arXiv:https://arxiv.org/abs/1811.03447
《ColorUNet: A convolutional classification approach to colorization》
arXiv:https://arxiv.org/abs/1811.03120
《ExGate: Externally Controlled Gating for Feature-based Attention in Artificial Neural Networks》
arXiv:https://arxiv.org/abs/1811.03403
《Learning from Large-scale Noisy Web Data with Ubiquitous Reweighting for Image Classification》
arXiv:https://arxiv.org/abs/1811.00700
数据增广
《Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised Localization and Beyond》
TPAMI
arXiv:https://arxiv.org/abs/1811.02545
《Exposing DeepFake Videos By Detecting Face Warping Artifacts》
arXiv:https://arxiv.org/abs/1811.00656
《Exposing Deep Fakes Using Inconsistent Head Poses》
arXiv:https://arxiv.org/abs/1811.00661
《Fast Face Image Synthesis with Minimal Training》
WACV 2019
arXiv:https://arxiv.org/abs/1811.01474
datasets:https://cvrl.nd.edu/projects/data/
《Facial Landmark Detection for Manga Images》
arXiv:https://arxiv.org/abs/1811.03214
《Real-time Driver Drowsiness Detection for Android Application Using Deep Neural Networks Techniques》
arXiv:https://arxiv.org/abs/1811.01627
《Query-based Logo Segmentation》
arXiv:https://arxiv.org/abs/1811.01395
《Prediction Error Meta Classification in Semantic Segmentation: Detection via Aggregated Dispersion Measures of Softmax Probabilities》
arXiv:https://arxiv.org/abs/1811.00648
《Unsupervised RGBD Video Object Segmentation Using GANs》
ACCV workshop
arXiv:https://arxiv.org/abs/1811.01526
《DUNet: A deformable network for retinal vessel segmentation》
arXiv:https://arxiv.org/abs/1811.01206
《Ischemic Stroke Lesion Segmentation in CT Perfusion Scans using Pyramid Pooling and Focal Loss》
2018 MICCAI workshop
arXiv:https://arxiv.org/abs/1811.01085
《An End-to-end Approach to Semantic Segmentation with 3D CNN and Posterior-CRF in Medical Images》
NIPS 2018 Workshop
arXiv:https://arxiv.org/abs/1811.03549
《Adaptive Semantic Segmentation with a Strategic Curriculum of Proxy Labels》
arXiv:https://arxiv.org/abs/1811.03542
《Deep Semantic Instance Segmentation of Tree-like Structures Using Synthetic Data》
WACV 2019
arXiv:https://arxiv.org/abs/1811.03208
《Improving GAN with neighbors embedding and gradient matching》
AAAI 2019
arXiv:https://arxiv.org/abs/1811.01333
《A General Theory of Equivariant CNNs on Homogeneous Spaces》
arXiv:https://arxiv.org/abs/1811.02017
《Triple consistency loss for pairing distributions in GAN-based face synthesis》
arXiv:https://arxiv.org/abs/1811.03492
github:https://github.com/ESanchezLozano/GANnotation
youtube:https://youtu.be/-8r7zexg4yg
OCR
《Auto-ML Deep Learning for Rashi Scripts OCR》
arXiv:https://arxiv.org/abs/1811.01290
《Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition》
arXiv:https://arxiv.org/abs/1811.00751
《Evolvement Constrained Adversarial Learning for Video Style Transfer》
arXiv:https://arxiv.org/abs/1811.02476
《Introduction to the 1st Place Winning Model of OpenImages Relationship Detection Challenge》
arXiv:https://arxiv.org/abs/1811.00662
《Improving Multi-Person Pose Estimation using Label Correction》
arXiv:https://arxiv.org/abs/1811.03331
《High Speed Tracking With A Fourier Domain Kernelized Correlation Filter》
arXiv:https://arxiv.org/abs/1811.03236
《Model Selection for Generalized Zero-shot Learning》
arXiv:https://arxiv.org/abs/1811.03252
《SPNet: Deep 3D Object Classification and Retrieval using Stereographic Projection》
arXiv:https://arxiv.org/abs/1811.01571
《Fast Adaptive Bilateral Filtering》
TIP
arXiv:https://arxiv.org/abs/1811.02308
《Fast High-Dimensional Bilateral and Nonlocal Means Filtering》
TIP
arXiv:https://arxiv.org/abs/1811.02363
《Continual Occlusions and Optical Flow Estimation》
ACCV 2018
arXiv:https://arxiv.org/abs/1811.01602
《Texture Synthesis Guided Deep Hashing for Texture Image Retrieval》
arXiv:https://arxiv.org/abs/1811.01401
《Semantic bottleneck for computer vision tasks》
ACCV 2018
arXiv:https://arxiv.org/abs/1811.02234
《3DCapsule: Extending the Capsule Architecture to Classify 3D Point Clouds》
WACV 2019
arXiv:https://arxiv.org/abs/1811.02191
《Automatic Thresholding of SIFT Descriptors》
ICIP 2016
arXiv:https://arxiv.org/abs/1811.03173
《DragonPaint: Rule based bootstrapping for small data with an application to cartoon coloring》