This is a curated list of resources that I have found useful regarding machine learning, in particular deep learning. Only books that add significant value to understanding the topic are listed. Also, a list of good articles and some other resources.
-
Introduction to Mathematical Statistics
Robert Hogg, Joeseph McKean and Allen Craig -
Numerical Linear Algebra and Applications
Biswa Nath Datta -
Introduction to Applied Linear Algebra – Vectors, Matrices, and Least Squares
-
Introduction To Machine Learning
Ethem Alpaydin
-
Reinforcement Learning: An Introduction
Richard Sutton and Andrew Barto -
The Elements of Statistical Learning
Trevor Hastie, Robert Tibshirani and Jerome Friedman -
Pattern Recognition and Machine Learning
Christopher Bishop -
Machine Learning: A Probabilistic Perspective
Kevin Murphy
-
Deep Learning
Ian Goodfellow, Yoshua Bengio and Aaron Courville -
Deep Learning with Python
François Chollet
-
From proprioception to long-horizon planning in novel environments: A hierarchical RL model
-
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
-
Object Goal Navigation using Goal-Oriented Semantic Exploration
-
Bilateral Attention Network for RGB-D Salient Object Detection
-
GrokNet: Unified Computer Vision Model Trunk and Embeddings For Commerce
-
ROAD SEGMENTATION ON LOW RESOLUTION LIDAR POINT CLOUDS FOR AUTONOMOUS VEHICLES
-
Multimodal and multiview distillation for real-time player detection on a football field
-
Deep High-Resolution Representation Learning for Human Pose Estimation
-
Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective
-
SEED RL: SCALABLE AND EFFICIENT DEEP-RL WITH ACCELERATED CENTRAL INFERENCE
-
Learning Predictive Representations for Deformable Objects Using Contrastive Estimation
-
AutoML-Zero: Evolving Machine Learning Algorithms From Scratch
-
TensorFlow Quantum: A Software Framework for Quantum Machine Learning
-
SalsaNext: Fast Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving
-
HeatNet: Bridging the Day-Night Domain Gap in Semantic Segmentation with Thermal Images
-
MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships
-
Wavesplit: End-to-End Speech Separation by Speaker Clustering
-
Roto-Translation Equivariant Convolutional Networks: Application to Histopathology Image Analysis
-
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
-
Bayesian Deep Learning and a Probabilistic Perspective of Generalization
-
Wavesplit: End-to-End Speech Separation by Speaker Clustering
-
Learning Interpretable Models in the Property Specification Language
-
Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN
-
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
-
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence
-
BADGR: An Autonomous Self-Supervised Learning-Based Navigation System
-
Variational Autoencoders and Nonlinear ICA: A Unifying Framework
-
Deep Reinforcement Learning for Autonomous Driving: A Survey
-
ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes
-
Fine-Tuning a Transformer-Based Language Model to Avoid Generating Non-Normative Text
-
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images
-
SAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation
-
High Speed and High Dynamic Range Video with an Event Camera
-
Pose-Assisted Multi-Camera Collaboration for Active Object Tracking
-
MatrixNets: A New Scale and Aspect Ratio Aware Architecture for Object Detection
-
ON THE RELATIONSHIP BETWEEN SELF-ATTENTION AND CONVOLUTIONAL LAYERS
-
OBJECTS DETECTION FOR REMOTE SENSING IMAGES BASED ON POLAR COORDINATES
-
How neural networks find generalizable solutions: Self-tuned annealing in deep learning
-
Make a Face: Towards Arbitrary High Fidelity Face Manipulation
-
Multi-organ Segmentation over Partially Labeled Datasets with Multi-scale Feature Abstraction
-
YOUR CLASSIFIER IS SECRETLY AN ENERGY BASED MODEL AND YOU SHOULD TREAT IT LIKE ONE
-
Compositional generalization through meta sequence-to-sequence learning
-
U-Net with spatial pyramid pooling for drusen segmentation in optical coherence tomography
-
From deep learning to mechanistic understanding in neuroscience: the structure of retinal prediction
-
Interrogating theoretical models of neural computation with deep inference
-
RealMix: Towards Realistic Semi-Supervised Deep Learning Algorithms
-
Surface HOF: Surface Reconstruction from a Single Image Using Higher Order Function Networks
-
RED-NET: A RECURSIVE ENCODER-DECODER NETWORK FOR EDGE DETECTION
-
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
-
Putting An End to End-to-End: Gradient-Isolated Learning of Representations
-
DATA-EFFICIENT IMAGE RECOGNITION WITH CONTRASTIVE PREDICTIVE CODING
-
Reinforcement Learning for Market Making in a Multi-agent Dealer Market
-
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models
-
Stealing Knowledge from Protected Deep Neural Networks Using Composite Unlabeled Data
-
PolyTransform: Deep Polygon Transformer for Instance Segmentation
-
Full-Gradient Representation for Neural Network Visualization
-
The intriguing role of module criticality in the generalization of deep networks
-
ViewAL: Active Learning with Viewpoint Entropy for Semantic Segmentation
-
SuperGlue: Learning Feature Matching with Graph Neural Networks
-
Merging-ISP: Multi-Exposure High Dynamic Range Image Signal Processing
-
Gauge Equivariant Convolutional Networks and the Icosahedral CNN
-
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
-
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets
-
Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors
-
GLMNet: Graph Learning-Matching Networks for Feature Matching
-
Hyper-SAGNN: a self-attention based graph neural network for hypergraphs
-
GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs
-
Universal Intelligence: A Definition of Machine Intelligence
-
Momentum Contrast for Unsupervised Visual Representation Learning
-
Reconciling modern machine learning practice and the bias-variance trade-off
-
Optimizing Millions of Hyperparameters by Implicit Differentiation
-
Shaping Visual Representations with Language for Few-shot Classification
-
Self-training with Noisy Student improves ImageNet classification
-
Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale
-
Combinatorial Bayesian Optimization using the Graph Cartesian Product
-
Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks
-
GENERALIZATION THROUGH MEMORIZATION: NEAREST NEIGHBOR LANGUAGE MODELS
-
Generalization of Reinforcement Learners with Working and Episodic Memory
-
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data
-
Learning to Fix Build Errors with Graph2Diff Neural Networks
-
Image-Conditioned Graph Generation for Road Network Extraction
-
Optuna: A Next-generation Hyperparameter Optimization Framework
-
On the Interaction Between Deep Detectors and Siamese Trackers in Video Surveillance
-
Neural Ordinary Differential Equations for Semantic Segmentation of Individual Colon Glands
-
Adversarial NLI: A New Benchmark for Natural Language Understanding
-
Evaluating the Factual Consistency of Abstractive Text Summarization
-
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
-
Neural Assistant: Joint Action Prediction, Response Generation, and Latent Knowledge Reasoning
-
A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models
-
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
-
SinGAN: Learning a Generative Model from a Single Natural Image
-
Neural Network Distiller: A Python Package For DNN Compression Research
-
DEEP CAUSAL REPRESENTATION LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION
-
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction
-
DIFFTAICHI: DIFFERENTIABLE PROGRAMMING FOR PHYSICAL SIMULATION
-
Capacity, Bandwidth, and Compositionality in Emergent Language Learning
-
BANANAS: Bayesian Optimization with Neural Architectures for Neural Architecture Search
-
A General Framework for Uncertainty Estimation in Deep Learning
-
ON MUTUAL INFORMATION MAXIMIZATION FOR REPRESENTATION LEARNING
-
Deep Reinforcement Learning meets Graph Neural Networks: An optical network routing use case
-
IMPROVING SAT SOLVER HEURISTICS WITH GRAPH NETWORKS AND REINFORCEMENT LEARNING
-
LEARNING PARTIAL DIFFERENTIAL EQUATIONS FROM DATA USING NEURAL NETWORKS
-
CASCADED GENERATION OF HIGH-QUALITY COLOR VISIBLE FACE IMAGES FROM THERMAL CAPTURES
-
GPU-ACCELERATED VITERBI EXACT LATTICE DECODER FOR BATCHED ONLINE AND OFFLINE SPEECH RECOGNITION
-
Teacher algorithms for curriculum learning of Deep RL in continuously parameterized environments
-
Gaze360: Physically Unconstrained Gaze Estimation in the Wild
-
Self-Supervised Sim-to-Real Adaptation for Visual Robotic Manipulation
-
Improved Sample Complexities for Deep Networks and Robust Classification via an All-Layer Margin
-
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
-
Precise measurement of quantum observables with neural-network estimators
-
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks
-
On the Utility of Learning about Humans for Human-AI Coordination
-
Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer
-
Restoring ancient text using deep learning: a case study on Greek epigraphy
-
MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction
-
Landmarks-assisted Collaborative Deep Framework for Automatic 4D Facial Expression Recognition
-
Emergent properties of the local geometry of neural loss landscapes
-
Active Learning for Graph Neural Networks via Node Feature Propagation
-
Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels
-
Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs
-
A Mixed-Supervision Multilevel GAN Framework for Image Quality Enhancement
-
Exploring single-cell data with deep multitasking neural networks
-
Mish: A Self Regularized Non-Monotonic Neural Activation Function
-
Artistic Glyph Image Synthesis via One-Stage Few-Shot Learning
-
Theoretical Limits of Pipeline Parallel Optimization and Application to Distributed Deep Learning
-
Bayesian Optimization Meets Riemannian Manifolds in Robot Learning
-
LEARN TO EXPLAIN EFFICIENTLY VIA NEURAL LOGIC INDUCTIVE LEARNING
-
DIVERSITY IS ALL YOU NEED: LEARNING SKILLS WITHOUT A REWARD FUNCTION
-
Practical Posterior Error Bounds from Variational Objectives
-
FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation from a Single Image
-
Identifying Weights and Architectures of Unknown ReLU Networks
-
End-to-End Motion Planning of Quadrotors Using Deep Reinforcement Learning
-
YOLO Nano: a Highly Compact You Only Look Once Convolutional Neural Network for Object Detection
-
Bootstrapping Conditional GANs for Video Game Level Generation
-
CAUSAL INDUCTION FROM VISUAL OBSERVATIONS FOR GOAL DIRECTED TASKS
-
Improving Map Re-localization with Deep ‘Movable’ Objects Segmentation on 3D LiDAR Point Clouds
-
Accelerating Federated Learning via Momentum Gradient Descent
-
SPECTRAL INFERENCE NETWORKS: UNIFYING DEEP AND SPECTRAL LEARNING
-
Likelihood-free MCMC with Amortized Approximate Likelihood Ratios
-
DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare
-
Efficient Graph Generation with Graph Recurrent Attention Networks
-
Fermionic neural-network states for ab-initio electronic structure
-
Hierarchical Decision Making by Generating and Following Natural Language Instructions
-
IS DEEP REINFORCEMENT LEARNING REALLY SUPER- HUMAN ON ATARI? LEVELING THE PLAYING FIELD
-
BagNet: Berkeley Analog Generator with Layout Optimizer Boosted with Deep Neural Networks
-
ALBERT: A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS
-
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML
-
Making the Invisible Visible: Action Recognition Through Walls and Occlusions
-
JOINT GRAPH AND FEATURE LEARNING IN GRAPH CONVOLUTIONAL NEURAL NETWORKS
-
Forecaster: A Graph Transformer for Forecasting Spatial and Time-Dependent Data
-
Temporal Coding in Spiking Neural Networks with Alpha Synaptic Function
-
HoughNet: neural network architecture for vanishing points detection
-
HIERARCHICAL FORESIGHT: SELF-SUPERVISED LEARNING OF LONG-HORIZON TASKS VIA VISUAL SUBGOAL GENERATION
-
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
-
Video Interpolation and Prediction with Unsupervised Landmarks
-
Achieving Verified Robustness to Symbol Substitutions via Interval Bound Propagation
-
CTRL: A CONDITIONAL TRANSFORMER LANGUAGE MODEL FOR CONTROLLABLE GENERATION
-
On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
-
rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch
-
Universal Adversarial Triggers for Attacking and Analyzing NLP
-
ParaQG: A System for Generating Questions and Answers from Paragraphs
-
Progressive Face Super-Resolution via Attention to Facial Landmark
-
Beyond Photo Realism for Domain Adaptation from Synthetic Data
-
PaperRobot: Incremental Draft Generation of Scientific Ideas
-
Y-Autoencoders: disentangling latent representations via sequential-encoding
-
MIDCURVENN: ENCODER-DECODER NEURAL NETWORK FOR COMPUTING MIDCURVE OF A THIN POLYGON
-
Chester: A Web Delivered Locally Computed Chest X-Ray Disease Prediction System
-
Adaptive Honeypot Engagement through Reinforcement Learning of Semi-Markov Decision Processes
-
Learning World Graphs to Accelerate Hierarchical Reinforcement Learning
-
Learning the Depths of Moving People by Watching Frozen People
-
A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones
-
Unsupervised Monocular Depth Estimation with Left-Right Consistency
-
Early Action Prediction with Generative Adversarial Networks
-
PR Product: A Substitute for Inner Product in Neural Networks
-
Multi-level Encoder-Decoder Architectures for Image Restoration
-
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models
-
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
-
Multi-Sample Dropout for Accelerated Training and Better Generalization
-
SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers
-
DECISION-BASED ADVERSARIAL ATTACKS: RELIABLE ATTACKS AGAINST BLACK-BOX MACHINE LEARNING MODELS
-
Improving the Robustness of Deep Neural Networks via Stability Training
-
Improved Precision and Recall Metric for Assessing Generative Models
-
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
-
Fooling automated surveillance cameras: adversarial patches to attack person detection
-
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras
-
WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation
-
Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce
-
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism
-
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
-
Knowledge-based Analysis for Mortality Prediction from CT Images
-
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
-
Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity
-
Neural Inverse Knitting: From Images to Manufacturing Instructions
-
A Geometric Perspective on Optimal Representations for Reinforcement Learning
-
VAE-GANs for Probabilistic Compressive Image Recovery: Uncertainty Analysis
-
Approximating CNNs with Bag-of-local-Features models works surprisingly well on ImageNet
-
Towards reconstructing intelligible speech from the human auditory cortex
-
Deep, Skinny Neural Networks are not Universal Approximators
-
SelectiveNet: A Deep Neural Network with an Integrated Reject Option
-
Adversarial Examples Target Topological Holes in Deep Networks
-
Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning
-
Redundant Perception and State Estimation for Reliable Autonomous Racing
-
On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models
-
Deep k-Nearest Neighbors: Towards Confident, Interpretable and Robust Deep Learning
-
Character-level Convolutional Networks for Text Classification
-
LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image
-
Negative Update Intervals in Deep Multi-Agent Reinforcement Learning
-
Global Pose Estimation with an Attention-based Recurrent Network
-
ON THE TURING COMPLETENESS OF MODERN NEURAL NETWORK ARCHITECTURES
-
Generation Meets Recommendation: Proposing Novel Items for Groups of Users
-
Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
-
Semi-supervised Tensor Factorization for Brain Network Analysis
-
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
-
OBJECT DETECTION BASED ON LIDAR TEMPORAL PULSES USING SPIKING NEURAL NETWORKS
-
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
-
Trying to Understand Recurrent Neural Networks for Language Processing
-
How deep is deep enough? - Optimizing deep neural network architecture
-
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
-
Deep Semantic Instance Segmentation of Tree-like Structures Using Synthetic Data
-
VIREL: A Variational Inference Framework for Reinforcement Learning
-
Gradient Descent Finds Global Minima of Deep Neural Networks
-
Inducing Interpretable Representations with Variational Autoencoders
-
TF-Ranking: Scalable TensorFlow Library for Learning-to-Rank
-
Counterfactuals uncover the modular structure of deep generative models
-
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder
-
Deep Learning with Mixed Supervision for Brain Tumor Segmentation
-
A Tutorial on Deep Latent Variable Models of Natural Language
-
Analogues of mental simulation and imagination in deep learning
-
Gated-Dilated Networks for Lung Nodule Classification in CT scans
-
Machine learning & artificial intelligence in the quantum domain
-
GAN DISSECTION: VISUALIZING AND UNDERSTANDING GENERATIVE ADVERSARIAL NETWORKS
-
Non-Stationary Model for Crime Rate Inference Using Modern Urban Data
-
Macro action selection with deep reinforcement learning in StarCraft
-
Strike (with) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects
-
Robust Artificial Intelligence and Robust Human Organizations
-
Leveraging Financial News for Stock Trend Prediction with Attention-Based Recurrent Neural Network
-
Spatio-temporal Stacked LSTM for Temperature Prediction in Weather Forecasting
-
Collaging on Internal Representations: An Intuitive Approach for Semantic Transfiguration
-
Matching Features without Descriptors: Implicitly Matched Interest Points (IMIPs)
-
Deep Learning can Replicate Adaptive Traders in a Limit-Order-Book Financial Market
-
WOULDA, COULDA, SHOULDA: COUNTERFACTUALLY-GUIDED POLICY SEARCH
-
DeepMasterPrints: Generating MasterPrints for Dictionary Attacks via Latent Variable Evolution
-
Curiosity Driven Exploration of Learned Disentangled Goal Spaces
-
Stochastic Gradient Descent Optimizes Over-parameterized Deep ReLU Networks
-
Predicting Diabetes Disease Evolution Using Financial Records and Recurrent Neural Networks
-
Solving Imperfect-Information Games via Discounted Regret Minimization
-
Stochastic Adaptive Neural Architecture Search for Keyword Spotting
-
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism
-
A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks
-
FUSIONSTITCHING: DEEP FUSION AND CODE GENERATION FOR TENSORFLOW COMPUTATIONS ON GPUS
-
ON THE STATISTICAL AND INFORMATION-THEORETIC CHARACTERISTICS OF DEEP NETWORK REPRESENTATIONS
-
A Convergence Theory for Deep Learning via Over-Parameterization
-
Complex-YOLO: An Euler-Region-Proposal for Real-time 3D Object Detection on Point Clouds
-
Controllable Neural Story Generation via Reinforcement Learning
-
Extending Pretrained Segmentation Networks with Additional Anatomical Structures
-
Pipe-SGD: A Decentralized Pipelined SGD Framework for Distributed Deep Net Training
-
Activation Functions: Comparison of Trends in Practice and Research for Deep Learning
-
MixTrain: Scalable Training of Formally Robust Neural Networks
-
PROGRESSIVE GROWING OF GANS FOR IMPROVED QUALITY, STABILITY, AND VARIATION
-
Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms?
-
Measuring the Effects of Data Parallelism on Neural Network Training
-
Learning Distributed Representations of Symbolic Structure Using Binding and Unbinding Operations
-
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning
-
HORIZON: FACEBOOK’S OPEN SOURCE APPLIED REINFORCEMENT LEARNING PLATFORM
-
SDRL: Interpretable and Data-efficient Deep Reinforcement Learning Leveraging Symbolic Planning
-
CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images
-
Automatic Poetry Generation with Mutual Reinforcement Learning
-
Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
-
One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
-
Dendritic cortical microcircuits approximate the backpropagation algorithm
-
Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search
-
BABYAI: FIRST STEPS TOWARDS GROUNDED LAN- GUAGE LEARNING WITH A HUMAN IN THE LOOP
-
Online Human Gesture Recognition using Recurrent Neural Networks and Wearable Sensors
-
Deep Graph Convolutional Encoders for Structured Data to Text Generation
-
LEARNED OPTIMIZERS THAT OUTPERFORM SGD ON WALL-CLOCK AND VALIDATION LOSS
-
Autowarp: Learning a Warping Distance from Unlabeled Time Series Using Sequence Autoencoders
-
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
-
BrainNet: A Multi-Person Brain-to-Brain Interface for Direct Collaboration Between Brains
-
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
-
Gradient Agreement as an Optimization Objective for Meta-Learning
-
Deep neural network based i-vector mapping for speaker verification using short utterances
-
Private Machine Learning in TensorFlow using Secure Computation
-
Structural Sparsity of Complex Networks: Bounded Expansion in Random Models and Real-World Graphs
-
UNSUPERVISED NEURAL MULTI-DOCUMENT ABSTRACTIVE SUMMARIZATION
-
At Human Speed: Deep Reinforcement Learning with Action Delay
-
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
-
CURIOUS: Intrinsically Motivated Multi-Task, Multi-Goal Reinforcement Learning
-
Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks
-
Arguing Machines: Human Supervision of Black Box AI Systems That Make Life-Critical Decisions
-
Person-Job Fit: Adapting the Right Talent for the Right Job with Joint Representation Learning
-
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
-
LARGE SCALE GAN TRAINING FOR HIGH FIDELITY NATURAL IMAGE SYNTHESIS
-
Knowing Where to Look? Analysis on Attention of Visual Question Answering System
-
DOUBLY REPARAMETERIZED GRADIENT ESTIMATORS FOR MONTE CARLO OBJECTIVES
-
Zero-Shot Skill Composition and Simulation-to-Real Transfer by Learning Task Representations
-
AutoLoss: Learning Discrete Schedules for Alternate Optimization
-
h-DETACH: MODIFYING THE LSTM GRADIENT TO- WARDS BETTER OPTIMIZATION
-
MyCaffe: A Complete C# Re-Write of Caffe with Reinforcement Learning
-
Gradient Descent Provably Optimizes Over-parameterized Neural Networks
-
A deep learning pipeline for product recognition in store shelves
-
Verification for Machine Learning, Autonomy, and Neural Networks Survey
-
MODULATED VARIATIONAL AUTO-ENCODERS FOR MANY-TO-MANY MUSICAL TIMBRE TRANSFER
-
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding
-
Learning to Remember, Forget, and Ignore using Attention Control in Memory
-
AlphaSeq: Sequence Discovery with Deep Reinforcement Learning
-
Few-Shot Goal Inference for Visuomotor Learning and Planning
-
Open Domain Question Answering Using Early Fusion of Knowledge Bases and Text
-
Net2Vec: Quantifying and Explaining how Concepts are Encoded by Filters in Deep Neural Networks
-
Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals
-
Graph Convolution over Pruned Dependency Trees Improves Relation Extraction
-
COMPUTATIONAL POWER AND THE SOCIAL IMPACT OF ARTIFICIAL INTELLIGENCE
-
A Survey of Learning Causality with Data: Problems and Methods
-
Understanding Convolutional Neural Networks for Text Classification
-
Learning Quickly to Plan Quickly Using Modular Meta-Learning
-
Solving Large Extensive-Form Games with Strategy Constraints
-
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
-
Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation
-
A Connectome Based Hexagonal Lattice Convolutional Network Model of the Drosophila Visual System
-
Long short-term memory and learning-to-learn in networks of spiking neurons
-
Multi-Task Learning for Email Search Ranking with Auxiliary Query Clustering
-
Automatic Judgment Prediction via Legal Reading Comprehension
-
Audio Adversarial Examples: Targeted Attacks on Speech-to-Text
-
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning
-
TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
-
The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo
-
Moments in Time Dataset: one million videos for event understanding
-
Visual Diagnostics for Deep Reinforcement Learning Policy Development
-
Textual Analogy Parsing: What’s Shared and What’s Compared among Analogous Facts
-
The Seven Tools of Causal Inference with Reflections on Machine Learning
-
Transparency and Explanation in Deep Reinforcement Learning Neural Networks
-
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning
-
Model-Based Reinforcement Learning via Meta-Policy Optimization
-
SOLAR: Deep Structured Latent Representations for Model-Based Reinforcement Learning
-
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
-
Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation
-
Learning Category-Specific Mesh Reconstruction from Image Collections
-
Question Answering by Reasoning Across Documents with Graph Convolutional Networks
-
Interpretation of Natural Language Rules in Conversational Machine Reading
-
How to Combine Tree-Search Methods in Reinforcement Learning
-
Dendritic error backpropagation in deep cortical microcircuits
-
Application of Self-Play Reinforcement Learning to a Four-Player Game of Imperfect Information
-
Twin-GAN – Unpaired Cross-Domain Image Translation with Weight-Sharing GANs
-
Time-Agnostic Prediction: Predicting Predictable Video Frames
-
robot gym: accelerated robot training through simulation in the cloud with ROS and Gazebo
-
Norm matters: efficient and accurate normalization schemes in deep networks
-
Unsupervised Learning of Syntactic Structure with Invertible Neural Projections
-
Contextual Parameter Generation for Universal Neural Machine Translation
-
LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations
-
Sleep-Wake Classification Via Quantifying Heart Rate Variability By Convolutional Neural Network
-
TreeGAN: Syntax-Aware Sequence Generation with Generative Adversarial Networks
-
The relativistic discriminator: a key element missing from standard GAN
-
Transfer Learning for Estimating Causal Effects using Neural Networks
-
Relational inductive biases, deep learning, and graph networks
-
SeVeN: Augmenting Word Embeddings with Unsupervised Relation Vectors
-
Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation
-
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval
-
Toward Domain-Invariant Speech Recognition Via Large Scale Training
-
Joint Training of Low-Precision Neural Network with Quantization Interval Parameters
-
3D Shape Perception from Monocular Vision, Touch, and Shape Priors
-
Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures
-
Item Recommendation with Variational Autoencoders and Heterogeneous Priors
-
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
-
Toward Scale-Invariance and Position-Sensitive Region Proposal Networks
-
How Complex is your classification problem? A survey on measuring classification complexity
-
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
-
Rethinking Numerical Representations for Deep Neural Networks
-
Distributional Multivariate Policy Evaluation and Exploration with the Bellman GAN
-
TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing
-
The challenge of realistic music generation: modelling raw audio at scale
-
Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer
-
Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors
-
The Variational Homoencoder: Learning to learn high capacity generative models from few examples
-
Variational Bayesian Reinforcement Learning with Regret Bounds
-
Backprop-Q: Generalized Backpropagation for Stochastic Computation Graphs
-
Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model
-
CNN Features off-the-shelf: an Astounding Baseline for Recognition
-
Multitask Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies
-
An Opinionated Introduction to AutoML and Neural Architecture Search
-
ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems
-
IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis
-
Variance Networks: When Expectation Does Not Meet Your Expectations
-
Task-Driven Convolutional Recurrent Models of the Visual System
-
Co-Training of Audio and Video Representations from Self-Supervised Temporal Synchronization
-
Dynamic Integration of Background Knowledge in Neural NLU Systems
-
The relativistic discriminator: a key element missing from standard GAN
-
Accurate Uncertainties for Deep Learning Using Calibrated Regression
-
ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations
-
The challenge of realistic music generation: modelling raw audio at scale
-
Every Pixel Counts: Unsupervised Geometry Learning with Holistic 3D Motion Understanding
-
Differentiable Compositional Kernel Learning for Gaussian Processes
-
Global Pose Estimation with an Attention-based Recurrent Network
-
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
-
Spectral Inference Networks: Unifying Spectral Methods With Deep Learning
-
Granger-causal Attentive Mixtures of Experts: Learning Important Features with Neural Networks
-
Relational inductive biases, deep learning, and graph networks
-
Multi-Resolution 3D Convolutional Neural Networks for Object Recognition
-
Structure from noise: Mental errors yield abstract represen- tations of events
-
Learning to Extract Coherent Summary via Deep Reinforcement Learning
-
MolGAN: An implicit generative model for small molecular graphs
-
Variational Inference for Data-Efficient Model Learning in POMDPs
-
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
-
Aggregated Residual Transformations for Deep Neural Networks
-
Visceral Machines: Reinforcement Learning with Intrinsic Rewards that Mimic the Human Nervous System
-
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
-
Progress & Compress: A scalable framework for continual learning
-
On the Practical Computational Power of Finite Precision RNNs for Language Recognition
-
Learning Intrinsic Image Decomposition from Watching the World
-
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images
-
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond
-
Learning Intrinsic Image Decomposition from Watching the World
-
Time Series Epenthesis: Clustering Time Series Streams Requires Ignoring Some Data
-
Graph2Seq: Graph to Sequence Learning with Attention-based Neural Networks
-
Learning to Separate Object Sounds by Watching Unlabeled Video
-
Bayesian Robust Tensor Factorization for Incomplete Multiway Data
-
Quantization Mimic: Towards Very Tiny CNN for Object Detection
-
Efficient Convolutional Network Learning using Parametric Log based Dual-Tree Wavelet ScatterNet
-
Zero-shot Recognition via Semantic Embeddings and Knowledge Graphs
-
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
-
Realistic Evaluation of Deep Semi-Supervised Learning Algorithms
-
MergeNet: A Deep Net Architecture for Small Obstacle Discovery
-
Harmonic Networks: Deep Translation and Rotation Equivariance
-
ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing
-
Detecting Malicious PowerShell Commands using Deep Neural Networks
-
Regularizing Deep Networks by Modeling and Predicting Label Structure
-
Generative Adversarial Networks for Extreme Learned Image Compression
-
Concept2vec: Metrics for Evaluating Quality of Embeddings for Ontological Concepts
-
Differentiable plasticity: training plastic neural networks with backpropagation
-
sense2vec - A Fast and Accurate Method for Word Sense Disambiguation In Neural Word Embeddings
-
L4: Practical loss-based stepsize adaptation for deep learning
-
OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
-
Complex-YOLO: An Euler-Region-Proposal for Real-time 3D Object Detection on Point Clouds
-
Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings
-
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval
-
Deep k-Nearest Neighbors: Towards Confident, Interpretable and Robust Deep Learning
-
Simple random search provides a competitive approach to reinforcement learning
-
ChatPainter: Improving Text to Image Generation using Dialogue
-
Deep Metric Learning via Lifted Structured Feature Embedding
-
Hard Negative Mining for Metric Learning Based Zero-Shot Classification
-
Triplet-based Deep Similarity Learning for Person Re-Identification
-
Robust Monitoring of Time Series with Application to Fraud Detection
-
Neural Machine Translation by Jointly Learning to Align and Translate
-
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
-
Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings
-
Learning a Parametric Embedding by Preserving Local Structure
-
Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives
-
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions
-
End-to-end Deep Image Reconstruction From Human Brain Activity
-
TSSD: Temporal Single-Shot Detector Based on Attention and LSTM for Robotic Intelligent Perception
-
Deep Multimodal Learning For Emotion Recognition In Spoken Language
-
On Characterizing the Capacity of Neural Networks using Algebraic Topology
-
Adversarial Examples that Fool both Human and Computer Vision
-
Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models
-
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
-
Supervised Dimensionality Reduction via Distance Correlation Maximization
-
Faster gaze prediction with dense networks and Fisher pruning
-
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
-
Deep Temporal Clustering : Fully Unsupervised Learning of Time-Domain Features
-
Regularized Evolution for Image Classifier Architecture Search
-
Speed/accuracy trade-offs for modern convolutional object detectors
-
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
-
Audio Adversarial Examples: Targeted Attacks on Speech-to-Text
-
ComboGAN: Unrestrained Scalability for Image Domain Translation
-
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
-
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
-
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
-
Understanding deep learning requires rethinking generalization
-
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
-
DeepFace: Closing the Gap to Human-Level Performance in Face Verification
-
Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes
-
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
-
Rich feature hierarchies for accurate object detection and semantic segmentation
-
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
-
Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks
-
3D Object Proposals using Stereo Imagery for Accurate Object Class Detection
-
Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
-
The Cityscapes Dataset for Semantic Urban Scene Understanding
-
Superpixel Convolutional Networks Using Bilateral Inceptions
-
Aggressive Deep Driving: Combining Convolutional Neural Networks and Model Predictive Control
-
Multi-View 3D Object Detection Network for Autonomous Driving
-
High-Resolution Semantic Labeling with Convolutional Neural Networks
-
Instance-aware Semantic Segmentation via Multi-task Network Cascades
-
Vehicle Detection from 3D Lidar Using Fully Convolutional Network
-
OctNet: Learning Deep 3D Representations at High Resolutions
-
EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis
-
Hierarchical Representations for Efficient Architecture Search
-
Improving Deep Learning by Inverse Square Root Linear Units (ISRLUs)
-
Long Short-Term Memory as a Dynamically Computed Element-wise Weighted Sum
-
Soft-NMS -- Improving Object Detection With One Line of Code
-
A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
-
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
-
Incremental Dense Semantic Stereo Fusion for Large-Scale Semantic Scene Reconstruction
-
StixelNet: A Deep Convolutional Network for Obstacle Detection and Road Segmentation
-
Semantic Image Segmentation with Deep Cconvolutional Nets and Fully Connected CRFS
-
A New Artificial Neural Network Approach in Solving Inverse Kinematics of Robotic Arm (Denso VP6242)
-
Auto-Conditioned LSTM Network for Extended Complex Human Motion Synthesis
-
DeepXplore: Automated Whitebox Testing of Deep Learning Systems
-
Real-time grasp detection using convolutional neural networks
-
Learning real manipulation tasks from virtual demonstrations using LSTM
-
A Neural Network-Based Approach for Trajectory Planning in Robot–Human Handover Tasks
-
SE3-Nets: Learning Rigid Body Motion using Deep Neural Networks
-
UnDeepVO: Monocular Visual Odometry through Unsupervised Deep Learning
-
Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours
-
3D Simulated Robot Manipulation Using Deep Reinforcement Learning
-
Deep-learning in Mobile Robotics - from Perception to Control Systems: A Survey on Why and Why not
-
Leveraging Deep Reinforcement Learning for Reaching Robotic Tasks
-
Imagenet classification with deep convolutional neural networks
-
Supervision via Competition: Robot Adversaries for Learning Tasks
-
Dermatologist-level classification of skin cancer with deep neural networks
-
Understanding image representations by measuring their equivariance and equivalence
-
Batch normalization: Accelerating deep network training by reducing internal covariate shift
-
Dropout: A simple way to prevent neural networks from overfitting
-
Delving deep into rectifiers: Surpassing human-level performance on imagenet classification
-
Improving neural networks by preventing co-adaptation of feature detectors
-
PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization
-
Systematic Testing of Convolutional Neural Networks for Autonomous Driving
-
Deep Visual-Semantic Alignments for Generating Image Descriptions
-
Very Deep Convolutional Networks for Large-scale Image Recognition
-
Clinical Intervention Prediction and Understanding with Deep Neural Networks
-
Connecting Images and Natural Language (Andrej Karpathy's Dissertation)
-
Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
-
U-Net: Convolutional Networks for Biomedical Image Segmentation
-
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
-
Modeling Smooth Backgrounds & Generic Localized Signals with Gaussian Processes (Kyle)
-
Adversarial Variational Optimization of Non-Differentiable Simulators (Kyle)
-
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
-
Mathematicians propose new way of using neural networks to work with noisy, high-dimensional data
-
The Evolution and Core Concepts of Deep Learning & Neural Networks
-
Deep Reinforcement Learning Based Trading Application at JP Morgan Chase
-
Randomly wired neural networks and state-of-the-art accuracy? Yes it works.
-
The Beauty of Functional Languages in Deep Learning — Clojure and Haskell
-
Extreme Rare Event Classification using Autoencoders in Keras
-
Weak Supervision: A New Programming Paradigm for Machine Learning
-
Rules of Machine Learning: Best Practices for ML Engineering
-
Play with Generative Adversarial Networks (GANs) in your browser!
-
ENet — A Deep Neural Architecture for Real-Time Semantic Segmentation
-
Hierarchical Bayesian Neural Networks with Informative Priors
-
A sane introduction to maximum likelihood estimation (MLE) and maximum a posteriori (MAP)
-
Preprocessing for deep learning: from covariance matrix to image whitening
-
InfoGAN: using the variational bound on mutual information (twice)
-
Introducing AdaNet: Fast and Flexible AutoML with Learning Guarantees
-
Training Neural Nets on Larger Batches: Practical Tips for 1-GPU, Multi-GPU & Distributed setups
-
PhD Thesis: Geometry and Uncertainty in Deep Learning for Computer Vision
-
Automatic Kernel Optimization for Deep Learning on All Hardware Platforms
-
Simple Beginner’s guide to Reinforcement Learning & its implementation
-
Efficient tuning of online systems using Bayesian optimization
-
Building a text classification model with TensorFlow Hub and Estimators
-
Generative Adversarial Networks (GANs) in 50 lines of code (PyTorch)
-
Rules of Machine Learning: Best Practices for ML Engineering
-
Fast Near-Duplicate Image Search using Locality Sensitive Hashing
-
Loc2Vec: Learning location embeddings with triplet-loss networks
-
Using Word2Vec for Better Embeddings of Categorical Features
-
Stochastic Weight Averaging — a New Way to Get State of the Art Results in Deep Learning
-
wav2letter: A Facebook AI Research(FAIR) Automatic Speech Recognition Toolkit
-
Using Evolutionary AutoML to Discover Neural Network Architectures
-
Using Tensorflow Object Detection to do Pixel Wise Classification
-
Understanding GANs through a statistical divergence perspective
-
Understanding and optimizing GANs (Going back to first principles)
-
Understanding Learning Rates and How It Improves Performance in Deep Learning
-
Only Numpy: Implementing Highway Network, OOP approach with Mini Batch with Interactive Code
-
Only Numpy: NIPS 2017 - Implementing Dilated Recurrent Neural Networks with Interactive Code
-
Annotating Large Datasets with the TensorFlow Object Detection API
-
Deep Learning Is Not Good Enough, We Need Bayesian Deep Learning for Safe AI
-
SBNet: Leveraging Activation Block Sparsity for Speeding up Convolutional Neural Networks
-
A set of Deep Reinforcement Learning Agents implemented in Tensorflow
-
Deep Learning Adversarial Examples – Clarifying Misconceptions
-
How to Define an Encoder-Decoder Sequence-to-Sequence Model for Neural Machine Translation in Keras
-
Feature Visualization - How neural networks build up their understanding of images
-
How to Develop a Character-Based Neural Language Model in Keras
-
How-To: Multi-GPU training with Keras, Python, and deep learning
-
Best Practices for Document Classification with Deep Learning
-
Leonardo Araujo dos Santos' Artificial Intelligence and Deep Learning GitBook
-
A quick complete tutorial to save and restore Tensorflow models
-
NVIDIA Apex: Tools for Easy Mixed-Precision Training in PyTorch
-
DeOldify: A Deep Learning based project for colorizing and restoring old images
-
Raincloud plots: a multi-platform tool for robust data visualization
-
The Mathematics of 2048: Optimal Play with Markov Decision Processes
-
Embed, encode, attend, predict: The new deep learning formula for state-of-the-art NLP models
-
Army develops face recognition technology that works in the dark
-
Generative Adversarial Networks for Extreme Learned Image Compression
-
Splash of Color: Instance Segmentation with Mask R-CNN and TensorFlow
-
Machine Learning is Fun Part 6: How to do Speech Recognition with Deep Learning
-
Engineering Extreme Event Forecasting at Uber with Recurrent Neural Networks
-
Releasing “Supervisely Person” dataset for teaching machines to segment humans
-
Only Numpy: Implementing Mini VGG (VGG 7) and SoftMax Layer with Interactive Code
-
An overview of word embeddings and their connection to distributional semantic models
-
NVIDIA Researchers Showcase Major Advances in Deep Learning at NIPS
-
GAN Playground - Explore Generative Adversarial Nets in your Browser
-
GAN Playground - Explore Generative Adversarial Nets in your Browser
-
Teaching a car to avoid obstacles using Reinforcement Learning
-
Creating Photorealistic Images with Neural Networks and a Gameboy Camera
-
Will Roscoe's Lane Following Autopilot with Keras & Tensorflow