This repository contains short summaries of some machine learning papers.
ARCHITECTURES
ATTENTION
Spatial Transformer Networks
LOSS FUNCTIONS
RECOGNITION
Working hard to know your neighbor’s margins: Local descriptor learning loss (thanks, alexobednikov)
FACE RECOGNITION
FACES
Neural Aggregation Network for Video Face Recognition (thanks, alexobednikov)
- Critical Learning Periods in Deep Neural Networks
GAN
SELF-DRIVING CARS
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANsSELF-DRIVING CARS
Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art
SELF-DRIVING CARS
Systematic Testing of Convolutional Neural Networks for Autonomous DrivingSELF-DRIVING CARS
SEGMENTATION
Fast Scene Understanding for Autonomous DrivingSELF-DRIVING CARS
Arguing Machines: Perception-Control System Redundancy and Edge Case Discovery in Real-World Autonomous DrivingSELF-DRIVING CARS
GAN
REINFORCEMENT
Virtual to Real Reinforcement Learning for Autonomous DrivingSELF-DRIVING CARS
End to End Learning for Self-Driving Cars
- Snapshot Ensembles: Train 1, get M for free
- Image Crowd Counting Using Convolutional Neural Network and Markov Random Field
REINFORCEMENT
Rainbow: Combining Improvements in Deep Reinforcement LearningREINFORCEMENT
Learning to Navigate in Complex EnvironmentsGAN
Unsupervised Image-to-Image Translation NetworksRNN
Dilated Recurrent Neural NetworksOBJECT DETECTION
TRACKING
Detect to Track and Track to DetectARCHITECTURES
Dilated Residual Networks
OBJECT DETECTION
Feature Pyramid Networks for Object DetectionOBJECT DETECTION
SSD: Single Shot MultiBox DetectorOBJECT DETECTION
EFFICIENT NETWORKS
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision ApplicationsOBJECT DETECTION
Mask R-CNN
FACES
Multi-view Face Detection Using Deep Convolutional Neural Networks (aka DDFD) (thanks, arnaldog12)
GAN
On the Effects of Batch and Weight Normalization in Generative Adversarial NetworksGAN
BEGANGAN
StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial NetworksACTIVATION FUNCTIONS
Self-Normalizing Neural NetworksGAN
Wasserstein GAN (aka WGAN)
OBJECT DETECTION
YOLO9000: Better, Faster, Stronger (aka YOLOv2)OBJECT DETECTION
You Only Look Once: Unified, Real-Time Object Detection (aka YOLO)OBJECT DETECTION
PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection
OBJECT DETECTION
R-FCN: Object Detection via Region-based Fully Convolutional NetworksOBJECT DETECTION
Faster R-CNNOBJECT DETECTION
Fast R-CNNOBJECT DETECTION
Rich feature hierarchies for accurate object detection and semantic segmentation (aka R-CNN)PEDESTRIANS
Ten Years of Pedestrian Detection, What Have We Learned?NEURAL STYLE
Instance Normalization: The Missing Ingredient for Fast Stylization
HUMAN POSE ESTIMATION
Stacked Hourglass Networks for Human Pose EstimationFACES
DeepFace: Closing the Gap to Human-Level Performance in Face VerificationTRANSLATION
Character-based Neural Machine Translation
HUMAN POSE ESTIMATION
Convolutional Pose MachinesFACES
HyperFace: A Deep Multi-task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender RecognitionFACES
Face Attribute Prediction Using Off-the-Shelf CNN FeaturesFACES
CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection- Conditional Image Generation with PixelCNN Decoders
GAN
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial NetsGAN
Improved Techniques for Training GANs- Synthesizing the preferred inputs for neurons in neural networks via deep generator networks
ARCHITECTURES
FractalNet: Ultra-Deep Neural Networks without Residuals- PlaNet - Photo Geolocation with Convolutional Neural Networks
OPTIMIZERS
Adam: A Method for Stochastic OptimizationGAN
RNN
Generating images with recurrent adversarial networksGAN
Adversarially Learned Inference
ARCHITECTURES
Resnet in Resnet: Generalizing Residual ArchitecturesAUTOENCODERS
Rank Ordered AutoencodersARCHITECTURES
Wide Residual NetworksARCHITECTURES
Identity Mappings in Deep Residual NetworksREGULARIZATION
Swapout: Learning an ensemble of deep architectures- Multi-Scale Context Aggregation by Dilated Convolutions
- Texture Synthesis Through Convolutional Neural Networks and Spectrum Constraints
- Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks
NEURAL STYLE
Semantic Style Transfer and Turning Two-Bit Doodles into Fine Artwork- Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis
SUPERRESOLUTION
Accurate Image Super-Resolution Using Very Deep Convolutional NetworksHUMAN POSE ESTIMATION
Joint Training of a Convolutional Network and a Graphical Model for Human Pose EstimationREINFORCEMENT
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic MotivationCOLORIZATION
Let there be Color
NEURAL STYLE
Artistic Style Transfer for Videos
REINFORCEMENT
Playing Atari with Deep Reinforcement LearningGENERATIVE
Attend, Infer, Repeat: Fast Scene Understanding with Generative ModelsARCHITECTURES
EFFICIENT NETWORKS
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model sizeACTIVATION FUNCTIONS
Noisy Activation FunctionsOBJECT DETECTION
IMAGE TO TEXT
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
REGULARIZATION
Deep Networks with Stochastic Depth
GAN
Deep Generative Image Models using a Laplacian Pyramid of Adversarial NetworksGENERATIVE
RNN
ATTENTION
DRAW A Recurrent Neural Network for Image Generation- Generating Images with Perceptual Similarity Metrics based on Deep Networks
GENERATIVE
Generative Moment Matching NetworksGENERATIVE
RNN
Pixel Recurrent Neural NetworksGAN
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
NEURAL STYLE
A Neural Algorithm for Artistic StyleNORMALIZATION
REGULARIZATION
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate ShiftARCHITECTURES
Deep Residual Learning for Image RecognitionACTIVATION FUNCTIONS
Fast and Accurate Deep Networks Learning By Exponential Linear Units (ELUs)- Fractional Max-Pooling
GAN
Generative Adversarial NetworksARCHITECTURES
Inception-v4, Inception-ResNet and the Impact of Residual Connections on LearningNORMALIZATION
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks