- 3D Object Reconstruction from a Single Depth View with Adversarial Learning.
arxiv
code
- Abnormal Event Detection in Videos using Spatiotemporal Autoencoder.
arxiv
tensorflow
- Accurate Single Stage Detector Using Recurrent Rolling Convolution.
arxiv
code
- Active Convolution: Learning the Shape of Convolution for Image Classification.
arxiv
caffe
- A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.
url
⭐ - [AENet] Learning Deep Audio Features for Video Analysis.
arxiv
code
⭐ - A Neural Representation of Sketch Drawings.
arxiv
pytorch
⭐ - A network of deep neural networks for distant speech recognition.
arXiv
⭐ - A New Convolutional Network-in-Network Structure and Its Applications in Skin Detection, Semantic Segmentation, and Artifact Reduction.
arxiv
- Annotating Object Instances with a Polygon-RNN.
arXiv
- Building Detection from Satellite Images on a Global Scale.
arxiv
- Cascade R-CNN: Delving into High Quality Object Detection.
arxiv
code
- Class-Weighted Convolutional Features for Visual Instance Search.
arxiv
code
- Convolutional 2D Knowledge Graph Embeddings.
arxiv
code
- CortexNet: a Generic Network Family for Robust Visual Temporal Representations.
arXiv
code
- CSVideoNet: A Real-time End-to-end Learning Framework for High-frame-rate Video Compressive Sensing.
arXiv
caffe
- Cost-Effective Active Learning for Deep Image Classification.
arxiv
- Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks.
arxiv
caffe
- DCT-like Transform for Image Compression Requires 14 Additions Only.
arxiv
- Deep Alignment Network: A convolutional neural network for robust face alignment.
arxiv
code
- Deep Bayesian Active Learning with Image Data.
arxiv
keras
- Deep Convolutional Neural Networks for Pairwise Causality.
arxiv
- DeepFix: Fixing Common C Language Errors by Deep Learning.
pdf
code
- DeepFM: A Factorization-Machine based Neural Network for CTR Prediction.
arxiv
tensorflow
- Deep Image Prior.
pdf
code
⭐ - Deep Learning Based Large-Scale Automatic Satellite Crosswalk Classification.
arxiv
code
- Deep Learning Features at Scale for Visual Place Recognition.
arxiv
- Deep learning for predicting refractive error from retinal fundus images.
arxiv
- Deep Learning for Tumor Classification in Imaging Mass Spectrometry.
arxiv
- Deep Hybrid Similarity Learning for Person Re-identification.
arxiv
- Deep Photo Style Transfer.
arxiv
code
⭐ - DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild.
arxiv
code
- Detecting Curve Text in the Wild: New Dataset and New Solution.
arxiv
code
- Detecting Oriented Text in Natural Images by Linking Segments.
arxiv
tensorflow
⭐ - Disentangled Person Image Generation.
arxiv
- Disentangling Motion, Foreground and Background Features in Videos.
arxiv
code
- Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network.
arxiv
- DR2-Net: Deep Residual Reconstruction Network for Image Compressive Sensing.
arxiv
- Dual-Path Convolutional Image-Text Embedding.
arxiv
code
- End-to-end Recovery of Human Shape and Pose.
arxiv
code
- End-to-end Training for Whole Image Breast Cancer Diagnosis using An All Convolutional Design.
arxiv
code
- End-to-end weakly-supervised semantic alignment.
arxiv
pytorch
- Estimated Depth Map Helps Image Classification.
arxiv
code
- Exercise Motion Classification from Large-Scale Wearable Sensor Data Using Convolutional Neural Networks.
arxiv
- Extreme 3D Face Reconstruction: Looking Past Occlusions.
arxiv
code
- Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car.
arxiv
⭐ - FaceBoxes: A CPU Real-time Face Detector with High Accuracy.
arxiv
code
- Face Detection using Deep Learning: An Improved Faster RCNN Approach.
arxiv
- Fader Networks: Manipulating Images by Sliding Attributes.
arxiv
pytorch
⭐ - Fast Image Processing with Fully-Convolutional Networks.
arxiv
code
- FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras.
arxiv
- Focal Loss for Dense Object Detection.
arxiv
mxnet
tensorflow
- Im2Pano3D: Extrapolating 360 Structure and Semantics Beyond the Field of View.
arxiv
- Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation.
arxiv
- Improving Smiling Detection with Race and Gender Diversity.
arxiv
- Improved Texture Networks: Maximizing Quality and Diversity in Feed-forward Stylization and Texture Synthesis.
arxiv
code
- Joint auto-encoders: a flexible multi-task learning framework.
arxiv
- Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN.
arxiv
- Large-Scale Evolution of Image Classifiers.
arxiv
pytorch
- Learning a Mixture of Deep Networks for Single Image Super-Resolution.
arxiv
code
] - Learning a time-dependent master saliency map from eye-tracking data in videos.
arxiv
code
- Learning Deep Representations for Scene Labeling with Semantic Context Guided Supervision.
arxiv
- Learning Deep ResNet Blocks Sequentially using Boosting Theory.
arxiv
- Learning Feature Pyramids for Human Pose Estimation.
arxiv
code
- Learning to Estimate 3D Hand Pose from Single RGB Images.
arxiv
tensorflow
- Learning to Estimate Pose by Watching Videos.
arxiv
- Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models.
arxiv
- Learning to Learn from Noisy Web Videos.
arxiv
⭐ - Learning to Segment Every Thing.
arxiv
- Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image.
arxiv
- Light-Head R-CNN: In Defense of Two-Stage Object Detector.
arxiv
code
- Linear Disentangled Representation Learning for Facial Actions.
arxiv
code
- Loss Max-Pooling for Semantic Image Segmentation.
arxiv
pytorch
- Mask R-CNN.
arxiv
caffe
mxnet
⭐ - MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels.
arxiv
⭐ - Mix-and-Match Tuning for Self-Supervised Semantic Segmentation.
arxiv
code
- MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.
arxiv
pytorch
keras
tensorflow
- Modeling Relational Data with Graph Convolutional Networks.
arxiv
⭐ - [MobileNets] Efficient Convolutional Neural Networks for Mobile Vision Applications.
arxiv
keras
⭐ - MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.
arxiv
- Multi-Scale Dense Networks for Resource Efficient Image Classification.
arxiv
code
- Negative Results in Computer Vision: A Perspective.
arxiv
- Neural Motifs: Scene Graph Parsing with Global Context.
arxiv
code
- Object Detection Using Deep CNNs Trained on Synthetic Images.
arxiv
code
- Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs.
arxiv
code
- Optimizing Deep CNN-Based Queries over Video Streams at Scale.
arxiv
tensorflow
- Pedestrian Alignment Network for Large-scale Person Re-identification.
arxiv
code
- Perceptually Optimized Image Rendering.
arxiv
- PersonRank: Detecting Important People in Images.
arxiv
- Photographic Image Synthesis with Cascaded Refinement Networks.
arxiv
tensorflow
⭐ - Pixel Recursive Super Resolution.
arxiv
⭐ - Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers.
arxiv
- Receptive Field Block Net for Accurate and Fast Object Detection.
arxiv
code
- Recurrent Scale Approximation for Object Detection in CNN.
arxiv
code
- Rethinking Atrous Convolution for Semantic Image Segmentation.
arxiv
- S^3FD: Single Shot Scale-invariant Face Detector.
arxiv
pytorch
- SfM-Net: Learning of Structure and Motion from Video.
arxiv
⭐ - Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner.
arxiv
code
- Single-Shot Refinement Neural Network for Object Detection.
arxiv
caffe
- SLAM with Objects using a Nonparametric Pose Graph.
arxiv
code
- Smart, Sparse Contours to Represent and Edit Images.
arxiv
- STN-OCR: A single Neural Network for Text Detection and Text Recognition.
arxiv
code
- Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection.
arxiv
- Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image.
arxiv
code
- SphereFace: Deep Hypersphere Embedding for Face Recognition.
arxiv
code
- Spinal cord gray matter segmentation using deep dilated convolutions.
arxiv
[code
](https://github.com/neuropoly/spinalcordtoolbox ) - SSPP-DAN: Deep Domain Adaptation Network for Face Recognition with Single Sample Per Person.
arxiv
- StreetStyle: Exploring world-wide clothing styles from millions of photos.
arxiv
- SuperPoint: Self-Supervised Interest Point Detection and Description.
arxiv
⭐ - Supervised Multilayer Sparse Coding Networks for Image Classification.
arxiv
- SurfaceNet: An End-to-end 3D Neural Network for Multiview Stereopsis.
arxiv
code
- SwGridNet: A Deep Convolutional Neural Network based on Grid Topology for Image Classification.
arxiv
code
- [Tacotron] Towards End-to-End Speech Synthesis.
arxiv
code
⭐ - Tangent: Automatic Differentiation Using Source Code Transformation in Python.
arxiv
code
- Time-Contrastive Networks: Self-Supervised Learning from Video.
arxiv
⭐ - Towards a Principled Integration of Multi-Camera Re-Identification and Tracking through Optimal Bayes Filters.
arxiv
- Toward Geometric Deep SLAM.
arxiv
⭐ - Towards perspective-free object counting with deep learning.
pdf
code
- Training object class detectors with click supervision.
arxiv
- TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation.
arxiv
code
- Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US.
arxiv
⭐ - Unsupervised Image-to-Image Translation Networks.
arxiv
tensorflow
- Unsupervised Learning by Predicting Noise.
arxiv
⭐ - Unsupervised Learning of Long-Term Motion Dynamics for Videos.
arxiv
- Variational Approaches for Auto-Encoding Generative Adversarial Networks.
arxiv
⭐ - Video-based Person Re-identification with Accumulative Motion Context.
arxiv
- Video Frame Interpolation via Adaptive Separable Convolution.
pdf
pytorch
- Video Frame Synthesis using Deep Voxel Flow.
arxiv
code
- ViP-CNN: A Visual Phrase Reasoning Convolutional Neural Network for Visual Relationship Detection.
arxiv
- Visualizing LSTM decisions.
arxiv
- Visual Attribute Transfer through Deep Image Analogy.
arxiv
pytorch
- Visual Discovery at Pinterest.
arxiv
- Visualizing Residual Networks.
arxiv
⭐ - VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection.
arxiv
- Wide-Residual-Inception Networks for Real-time Object Detection.
arxiv
⭐ - YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video.
arxiv
code
- Zoom Out-and-In Network with Recursive Training for Object Proposal.
arxiv