CV_resources
CV_resources copied to clipboard
A curated list of resources dedicated to Face Recognition & Detection, OCR, Objection Detection, Gan, 3D, Motion Track & Pose Estimation, ReID, NAS, Recommentation, Model Scaling
@[TOC]
Awesome of Computer Vision Resources
A curated list of resources dedicated to Face Recognition & Detection, OCR, Objection Detection, Gan, 3D, Motion Track & Pose Estimation, ReID, NAS, Recommentation, Model Scaling. Any suggestions and pull requests are welcome.
- ReID
- Gan
- NAS
- SLAM
- Classification
- Recommendation & CTR
- CTR
- Recommendation
- Video Processing
- Classification
- Augumentation
- Building and Training
- Optimizing
- Constructure
- Strategy
- Evaluation
- Body Related
- Face Detection
- Face Alignment
- Head Detection
- Liveness Detection
- 3D Face
- Data Processing
- Super resolution
- Synthesis
- Image Translation
- Date augmentaiton
- Objection Detection & Semantic
- Objection Detection
- Salient Object Detecion
- Segmentation
- Model Compress and Accelerate
- Pruning
- Accelerating
- Motion & Pose
- Pose Estimation
- Pose Transfer
- Motion Track
- Action Recognition
- Keypoint Detection
- Text Detection & Recognition
- Detection
- Recogination
ReID
- [2019-CVPR] Bags of Tricks and A Strong Baseline for Deep Person Re-identification(Baseline)
papercodepaper - [2019-CVPR] Backbone Can Not be Trained at Once: Rolling Back to Pre-trained Network for Person Re-IdentificationRolling Back to Pre-trained Network for Person Re-Identification
papercode - [2019-CVPR] DBC: Dispersion based Clustering for Unsupervised Person Re-identification
papercode - [2019-CVPR] EANet: Enhancing Alignment for Cross-Domain Person Re-identification(***SOTA)
papercode - [2019-CVPR] High-level Semantic Feature Detection: A New Perspective for Pedestrian Detection
paperhttps://github.com/liuwei16/CSP - [2019-CVPR] Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-identification
papercode - [2019-CVPR] MAR: Unsupervised Person Re-identification by Soft Multilabel Learning
papercode - [2019-CVPR] SSA-CNN: Semantic Self-Attention CNN for Pedestrian Detection(SOTA)
paper - [2018-BMVC] Deep Association Learning for Unsupervised Video Person Re-identification
papercode
Gan
- [
collection] Awesome Generative Adversarial Networks with tensorflow**code - [
framework] Implementations of a number of generative models GAN, VAE, Seq2Seq, VAEGAN, GAIA, Spectrogram Inversion in Tensorflow**code - [2019-CVPR] BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
papercodecode-pytorch - [2019-CVPR] StyleGan: Generator Inversion for Image Enhancement and Animation
papercode - [2018-ICLR] Progressive Growing of GANs for Improved Quality, Stability, and Variation
papercode)
NAS
- [
framework] An open source AutoML toolkit for neural architecture search and hyper-parameter tuningcode - [2019-CVPR] AutoGrow: Automatic Layer Growing in Deep Convolutional Networks
papercode - [2019-ar Xiv] MDENAS: Multinomial Distribution Learning for Effective Neural Architecture Search
papercode - [2019-CVPR] MnasNet: Platform-Aware Neural Architecture Search for Mobile
papercode - [2019-CVPR] Searching for A Robust Neural Architecture in Four GPU Hours
papercode - [2019-arXiv] Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization
papercode - [2019-CVPR] Dynamic Distribution Pruning for Efficient Network Architecture Search
papercode
SLAM
- [
ToolBox] OpenVSLAM: a Versatile Visual SLAM Frameworkcode - [2019-CVPR] AdaptForStereo: Learning to Adapt for Stereo
papercode - [2019-arXiv] DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction
papercode - [2019-CVPR] Detailed Human Shape Estimation from a Single Image by Hierarchical Mesh Deformation
papercode - [2019-CVPR] Defusr: Learning Non-volumetric Depth Fusion using Successive Reprojections
code - [2019-CVPR] GA-Net: Guided Aggregation Net for End-to-end Stereo Matching
papercode - [2019-CVPR] MegaDepth: Learning Single-View Depth Prediction from Internet Photos
paper - [2019-CVPR] Neural Rerendering in the Wild
papercodecode - [2019-CVPR] PyRobot: An Open-source Robotics Framework for Research and Benchmarking
papercode - [2019-CVPR] Robust Point Cloud Based Reconstruction of Large-Scale Outdoor Scenes(3D reconstruction)
papercode - [2019-CVPR] SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation with Stacked Generative Adversarial Networks
paper - [2019-CVPR] Taking a Deeper Look at the Inverse Compositional Algorithm(image alignment)
papercode
Classification
- [
ToolBox] Sandbox for training convolutional networks for computer vision (VGG,ResNet,PreResNet,ResNeXt,SENet,ResAttNet,SKNet,PyramidNet, - DenseNet,BagNet,MSDNet,FishNet,SqueezeNet,SqueezeResNet,SqueezeNext,ShuffleNet,ShuffleNetV2,MENet,MobileNet,FD-MobileNet,MobileNetV2,MobileNetV3,
Xception,InceptionV3,InceptionV4,InceptionResNetV2,PolyNet,NASNet-Mobile,PNASNet-Large,EfficientNet)
code - [
ToolBox] Classification models trained on ImageNetcodecode-keras - [2019-CVPR] RepMet: Representative-based metric learning for classification and one-shot object detection
paper - [2018-CVPR] SENet: Squeeze-and-Excitation Networks(champion for imageNet)
papercodecode-caffe - [2018-CVPR] FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction
papercode
Recommendation & CTR
- [
ToolBox] Implementation of Deep Learning based Recommender Algorithms with Tensorflowcode - [
ToolBox] A framework for training and evaluating AI models on a variety of openly available dialogue datasetscode - [
ToolBox] StarSpace: Embed All The Things!papercode - [
ToolBox] Modular and Extendible package of deep-learning based CTR modelscode - [
collection] Classic papers and resources on recommendationpapers - [
collection] A collection of resources for Recommender Systemspapers - [
collection] papers,datas,outline for recommendationcodecode
CTR
- [2019-arXiv] Deep Learning Recommendation Model for Personalization and Recommendation Systems(***CTR)
papercode
Recommendation
- [2019-arXiv] Generative Adversarial User Model for Reinforcement Learning Based Recommendation System
paper - [2019-arXiv] Recent Advances in Diversified Recommendation
paper - [2017-arXiv] Training Deep AutoEncoders for Collaborative Filtering(***SOTA)
papercode
Video Processing
Classification
- [2019-CVPR] Video Classification
papercode - [2019-CVPR] FastDVDnet: Towards Real-Time Video denoising Without Explicit Motion Estimation(denoising)
papercode - [2019-CVPR] Hallucinating Optical Flow Features for Video Classification
papercode
Augumentation
- [2019-CVPR] DAVANet: Stereo Deblurring with View Aggregation(debluring)
papercode - [2019-CVPR] DVDnet: A Simple and Fast Network for Deep Video Denoising(***SOTA)
papercode - [2019-CVPR] Deep Flow-Guided Video Inpainting
papercode - [2019-CVPR] EDVR: Video Restoration with Enhanced Deformable Convolutional Networks
papercode - [2019-CVPR] FastDVDnet: Towards Real-Time Video denoising Without Explicit Motion Estimation(denoising)
papercode - [2019-CVPR] TecoGAN: Temporally Coherent GANs for Video Super-Resolution
papercode - [2018-XXXX] A Deep Learning based project for colorizing and restoring old images and video!(***)
code
Building and Training
- [
ToolBox] Pretrained EfficientNet, MobileNetV3 V2 and V1, MNASNet A1 and B1, FBNet, ChamNet, Single-Path NAScode
Optimizing
- [2019-CVPR] Aggregation Cross-Entropy for Sequence Recognition (The ACE loss function exhibits competitive performance to CTC)
papercode - [2019-CVPR] KL-Loss: Bounding Box Regression with Uncertainty for Accurate Object Detection
papercode
Constructure
- [2019-CVPR] Pacnet: Pixel-Adaptive Convolutional Neural Networks(new net constructure)
papercode - [2019-CVPR] ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
paper
Strategy
- [
Toolbox] A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learningcode - [2019-CVPR]mixup: Bag of Freebies for Training Object Detection Neural Networks
papercode - [2019-CVPR] Improving Transferability of Adversarial Examples with Input Diversity
papercode - [2019-CVPR] RePr: Improved Training of Convolutional Filters
paper - [2018-CVPR] Fd-mobilenet: Improved mobilenet with a fast downsampling strategy
papercode
Evaluation
- [2019-CVPR] TedEval: A Fair Evaluation Metric for Scene Text Detectors(***)
papercode - [2019-CVPR] Tools for evaluating and visualizing results for the Multi Object Tracking and Segmentation (MOTS)
papercode
Body Related
- [
collection] A curated list of related resources for hand pose estimation**code - [
collection] Face Benchmark and Datasetcode - [
ToolBox] A face recognition solution on mobile devicecode
Face Detection
- [2019-CVPR] Dense 3D Face Decoding over 2500FPS: Joint Texture & Shape Convolutional Mesh Decoders
paper - [2019-CVPR] DSFD: Dual Shot Face Detector
papercode - [2019-CVPR] RetinaFace: Single-stage Dense Face Localisation in the Wild(***SOTA)
papercode - [2019-CVPR] PyramidBox++: High Performance Detector for Finding Tiny Face(***SOTA)
papercode - [2019-CVPR] SRN: Improved Selective Refinement Network for Face Detection(SOTA)
paeprcode
Face Alignment
- [2018-arXiv] Face Alignment: How far are we from solving the 2D & 3D Face Alignment problem
papercode - [2018-CVPR] Look at Boundary: A Boundary-Aware Face Alignment Algorithm
papercode - [2018-ECCV] Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network
code
Head Detection
Liveness Detection
- [2019-CVPR] A Non-Intrusive Method of Face Liveness Detection Using Specular Reflection and Local Binary Patterns(Liveness Detection)
paper - [2019-CVPR] FeatherNets: Convolutional Neural Networks as Light as Feather for Face Anti-spoofing(***Anti-spoofing)
papercode - [2019-CVPR] Liveness Detection Using Implicit 3D Features
paper
3D Face
- [2019-CVPR] Disentangled Representation Learning for 3D Face Shape(3D face)
papercode - [2019-CVPR] Expressive Body Capture: 3D Hands, Face, and Body From a Single Image
papercode - [2019-CVPR] Learning to Regress 3D Face Shape and Expression From an Image Without 3D Supervision
papercode - [2019-CVPR] Monocular Total Capture: Posing Face, Body and Hands in the Wild
papercode - [2019-CVPR] MVF-Net: Multi-View 3D Face Morphable Model Regression(face reconstructing)
code
Data Processing
Super resolution
- [2019-CVPR] AdaFM: Modulating Image Restoration with Continual Levels via Adaptive Feature Modification Layers(denoising)
papercode - [2019-arXiv] AWSRN: Lightweight Image Super-Resolution with Adaptive Weighted Learning Network
papercode - [2019-CVPR] Deep Learning for Image Super-resolution: A Survey
paper - [2019-CVPR] DPSR: Deep Plug-and-Play Super-Resolution for Arbitrary Blur Kernels
papercode - [2019-CVPR] Meta-SR: A Magnification-Arbitrary Network for Super-Resolution
papercode - [2019-arXiv] PASSRnet: Learning Parallax Attention for Stereo Image Super-Resolution
papercode - [2019-CVPR] SRNTT: Image Super-Resolution by Neural Texture Transfer
papercode - [2019-CVPR] Towards Real Scene Super-Resolution with Raw Images
paper - [2018-CVPR] RCAN: Image Super-Resolution Using Very Deep Residual Channel Attention Networks
papercode
Synthesis
- [
collection] Awesome Generative Adversarial Networks with tensorflow**code - [
framework] Implementations of a number of generative models GAN, VAE, Seq2Seq, VAEGAN, GAIA, Spectrogram Inversion in Tensorflow**code - [2019-CVPR] DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis
papergithub.com/NVlabs/SPADE) - [2019-CVPR oral] GauGAN: Semantic Image Synthesis with Spatially-Adaptive Normalization
papercode - [2019-CVPR] MSGAN: Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis
papercode - [2019-arXiv] MSG-GAN: Multi-Scale Gradients GAN for more stable and synchronized multi-scale image synthesis
papercode - [2019-argXiv] Self-Attention Generative Adversarial Networks
papercode - [2019-CVPR] Shapes and Context: In-the-wild Image Synthesis & Manipulation(Image Synthesis)
codecode - [2019-CVPR] STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing
papercode - [2018-CVPR] High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
papercode
Image Translation
- [2019-CVPR] Image-to-Image Translation via Group-wise Deep Whitening-and-Coloring Transformation( )
papercode - [2018-CVPR] CycleGAN: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
paper - [2018-CVPR] Pix2pix: Image-to-Image Translation with Conditional Adversarial Networks
papercode
Date augmentaiton
- [2019-CVPR] A Preliminary Study on Data Augmentation of Deep Learning for Image Classification
paper - [2019-CVPR] Further advantages of data augmentation on convolutional neural networks
paper - [2019-CVPR] Learning Data Augmentation Strategies for Object Detection
paper - [2019-CVPR] PSIS: Data Augmentation for Object Detection via Progressive and Selective Instance-Switching
papercode - [2019-CVPR] Wide-Context Semantic Image Extrapolation(expand image)
papercode
Objection Detection & Semantic
- [
ToolBox] A Simple and Versatile Framework for Object Detection and Instance Recognitioncode - [
ToolBox] Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorchcode - [
ToolBox] ObjectionDetection by yolov2, tiny yolov3, mobilenet, mobilenetv2, shufflenet(g2), shufflenetv2(1x), squeezenext(1.0-SqNxt-23v5), light xception, xceptioncode - [
ToolBox] MMDetection: Open MMLab Detection Toolbox and Benchmarkpapercode - [
ToolBox] Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet)codecode - [
ToolBox] Segmentation models with pretrained backbonescode
Objection Detection
- [2019-CVPR] Activity Driven Weakly Supervised Object Detection
code - [2019-CVPR] CenterNet: Objects as Points
paper(***)code - [2019-CVPR] Cascade R-CNN:High Quality Object Detection and Instance Segmentation(***SOTA)
papercodecode-Caffe - [2019-CVPR] CornerNet-Lite: Efficient Keypoint Based Object Detection(SOTA)
papercode - [2019-CVPR] DFPN: Efficient Object Detection Model for Real-Time UAV Applications
papercodecode-Caffe - [2019-CVPR] Distilling Object Detectors with Fine-grained Feature Imitation
code - [2019-CVPR] ExtremeNet: Bottom-up Object Detection by Grouping Extreme and Center Points(***)
papercode - [2019-CVPR] FSAF: Feature Selective Anchor-Free Module for Single-Shot Object Detection(SOTA)
paper - [2019-CVPR] FoveaBox: Beyond Anchor-based Object Detector(SOTA)
paper - [2019-CVPR] FCOS: Fully Convolutional One-Stage Object Detection(***)
papercode - [2019-CVPR] Grid R-CNN Plus: Faster and Better
papercode - [2019-CVPR] Hybrid Task Cascade for Instance Segmentation
papercode - [2019-CVPR] Locating Objects Without Bounding Boxes(***crowd count)
papercode - [2019-CVPR] Learning Data Augmentation Strategies for Object Detection
papercode - [2019-CVPR] LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking
papercode - [2019-CVPR] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud(***SOTA)
papercodecode-unofficial - [2019-CVPR] TridentNet: Scale-Aware Trident Networks for Object Detection(***SOTA)
papercode - [2019-CVPR] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection
papercode - [2019-CVPR] Region Proposal by Guided Anchoring
papercode - [2019-CVPR] SNIPER: Efficient Multi-Scale Training
papercode - [2019-CVPR] SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection(fast and low power)
paper - [2019-CVPR] ScratchDet: Training Single-Shot Object Detectors from Scratch
papercode - [2019-CVPR] YOLOv3+: Assisted Excitation of Activations: A Learning Technique to Improve Object Detectors
papercode - [2018-ECCV] Acquisition of Localization Confidence for Accurate Object Detection
papercode
Salient Object Detecion
- [
Survey] Salient Object Detection: A Surveypaper - [2019-CVPR] A Mutual Learning Method for Salient Object Detection with intertwined Multi-Supervision
code - [2019-CVPR] AFNet: Attentive Feedback Network for Boundary-aware Salient Object Detection
code - [2019-CVPR] A Simple Pooling-Based Design for Real-Time Salient Object Detection
code - [2019-CVPR] BASNet: Boundary-Aware Salient Object Detection
papercode - [2019-CVPR] Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection
papercode - [2019-CVPR] CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection
papercode - [2019-CVPR] Cascaded Partial Decoder for Fast and Accurate Salient Object Detection(***)
code - [2019-CVPR] LFNet: Light Field Saliency Detection with Deep Convolutional Networks
papercode - [2019-CVPR] Pyramid Feature Attention Network for Saliency detection(***)
papercode - [2019-CVPR] Shifting More Attention to Video Salient Objection Detection
papercode
Segmentation
- [2019-CVPR oral] CLAN: Category-level Adversaries for Semantics Consistent
papercode - [2019-CVPR] BRS: Interactive Image Segmentation via Backpropagating Refinement Scheme(***)
papercode - [2019-CVPR] DFANet:Deep Feature Aggregation for Real-Time Semantic Segmentation(used in camera)
papercode - [2019-CVPR] DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency
papercode - [2019-CVPR] Domain Adaptation(reducing the domain shif)
paper - [2019-CVPR] ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation
papercode - [2019-CVPR oral] GLNet: Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images
papercode - [2019-CVPR] Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth(***SOTA)
papercode - [2019-ECCV] ICNet: Real-Time Semantic Segmentation on High-Resolution Images
papercode - [2019-CVPR] LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation(***SOTA)
papercode - [2019-arXiv] LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation
papercode - [2019-CVPR] PTSNet: A Cascaded Network for Video Object Segmentation
papercode - [2019-CVPR] PPGNet: Learning Point-Pair Graph for Line Segment Detection
papercode - [2019-CVPR] Show, Match and Segment: Joint Learning of Semantic Matching and Object Co-segmentation
papercode - [2019-CVPR] Video Instance Segmentation
papercode - [2018-ECCV] BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation
paper[code](https://
Model Compress and Accelerate
- [
collection] Collection of recent methods on DNN compression and acceleration https://github.com/MingSun-Tse/EfficientDNNs - [
collection] A curated list of neural network pruning resources https://github.com/he-y/Awesome-Pruning - [
collection] model compression and acceleration research papers https://github.com/cedrickchee/awesome-ml-model-compression - [
TollBox] Neural Network Distiller by Intel AI Lab: a Python package for neural network compression researchcode
Pruning
- [2019-CVPR] An Improved Trade-off Between Accuracy and Complexity with Progressive Gradient Pruning(Prune)
paeprcode - [2019-ICML] EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
papercodecode - [2019-CVPR] FPGM: Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
papercode - [2019-CVPR] Importance Estimation for Neural Network Pruning
code
Accelerating
- [2019-CVPR] SKNet: Selective Kernel Networks
papercode - [2019-CVPR] SENet: Squeeze-and-Excitation Networks
papercode - [2019-CVPR] ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
paper
Motion & Pose
Pose Estimation
- [2019-CVPR] AlphaPose: Real-Time and Accurate Multi-Person Pose Estimation&Tracking System
papercode - [2019-CVPR] CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark
papercode - [2019-CVPR] Efficient Online Multi-Person 2D Pose Tracking with Recurrent Spatio-Temporal Affinity Fields(Oral)
papercode - [2019-CVPR] EpipolarPose: Self-Supervised Learning of 3D Human Pose using Multi-view Geometry
papercode - [2019-CVPR] Exploiting Temporal Context for 3D Human Pose Estimation in the Wild
papercode - [2019-CVPR] Generating Multiple Hypotheses for 3D Human Pose Estimation With Mixture Density Network(SOTA)
papercode - [2019-CVPR] Fast Human Pose Estimation(pytorch)
papercode - [2019-CVPR] High-Resolution Representation Learning for Human Pose Estimation(SOTA)
papercode - [2019-CVPR] Hand Shape and Pose Estimation from a Single RGB Image
papercode - [2019-CVPR] In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations
paper - [2019-CVPR] VideoPose3D: 3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training
code - [2019-CVPR] XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera
paper
Pose Transfer
Motion Track
- [2019-CVPR] ATOM: Accurate Tracking by Overlap Maximization(***SOTA)
papercode - [2019-IEEE] FANTrack: 3D Multi-Object Tracking with Feature Association Network
papercode - [2019-CVPR] Joint Monocular 3D Vehicle Detection and Tracking(***)
papercode - [2019-CVPR] Leveraging Shape Completion for 3D Siamese Tracking
papercode - [2019-CVPR Oral] Graph Convolutional Tracking(SOTA)
code - [2019-arXiv] Instance-Aware Representation Learning and Association for Online Multi-Person Tracking
paper - [2019-Github] multi-people tracking (centerNet based person detector + deep sort algorithm with pytorch)(SOTA)
code - [2019-CVPR] PoseFix: Model-agnostic General Human Pose Refinement Network
papercode - [2019-CVPR Oral] Progressive Pose Attention Transfer for Person Image Generation
papercode - [2019-CVPR] PifPaf: Composite Fields for Human Pose Estimation
papercodecode - [2019-CVPR] SemGCN: Semantic Graph Convolutional Networks for 3D Human Pose Regression
papercode - [2019-CVPR] MVPOSE: Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views(multi-person)
papercode - [2019-CVPR] SiamMask: Fast Online Object Tracking and Segmentation: A Unifying Approach(***SOTA)
papercode - [2019-CVPR] SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks(***SOTA)
papercode
Action Recognition
Keypoint Detection
- [2018-CVPR] OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation(***)
code
Text Detection & Recognition
Detection
- [2019-CVPR] Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation
paper - [2019-CVPR] A Multitask Network for Localization and Recognition of Text in Images(end-to-end)
paper - [2019-CVPR] AFDM: Handwriting Recognition in Low-resource Scripts using Adversarial Learning(data augmentation)
papercode - [2019-CVPR] CRAFT: Character Region Awareness for Text Detection
papercode - [2019-CVPR] Data Extraction from Charts via Single Deep Neural Network(*)
paper - [2019-CVPR] E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
paper - [2019-arXiv] FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition
paper - [2019-CVPR] Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes
paper - [2019-CVPR] PSENET: Shape Robust Text Detection with Progressive Scale Expansion Network
paper - [2019-CVPR] PMTD: Pyramid Mask Text Detector
papercode - [2019-CVPR] Spatial Fusion GAN for Image Synthesis (word Synthesis) [
paper](https://arxiv.org/abs/1812.05840code - [2019-CVPR] Scene Text Detection with Supervised Pyramid Context Network
paper - [2019-arXiv] TextField: Learning A Deep Direction Field for Irregular Scene Text Detection
papercode - [2019-CVPR] Typography with Decor: Intelligent Text Style Transfer
papercode - [2019-CVPR] TIOU: Tightness-aware Evaluation Protocol for Scene Text Detection(new Evalution tool)
papercode - [2019-arXiv] MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
papercode - [2019-CVPR] Scene Text Magnifier
paper - [2018-CVPR] Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks
paper - [2018-ECCV] Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
papercode - [2018-AAAI] PixelLink: Detecting Scene Text via Instance Segmentation
papercode - [2018-CVPR] RRPN: Arbitrary-Oriented Scene Text Detection via Rotation Proposals
papercode