CV Code | 計算機視覺開源周報20191001期

  • 2019 年 12 月 27 日
  • 筆記

本圖出自OpenVSLAM‍

‍總結了過去一周新出的計算機視覺開源代碼。

ICCV 2019 臨近,不少論文和相應代碼公布,也包括其中的WorkShop的工作。

涵蓋的方向包括視覺SLAM、基於標記的SLAM、3D 重建、視線跟蹤、植物蟲害圖像檢測識別、人體姿態估計、視頻目標分割、語義分割等。

SLAM

OpenVSLAM: A Versatile Visual SLAM Framework

Shinya Sumikura, Mikiya Shibuya, Ken Sakurada

ACM Multimedia 2019 Open Source Software Competition

https://arxiv.org/abs/1910.01122v1

https://github.com/xdspacelab/openvslam

SLAM

TagSLAM: Robust SLAM with Fiducial Markers

Bernd Pfrommer, Kostas Daniilidis

https://arxiv.org/abs/1910.00679v1

https://berndpfrommer.github.io/tagslam_web

3D 重建

Learning Continuous 3D Reconstructions for Geometrically Aware Grasping

Mark Van der Merwe, Qingkai Lu, Balakumar Sundaralingam, Martin Matak, Tucker Hermans

ICRA 2020

https://arxiv.org/abs/1910.00983v1

視線跟蹤

RITnet: Real-time Semantic Segmentation of the Eye for Gaze Tracking

Aayush K.Chaudhary, Rakshit Kothari, Manoj Acharya, Shusil Dangi, Nitinraj Nair, Reynold Bailey, Christopher Kanan, Gabriel Diaz, Jeff B. Pelz

ICCV 2019 Workshop OpenEDS Semantic Segmentation Challenge for Eye images

https://arxiv.org/abs/1910.00694v1

https://bitbucket.org/eye-ush/ritnet/

植物蟲害圖像檢測與識別

Research on insect pest image detection and recognition based on bio-inspired methods

Loris Nanni, Gianluca Maguolo, Fabio Pancino

https://arxiv.org/abs/1910.00296v1

https://github.com/LorisNanni/

機械人推動行為數據集

Omnipush: accurate, diverse, real-world dataset of pushing dynamics with RGB-D video

Maria Bauza, Ferran Alet, Yen-Chen Lin, Tomas Lozano-Perez, Leslie P. Kaelbling, Phillip Isola, Alberto Rodriguez

IROS 2019

https://arxiv.org/abs/1910.00618v1

https://web.mit.edu/mcube/omnipush-dataset/

基於膠囊網絡的半監督視頻目標分割

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing

Kevin Duarte, Yogesh S Rawat, Mubarak Shah

ICCV 2019

https://arxiv.org/abs/1910.00132v1

https://github.com/KevinDuarte/CapsuleVOS

物體 6D 姿態估計

CullNet: Calibrated and Pose Aware Confidence Scores for Object Pose Estimation

Kartik Gupta, Lars Petersson, Richard Hartley

ICCV Workshop on Recovering 6D Object Pose, 2019

https://arxiv.org/abs/1909.13476v1

https://github.com/kartikgupta-at-anu/CullNet

用於語義分割的最大方差損失域適應

Domain Adaptation for Semantic Segmentation with Maximum Squares Loss

Minghao Chen, Hongyang Xue, Deng Cai

ICCV 2019

https://arxiv.org/abs/1909.13589v1

https://github.com/ZJULearning/MaxSquareLoss

單網絡全人體的姿態估計方法

Single-Network Whole-Body Pose Estimation

Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh

ICCV 2019

https://arxiv.org/abs/1909.13423v1

https://github.com/CMU-Perceptual-Computing-Lab/openpose_train

OpenPose 升級,CMU提出首個單網絡全人體姿態估計網絡,速度大幅提高

EdgeCNN:CNN用於邊緣計算

EdgeCNN: Convolutional Neural Network Classification Model with small inputs for Edge Computing

Shunzhi Yang, Zheng Gong, Kai Ye, Yungen Wei, Zheng Huang, Zhenhua Huang

https://arxiv.org/abs/1909.13522v1

https://github.com/yangshunzhi1994/EdgeCNN

視頻目標分割 | 基於可微分掩膜匹配方法

DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation

Xiaohui Zeng, Renjie Liao, Li Gu, Yuwen Xiong, Sanja Fidler, Raquel Urtasun

ICCV 2019

https://arxiv.org/abs/1909.12471v1

https://github.com/ZENGXH/DMM_Net

實時多目標跟蹤

Towards Real-Time Multi-Object Tracking

Zhongdao Wang, Liang Zheng, Yixuan Liu, Shengjin Wang

https://arxiv.org/abs/1909.12605v1

https://github.com/Zhongdao/Towards-Realtime-MOT

業界首個實時多目標跟蹤系統開源

3D 人體姿態與形狀重建的學習方法

Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop

Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis

ICCV 2019

https://arxiv.org/abs/1909.12828v1

https://seas.upenn.edu/~nkolot/projects/spin

一種可學習的樹濾波器,用於結構保持的特徵變換,嵌入到語義分割網絡中,有效改進了分割精度

Learnable Tree Filter for Structure-preserving Feature Transform

Lin Song, Yanwei Li, Zeming Li, Gang Yu, Hongbin Sun, Jian Sun, Nanning Zheng

NeurIPS-2019

https://arxiv.org/abs/1909.12513v1

https://github.com/StevenGrove/TreeFilter-Torch