CV Code | 計算機視覺開源周報20191001期
- 2019 年 12 月 27 日
- 筆記

本圖出自OpenVSLAM
總結了過去一周新出的計算機視覺開源代碼。
ICCV 2019 臨近,不少論文和相應代碼公布,也包括其中的WorkShop的工作。
涵蓋的方向包括視覺SLAM、基於標記的SLAM、3D 重建、視線跟蹤、植物蟲害圖像檢測識別、人體姿態估計、視頻目標分割、語義分割等。
SLAM
OpenVSLAM: A Versatile Visual SLAM Framework
Shinya Sumikura, Mikiya Shibuya, Ken Sakurada
ACM Multimedia 2019 Open Source Software Competition
https://arxiv.org/abs/1910.01122v1
https://github.com/xdspacelab/openvslam
SLAM
TagSLAM: Robust SLAM with Fiducial Markers
Bernd Pfrommer, Kostas Daniilidis
https://arxiv.org/abs/1910.00679v1
https://berndpfrommer.github.io/tagslam_web
3D 重建
Learning Continuous 3D Reconstructions for Geometrically Aware Grasping
Mark Van der Merwe, Qingkai Lu, Balakumar Sundaralingam, Martin Matak, Tucker Hermans
ICRA 2020
https://arxiv.org/abs/1910.00983v1
視線跟蹤
RITnet: Real-time Semantic Segmentation of the Eye for Gaze Tracking
Aayush K.Chaudhary, Rakshit Kothari, Manoj Acharya, Shusil Dangi, Nitinraj Nair, Reynold Bailey, Christopher Kanan, Gabriel Diaz, Jeff B. Pelz
ICCV 2019 Workshop OpenEDS Semantic Segmentation Challenge for Eye images
https://arxiv.org/abs/1910.00694v1
https://bitbucket.org/eye-ush/ritnet/
植物蟲害圖像檢測與識別
Research on insect pest image detection and recognition based on bio-inspired methods
Loris Nanni, Gianluca Maguolo, Fabio Pancino
https://arxiv.org/abs/1910.00296v1
https://github.com/LorisNanni/
機械人推動行為數據集
Omnipush: accurate, diverse, real-world dataset of pushing dynamics with RGB-D video
Maria Bauza, Ferran Alet, Yen-Chen Lin, Tomas Lozano-Perez, Leslie P. Kaelbling, Phillip Isola, Alberto Rodriguez
IROS 2019
https://arxiv.org/abs/1910.00618v1
https://web.mit.edu/mcube/omnipush-dataset/
基於膠囊網絡的半監督視頻目標分割
CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing
Kevin Duarte, Yogesh S Rawat, Mubarak Shah
ICCV 2019
https://arxiv.org/abs/1910.00132v1
https://github.com/KevinDuarte/CapsuleVOS
物體 6D 姿態估計
CullNet: Calibrated and Pose Aware Confidence Scores for Object Pose Estimation
Kartik Gupta, Lars Petersson, Richard Hartley
ICCV Workshop on Recovering 6D Object Pose, 2019
https://arxiv.org/abs/1909.13476v1
https://github.com/kartikgupta-at-anu/CullNet
用於語義分割的最大方差損失域適應
Domain Adaptation for Semantic Segmentation with Maximum Squares Loss
Minghao Chen, Hongyang Xue, Deng Cai
ICCV 2019
https://arxiv.org/abs/1909.13589v1
https://github.com/ZJULearning/MaxSquareLoss
單網絡全人體的姿態估計方法
Single-Network Whole-Body Pose Estimation
Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh
ICCV 2019
https://arxiv.org/abs/1909.13423v1
https://github.com/CMU-Perceptual-Computing-Lab/openpose_train
OpenPose 升級,CMU提出首個單網絡全人體姿態估計網絡,速度大幅提高
EdgeCNN:CNN用於邊緣計算
EdgeCNN: Convolutional Neural Network Classification Model with small inputs for Edge Computing
Shunzhi Yang, Zheng Gong, Kai Ye, Yungen Wei, Zheng Huang, Zhenhua Huang
https://arxiv.org/abs/1909.13522v1
https://github.com/yangshunzhi1994/EdgeCNN
視頻目標分割 | 基於可微分掩膜匹配方法
DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation
Xiaohui Zeng, Renjie Liao, Li Gu, Yuwen Xiong, Sanja Fidler, Raquel Urtasun
ICCV 2019
https://arxiv.org/abs/1909.12471v1
https://github.com/ZENGXH/DMM_Net
實時多目標跟蹤
Towards Real-Time Multi-Object Tracking
Zhongdao Wang, Liang Zheng, Yixuan Liu, Shengjin Wang
https://arxiv.org/abs/1909.12605v1
https://github.com/Zhongdao/Towards-Realtime-MOT
3D 人體姿態與形狀重建的學習方法
Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop
Nikos Kolotouros, Georgios Pavlakos, Michael J. Black, Kostas Daniilidis
ICCV 2019
https://arxiv.org/abs/1909.12828v1
https://seas.upenn.edu/~nkolot/projects/spin
一種可學習的樹濾波器,用於結構保持的特徵變換,嵌入到語義分割網絡中,有效改進了分割精度
Learnable Tree Filter for Structure-preserving Feature Transform
Lin Song, Yanwei Li, Zeming Li, Gang Yu, Hongbin Sun, Jian Sun, Nanning Zheng
NeurIPS-2019
https://arxiv.org/abs/1909.12513v1
https://github.com/StevenGrove/TreeFilter-Torch


