[計算機視覺] 入門學習資料

時間 2021-08-13

標籤 php html c++ git 程序員 github web 算法 spring apache 欄目快樂工作简体版

原文原文鏈接

本文將介紹計算機視覺相關的經典書籍，頂級期刊/會議，在線學習課程，經常使用開源庫和安利小工具等。php

簡介html

計算機視覺（Computer Vision）c++

計算機視覺是一個跨學科領域, 涉及如何使計算機可以得到從數字圖像或視頻的高層次理解。從工程學的角度來看, 它尋求自動化人類視覺系統能夠作的任務。git

Reference：https://en.wikipedia.org/wiki/Computer_vision程序員

計算視覺相關條目github

Outline of computer visionweb

Reference：https://en.wikipedia.org/wiki/Outline_of_computer_vision算法

Category:Computer visionspring

Reference：https://en.wikipedia.org/wiki/Category:Computer_visionapache

經典書籍
《Multiple View Geometry in Computer Vision》Second Edition

計算機視覺中的一個基本問題是瞭解真實世界場景的幾個圖像的結構。解決這個問題的技術來自投影幾何和攝影測量。在這裏，做者按照相機投影矩陣，基本矩陣和三焦張量來描述幾何原理和它們的代數表示。這些實體的計算理論和方法都是用實例來討論的，正如它們用於重建多個圖像的場景同樣。新版本的特色是對書中的關鍵概念進行了擴展介紹（本書已經更新了其餘示例和附錄）以及自初版以來出現的重要新結果。提供了全面的背景材料，所以熟悉線性代數和基本數值方法的讀者能夠理解所提出的投影幾何和估計算法，並直接從書中實現算法。

注：理解3D到2D變換的必備書籍，如homography推導

Reference：http://www.robots.ox.ac.uk/~vgg/hzbook/

《Computer Vision: Algorithms and Applications》

此書是計算機視覺的聖經，介紹CV諸多方向，內容很全，十分適合做爲入門和進階書籍。介紹的內容有特徵檢測、匹配，分割，SFM，圖像拼接，立體視覺，三維重建等。

Reference：http://szeliski.org/Book/

《Computer Vision: Models, Learning, and Inference》

本書也是計算機視覺經典書籍，其以最小的先決條件爲基礎，從機率和模型擬合的基本知識入手，並提供可供讀者實施和修改以構建有用視覺系統的實例。主要面向高級本科生和研究生，詳細的方法論演示對計算機視覺從業者也頗有用。本書涵蓋尖端技術，包括圖形切割，機器學習和多視圖幾何的方法，顯示重要計算機視覺問題的解決方案，例如相機校準，人臉識別和對象跟蹤，描述了70多種算法。

Reference：http://www.computervisionmodels.com/

《An Invitation to 3d vision》

本書介紹了三維視覺的幾何結構，即從一系列二維圖像重建物體的三維模型。它詳細描述了兩個視圖幾何的經典理論。

Reference：http://vision.ucla.edu/MASKS/

《Concise Computer Vision: An Introduction into Theory and Algorithms》

本書重點突出重要算法和數學概念的做用。

Reference：http://www.springer.com/us/book/9781447163190

《Robot Vision》

This book presents a coherent approach to the fast moving field of machine vision, using a consistent notation based on a detailed understanding of the image formation process.

Reference：https://mitpress.mit.edu/books/robot-vision

《Computer Vision: A Modern Approach》

《Learning OpenCV》

OpenCV1.x教材書，本書於2008年出版（有中文版），是OpenCV1.x最佳書籍，早期學習OpenCV的程序員必備書籍。

Reference：http://it-ebooks.info/book/299/

《Learning OpenCV3》

OpenCV3.x教程書，本書於2016年出版（目前尚未中文版），是OpenCV3.x最佳書籍，如今學習OpenCV的程序可選書籍。

Reference：http://shop.oreilly.com/product/0636920044765.do

《數字圖像處理》岡薩雷斯

頂級CV期刊/會議
期刊

TPAMI：https://www.computer.org/web/tpami

IJCV：https://link.springer.com/journal/11263

會議

CVPR：http://cvpr2018.thecvf.com/

ICCV ：http://iccv2017.thecvf.com/

ECCV ：https://eccv2018.org/

SIGGRAPH ：https://www.siggraph.org//

NIPS：https://nips.cc/

在線學習課程
Annotated Computer Vision Bibliography: Table of Contents：超全的計算機視覺知識網址，羅列近20年的知識（目前還在更新中）

Reference：http://www.visionbib.com/

CVonline：很是全面的資源連接

Reference：http://homepages.inf.ed.ac.uk/rbf/CVonline/

CS131: Computer Vision: Foundations and Applications

Reference：http://vision.stanford.edu/teaching/cs131_fall1718/

CS231n: Convolutional Neural Networks for Visual Recognition

很是好的教程，CV與DL結合，深刻淺出！

Reference：http://cs231n.stanford.edu/CS231M: Mobile Computer Vision

CSE576: Computer Vision

Reference：http://courses.cs.washington.edu/courses/cse576/08sp/

Learn OpenCV

Reference：https://www.learnopencv.com/

經常使用開源庫
OpenCV：計算機視覺庫（支持C++/Python）

Reference：https://opencv.org/

EmguCV：主要支持.NET語言，如C#/VB/VC++

Reference：http://www.emgu.com/wiki/index.php/Main_Page

TensorFlow：TensorFlow™ 是一個採用數據流圖（data flow graphs），用於數值計算的開源軟件庫。節點（Nodes）在圖中表示數學操做，圖中的線（edges）則表示在節點間相互聯繫的多維數據數組，即張量（tensor）。它靈活的架構讓你能夠在多種平臺上展開計算，例如臺式計算機中的一個或多個CPU（或GPU），服務器，移動設備等等。TensorFlow 最初由Google大腦小組（隸屬於Google機器智能研究機構）的研究員和工程師們開發出來，用於機器學習和深度神經網絡方面的研究，但這個系統的通用性使其也可普遍用於其餘計算領域。

Reference：https://www.tensorflow.org/

Caffe：Caffe，全稱Convolutional Architecture for Fast Feature Embedding。是一種經常使用的深度學習框架，在視頻、圖像處理方面應用較多。

Reference：http://caffe.berkeleyvision.org/

Caffe2：Caffe2 aims to provide an easy and straightforward way for you to experiment with deep learning and leverage community contributions of new models and algorithms. You can bring your creations to scale using the power of GPUs in the cloud or to the masses on mobile with Caffe2's cross-platform libraries.

Reference：https://caffe2.ai/

Pytorch：PyTorch是使用GPU和CPU優化的深度學習張量庫。

Reference：http://pytorch.org/

Torch：Torch is a scientific computing framework with wide support for machine learning algorithms that puts GPUs first. It is easy to use and efficient, thanks to an easy and fast scripting language, LuaJIT, and an underlying C/CUDA implementation.

Reference：http://torch.ch/

MxNet：一個用於深度學習的靈活且高效的庫

Reference：http://mxnet.incubator.apache.org/index.html

Theano：Theano是一個Python庫, 它容許你有效地定義、優化和評估涉及多維數組的數學表達式。

Reference：http://deeplearning.net/software/theano/

Keras：Keras 是一種高層次的神經網絡 API，它以 Python 編寫，可以在 TensorFlow、CNTK 或 Theano上運行。

Reference：https://keras.io/

Dlib：Dlib 是一個c++工具包，包含機器學習算法和工具，用於在 c++中建立複雜的軟件以解決現實世界中的問題。它在工業和學術界都有普遍的應用，包括機器人、嵌入式設備、移動電話和大型高性能計算環境。Dlib 的開源受權容許您在任何應用程序中無償使用它。

Reference：http://dlib.net/

安利小工具

Google Scholar：強大的論文搜索引擎

Reference：https://scholar.glgoo.org/