[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural n

時間 2019-11-17

標籤 cvpr2015 cvpr object localization free weakly supervised learning convolutional neural 简体版

原文原文鏈接

Is object localization for free? –Weakly-supervised learning with convolutional neural networks. Maxime Oquab, Leon Bottou, Ivan Laptev, Josef Sivicweb

http://www.di.ens.fr/~josef/publications/Oquab15.pdf網絡

亮點app

一個好名字給了讓讀者開始閱讀的理由
global max pooling over sliding window的定位方法值得借鑑

方法spa

本文的目標是：設計一個弱監督分類網絡，注意本文的目標主要是提高分類。由於是2015年的文章，方法比較簡單原始。設計

Following three modifications to a classification network.blog

Treat the fully connected layers as convolutions, which allows us to deal with nearly arbitrary-sized images as input.

The aim is to apply the network to bigger images in a sliding window manner thus extending its output to n×m× K, where n and m denote the number of sliding window positions in the x- and y- direction in the image, respectively.
3xhxw —> convs —> kxmxn (k: number of classes)

Explicitly search for the highest scoring object position in the image by adding a single global max-pooling layer at the output.

kxmxn —> kx1x1
The max-pooling operation hypothesizes the location of the object in the image at the position with the maximum score

Use a cost function that can explicitly model multiple objects present in the image.

由於圖中可能有不少物體，因此多類的分類loss不適用。做者把這個任務視爲多個二分類問題，loss function和分類的分數以下three

trainingip

muti-scale testci

實驗rem

classification

mAP on VOC 2012 test: ＋3.1% compared with [56]
mAP on VOC 2012 test: ＋7.6% compared with kx1x1 output and single scale training
mAP on VOC: ＋2.6% compared with RCNN
mAP on COCO 62.8%

Localisation

Metric: if the maximal response across scales falls within the ground truth bounding box of an object of the same class within 18 pixels tolerance, we label the predicted location as correct. If not, then we count the response as a false positive (it hit the background), and we also increment the false negative count (no object was found).
metric on VOC 2012 val: -0.3% compared with RCNN
mAP on COCO 41.2%

缺點

定位評測的metric不具備權威性
max pooling改成average pooling會不會對於多個instance的狀況更好一些

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。