讲解人 : 崔振 2010.9.17 Supervised Translation-Invariant Supervised Translation-Invariant...

讲解人 : 崔振2010.9.17

Supervised Translation-Supervised Translation-InvariantInvariant

Sparse CodingSparse Coding

[Jianchao Yang, Kai Yu, Thomas Huang]

提纲

•作者信息•文章信息•拟解决的问题•本文的方法•实验•结论

提纲

Jianchao Yang

jyang29 @ifp.uiuc.edu

Image Formation & Processsing Group (IFP), University of Illinois at Urbana-Champaign (UIUC)

Ph.D. Candidate (06-Present, ECE, UIUC) ; Ph.D. Adviser: Prof. Thomas S. Huang

B.Eng (02-06, EEIS, USTC)

Publication（第一作者） CVPR ： 4篇， 2 篇 oral TIP ： 2篇 ECCV10 ， 1篇 ICIP,1篇

Homepage: http://www.ifp.illinois.edu/~jyang29/

Kai Yu

Machine Learning researcher and the Head of Media Analytics Department at NEC Laboratories America. Inc..

Ph.D. Computer Science, University of Munich,Germany, January 2001 – July 2004.

B.Sc and M.Sc, Nanjing University.

Research Interests Areas: machine learning, data mining, information

retrieval, computer vision CVPR(4),ECCV(4+),ICML(8+),NIPS(10+),…

http://www.dbs.informatik.uni-muenchen.de/~yu_k/

Thomas Huang

Beckman Institute Image Formation and Processing and Artificial Intelligence groups.

William L. Everitt Distinguished Professor in the U of I Department of Electrical and Computer Engineering and the Coordinated Science Lab (CSL);

Sc.D. from MIT in 1963

computer vision, image compression and enhancement, pattern recognition, and multimodal signal processing.

http://www.beckman.illinois.edu/directory/t-huang1

提纲

文章信息

文章出处 CVPR10 （ oral）

相关文章 Yang et al. Linear spatial pyramid matching using

sparse coding for image classification. CVPR’09.

Abstract

In this paper, we propose a novel supervised hierarchical sparse coding model based on local image descriptors for classification tasks. The supervised dictionary training is performed via back-projection, by minimizing the training error of classifying the image level features, which are extracted by max pooling over the sparse codes within a spatial pyramid. Such a max pooling procedure across multiple spatial scales offer the model translation invariant properties, similar to the Convolutional Neural Network (CNN). Experiments show that our supervised dictionary improves the performance of the proposed model significantly over the unsupervised dictionary, leading to state-of-the-art performance on diverse image databases. Further more, our supervised model targets learning linear features, implying its great potential in handling large scale datasets in real applications.

摘要

针对分类任务，提出了一种新颖的基于局部图像描述子的监督分级稀疏编码模型。

通过 back-projection方法，以最小化在图像层级特征(image level features)的分类误差训练监督词典。其中图像层级特征是以空间金字塔为结构max pooling稀疏编码。在多种空间尺度下max pooling方法具有平移不变的特性，如同 CNN(Convolutional Neural Network)一样。

实验证明，与无监督词典相比，监督词典明显地改善了模型的性能，并且在多个图像数据库拥有最好的表现。

另外，监督模型目标是学习线性特征，它蕴含了一个巨大潜能 -实时地处理大规模数据库。

提纲

拟解决的问题

Image classification To find a generic feature representation Interested in linear prediction model

Sparse Coding for Image Classification

Sparse Coding Unsupervised Supervised

Sparse coding on holistic image

-Linear model assumption

-Sensitive to image misalignment

-Limited applications

D. Bradley et al. ‘08

J. Wright et al. ’09

A. Wagner et al.’09

D. Bradley et al. ‘08

J. Marialet al. ’08

Q. Zhang. CVPR10

Sparse coding on local descriptors

-Break linear model assumption for the image space

-Robust to image misalignment

-Applicableto generic image

classification

R. Rainaet al. ’07

J. Yang et al. ’09

J. Yang et al. ’10

提纲

本文的方法框架相关知识本文模型求解方法

框架

Bag of coordinatedLocal descriptors

High-dimensionalsparse codes

Imagerepresentation

It must be a cool Cat!

Descriptor extraction

nonlinear coding

feature pooling

classification

J. Yang et al. Linear spatial pyramid matching using sparse coding for image classification. CVPR’09.

Yang. CVPR09

已有方法

Histogram-based SPM feature Step 1: local descriptor extraction Step 2: vector quantization (e.g.k-means) Step 3: hierarchical average pooling Step 4: nonlinear SVM

The framework of ScSPM （ CVPR09） Step 1: local descriptor extraction Step 2: sparse coding (无监督词典 ) Step 3: hierarchical max pooling Step 4: linear SVM

讲解人 : 崔 振 2010.9.17 Supervised Translation-Invariant Supervised Translation-Invariant...

Documents

Transcript of 讲解人 : 崔 振 2010.9.17 Supervised Translation-Invariant Supervised Translation-Invariant...

translation-invariant operators on spaces of vector-valued functions

INVARIANT DIFFERENTIAL OPERATORS AND MEIXNER …

PERBANDINGAN METODE KLASIFIKASI SUPERVISED …sinasinderaja.lapan.go.id/files/prosiding/2014/bukuprosiding_505... · Klasifikasi supervised maximum likelihood merupakan klasifikasi

INVARIANT MEASURES AND ARITHMETIC QUANTUM UNIQUE …

METODE MOMENT INVARIANT DAN BACKPRORAGATION …

SELF SUPERVISED REPRESENTATION LEARNING WITH RELATIVE ...

Semi-Supervised SVM

2.supervised learning(epoch#2)-2

Semi supervised learning Türkçe

Grayscale Template-Matching Invariant to Rotation, … Template-Matching Invariant to Rotation, Scale, Translation, Brightness and Contrast Hae Yong Kim and Sidnei Alves de Araújo

Semi-Supervised Learning

Bounded Invariant Equivalence Relationsrzepecki/pubdir/prace/thesis_111018.… · Bounded Invariant Equivalence Relations doctoral thesis supervised by prof. Krzysztof Krupin ski

Linear Time-Invariant Dynamical Systems

Supervised Injecting Facilities ‘The case’

Semi-supervised Active Learning Survey

Translation invariant operators on Lorentz spacesarchive.numdam.org › article › ASNSP_1987_4_14_2_257_0.pdf · 2019-05-10 · Translation Invariant Operators on Lorentz Spaces

Chirurgie et second invariant de Yamabe

N-Tupling Transformations and Invariant Deﬁnite · N-Tupling Transformations and Invariant Deﬁnite ... isa“doubling”ofthegraphof F(x) ... the ψ in (4) is just a translation,

Scale-Invariant Feature Transform (SIFT)

75502390 Invariant Theory 1

讲解人 : 崔振 2010.9.17 Supervised Translation-Invariant Supervised Translation-Invariant...

Transcript of 讲解人 : 崔振 2010.9.17 Supervised Translation-Invariant Supervised Translation-Invariant...