Lecture’13:’k,means’and’’...

Lecture 13 - Fei-Fei Li

Lecture 13: k-‐means and mean-‐shi4 clustering

Juan Carlos Niebles Stanford AI Lab

Professor Fei-‐Fei Li Stanford Vision Lab

3-‐Nov-‐15 1

Recap: Gestalt Theory •  Gestalt: whole or group

–  Whole is greater than sum of its parts –  RelaLonships among parts can yield new properLes/features

•  Psychologists idenLfied series of factors that predispose set of elements to be grouped (by human visual system)

Untersuchungen zur Lehre von der Gestalt, Psychologische Forschung, Vol. 4, pp. 301-350, 1923 http://psy.ed.asu.edu/~classics/Wertheimer/Forms/forms.htm

“I stand at the window and see a house, trees, sky. Theoretically I might say there were 327 brightnesses and nuances of colour. Do I have "327"? No. I have sky, house, and trees.”

Max Wertheimer (1880-1943)

3-‐Nov-‐15 2

Recap: Gestalt Factors

•  These factors make intuiLve sense, but are very difficult to translate into algorithms.

3-‐Nov-‐15 3

Recap: Image SegmentaLon

•  Goal: idenLfy groups of pixels that go together

3-‐Nov-‐15 4

What will we learn today?

•  K-‐means clustering •  Mean-‐shi4 clustering

3-‐Nov-‐15 5

Reading: [FP] Chapters: 14.2, 14.4 D. Comaniciu and P. Meer, Mean Shi4: A Robust Approach toward Feature Space Analysis, PAMI 2002.

3-‐Nov-‐15 6

Image SegmentaLon: Toy Example

•  These intensiLes define the three groups. •  We could label every pixel in the image according to which

of these primary intensiLes it is. –  i.e., segment the image based on the intensity feature.

•  What if the image isn’t quite so simple?

intensity input image

black pixels gray pixels

white pixels

Slide credit: Kristen Grauman

3-‐Nov-‐15 7

Pixel cou

Input image

Input image Intensity

Pixel cou

Intensity

3-‐Nov-‐15 8

•  Now how to determine the three main intensiLes that define our groups?

•  We need to cluster.

Input image Intensity

Pixel cou

3-‐Nov-‐15 9

•  Goal: choose three “centers” as the representaLve intensiLes, and label every pixel according to which of these centers it is nearest to.

•  Best cluster centers are those that minimize Sum of Square Distance (SSD) between all points and their nearest cluster center ci:

0 190 255

Intensity

3-‐Nov-‐15 10

SSD = x − ci( )2x∈clusteri∑

clusteri∑

Clustering for SummarizaLon

Goal: cluster to minimize variance in data given clusters – Preserve informaLon

Whether is assigned to

Cluster center Data

Slide: Derek Hoiem

c*, δ* = argminc, δ

∑ ci − x j( )2

x j ci

3-‐Nov-‐15 11

Clustering •  With this objecLve, it is a “chicken and egg” problem: –  If we knew the cluster centers, we could allocate points to groups by assigning each to its closest center.

–  If we knew the group memberships, we could get the centers by compuLng the mean per group.

3-‐Nov-‐15 12

K-‐means clustering 1.  IniLalize ( ): cluster centers

2.  Compute : assign each point to the closest center –  denotes the set of assignment for each to cluster at iteraLon t

3.  Computer : update cluster centers as the mean of the points

4.  Update , Repeat Step 2-‐3 Lll stopped

Slide: Derek Hoiem

c1,...,cK

δ t = argminδ

δ t−1iji

∑ ct−1i − x j( )2

ct = argminc

δ tiji

∑ ct−1i − x j( )2

t = t +1

technical

x j ciδ t

3-‐Nov-‐15 13

K-‐means clustering 1.  IniLalize ( ): cluster centers

2.  Compute : assign each point to the closest center

3.  Computer : update cluster centers as the mean of the points

4.  Update , Repeat Step 2-‐3 Lll stopped

Slide: Derek Hoiem

c1,...,cKtechn

note t = 0

•  Commonly used: random iniLalizaLon •  Or greedily choose K to minimize residual

•  Typical distance measure: •  Euclidean •  Cosine •  Others

•  doesn’t change anymore. ct

sim(x, !x ) = xT !xsim(x, !x ) = xT !x x ⋅ !x( )

t = t +1

ct = argminc

δ tiji

∑ ct−1i − x j( )2

3-‐Nov-‐15 14

K-‐means clustering

Illustration Source: wikipedia

1. Initialize Cluster Centers

2. Assign Points to Clusters

3. Re-compute Means

Repeat (2) and (3)

•  Java demo: hmp://home.dei.polimi.it/mameucc/Clustering/tutorial_html/AppletKM.html

3-‐Nov-‐15 15

•  Converges to a local minimum soluLon –  IniLalize mulLple runs

•  Bemer fit for spherical data

•  Need to pick K (# of clusters)

3-‐Nov-‐15 16

K-‐means clustering techn

SegmentaLon as Clustering

K=3 img_as_col = double(im(:)); cluster_membs = kmeans(img_as_col, K); labelim = zeros(size(im)); for i=1:k inds = find(cluster_membs==i); meanval = mean(img_as_column(inds)); labelim(inds) = meanval; end

3-‐Nov-‐15 17

K-‐Means++

•  Can we prevent arbitrarily bad local minima?

1. Randomly choose first center. 2. Pick new center with prob. proporLonal to

–  (ContribuLon of x to total error) 3. Repeat unLl K centers.

•  Expected error * opLmal

Arthur & Vassilvitskii 2007

3-‐Nov-‐15 18

x − ci( )2

=O log K( )

Feature Space •  Depending on what we choose as the feature space, we can group pixels in different ways.

•  Grouping pixels based on intensity similarity

•  Feature space: intensity value (1D) Slide credit: Kristen Grauman

3-‐Nov-‐15 19

Feature Space •  Depending on what we choose as the feature space, we can

group pixels in different ways.

•  Grouping pixels based on color similarity

•  Feature space: color value (3D)

R=255 G=200 B=250

R=245 G=220 B=248

R=15 G=189 B=2

R=3 G=12 B=2

3-‐Nov-‐15 20

Feature Space •  Depending on what we choose as the feature space, we can

group pixels in different ways.

•  Grouping pixels based on texture similarity

•  Feature space: filter bank responses (e.g., 24D)

Filter bank of 24 filters

3-‐Nov-‐15 21

Smoothing Out Cluster Assignments

•  Assigning a cluster label per pixel may yield outliers:

•  How can we ensure they are spaLally smooth? 1 2

Original Labeled by cluster center’s intensity

3-‐Nov-‐15 22

SegmentaLon as Clustering •  Depending on what we choose as the feature space, we can group pixels in different ways.

•  Grouping pixels based on intensity+posi9on similarity

⇒ Way to encode both similarity and proximity. Slide credit: Kristen Grauman

Intensity

3-‐Nov-‐15 23

K-‐Means Clustering Results •  K-‐means clustering based on intensity or color is essenLally vector quanLzaLon of the image amributes –  Clusters don’t have to be spaLally coherent

Image Intensity-‐based clusters Color-‐based clusters

3-‐Nov-‐15 24

K-‐Means Clustering Results

•  K-‐means clustering based on intensity or color is essenLally vector quanLzaLon of the image amributes –  Clusters don’t have to be spaLally coherent

•  Clustering based on (r,g,b,x,y) values enforces more spaLal coherence

3-‐Nov-‐15 25

How to evaluate clusters?

•  GeneraLve – How well are points reconstructed from the clusters?

•  DiscriminaLve – How well do the clusters correspond to labels?

•  Purity – Note: unsupervised clustering does not aim to be discriminaLve

Slide: Derek Hoiem 3-‐Nov-‐15 26

How to choose the number of clusters?

•  ValidaLon set – Try different numbers of clusters and look at performance

• When building dicLonaries (discussed later), more clusters typically work bemer

Slide: Derek Hoiem 3-‐Nov-‐15 27

K-‐Means pros and cons •  Pros

•  Finds cluster centers that minimize condiLonal variance (good representaLon of data)

•  Simple and fast, Easy to implement •  Cons

•  Need to choose K •  SensiLve to outliers •  Prone to local minima •  All clusters have the same parameters

(e.g., distance measure is non-‐adapLve)

•  *Can be slow: each iteraLon is O(KNd) for N d-‐dimensional points

•  Usage •  Unsupervised clustering •  Rarely used for pixel segmentaLon

3-‐Nov-‐15 28

3-‐Nov-‐15 29

Mean-‐Shi4 SegmentaLon

•  An advanced and versaLle technique for clustering-‐based segmentaLon

hmp://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html

D. Comaniciu and P. Meer, Mean Shi4: A Robust Approach toward Feature Space Analysis, PAMI 2002. Slid

3-‐Nov-‐15 30

Mean-‐Shi4 Algorithm

•  IteraLve Mode Search 1.  IniLalize random seed, and window W 2.  Calculate center of gravity (the “mean”) of W: 3.  Shi4 the search window to the mean 4.  Repeat Step 2 unLl convergence

3-‐Nov-‐15 31

Region of interest

Center of mass

Mean Shift vector

Mean-‐Shi4

Slide by Y. Ukrainitz & B. Sarel

3-‐Nov-‐15 32

Region of interest

Center of mass

Mean Shift vector

Mean-‐Shi4

3-‐Nov-‐15 33

Region of interest

Center of mass

Mean Shift vector

Mean-‐Shi4

3-‐Nov-‐15 34

Region of interest

Center of mass

Mean Shift vector

Mean-‐Shi4

3-‐Nov-‐15 35

Region of interest

Center of mass

Mean Shift vector

Mean-‐Shi4

3-‐Nov-‐15 36

Region of interest

Center of mass

Mean Shift vector

Mean-‐Shi4

3-‐Nov-‐15 37

Region of interest

Center of mass

Mean-‐Shi4

3-‐Nov-‐15 38

Tessellate the space with windows Run the procedure in parallel Slide by Y. U

krainitz & B. Sarel

Real Modality Analysis

3-‐Nov-‐15 39

The blue data points were traversed by the windows towards the mode. Slide by Y. U

krainitz & B. Sarel

Real Modality Analysis

3-‐Nov-‐15 40

Mean-‐Shi4 Clustering

•  Cluster: all data points in the amracLon basin of a mode

•  AmracLon basin: the region for which all trajectories lead to the same mode

3-‐Nov-‐15 41

Mean-‐Shi4 Clustering/SegmentaLon •  Find features (color, gradients, texture, etc) •  IniLalize windows at individual pixel locaLons •  Perform mean shi4 for each window unLl convergence •  Merge windows that end up near the same “peak” or mode

3-‐Nov-‐15 42

Mean-‐Shi4 SegmentaLon Results

hmp://www.caip.rutgers.edu/~comanici/MSPAMI/msPamiResults.html Slid

3-‐Nov-‐15 43

More Results

3-‐Nov-‐15 44

More Results

3-‐Nov-‐15 45

•  Need to shi4 many windows… •  Many computaLons will be redundant.

Problem: ComputaLonal Complexity

3-‐Nov-‐15 46

Speedups: Basin of AmracLon

3-‐Nov-‐15 47

1.  Assign all points within radius r of end point to the mode.

Speedups

3-‐Nov-‐15 48

2.  Assign all points within radius r/c of the search path to the mode -‐> reduce the number of data points to search.

Summary Mean-‐Shi4 •  Pros

–  General, applicaLon-‐independent tool –  Model-‐free, does not assume any prior shape (spherical, ellipLcal, etc.) on data clusters

–  Just a single parameter (window size h) •  h has a physical meaning (unlike k-‐means)

–  Finds variable number of modes –  Robust to outliers

•  Cons –  Output depends on window size –  Window size (bandwidth) selecLon is not trivial –  ComputaLonally (relaLvely) expensive (~2s/image) –  Does not scale well with dimension of feature space

3-‐Nov-‐15 49

Lecture 13 - Fei-Fei Li 3-‐Nov-‐15 50

Mean-‐shi4 Algorithm

technical

Comaniciu & Meer, 2002

Lecture 13 - Fei-Fei Li 3-‐Nov-‐15 51

Mean-‐shi4 Algorithm

technical

Comaniciu & Meer, 2002

What will we have learned today

3-‐Nov-‐15 52

Lecture’13:’k,means’and’’...

Documents

Transcript of Lecture’13:’k,means’and’’...

Fituici FEI

Lecture7 xing fei-fei

hewan omni aify angel fei fei p4.docx

~. r. fei,

Apostila Vibrações -FEI

Matlab Fei

Cartolina FEI

Apostila Lab Polimeros FEI

Grile Fei

Lecture3 xing fei-fei

Lecture'9'&'10:'' Stereo'Vision'vision.stanford.edu/.../lecture9_10_stereo_cs131_2016.pdfFei-Fei Li Lecture 9 & 10 - Algorithm: • Re-project image planes onto a common plane parallel

FEI -8 FEI 0.5 - 10 14.0 16.0 8.0 10.0 [RoHS2 (AWG) (22-20) · 14.0 16.0 8.0 10.0 [rohs2 (awg) (22-20) 5- (20-19) (18) (16) fei o. fei fei fei fei 75-10 16.0 14.0 16.0 14.0 16.0 10.0

shi4.eduusolie.rushi4.eduusolie.ru/files/14-15sam_ob.pdf · 1 УТВЕРЖДАЮ Директор ГОКУ «Санаторная школа-интернат № 4» _________________А.Е.

Yi Dance_Yang Xue Fei

FEI - Dominio FEI 01 en-us

FEI - Institucional 2012

FEI-2014-2015 MGT.pdf

FEI 2015 presentation

Máquinas de Fluxo - FEI

FEI katalog 2011