Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang...

33
Intelligent Database Systems Lab Advisor Dr. Hsu Graduate Yu Cheng Chen Author Yongqiang Cao Jianhong Wu 國國國國國國國國 National Yunlin University of Science and Technology Projective ART for clustering data sets in high dimensional spaces Neural Networks , 2003. Proceedings. 2002 Elsevier Sci ence Ltd

description

Intelligent Database Systems Lab Motivation Most clustering algorithms do not work efficiently for data sets in high dimensional spaces because of the inherent sparsity of data. Consequently, a clustering algorithms is often preceded by feature selection, but a feature selection procedure can lead to a significant loss of information. N.Y.U.S.T. I.M.

Transcript of Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang...

Page 1: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Advisor : Dr. HsuGraduate : Yu Cheng ChenAuthor : Yongqiang Cao

Jianhong Wu

國立雲林科技大學National Yunlin University of Science and Technology

Projective ART for clustering data sets in high dimensional spaces

Neural Networks , 2003. Proceedings. 2002 Elsevier Science Ltd

Page 2: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Outline Motivation Objective Introduction Projective adaptive resonance theory Algorithms Simulation and comparisons Conclusions Personal Opinion Review

N.Y.U.S.T.I.M.

Page 3: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Motivation Most clustering algorithms do not work efficiently for da

ta sets in high dimensional spaces because of the inherent sparsity of data.

Consequently, a clustering algorithms is often preceded by feature selection, but a feature selection procedure can lead to a significant loss of information.

N.Y.U.S.T.I.M.

Page 4: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Objective PART and the resulting algorithms are proposed to find

projected clusters for data sets in high dimensional spaces.

N.Y.U.S.T.I.M.

Page 5: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Introduction Projected Clustering The goal of Projected clustering is to find projected

clustering, each of which consists of a subset C of data points together with a subset D of dimensions such that the points in C are closely correlated in the subspace of dimensions D.

N.Y.U.S.T.I.M.

Page 6: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Introduction ART1 architecture

N.Y.U.S.T.I.M.

Page 7: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

PART architecture

N.Y.U.S.T.I.M.Introduction

Page 8: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Introduction The principal difference between PART and ART is

selectively sends signals in F1 layer to nodes in the F2 layer.

In other words, a node in the F1 layer can be active relative to some F2 nodes, but inactive relative to other F2 nodes.

An F1 node is active is determined by a similarity test between the corresponding top-down weight and the signal generated in the F1 node.

N.Y.U.S.T.I.M.

Page 9: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

We define the selective output signal of node vi to node v

j by

N.Y.U.S.T.I.M.Projective adaptive resonance

(1) )()),((),,( 1 ijjiijiijiij zlzxfhzzxhh

(2) b)d(a, if 0

b)d(a, if 1),(

bah

(3) z if 0

z if 1)(

ij

ij

ijzl

We say that vi is active to vj if hij=1,and inactive to vj if hij=0

Page 10: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

STM equations

N.Y.U.S.T.I.M.Projective adaptive resonance

iii Ix

dtdx

(4) 1 0

(5) )()1( jjjjjj JCxBJAxx

dtdx

(7) )(J

(6) )(J

2,

-j

j

Fvjkk

jj

k

xg

Txgwhere

user
Page 11: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

STM equations

N.Y.U.S.T.I.M.Projective adaptive resonance

(8) ),,(11

Fvi

jiijiijFv

ijijj zzxhzhzTi

(9) otherwise 0

winnera is vnode if 1)( j

2

jxf

F2 layer makes a choice by winner-take –all paradigm

Page 12: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

LTM equations

N.Y.U.S.T.I.M.Projective adaptive resonance

1,

2

(10) )],,(

),,()1)[((

Fvikjkkjkij

jiijiijjij

k

zzxhz

zzxLhzxfdt

dz

nactiveif vj is i

jve to v is inactii, but v is activejif v

to v is activei and v is activejif v

ijzX

XijzLijzdt

dz

j

ij

0

||

)1|(|)1(

Page 13: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

LTM equations

N.Y.U.S.T.I.M.Projective adaptive resonance

)11()( 12 ted is commit)] if v(xfz[xfdt

dzjijij

ji

)12()( 12 mitted is noncom)] if v(xfz[xfdt

dzjijij

ji

(14) inactive is vif 0

active is vif )(

j

j1

ijiji xfzdt

dz

(15) inactive is vif 0active is vif )(

j

i1 ijiij xfzdt

dz

Page 14: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Vigilance and reset

and we reset the winner vj if and only if

N.Y.U.S.T.I.M.Projective adaptive resonance

(16) i

ijj hr

(17) jr

Page 15: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

The extension of PART architecture: PART tree

N.Y.U.S.T.I.M.Projective adaptive resonance

Page 16: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Algorithms F1 activation and computation of hij

Here, we take f1(xi)=xi, and by Eq. (4), xi=Ii

N.Y.U.S.T.I.M.

(1) )()),((),,( 1 ijjiijiijiij zlzxfhzzxhh

(18) )(),( ijjiiij zlzIhh

(19) /),( bebabad

Page 17: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

F2 activation and selection of winner

We compute the input Tj to the committed F2 node vj

by Eq. (8), and then select the winner.

N.Y.U.S.T.I.M.Algorithms

(8) ),,(11

Fvi

jiijiijFv

ijijj zzxhzhzTi

Page 18: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Vigilance and resetWe use the vigilance and reset mechanism show in Eqs. (16) and (17). Namely winner vj is reset if and only if

N.Y.U.S.T.I.M.Algorithms

(16) i

jj hir

(17) jr

jr

Page 19: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

LearningFor the committed winning F2 node vj which has passed the vigilance test, we have

N.Y.U.S.T.I.M.Algorithms

(21) 0

) 1/(

j v toinactive is i vnode 1F if

j v toactive is i vnode 1F ifXLLz new

ij

Page 20: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

LearningFor the committed winning F2 node vj which has passed the vigilance test, we have

N.Y.U.S.T.I.M.Algorithms

(22) )1( ioldji

newji αIz-αz

Page 21: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

LearningFor a noncommitted winner vj, and for every F1 node vi we have

N.Y.U.S.T.I.M.Algorithms

(24)

(23) )1/(

inewji

newij

Iz

mLLz

Page 22: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

PART tree algorithm

N.Y.U.S.T.I.M.Algorithms

Page 23: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

N.Y.U.S.T.I.M.Simulations and comparisons

Page 24: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Simulations and comparisonsN.Y.U.S.T.

I.M.

Page 25: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Simulations and comparisonsN.Y.U.S.T.

I.M.

Page 26: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Simulations and comparisons

Data set 1 with 10,000 data points and number of clusters k=5

N.Y.U.S.T.I.M.

Page 27: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Simulations and comparisons N.Y.U.S.T.

I.M.

Page 28: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Simulations and comparisons N.Y.U.S.T.

I.M.

Page 29: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Simulations and comparisons N.Y.U.S.T.

I.M.

Page 30: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Simulations and comparisons N.Y.U.S.T.

I.M.

Page 31: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Conclusions

PART provides a solution to the feasibility-reliability dilemma in clustering data sets in high dimensional spaces.

N.Y.U.S.T.I.M.

Page 32: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

Personal OpinionN.Y.U.S.T.

I.M.

Page 33: Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.

Intelligent Database Systems Lab

ReviewN.Y.U.S.T.

I.M.