1/23 Ant Colony Optimization for Hyperbox Clustering and its Application to HPV Virus Classification...
-
Upload
abigail-conley -
Category
Documents
-
view
218 -
download
0
Transcript of 1/23 Ant Colony Optimization for Hyperbox Clustering and its Application to HPV Virus Classification...
1/23
Ant Colony Optimization for Hyperbox Clustering and its Application to HPV Virus Classificationハイパーボックス・クラスタリングのためのアント・コロニ最適化とHPVウィルス判別への応用
知能システム科学専攻 廣田研究室Guilherme Novaes RAMOS
04M35692
2/23
Pattern recognition Text Speech Image Customer profile Chemical compounds Microarrays …
Motivation
3/23
Human PapillomaVirus
HPV virus
HPV symptom
Cervical HPVs Oral HPVs
Research is not very advanced
Proper treatment
Local risk profile
Cancer
Early diagnosis
4/23
Proposal Hyperbox
5/23
Background:Ant Colony Optimization
Dorigo [IEEE, 97]
Characteristics Versatile Robust Population based
1/3
6/23
3
Background:Hyperboxes Simpson [91]
Defines a region in an n-dimensional space
Described by 2 vectors
Simplest classifierIf x H1 Then x Class 1
2/3
7/23
Background:Existing applications ACO
Cemetery approach Partition matrix
Hyperbox Min-max fuzzy neural networks
Pattern classification Clustering
Classifiers
3/3
8/23
Hyperbox clustering with Ant Colony Optimization
Ants scatter hyperboxes in the feature space
Objective: maximize hyperbox density
9/23
HACOInitialization
Start
Build solution
Local optimization
Update pheromone
Criteria?
Stop
Load data
Define C
Initialize pheromone
Y
N
1/6
Define Clusters
10/23
HACO
ExploitationExploration
Probability
Assign hyperbox
2/6
Initialization
Start
Build solution
Local optimization
Update pheromone
Criteria?
Stop
Y
N
Define Clusters
11/23
HACO Hyperbox density
Generate neighbor
Change solution
N
Y
Probability
Density?
3/6
Initialization
Start
Build solution
Local optimization
Update pheromone
Criteria?
Stop
Y
N
Define Clusters
12/23
HACO
ij : pheromone value : trail persistance best : hyperbox density
of best solution
4/6
Initialization
Start
Build solution
Local optimization
Update pheromone
Criteria?
Stop
Y
N
Define Clusters
bestijij tt )1()(
13/23
HACO Fitness (density) Number of
iterations Comparison with
previous solutions …
5/6
0.85
0.86
0.87
0.88
0.89
0.9
0.91
0.92
0.93
0.94
0.95
1 61 121181241301361 421481541601661721781841901961
HACO
0
200
400
600
800
1000
1200
1400
1 62 123184 245306367428489550611672733794 855916977
ACO
Initialization
Start
Build solution
Local optimization
Update pheromone
Criteria?
Stop
Y
N
Define Clusters
Density
Iteration
Fitness
14/23
HACO Overlapping Nearest neighbor
6/6
Initialization
Start
Build solution
Local optimization
Update pheromone
Criteria?
Stop
Y
N
Define Clusters
15/23
Specifications Pentium M 1.6GHz, 512 MB of RAM C++ Suse Linux
Data sets 3 computer generated HPV
Experiments
16/23
Experiments - Dataset 1
1/6
FCMACOHACO
150 samples, 2 dimensions
NN
17/23
Experiments - Dataset 2
2/6
FCMACOHACO
302 samples, 2 dimensions
NN
18/23
Experiments - Dataset 3
3/6
FCMACOHACO
600 samples, 2 dimensions
NN
19/23
Experiments - Results
4/6
NN FCM ACOTime
Fitness Accuracy Time Fitness Accuracy Time Fitness Accuracy
DS1 0.02
889.5 100% 0.04 913.6 74% 6.1 477.5 70.7%
DS2 0.05
19164.6
100% 0.1 24457.4
48.7% 16.2 11089.2
56%
DS3 0.10
594.1 100% 0.2 1743.7 100% 33 4754.6 72.2%
HACO
D = 1 D = 2 D = 3Time
Fitness Accuracy Time Fitness Accuracy Time Fitness Accuracy
DS1 3.5 889.5 100% 1 889.5 100% 1 489.89 65.3%
DS2 33.8
19164.6
100% 9.9 19164.6
100% 6.1 19164.6
100%
DS3 33.8
594.1 100% 10 594.1 100% 5.1 594.1 100%
20/23
Experiments - HPV Data Department of stomatology Dentistry School
Characteristics 199 samples 42 attributes
5/6
21/23
Experiments - Results
6/6
NN FCM ACOTime
Fitness Accuracy Time Fitness Accuracy Time Fitness Accuracy
HPV 0.30
49534.3
68.4% 0.70 21880.3
50.8% 42.0 12452.4
52.6%
HACO
D = 1 D = 2 D = 3Time
Fitness Accuracy Time Fitness Accuracy Time Fitness Accuracy
HPV 5.80
25555.1
71.4% 2 31918.1
68.1% 1 31522.1
67.6%
22/23
Conclusions Pattern recognition
Probable HPV risk profile
Advantages Higher accuracy Competitive runtime
ACO (HPV) 29.1% - 36.3% more accurate 82.6% - 97.6% faster
23/23
Perspectives
Test with larger data sets Automatic parameter setting Hyperbox shape optimization Compare/Apply other tools
GA SOM …
24/23
Thank you for your attention
25/23
HPV statistics
Over 100 viruses
500,000 new cases of cancer diagnosed each year
200,000 deaths each year
26/23
Parameters
27/23
Hyperbox Number
: search space ratio n : attributes Dk : k-th dimension length xk : k-th attribute of samples
n
k k
k
D
xrangeroundC
1
)(
1/6