1 4 Data Reduction 응용화학부 송상옥. 2 발표순서 o Data Reduction 의 필요성 o...

4Data Reduction

응용화학부송상옥

발표순서

Data Reduction 의 필요성 Dimension Reduction 의 역할 및 형태 Dimension Reduction 의 구체적 방법

왜 필요한가 ?

데이터가 너무 많으면– 예측 프로그램의 용량 초과– 해를 구하는데 걸리는 시간 지연

적절한 양의 데이터– 데이터에 포함된 개념의 복잡도에 의존

(model 의 complexity)– mining 이전에 알 수 없다 .– Ex) random data

Dimension Reduction 의 역할

Dimension Reduction 의 형태 Delete a column (feature) Delete a row (case) Reduce the number of values in a

column (smooth a feature)

transformation to new data set(PCA)

Best Features Selection

Impossible !– Search space– computational time

approximation– promising subsets– simple distance

measure– using only training

Mean and Variance

Cases : a sample from some dist. Spreadsheet mean and variance BUT, Dist. is unknown

Heuristic Feature Selection Guidance

Independent Features

Classification problem

k classes classification– k pairwise comparison

Regression = pseudo-classification

BmeanAmean

varvar

Distance Based Selection

Independent analysis + correlation analysis detect redundancy

Distance measure

– Independent feature

Branch-and-Bound Algorithm

TM MMCCMMD 211

iiimim 212

21 varvar

iFDFD MM ,

Heuristic Feature Selection

Comparison measures– Significant Test

– Dm

– F-Test

Principal Components

Merging features– a new set of fewer columns

first k-component First principal component

– minimum euclidean distance Feature with a large variance

– excellent chances for separation of class or group of case values

Decision Trees

Dynamic logic approach– coordinated with searching for

solution advantageous in large feature

spaces recursive partitioning

Reducing Values Problem

Clustering problem

Rounding

iyiythenixif

121010,mod

)10int(

K-Mean Clustering

Class Entropy

knkentErr

CCkent

Prlog*Pr

How many Cases?

적절한 sample size complexity Prediction method 와 긴밀하게 연관 빠른 시간 안에 적절한 해

Case reduction !! Basic approach (random sampling)

– Incremental samples– Average samples

A Single Sample

Incremental Samples

Average Samples

추가적인 bias 없이 variance error 를 줄일 수 있음

Best Solution Approach

Specialized Techniques

Sequential Sampling over Time– Time-dependent data– Sampling period 와 feature measuring

사이에 최적화 Strategic sampling of Key Event

– Net change > threshold (regression) Adjusting prevalence

– Low prevalence 에 대해 case 반복

1 4 Data Reduction 응용화학부 송상옥. 2 발표순서 o Data Reduction 의 필요성 o...

Documents

Transcript of 1 4 Data Reduction 응용화학부 송상옥. 2 발표순서 o Data Reduction 의 필요성 o...

HARM REDUCTION HARM REDUCTION کاهش آسیب HARM REDUCTION کاهش آسیب By : Dr Seddigh HUMS.

“모성성”의 허상과 “여성상”의 왜곡 해체하기builder.hufs.ac.kr/user/ibas/No27/11.pdf · 2012. 12. 31. · “모성성”의 허상과 “여성상”의 왜곡

Reduction reactions

Reduction Gear

NITROGEN REDUCTION & ASSIMILATION

reduction des catastrophes.pdf

Cost Reduction TFN

Size Reduction

도미니카 공화국 대만 의 협력 의 분석

Lecture 9: Dimension Reduction...•SVD is used as a method for noise reduction. •Let a matrix A represent the noisy signal: –compute the SVD, –and then discard small singular

of the Academy of Sciences and Charles University in V ......Hypertension 25% reduction Tobacco 30% reduction Salt 30% reduction Physical inactivity 10% reduction Indicators with targets

Cost Reduction Projects

Dimension Reduction and High-Dimensional Data · Dimension Reduction and High-Dimensional Data Estimation and Inference with Application to Genomics and Neuroimaging Maxime Turgeon

Kernel Dimension Reduction in Regressionfbach/fukumizu-bach-jordan...Michael I. Jordan Department of Statistics Department of Computer Science and Electrical Engineering University

L'oxydo- reduction

상세 지도 3 - Izumi, Osaka · (의)오쿠무라병원 마야내과클리닉 (의)하구쿠미노모리오카노어린이클리닉 기타다내과클리닉 (의)다쿠미정형외과

Math 182: Hidden Data in Random Matrices Todd Kemptkemp/182/182.Notes.pdfContents Chapter 1. Principal Component Analysis 5 1.1. Dimension Reduction via Afﬁne Projection 5 1.2. Least

Lecture 4: Machine Learning 101 - media.ee.ntu.edu.twmedia.ee.ntu.edu.tw/courses/cv/18F/slides/cv2018_lec04.pdf · • Linear & unsupervised dimension reduction • PCA can be applied

KEPICKEPIC의 의 의 역할과역할과 향후 향후 향후 …대한전기협회 - 1 - KEPICKEPIC의 의 의 역할과역할과 향후 향후 향후 추진방향추진방향 대

Flandres Reduction