Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now:...

50
Copyright©2018 NTT corp. All Rights Reserved. Causal Inference in Time Series via Supervised Learning (IJCAI2018, to appear) Yoichi Chikahara, Akinori Fujino NTT Communication Science Laboratories Kyoto, Japan

Transcript of Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now:...

Page 1: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Copyright©2018 NTT corp. All Rights Reserved.

Causal Inference in Time Series

via Supervised Learning

(IJCAI2018, to appear)

Yoichi Chikahara, Akinori Fujino

NTT Communication Science Laboratories

Kyoto, Japan

Page 2: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

2 Copyright©2018 NTT corp. All Rights Reserved.

A bit about myself

• Name: Yoichi Chikahara (近原 鷹一)

• Contact: [email protected]

• Education:

2013.03: B. Sc. from Keio University

2015.03: M. Info. Sci. & Tech. from University of Tokyo

DNA information analysis lab. (Miyano lab.) in Dept. of Computer Science

2015.04 – Now: Researcher @ NTT Communication Science

Laboratories

• Research:

Machine Learning, Bioinformatics / Systems Biology

Page 3: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Causal Inference

in Time Series

Page 4: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

4 Copyright©2018 NTT corp. All Rights Reserved.

Causal inference in time series

• Given time series data

• Infer causal relationships between variables

Input: Time Series Data

X

Y

X Y

cause effect

Output: Causal Relationships

Page 5: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

5 Copyright©2018 NTT corp. All Rights Reserved.

Application 1: Economics

• Finding that R&D expenditures influences

total sales is useful for companies

total sales

R&D

expenditures

cause effect

Page 6: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

6 Copyright©2018 NTT corp. All Rights Reserved.

Application 2: Bioinformatics

• Discovering gene regulatory relationships is

useful for drug discovery

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

Page 7: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

What is “causal relationship”?

How can we define causal

relationships between variables?

Page 8: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

8 Copyright©2018 NTT corp. All Rights Reserved.

A definition of temporal causality

X is the cause of Y

if the past values of X are helpful in predicting

the future values of Y

Clive W. J. Granger (1934-2009)

Granger causality [Granger1969]

Page 9: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

9 Copyright©2018 NTT corp. All Rights Reserved.

Existing approach:

Compare prediction errors with/without using values of X

If errors are significantly reduced

by using values of X,

X Y

cause effect

Y

X

Y

t

(Two) Regression

Models

Y

X

Y

Predicted Values Prediction Errors

Page 10: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

10 Copyright©2018 NTT corp. All Rights Reserved.

In Summary,

Regression

Models X Y Y

X

Page 11: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

11 Copyright©2018 NTT corp. All Rights Reserved.

Weakness: Model selection problem

X Y

Data1

Which regression model

should I use ?

X Y

X Y

Data2

Data3

VAR

Model

Gaussian

Processes

GAM

Page 12: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

12 Copyright©2018 NTT corp. All Rights Reserved.

Weakness: Model selection problem

• Problem

Selecting appropriate regression models is difficult

(needs a deep understanding of data analysis)

It is known that existing approach does not work

when regression models cannot be well fitted to data

Page 13: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

13 Copyright©2018 NTT corp. All Rights Reserved.

Our approach:

Causal inference via classification

X Y

Data1

Same

Classifier

X Y

X Y

Data2

Data3

No need to select

regression models!

Page 14: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

14 Copyright©2018 NTT corp. All Rights Reserved.

Related work [ICML15, JMLR15, CVPR17]: Causal inference from i.i.d. data via classification

• In fact, in case of i.i.d. data, there are several

existing methods based on classification

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

Classifier

i.i.d. data

Page 15: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

15 Copyright©2018 NTT corp. All Rights Reserved.

Related work [ICML15, JMLR15, CVPR17]:

1) Train a classifier

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

Classifier

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

...

Training data

(Data where causal relationships are known)

Page 16: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

16 Copyright©2018 NTT corp. All Rights Reserved.

Copyright©2016 NTT corp. All Rights Reserved .

X→Y

X←Y

C , -‐‐‐ 2 05 5  0 1 .

Test Data

Related work [ICML15, JMLR15, CVPR17]: 2) Infer causal relationship by using trained classifier

Trained

Classifier

(Data where causal relationships

are unknown)

Page 17: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

17 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Test Data

Training Data

...

Our approach:

Causal inference from time series data via supervised learning

Page 18: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Classification approach seems good,

but how can we solve

Granger causality identification problem

via classification?

Page 19: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Classification approach seems good,

but how can we solve

Granger causality identification problem

via classification?

Page 20: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

20 Copyright©2018 NTT corp. All Rights Reserved.

• Granger causality assumes that

(Our method also uses the assumption)

X

Y X Y

At any time point t, the causal direction is the same

Revisiting assumption of Granger causality:

Causal direction never changes over time

Page 21: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

21 Copyright©2018 NTT corp. All Rights Reserved.

Revisiting definition of Granger causality

if the following holds:

X Y

cause effect

X

Y

t

at any time point t

Page 22: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

22 Copyright©2018 NTT corp. All Rights Reserved.

Revisiting definition of Granger causality

if the following holds:

X Y

cause effect

X

Y

t

Distribution of Yt+1

given past values of Y

Distribution of Yt+1

given past values of Y and X ≠

SX is useful in prediction!

Page 23: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

23 Copyright©2018 NTT corp. All Rights Reserved.

Revisiting definition of Granger causality

if

X Y

X Y

if

Page 24: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

24 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Building a classifier for Granger causality identification

If

If

If

X Y

Y X

X Y

Y X

X Y

Y X

, then assign

, then assign

, then assign

Label Assignment Rules

Page 25: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

25 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Building a classifier for Granger causality identification

If

If

If

X Y

Y X

X Y

Y X

X Y

Y X

, then assign

, then assign

, then assign

Label Assignment Rules

Test Data

Page 26: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

26 Copyright©2018 NTT corp. All Rights Reserved.

Building a classifier for Granger causality identification

If

If

If

then

then

then

Label Assignment Rules

Page 27: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Key information lies in distributions

-> To determine whether or not

the two distributions are identical,

how do we obtain feature vectors

for classification?

Page 28: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Key information lies in distributions

-> To determine whether or not

the two distributions are identical,

how do we obtain feature vectors

for classification?

Page 29: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

29 Copyright©2018 NTT corp. All Rights Reserved.

Representing features of distributions

E[Z]

to represent mean

Page 30: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

30 Copyright©2018 NTT corp. All Rights Reserved.

Representing features of distributions

E[Z]

E[Z2]

to represent

mean & variance

Page 31: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

31 Copyright©2018 NTT corp. All Rights Reserved.

Feature Space

(RKHS)

Representing features of distributions

• Kernel mean embedding: map a distribution

to a point in feature space called RKHS

When using Gaussian kernel,

Page 32: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

32 Copyright©2018 NTT corp. All Rights Reserved.

• By mapping distributions to points,

label assignment rules can be rephrased as

Feature Space

Reformulating label assignment rules

If

If

If

then

then

then

Feature Space

Page 33: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

33 Copyright©2018 NTT corp. All Rights Reserved.

Feature representation

• We only have to determine whether or not

the two points are equal over time t

• We obtain feature vectors by using the distance between the points (called maximum mean discrepancy (MMD) [Gretton+ NIPS2007]

in kernel method community)

Page 34: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

34 Copyright©2018 NTT corp. All Rights Reserved.

Feature representation

... (Since MMDs are finite sample estimates,

they cannot become exactly zero)

• By utilizing MMDs, we can obtain feature vectors

that are sufficiently different depending on Granger

causality

Page 35: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

35 Copyright©2018 NTT corp. All Rights Reserved.

Classifier (Random Forest)

Experiments

Test Data

Training Data

• linear time series from VAR model

• Nonlinear time series from VAR + sigmoid

...

Page 36: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

36 Copyright©2018 NTT corp. All Rights Reserved.

Experiment 1: Synthetic test data

• Prepare 300 pairs of bivariate time series

• Evaluate the number of time series whose causal

relationships are correctly inferred (i.e., Test Accuracy)

Linear Test Data

-- generated from VAR model

Nonlinear Test Data

-- generated from

Page 37: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

37 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

Page 38: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

38 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

Existing Granger causality methods

Test accuracy strongly depends on the regression model

Page 39: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

39 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

GCKER < GCGAM

Kernel regression cannot be well fitted since time series are too short

Page 40: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

40 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Linear Test Data Nonlinear Test Data

Proposed > Existing classification approach for i.i.d. data

Our feature representation is effective

Page 41: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

41 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Real-world

Test Data

Synthetic

Training Data

...

e.g., River Runoff

X: Precipitation

Y: River runoff

(※truth: )

Experiment 2: Real-world test data

True causal directions are given in literatures

Page 42: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

42 Copyright©2018 NTT corp. All Rights Reserved.

Test accuracy

Our Proposed sufficiently worked better

than other methods

Page 43: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

How can we extend proposed approach

to multivariate time series?

Page 44: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

44 Copyright©2018 NTT corp. All Rights Reserved.

Granger causality definition for

multivariate time series

• Conditional Granger causality [Geweke JASA1984]:

compare two conditional distributions given past

values of the third variable Z

if X Y

X Y if

Z

Z

Page 45: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

45 Copyright©2018 NTT corp. All Rights Reserved.

Feature representation

• Similarly, we map conditional distributions to points

in feature spaces and measure the distance

• By using additional MMDs, we formulate feature

representation for multivariate time series

Page 46: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

46 Copyright©2018 NTT corp. All Rights Reserved.

Classifier

Real-world

Test Data

Synthetic

Training Data

Yeast cell cycle gene expression data [Spellman+ 1998]

14 variables (genes)

Experiment 3: Multivariate real-world data

True causal directions are given in database

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value

0 20 40 60 80 100

−1.0−0.5

0.00.5

1.01.5

2.0

time

value X Y

Z ...

Page 47: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

47 Copyright©2018 NTT corp. All Rights Reserved.

Macro F1 score and micro F1 score

※Higher is better

Page 48: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

48 Copyright©2018 NTT corp. All Rights Reserved.

Macro F1 score and micro F1 score

※Higher is better

Proposed with extended feature representation

worked better

Page 49: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

49 Copyright©2018 NTT corp. All Rights Reserved.

Conclusion

• Classification approach to Granger causality identification

Requires no selection of regression models

Performs sufficiently better than existing model-based approach

Can be applied to multivariate time series

• Future work:

Addressing more complicated setting

e.g., causal direction changes over time t

Page 50: Causal Inference in Time Series via Supervised Learning … · 2018-12-01 · 2015.04 – Now: Researcher @ NTT Communication Science Laboratories ... Causal inference from time series

Questions ?