GLM I: Introduction to Generalized Linear Models

By Curtis Gary DeanVice President and Actuary,

Classical Multiple Linear RegressionClassical Multiple Linear Regression

Yi = a0 + a1Xi1 + a2Xi2 …+ amXim + ei

Yi are the response variables and Xij are predictors

ei is random error term: Normally distributed

E[ei] = 0 and Var(ei) = σ2 is constant

Yi = a0 + a1Xi1 + a2Xi2 …+ amXim + ei

Yi are the response variables and Xij are predictors

ei is random error term: Normally distributed

E[ei] = 0 and Var(ei) = σ2 is constant

Classical Multiple Linear RegressionClassical Multiple Linear Regression

μi = E[Yi] = a0 + a1Xi1 + …+ amXim

Yi is Normally distributed random variable with variance σ2

Want to estimate μi = E[Yi] for each i

μi = E[Yi] = a0 + a1Xi1 + …+ amXim

Yi is Normally distributed random variable with variance σ2

Want to estimate μi = E[Yi] for each i

Response Yi has Normal Distribution

-12 -7 -2 3 8 μ i

Problems with Traditional ModelProblems with Traditional Model

Number of claims is discrete

Claim sizes are skewed to the right

Probability of an event is in [0,1]

Variance is not constant across data points i

Nonlinear relationship between X’s and Y’s

Number of claims is discrete

Claim sizes are skewed to the right

Probability of an event is in [0,1]

Variance is not constant across data points i

Nonlinear relationship between X’s and Y’s

Generalized Linear Models - GLMsGeneralized Linear Models - GLMs

Fewer restrictions

Y can model number of claims, probability of renewing, loss severity, loss ratio, etc.

Large and small policies can be put into one model

Y can be nonlinear function of X’s

Classical linear regression model is a special case

Fewer restrictions

Y can model number of claims, probability of renewing, loss severity, loss ratio, etc.

Large and small policies can be put into one model

Classical linear regression model is a special case

g(μi )= a0 + a1Xi1 + …+ amXim

g( ) is the link function

E[Yi] = μi = g-1(a0 + a1Xi1 + …+ amXim)

Yi can be Normal, Poisson, Gamma, Binomial, Compound Poisson, …

Variance can be modeled

g(μi )= a0 + a1Xi1 + …+ amXim

g( ) is the link function

E[Yi] = μi = g-1(a0 + a1Xi1 + …+ amXim)

Yi can be Normal, Poisson, Gamma, Binomial, Compound Poisson, …

Variance can be modeled

Exponential Family of DistributionsExponential Family of Distributions

assumption commona is/)(

)()(''][

,exp , ;

abYVar

Normal Distribution in Exponential FamilyNormal Distribution in Exponential Family

1lnexp

Some Math Rules: RefresherSome Math Rules: Refresher

lnln)/ln()6(

ln)ln()/1ln()5(

ln)ln()4(

lnln)ln()3(

]ln[exp]exp[ln)2(

)exp()1(

Poisson Distribution in Exponential FamilyPoisson Distribution in Exponential Family

)!ln(1

)(lnexp]Pr[

!lnexp]Pr[

)!ln(1

)(lnexp]Pr[

!lnexp]Pr[

dabYVar

)()(''][

1)(and)(

dabYVar

)()(''][

1)(and)(

Compound Poisson DistributionCompound Poisson Distribution

Y = C1 + C2 + . . . + CN

N is Poisson random variable Ci are i.i.d. with Gamma distribution

This is an example of a Tweedie distribution

Y is member of Exponential Family

Y = C1 + C2 + . . . + CN

N is Poisson random variable Ci are i.i.d. with Gamma distribution

This is an example of a Tweedie distribution

Y is member of Exponential Family

Members of the Exponential FamilyMembers of the Exponential Family

• Normal• Poisson• Binomial• Gamma• Inverse Gaussian• Compound Poisson

Variance StructureVariance Structure

E[Yi] = μi = b' (θi) → θi = b' (-1)(μi )

Var[Yi] = a(Φi) b' ' (θi) = a(Φi) V(μi)

Common form: Var[Yi] = Φ V(μi)/wi

Φ is constant across data but weights applied to data points

E[Yi] = μi = b' (θi) → θi = b' (-1)(μi )

Var[Yi] = a(Φi) b' ' (θi) = a(Φi) V(μi)

Common form: Var[Yi] = Φ V(μi)/wi

Φ is constant across data but weights applied to data points

V(μ) Normal 0

Poisson Binomial (1-) Tweedie p, 1<p<2 Gamma 2

Inverse Gaussian 3

Recall: Var[Yi] = Φ V(μi)/wi

Variance Functions V(μ)

Variance at Point and Fit Variance at Point and Fit

A Practioner’s Guide to Generalized Linear Models: A CAS Study Note

Why Exponential Family?Why Exponential Family?

Distributions in Exponential Family can model a variety of problems

Standard algorithm for finding coefficients a0, a1, …, am

Distributions in Exponential Family can model a variety of problems

Standard algorithm for finding coefficients a0, a1, …, am

Modeling Number of ClaimsModeling Number of Claims

Number ofPolicy Sex Territory Claims in 5 Years

1 M 02 02 F 01 03 F 01 04 F 02 15 F 01 06 F 02 17 M 02 28 M 02 29 M 02 1

10 F 01 1: : : :

Assume a Multiplicative ModelAssume a Multiplicative Model

μi = expected number of claims in five years

μi = BF,01 x CSex(i) x CTerr(i)

If i is Female and Terr 01

→ μi = BF,01 x 1.00 x 1.00

μi = expected number of claims in five years

μi = BF,01 x CSex(i) x CTerr(i)

If i is Female and Terr 01

→ μi = BF,01 x 1.00 x 1.00

Multiplicative ModelMultiplicative Model

μi = exp(a0 + aSXS(i) + aTXT(i))

μi = exp(a0) x exp(aSXS(i)) x exp( aTXT(i))

i is Female → XS(i) = 0; Male → XS(i) = 1

i is Terr 01 → XT(i) = 0; Terr 02 → XT(i) = 1

μi = exp(a0 + aSXS(i) + aTXT(i))

i is Female → XS(i) = 0; Male → XS(i) = 1

i is Terr 01 → XT(i) = 0; Terr 02 → XT(i) = 1

Natural Log Link FunctionNatural Log Link Function

ln(μi) = a0 + aSXS(i) + aTXT(i)

μi is in (0 , ∞ )

ln(μi) is in ( - ∞ , ∞ )

ln(μi) = a0 + aSXS(i) + aTXT(i)

μi is in (0 , ∞ )

ln(μi) is in ( - ∞ , ∞ )

)!ln(1

lnexp]Pr[

)!ln(1

lnexp]Pr[

Natural Log is Canonical Link for PoissonNatural Log is Canonical Link for Poisson

θi = ln(μi)

θi = a0 + aSXS(i) + aTXT(i)

θi = ln(μi)

θi = a0 + aSXS(i) + aTXT(i)

Estimating Coefficients a1, a2, .., amEstimating Coefficients a1, a2, .., am

Classical linear regression uses least squares

GLMs use Maximum Likelihood Method

Classical linear regression uses least squares

GLMs use Maximum Likelihood Method

Likelihood and Log LikelihoodLikelihood and Log Likelihood

)];(ln[,..),..;(

,..)],..;(ln[,..),..;(

);(,..),..;(

Find a0, aS, and aT for PoissonFind a0, aS, and aT for Poisson

a)()(0

)()(0)(a

:Maximize

)!ln(,..),..;(

iTTiSS

xaxaiiTTiSS

eyxaxa

Iterative Numerical Procedure to Find ai’sIterative Numerical Procedure to Find ai’s

Use statistical package or actuarial software

Specify link function and distribution type

Use statistical package or actuarial software

Specify link function and distribution type

Solution to Our ExampleSolution to Our Example

a0 = -.288 → exp(-.288) = .75 aS = .262 → exp(.262) = 1.3 aT = .095 → exp(.095) = 1.1

μi = .75 x 1.3XS(i) x 1.1XT(i)

i is Male,Terr 01 → μi = .75 x 1.31 x 1.10

a0 = -.288 → exp(-.288) = .75 aS = .262 → exp(.262) = 1.3 aT = .095 → exp(.095) = 1.1

μi = .75 x 1.3XS(i) x 1.1XT(i)

i is Male,Terr 01 → μi = .75 x 1.31 x 1.10

Other Common Link FunctionsOther Common Link Functions

Identity link: canonical link for Normal

Logit link ln[ μ / (1- μ) ]: canonical link for Binomial maps (0,1) onto ( -∞, ∞)

Reciprocal 1/μ: canonical link for Gamma

But, you do not have to match link and distribution. Choose link and distribution that make sense!

Identity link: canonical link for Normal

Logit link ln[ μ / (1- μ) ]: canonical link for Binomial maps (0,1) onto ( -∞, ∞)

Reciprocal 1/μ: canonical link for Gamma

But, you do not have to match link and distribution. Choose link and distribution that make sense!

Measuring Goodness of FitMeasuring Goodness of Fit

Deviance is measure of goodness of fitCompare different modelsCompare effect of dropping/adding Xij’s

Also called log likelihood ratio statistic

Deviance is measure of goodness of fitCompare different modelsCompare effect of dropping/adding Xij’s

Also called log likelihood ratio statistic

GLM I: Introduction to Generalized Linear Models

Documents

Transcript of GLM I: Introduction to Generalized Linear Models

1 General properties - Stanford Universitystatweb.stanford.edu/~jtaylo/courses/stats306b/... · of exponential families: generalized linear models. 1.1 Likelihoods, scores, and Fisher

Algoritma Komputasi dan Program R dalam GLM · menyebar Binomial, Poisson, Gamma, Eksponensial, dsb. Maka dikembangkan Model Linear Terampat (Generalized Linear Model) untuk mengatasi

From L to N: Nonlinear Predictors in Generalized Models

Generalized Linear Models Logistic Regression Log-Linear Models © G. Quinn & M. Keough, 2004.

Glm Producciones

Using Generalized Linear Models in Analysis of Risk Factors to ...

Glm 2015 sk

Generalized Linear Models...Generalized Linear Models † GLMs generalize the standard linear model: Yi = Xiﬂ + †i Random: Normal distribution †i » N (0;¾2) Systematic: linear

Generalized linear models - UCSD PSYC 201A/B · 2018. 1. 9. · 110 GENERALIZED LINEAR MODELS • Multinomial logit and probit models (Section 6.5) are extensions of logistic and

Change Point Detection in Generalized Linear Models · 2007. 3. 12. · located at a speciﬁc point of the data, the so-called change point. Models with both, discontinuous or continuous

L7 GLM Binomial

Generalized linear models (logistic regression)

STAT 705 Generalized additive modelspeople.stat.sc.edu/hansont/stat705/stat705gam.pdf · 2014. 4. 7. · Generalized additive models Consider a linear regression problem: Y i = 0

Machine Learning Allows alibration Models to …1 Machine Learning Allows alibration Models to Predict Trace Element oncentration in Soil with Generalized LIS Spectra Chen Sun1, Ye

Low-rank methods and Proper Generalized Decompositions ... · Low-rank methods and Proper Generalized Decompositions ... 1 Complex numerical models u(ξ) ∈ VN, FN (u(ξ);ξ) = 0

リニアモータシリーズ RDM - THK · rdm‒ mini glm 15 glm 20 glm 25 glm 10 gls シリーズエコノミー シリーズコンパクト シリーズクリーン スーパー

Linear Models - GitHub · Generalized linear models - the glm() function I Some types of observations can never be transformed into normality I Example: binary data; ones and zeroes.

Generalized linear models - University of California, San Diegovulstats.ucsd.edu/pdf/Gelman.ch-06.generalized-linear...112 GENERALIZED LINEAR MODELS where, as before, θ i =exp(X iβ).The

Logistic Regression and Generalized Linear Models · Logistic Regression and Generalized Linear Models Sridhar Mahadevan mahadeva@cs.umass.edu University of Massachusetts …Authors:

一般化加法モデル(Generalized Additive Models: …一般化加法モデル(Generalized Additive Models: GAM)による 個人ローンデータの解析 大阪電気通信大学

リニアモータシリーズ RDM - THK · rdm‒ mini glm 15 glm 20 glm 25 glm 10 gls シリーズエコノミーシリーズコンパクトシリーズクリーンスーパー

一般化加法モデル(Generalized Additive Models: …一般化加法モデル(Generalized Additive Models: GAM)による個人ローンデータの解析大阪電気通信大学