Hypothesis-Testing Model-Complexity. Hypothesis Testing …..

Hypothesis-Testing

Model-Complexity

Hypothesis Testing …..

Domain of groundwater model ...

…topographic contours ...

… a dam ...

… irrigated area ...

… channel system ...

… extraction bores ...

… native woodland ...

… observation bores

Inflow from uphill

Supplied “from outside”

Inflow from uphill

Groundwater interaction with rivers

Inflow from uphillGroundwater interaction with dam

Leakage from channels

Leackage from channels

Aquifer extraction

Leackage from channels

Groundwater recharge

Aquifer extraction

More often than not, a definitive model cannot be built.

Recognize this, focus on the question that is being asked and, if necessary, use the model for hypothesis testing.

Remember that model calibration is a form of data interpretation. The whole modelling process is simply advanced data processing.

Cattle Ck.

Cattle Creek Catchment

Soils and current land use

Model grid; fixed head and drainage cells shown coloured

Groundwater levels in June 1996

Groundwater levels in January 1991

Modelled and observed water levels after model calibration.

1000 10001000 253

171000

Calibrated transmissivities

Cattle Creek Catchment

CANE EXPANSION

New Development CURRENT

Increased cane productionLeakage from balancing storage:

2.5 mm/d at calibration2.5 mm/d for prediction

46R10P8

46R15P8

Zone 17 absent

48R14P8

46R3P7

46R4P7

Zone 17 absent

48R8P7

46R10P10

46R11P10

Zone 17 absent

48R14P10

Simple ModelRunoff

Runoff

Simple Model

•M Soil Moisture Capacity (mm/m depth)•d Effective Rooting Depth•Ki Initital loss•fcap Field Capacity•Ks Saturated Hydraulic Conductivity

Simple ModelRunoff

•M Soil Moisture Capacity (mm/m depth)•d Effective Rooting Depth•Ki Initital loss•fcap Field Capacity•Ks Saturated Hydraulic Conductivity

A probability contour:-

“Fixing” a parameter

This has the potential to introduce bias into key model predictions.

Also, what if this parameter is partly a surrogate for an unrepresented process?

• Not only does uncertainty arise from parameter nonuniqueness; it also arises from lack of certainty in model inputs/outputs and model boundary conditions.

• The model can be used as an instrument for data interpretation, allowing various hypotheses concerning inputs/outputs and boundary conditions to be tested.

• Where did the idea ever come from that there should be one calibrated model?

modeller

construction calibration prediction

“the deliverable”

prediction

modeller

construction calibration prediction

“Dual calibration”

Observation bore

Pumped bore

K = 5Sy = 0.1

K = 25Sy = 0.3In

A River Valley

Recharge × 10-3

0 100 200 300

Recharge rate

0 100 200 300

2000 Discharge

Discharge

0 100 200 300

Pumping rate

Water level

0 100 200 300

Borehole hydrographs

The finite-difference grid

and parameter zonation

0 100 200 300

K=5; Sy=0.1

K=25; Sy=0.3

Calibrated parameters

Field dataModel-calculated

Field andmodel-generatedboreholehydrographs

0 100 200 300

K=10.2; Sy=0.21

K=18.8; Sy=0.21

Field andmodel-generatedboreholehydrographs

Simulation of Drought Conditions

• Decrease inflow from left from 2750 to 2200 m3/day.

• Increase pumping from left bore from (1500, 1000, 0, 1500)

to 2000 m3/day.

• Increase pumping from right bore from

(2000,1000,500,1500) to 3000 m3/day.

• Run model for 91 days.

• Same initial heads, ie. 54 m.

For “true parameters”, water level in right bore after this run is 43.9m.

Is it possible that the water level in the left bore will be as low as 42m?

Use PEST with “model” comprised of two MODFLOW runs, one under calibration conditions and one under predictive conditions.

In the latter case there is only one “observation”, viz water level in right pumped cell is 42m at end of run (weight is the sum of the weights used for all water levels over calibration period).

Methodology

Input files

Output files

writes model input files

reads model output files

Modelcalibration conditions

Input files

Output files

Input files

Modelpredictive conditions

Output files

0 100 200 300

K=22; Sy=0.14

K=16; Sy=0.16

K=9.8; Sy=0.28

Field andmodel-generatedboreholehydrographs overcalibration period.

Water level in right pumped bore at end ofdrought = 42m.

0 100 200 300

K=22; Sy=0.14

K=16; Sy=0.16

K=9.8; Sy=0.28

0 100 200 300

K=5; Sy=0.099

K=14; Sy=0.11

K=20; Sy=0.32

K=4.6; Sy=0.090

0 100 200 300

K=8.8; Sy=0.13

K=15; Sy=0.14

K=18; Sy=0.29

K=2.7; Sy=0.19

We are not calibrating a groundwater model.

We are calibrating our regularisation

methodology.

Some Lessons

• if possible, include in the calibration dataset measurements of the type that you need to predict

• intuition and knowledge of an area plays just an important part in modelling as does the model itself

• focus on what the model needs to predict when building the model…..

There should be no such thing as a model for an area, only for a specific problem.

So how should we model?

open cut mine

underground mine

waterholes

A model area

extraction bores

open cut mine

underground mine

waterholes

A model area

extraction bores

monitoring bores

guaging stations

A model area

Sources of Uncertainty Close to Waterholes

• conductance of bed (and heterogeneity thereof)

• change in bed conductance with wetted perimeter

• change in bed conductance with inflow/outflow and season

• relationship between area and level

• relationship between level and flow

• rate of evaporation

• hydraulic properties of rocks close to ponds

• behaviour during flood events

• change in hydraulic characteristics after flood events

• uncertainty in future flows

• inflow to ponds from neighbouring surface catchment

• lack of borehole data to define groundwater mounds

• uncertainties in streamflow

Let’s start again…..

Complexity leads to parameter uncertainty.

Parameter correlation can be enormous due to inadequate data.

Parameter uncertainty may lead to predictive uncertainty.

The more that the prediction depends on system “fine detail”, the more this is likely to occur.

Predictive uncertainty must be analysed.

Complexity must be “focussed” - dispense with non-essential complexity.

No model should be built independently of the prediction which it has to make.

A model area

open cut mine

underground mine

waterholes

Sensitive area

open cut mine

underground mine

waterholes

Sensitive area

open cut mine

underground mine

waterholes

Sensitive area

A model is not a database! A model is a data processor.

Ubiquitous complexity in a “do-everything model”

Focussed complexity in a prediction-specific model

Model Complexity

For reasons which we have already discussed, a complex model is really a simply model in disguise.

Complex models:-

More parameters Longer run times Greater likelihood of numerical

instability More costly Destroys user’s intuition

The level of complexity is set by system properties to which the prediction is most sensitive.

Objective functionminimum

Objective function contourslinear model

p1+p2 p1-p2

Ideally, simplification of a model should be done in such a way that only the parameters that “don’t matter” are dispensed with.

There are many cases where a specific prediction depends on at least one of the values of the individual parameters - the parameters that cannot be resolved by the parameter estimation process.

In fact, that is often why we are using a physically based model; if calibration alone sufficed for full parameterisation, then a black box would be all we need.

Over-simplified model design introduces bias, for we are effectively assuming values for unrepresented parameters.

Increasing model complexity

l er r

o r in

complexity

But we don’t know how much bias we are introducing.

complexity

predictive uncertainty

These levels are equalpo

r ro r

complexity

predictive uncertainty

These levels are equalpo

r ro r

The point where no further complexity is warranted, is the point where the uncertainty of a specific model prediction no longer rises.

Essential and non-essential complexity are prediction-dependent.

Complexity does not guarantee the “right answer” - it guarantees that the right answer will lie within the limits of predictive uncertainty.

Complexity without uncertainty analysis is a waste of time. A complex model can be just as biased as a simple model.

Use a simple model and add the “predictive noise” – far cheaper.

A complex model allows you to replace “predictive noise” with science. But if you don’t do it, what is the point of a complex model.

An Example….

NO RTH C AR O LINA

Neuse R iver basin

Contentnea C reekwatershed

N C County BoundariesSandy RunM iddle Sw am pLittle C ontentneaC ontentneaN euse

(77 km 2)

(140 km 2)

(470 km 2)

(2600 km 2)

(14500km 2)

1-Jan-83 1-Mar-83 1-May-83 1-Jul-83 1-Sep-83 1-Nov-83 1-Jan-84

Observed and modelled flows

0.E+00

1.E+09

2.E+09

3.E+09

4.E+09

5.E+09

6.E+09

7.E+09

8.E+09

1970 1972 1974 1976 1978 1980 1982 1984 1986

Observed and modelled monthly volumes

10 100 1000 10000

Flow (cu ft /sec)

Observed and modelled exceedence fractions

ParameterLZSN 2.0UZSN 2.0INFILT 0.0526BASETP 0.200AGWETP 0.00108LZETP 0.50INTFW 10.0IRC 0.677AGWRC 0.983

Observed and modelled flows

0.E+00

1.E+09

2.E+09

3.E+09

4.E+09

5.E+09

6.E+09

7.E+09

8.E+09

1970 1972 1974 1976 1978 1980 1982 1984 1986

Observed and modelled monthly volumes

10 100 1000 10000

Flow (cu ft/sec)

Observed and modelled exceedence fractions

Parameter Set 1 Set 2 Set 3 Set 4 Set 5 Set 6LZSN 2.0 2.0 2.0 2.0 2.0 2.0UZSN 2.0 1.79 2.0 2.0 1.76 2.0INFILT 0.0526 0.0615 0.0783 0.0340 0.0678 0.0687BASETP 0.200 0.182 0.199 0.115 0.179 0.200AGWETP 0.00108 0.0186 0.0023 0.0124 0.0247 0.0407LZETP 0.50 0.50 0.20 0.72 0.50 0.50INTFW 10.0 3.076 1.00 4.48 4.78 2.73IRC 0.677 0.571 0.729 0.738 0.759 0.320AGWRC 0.983 0.981 0.972 0.986 0.981 0.966

Observed and modelled flows over validation period

0.E+00

1.E+09

2.E+09

3.E+09

4.E+09

5.E+09

6.E+09

7.E+09

8.E+09

9.E+09

1.E+10

1986 1987 1988 1989 1990 1991 1992 1993 1994 1995

Observed and modelled monthly volumes over validation period

10 100 1000 10000

Flow (cu ft/sec)

Observed and modelled exceedence fractions over validation period

Parameterisation using PEST’s predictive analyser

Observed and modelled flows over calibration period

ParameterLZSNUZSNINFILTBASETPAGWETPLZETPINTFWIRCAGWRC

ParameterLZSNUZSNINFILTBASETPAGWETPLZETPINTFWIRCAGWRCDEEPFR

Parameterisation using PEST’s predictive analyser

Observed and modelled flows over calibration period

Hypothesis-Testing Model-Complexity. Hypothesis Testing …..

Documents

Transcript of Hypothesis-Testing Model-Complexity. Hypothesis Testing …..

Hypothesis Testing

Continution Hypothesis Testing

ECE531 Lecture 3: Minimax Hypothesis Testing · Minimax Hypothesis Testing Minimax Hypothesis Testing Deﬁnition: ρmm:= argmin ρ max j Rj(ρ) Remarks: No single decision rule minimizes

Frequentist Statistics and Hypothesis Testing

Hypothesis testing of population proportion

Decision Making (Hypothesis Testing)

Chapter 8: Testing Statistical Hypothesis .

• Hypothesis testing for µ:

Intervals Hypothesis Testing

HYPOTHESIS TESTING WITH ONE SAMPLEfaculty.ccc.edu/ashuaibi/fall09_Courses/Math125_ST/Math... · 2009-08-25 · CHAPTER 7 HYPOTHESIS TESTING WITH ONE SAMPLE Hypothesis testing in this

The Complexity Of Primality Testing

Ch. 9 Fundamental of Hypothesis Testing

Fundamentals of Hypothesis Testing

IE241: Introduction to Hypothesis Testing

HYPOTHESIS TESTING - people.stern.nyu.edupeople.stern.nyu.edu/wgreene/Statistics/HypothesisTestsCollection.pdf · The logic of hypothesis testing, as compared to jury trials page

Statistics and Hypothesis Testingcourses.umass.edu/pubp608/lectures/l5.pdf · Important Notes on Hypothesis Testing I Summary of the hypothesis-testing approach 1. The null hypothesis

Chapter 9 Fundamental of Hypothesis Testing

Hypothesis Testing with z tests

11.Hypothesis Testing II

Introduction Hypothesis Testing