Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 ·...

32
Towards the Science of Network Measurement Rocky K. C. Chang The Internet Infrastructure and Security Laboratory November 20, 2012

Transcript of Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 ·...

Page 1: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Towards the Science of Network Measurement

Rocky K. C. Chang The Internet Infrastructure and Security Laboratory

November 20, 2012

Page 2: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Network measurement problems

• Topology characterization

• Geolocation problems

• Performance problems

• Reliability problems

• Routing problems

• Security problems

• …

清华大学网络运行与管理技术研究室 2

Page 3: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Why measuring network path? P

erf

orm

ance

met

rics

Latency

Delay variation (jitter)

Connectivity

Packet loss/reordering

Link/path capacity

Available Bandwidth

TCP throughput

Router hop (count)

Packet duplication

Ap

plic

atio

ns

Traffic engineering

• Network tomography

• Path fingerprinting

• Routing optimization

• QoS routing, admission control, channel assignment in WLAN

User profiling

• Network resource planning

• SLA verification

Application performance tuning

• Rate adaption for VoIP/video streaming apps

• Distance/location prediction for overlay networks, P2Ps, CDNs

清华大学网络运行与管理技术研究室 3

Page 4: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Approaches to path performance measurement

• Passive – Per flow

– Per packet

• Active – Client side vs server side

– One-sided vs two-sided

• Passive-active – Passively waiting for incoming packet for active

measurement

清华大学网络运行与管理技术研究室 4

Page 5: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Current state of active measurement

• Two-sided: OWAMP and TWAMP

• One sided: Best-effort measurement (e.g., ping, ping, ping …)

– Connectionless

– Not reliable in terms of measurability and accuracy

– Measuring the wrong thing

清华大学网络运行与管理技术研究室 5

Page 6: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Best-effort measurement

• Best-effort measurement is designed for reachability test.

• Wrongly extending reachability test performance test: – ICMP packets measure IP’s control plane (not the data

plane)

– TCP SYN/RST segments measure TCP’s control plane (not the data plane)

• Do not differentiate between system delay and network delay.

清华大学网络运行与管理技术研究室 6

Page 7: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Beyond best-effort measurement

• Measuring the data path – In-band vs out-of-band – Transport/application specific – Load-balancing/traffic engineering below L3

• Measuring the network part – Mitigate the impacts of the network nodes – Measuring paths to proxies or original servers

• The manner of measurement – Sampling patterns and rates – Avoid self-induced measurement results – Choice of packet sizes

清华大学网络运行与管理技术研究室 7

Page 8: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Where to start?

• A possibility is a two-side measurement tool, such as OWAMP in perfSONAR.

– A complete control of the measurement parameters

– But not measuring application-specific data paths

– Deployment is costly.

清华大学网络运行与管理技术研究室 8

Page 9: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Our starting point

• OneProbe: A TCP-data-channel measurement approach – Stateful measurement

– Can control the size of the probe and response data packets

– Can control sampling rate and pattern by using multiple TCP connections

– A single observation based on • Two probe data packets and elicited response data

packets

清华大学网络运行与管理技术研究室 9

Page 10: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

清华大学网络运行与管理技术研究室 10

Active

measurement

Overt

(e.g., OWAMP)

Covert

Non-data

channel

(e.g., PPing)

Data

channel

Application

specific

(e.g., BitProbes)

Application

non-specific

TCP data

probes

UDP data

probes (e.g.,

BADABING)

TCP data responses

(e.g., OneProbe )

TCP ACK responses

(e.g., Sting)

Page 11: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

OneProbe’s primitive operation • Send two back-to-back probe data packets.

– Capacity measurement based on packet-pair dispersion

– At least two packets for packet reordering

– Determine which packet is lost.

11 清华大学网络运行与管理技术研究室

Page 12: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

The probe design (cont’d)

• Similarly for the response packets

• Each probe packet elicits a response packet. – Adv. Window = 2 and acknowledge only 1 packet.

12 清华大学网络运行与管理技术研究室

Page 13: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Bootstrapping and continuous monitoring

13 清华大学网络运行与管理技术研究室

Page 14: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Loss and reordering measurement via response diversity

14

Page 15: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

18 possible path events

15 清华大学网络运行与管理技术研究室

Page 16: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Based on their response packets

16 清华大学网络运行与管理技术研究室

Page 17: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Our research model

清华大学网络运行与管理技术研究室 17

Devise a

measurement

method

Development of a

tool and analysis

methods

Validation of the

method in a test -

bed & Internet

Deploy in our

measurement

platfoms

Network data -

base

Service and

development

Discover new

applications

Datasets for

publicationsResearch

Research Research

Development

Page 18: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Measurement methods

• RTT, bi-directional loss rate, bi-directional reordering rate, and delay jitter – Proc. USENIX Annual Tech. Conf. 2009.

• Bi-directional bottleneck capacity – Proc. ACM CoNEXT 2011 – Proc. ACM CoNEXT 2009

• Loss-delay analysis – ACM/USENIX IMC 2010

• Fast available bandwidth estimate – ACM Multimedia Systems Conf. 2012

清华大学网络运行与管理技术研究室 18

Page 19: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Datasets are used in

• “An Efficient Approach to Multi-level Route Analytics,” Proc. IFIP/IEEE IM 2013.

• “MonoScope: Automated Network Faults Diagnosis Based on Active Measurements,” Proc. IFIP/IEEE IM 2013.

• “Characterizing Inter-domain Rerouting after Japan Earthquake,” Proc. IFIP NETWORKING 2012.

• “Non-cooperative Diagnosis of Submarine Cable Faults,” Proc. PAM 2011.

• “Could Ash Cloud or Deep-Sea Current Overwhelm the Internet?" Proc. USENIX HotDep 2010.

清华大学网络运行与管理技术研究室 19

Page 20: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Measurement platforms

• “Performance Monitoring and Measurement of HARNET," funded by the Joint Universities Computer Centre, since January 2009.

• “Performance Monitoring of Critical Network and Service Infrastructure in Hong Kong” 2013.

清华大学网络运行与管理技术研究室 20

Page 21: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

HARNET measurement platform

清华大学网络运行与管理技术研究室 21

Page 22: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

On

ePro

be

@H

KU

On

ePro

be

@C

UH

K

On

ePro

be

@C

ityU

On

ePro

be

@Po

lyU

On

ePro

be

@B

U

On

ePro

be

@H

KU

ST

On

ePro

be

@H

KIE

D

On

ePro

be

@LU

40+ web servers selected by the JUCC

Planetopus, database, etc

HKU CUHK PolyU CityU BU HKUST LU HKIED

Mea

sure

me

nt

sid

e U

ser

sid

e

Page 23: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

清华大学网络运行与管理技术研究室 23

Page 24: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Time-series plots

清华大学网络运行与管理技术研究室 24

Page 25: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Time-series heat map

清华大学网络运行与管理技术研究室 25

Page 26: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

清华大学网络运行与管理技术研究室 26

Page 27: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Offering network path measurement as a service

• “Design and Implementation of a Unified Box for Offering Network Path Measurement as a Service,” Funded by ITF

• Major deliverables:

– Novel network measurement boxes

– Novel network measurement platforms

• Residential broadband measurement

• IPv6 measurement

清华大学网络运行与管理技术研究室 27

Page 28: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

New measurement platforms

清华大学网络运行与管理技术研究室 28

Page 29: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

A service and research platform

• Performance problems – E.g., QoE measurement of HTTP video (“QDASH: A

QoE-Aware DASH System”)

• Reliability problems – E.g., fault localization (“MonoScope: Automated

Network Faults Diagnosis Based on Active Measurements”)

• Routing problems

• Security problems

清华大学网络运行与管理技术研究室 29

Page 30: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Conclusions

• Network measurement is a primitive in network science and applications.

• But the current status is very much best-effort measurement.

• Not enough skepticism on the measurement accuracy

• What we need are reliable measurement apparatus and platform.

• Network science =? Network data science

清华大学网络运行与管理技术研究室 30

Page 31: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

清华大学网络运行与管理技术研究室 31

Page 32: Towards the Science of Network Measurementcsrchang/Towards the science of... · 2013-07-09 · –Proc. ACM CoNEXT 2011 –Proc. ACM CoNEXT 2009 •Loss-delay analysis –ACM/USENIX

Thanks (oneprobe.org)