A Dynamic Mobility Histogram Construction Method Based on Markov Chains

Yoshiharu Ishikawa (Nagoya University)

Yoji Machida (University of Tsukuba)

Hiroyuki Kitagawa (University of Tsukuba)

A Dynamic Mobility Histogram Construction Method

Based on Markov Chains

Outline

• Background and Objectives• Modeling Movement Patterns• Mobility Histogram: Logical Structure• Mobility Histogram: Physical Structure• Experimental Results• Conclusions

Background• Advance of GPS and communication technology enabled

tracking of moving objects– Example: A taxi company in Tokyo monitor >200 taxi cabs continually

• Movement data is delivered as a data stream

Data Stream

Movement Data

Moving ObjectDatabase

Moving Objects

Objectives

• Construction and maintenance of a mobility histogram– Compact summary of movement data for a

specific time period– Used for mobility analysis and estimation

• Problems– Concrete definition of a mobility histogram

• How to model movement patterns

– Compact representation• Tradeoff with accuracy

– Efficient construction and maintenance• Incremental processing for streamed data

MovementData (as aData Stream)

Mobilityhistogram

Histogram MaintenanceModule

Incrementalupdates

Mobility Analysis /estimationModule

Query forestimation

Request foranalysis /estimation

Results

Basic Idea

Outline

Approach

• 2-D movement area

• Uniform cell decompositions– But allow multiple spatial granularities

(e.g., 4 x 4, 16 x 16)

• Movement pattern is represented as a sequence of cell numbers

• Based on the Markov chain model– Treats a movement pattern as a Markov chain

sequence– Well-known model in traffic modeling

Movement Patterns: Example (1)

Movement pattern of A

Movement pattern of B

Movement pattern of C

2 2 0 0

3 3 1 1

0 2 2 3

Movement Patterns: Example (2)

• Cell partitioning with different granularities

Movement pattern of A

11 9 3 1

Cell Numbering Scheme (1)

• Based on Z-ordering method– Simple encoding

method– Assign similar values

to neighboring cells– Translation to

different granularities is easy

Cell Numbering Scheme (2)

00001(2)

00103(2)

Level-1 (21x21) decomposition Level-2 (22x22) decomposition

Markov Chain Model (example: order = 2)

Step 0 Step 1 Step 2

2(1) 3(1) 1(1)

9(2) 12(2) 6(2)

Outline

Mobility Histogram as a Data Cube

• Representing order-n Markov chain statistics as a (n +1)-d data cube

Example: 1(1) 1(1) 0(1)

MovementData

Mobilityhistogram

Histogram MaintenanceModule

Incrementalupdates

Mobility Analysis /EstimationModule

Query foranalysis

Histogram Maintenance

• Periodical reconstruction– To cope with non-stationary movement patterns– Ease of maintenance– Old histograms are written to disk

Outline

Mobility Histogram: Physical Structure

• Problems in logical structure: huge space– 2GB (!) for a typical parameter setting– Needs multiple cubes for multiple spatial

granularities– Data cubes are sparse: most of mobility

patterns are hard to occur

• Solution: tree-based representation– Unification of quad-tree, k-d tree, and trie– Integration of cubes in multiple granularities– Selective allocation of nodes

• Saves memory space

x : counter

level 1

level 2Binary representation

Step 0:

Step 1:

Step 2:

(=12) : visited edge : non-visited edge

0100 11

step 0 step 1 step 2

Insertion of 3(2) 6(2) 12(2): BASE method

Approximated Histogram (APR)

• Problem of the BASE method– Memory size requirement is still high

• Approximated method (APR)– Compact histogram construction by adaptive

tree expansion• Allocate a buffer for each leaf node• If skew is observed, the leaf node is expanded2 statistics is used to check the non-uniformity

– Inherited the idea from decision tree construction from streamed data (e.g., VFDT)

Node Expansion

trans_seq[0]

trans_seq[1]

buffer

0001 10

bufferbuffer buffer

buffer

expansion

skew isdetected

root root

internal node

leaf node

internal orleaf node

0001 10

Quit expansion when no. of nodeshas reached a given constant

Example: 100 sequences in the buffer

Non-uniformity Check

• Use of 2 test for goodness of fit

• Null hypothesis: distribution is uniform

• If 2 value > 7.815, the distribution is non-uniform at the significance level 5%

411100100 xxxx

Buffer

…5(2)12(2) 9(2)

7(2) 13(2) 15(2)

4(2)12(2)6(2 ）

)11,10,01,00(

Uniform Non-uniform

x00 x01

x10 x11

Distribution ofnext steps

Problems in Statistical Test

• Problems: 2 value is not reliable– when the total number is small

– when some value(s) is close to 0

• Solution: use non-parametric statistics while 2 value is not reliable– Detail is shown in the paper

1 4Total number = 1 + 2 + 1 + 4 = 8

These situations arecommon in our case

• Minor improvement to the APR method– Use a small bitmap cube in addition to a tree-

structured histogram– Represent “correct” summary in some coarse level– Improvement of precision

Use of Bitmap Cube (APR-BM)

level = 1

level = 2

0001 10

1125336

Tree-basedhistogram(APR method)

Small bitmapcube in a coarselevel

Example: When partition level = 3,Markov order = 2,bitmap size = 32KB

Accurateestimation forsome queries

Outline

Dataset and Environments

• Experimental data– Used moving objects

simulator by Brinkoff

– 1024×1024 in finest granularities

– 1,000 moving objects are on the map at every time instance

• Environments– CPU： Pentium4

3.2GHz

– Memory： 1GB RAM

– OS： Cygwin

Histogram Size

• Settings– Data Size: 1K, 10K, 50K– Order-2 Markov transition

• Results– BASE method requires huge storage

BASE APR APR-BM

1K 0.35 0.01 0.04

10K 2.7 0.10 0.13

50K 9.4 0.52 0.55Dat

Histogram Size (MB)

Construction Time

• Comparison of BASE and APR– M: maximal partitioning level (granularity of input sequences)

• Results– BASE has small construction cost– APR has nearly O(n2) cost due to non-uniformity check, but still

has small processing cost (less than 0.15 ms per input sequence)

1K 10K 50K

Data Size

ruction T

ime (m

5( )素朴な方式5( )近似方式10( )素朴な方式10( )近似方式

1K 10K 50K

Data Size

5( )素朴な方式5( )近似方式10( )素朴な方式10( )近似方式

M = 5, BASE M = 5, APR M = 10, BASE M = 10, APR

Construction Time Construction Time per Sequence

Query Processing Time

• Two types of queries– Fine level: Issue

queries on the most fine partitioning level (M = 10)

– Mixed-level: Issue queries on randomly mixed partitioning levels

• Results– Comparison of BASE

and APR– No difference– Quite fast

203040

素朴な方式近似方式素朴な方式近似方式

最大空間分割レベルと一致する問合せ

最大空間分割レベルよりも粗い問合せ

問合せパターン

1K10K50K

BASE BASE APR APR

fine-level query

mixed-level query

Accuracy: Histogram Plot (1)

• Order-1 Markov chain histograms

• Partition level = 2

BASE (“true” count)

Accuracy: Histogram Plot (2)

Diff Count = |Base count – APR count|

Histogram Difference

Precision: Evaluation Measures

• Distance

• Relative Error

1＋nP

iinP ACT

ESTACT

iii ESTACT

• ACTi: Actual cell value (BASE method)

• ESTi: Estimated cell value (APR and APR-BM methods)

Evaluation of Precision

• Comparison of APR and APR-BM– Using “Distance” and

“Relative Error”

• Results– Similar results for

Distance– APR-BM is better in

terms of Relative Error• APR-BM can estimate

small cell values accurately

1K 2.5K 5K 6.692K

Number of Nodes

e APR APR- BM

1K 2.5K 5K 6.692K

Number of Nodes

APR APR- BM

Distance

Relative Error

Outline

Conclusions

• Mobility histogram construction method– Based on Markov chain model– Handling streamed trajectory sequences– Logical histogram: data cube– Physical histogram: tree structure (quad tree

+ k-d tree)• Adaptive tree growth• Approximated representation method• Use of nonparametric statistics for exceptional

cases• Use of a bitmap cube to enhance precision

A Dynamic Mobility Histogram Construction Method Based on Markov Chains

Documents

Transcript of A Dynamic Mobility Histogram Construction Method Based on Markov Chains

1 Markov Chains. Markov Chains (1) A Markov chain is a mathematical model for stochastic systems whose states, discrete or continuous, are governed.

Histogram Citra - Apache2 Ubuntu Default Page: It worksrinaldi.munir/Citra/...Histogram Citra •Histogram citra (image histogram) merupakan informasi yang penting mengenaiisi citra

Random Walks and Markov Chains Nimantha Thushan Baranasuriya Girisha Durrel De Silva Rahul Singhal Karthik Yadati Ziling Zhou.

Supercanonical convergence rates in quasi-Monte Carlo ...lecuyer/myftp/slides/arrayrqmc-stanford17.pdf · in quasi-Monte Carlo simulation of Markov chains Pierre L’Ecuyer Joint

Degenerate Stochastic Di erential Equations and Super-Markov Chainsbarlow/preprints/wood.pdf · 2005-04-18 · Degenerate Stochastic Di erential Equations and Super-Markov Chains

The Checklist - 4. Projection - Application: multivariate Markov chains

COMP 182 Algorithmic Thinking Markov Chains andnakhleh/COMP182/MarkovChainsAndHMMs.pdfMarkov Chains A ﬁnite-state Markov chain is equivalent to a stochastic automaton. One way to

Bounding Mixing Times of Markov Chains - KTHI A. Sinclair, \Improved bounds for mixing rates of Markov chains and multicommodity ow," Combinatorics, Probability & Computing, vol. 1,

:متفه شب فوکرام یاهدنیآرف Markov process(1).pdf5/25/2011 1 :متفه شب فوکرام یاهدنیآرف 1 )Discrete Markov Chains( ۂ یڜا ) تڥڥې( تڤ

MARKOV CHAINS WITH RANDOM TRANSITION MATRICES

Markov Chains (Part 4) - University of Washingtoncourses.washington.edu/inde411/MarkovChains(part4).pdf · Markov Chains - 3 Some Observations About the Limi • The behavior of this

repository.uinjkt.ac.idrepository.uinjkt.ac.id/dspace/bitstream/123456789/33272... · 2016-12-27 · ždaLah saling bebas clan berdistnbusi peluang Identik- Markov (Mar\ov Chains)

PEMILIHAN UNIVERSITAS FAVORIT DENGAN PENDEKATAN …research-dashboard.binus.ac.id/uploads/paper/document/publication... · dalam pengukuran, di mana teknik Rantai Markov (Markov Chains)

1 Analysis of Markov Chains - Stanford University · 1 Analysis of Markov Chains 1.1 Martingales Martingales are certain sequences of dependent random variables which have found many

Stat 8112 Lecture Notes Markov Chains Charles J. Geyer ... · A signed measure on a measurable space (;A) is a function : A!R ... state space of the process, is a Markov chain if

stta.ac.idstta.ac.id/data_lp3m/01.AniSTTNas.doc · Web viewMetode markov chains dapat melakukan analisa perulangan fasies yaitu dengan melihat matriks probabilitas transisi yang dapat

kommentare zum vorlesungsangebot - mi.uni-koeln.de · Markov chains and mixing times. Ameri-can ... sind Voraussetzung fur den Erwerb eines Leistungsnachweises im Fachdidaktik-Modul

Chapter 9: Markov Chains 1 Discrete Time Markov Chains ...eceweb1.rutgers.edu/~csi/ECE541/Chapter9.pdfwe can model the system where a state is not anymore the individual state but

Beyond mean field limits: Local dynamics for large sparse ... · Networks of interacting Markov chains, more precisely Inputs: IArbitrary (Polish) state space S. IIndependent noises

CS433 Modeling and Simulation Lecture 06 – Part 03 Discrete Markov Chains