Dynamic Programming 張智星 (Roger Jang) jang@mirlab.org 多媒體資訊檢索實驗室...

Dynamic Programming

張智星 (Roger Jang)

jang@mirlab.org

http://mirlab.org/jang

多媒體資訊檢索實驗室台灣大學資訊工程系

Dynamic ProgrammingDynamic Programming (DP)

An effective method for finding the optimum solution to a multi-stage decision problem, based on the principal of optimality

Applications: NUMEROUS! Longest common subsequence, edit distance, matrix chain products, all-pair shortest distance, dynamic time warping, hidden Markov models, …

Principal of Optimality

Richard Bellman, 1952 An optimal policy has the property that whatever the initial state and the initial decisions are, the remaining decisions must constitute an optimal policy with regard to the state resulting from the first decision.

Problems Solvable by DP

Characteristics of problems solvable by DP Decomposition: The original problem can be

expressed in terms of subproblems. Subproblem optimality: the global optimum value

of a subproblem can be defined in terms of optimal subproblems of smaller sizes.

DP Example: Optimal Path Finding

Path finding in a feed-forward network p(a,b): transition cost q(a): state cost

Goal Find the optimal path

from 0 to 7 such that the total cost is minimized.

p(3,6)=8q(3)=1

Node index

Three steps in DP Optimum-value function

t(h): the minimum cost from the start point to node h.

Recurrent formula

Answer: t(7)

( ) ( , )

min ( ) ( , ) ,

( ) ( , )

with boundary condition (0) 0.

t a p a h

t h q h t b p b h

t c p c h

Optimum-value function

Step-by-step animation of DP

Click to go through DP

Principal of Optimality: Example

In terms of the shortest path problem Any partial path of the shortest path should itself be an optimal path given the starting and ending nodes

Three Steps of DP

DP formulation involves 3 steps Define the optimum-value function Derive the recurrent formula of the optimum-value function, with boundary conditions

Specify the answer to the original task in terms of the optimum-value function.

General Approach to DP

Usually bottom-up design Start at the bottom Solve small sub-problems Store solutions Reuse previous results for solving larger sub-problems

Usually it’s reducedto table filling!

General Characteristics about DP

Some general characteristics about DP We need to store back-tracking information in

order to identify the path efficiently. Only the optimal path is found. To find the second

best, we need to invoke a more complicated n-best approach.

Comparison: Recursion, Divide & Conquer, DP

Recursion A problem of size n is solved by first solving a sub-

problem of size n-1.

Divide & conquer A problem of size n is solved by first solving a sub-

problem of size k and another of size n-k.

DP A problem of size n is solved by first solving all sub-

problems of all sizes k, where k < n.

Longest Common Subsequence Subsequence

Given a string, we can delete some elements to form a subsequence:s1=uvwxyz s2=uwyz (after deleting v and x)s2 is a subsequence of s1.

Longest common subsequence (LCS) The similarity of two string can be define as the length of

the LCS between them. Example: abcdefg and xzackdfwgh have acdfg as a

longest common subsequence

Brute-Force Approach to LCS

A Brute-force solution Enumerate all subsequences of X Test which ones are also subsequences of Y Pick the longest one.

Analysis: If X is of length n, then it has 2n subsequences This is an exponential-time algorithm!

DP for LCS: 3-step Formula

Three-step DP formula for computing ,

1. Optimum-value function

, is the length of LCS between string and .

2. Recurrent formula

, 1, if

, ,max

lcs A B

lcs a b a b

lcs a b x y

lcs ax by lcs ax b

lcs a b

Boundary conditions: ,[] [], 0.

3. Answer: ,

lcs a lcs b

lcs A B

DP for LCS: Filling the Table

DP for LCS: Filling the Table (2)

Observations LCS=‘properi’ or

‘propert’ (which is obtained by keeping multiple back-tracking paths)

A match occurs when the node has a 45-degree back-tracking path

DP for LCS: Quiz!

String1 = abouta b o u t

LCS = aou

Quiz Solution

String1 = abouta b o u t

LCS = aou To create this plot Download Machine

Learning Toolbox Run lcs('about', 'aeiopu', 1)

under MATLAB

Edit Distance

Edit distance The minimum number of the basic operations

(delete, insert, substitute) that are required to converting a string into another.

DP for Edit Distance: 3-step Formula

Three-step DP formula for computing ,

1. Optimum-value function

, is the edit distance between string and .

2. Recurrent formula

, , if

ed A B

ed a b a b

ed a b x y

ed ax bed ax by

, 1, if

Boundary conditions: ed ,[] , [], .

3. Answer: ,

ed a by x y

ed a b

a len a ed b len b

ed A B

DP for Edit Distance: Filling the Table

DP for Edit Distance: Filling the Table (2)

DP for Edit Distance: Quiz!

e x e c u t i o n

String1 = execution

Min. edit distance = 8

Matrix Chain Products (MCP) Review: Matrix Multiplication.

C = A*B A is p × q and B is q × r

O(pqr ) time

],[*],[],[q

jkBkiAjiC

for (i=0; i<p; i++) for (j=0; j<r; j++){ c[i,j]=0; for (k=0; k<q; k++) c[i,j]+=a[i,k]*b[k,j]; }

Matrix Chain-ProductsProblem definition Given n matrices A0, A1, …, An-1,

where Ai is of dimension di×di+1

How to parenthesize A0*A1*…*An-1 to minimize the overall cost?

Example of MCPThe product A (2×3), B (3×5), C (5×2), D (2×4) can be fully parenthesized in 5 distinct ways:

(A (B (C D))) 5×2×4 + 3×5×4 + 2×3×4 = 124(A ((B C) D)) 3×5×2 + 3×2×4 + 2×3×4 = 78((A B) (C D)) 2×3×5 + 5×2×4 + 2×5×4 = 110((A (B C)) D) 3×5×2 + 2×3×2 + 2×2×4 = 58(((A B) C) D) 2×3×5 + 2×5×2 + 2×2×4 = 66

The way the chain is parenthesized can have a dramatic impact on the cost of evaluating the product.

Dynamic Programming28

An Enumeration ApproachMatrix Chain-Product Alg.: Try all possible ways to parenthesize

A=A0*A1*…*An-1

Calculate total number of operations for each way

Pick the one that is best

Running time: The number of ways of parenthesizations is

equal to the number of binary trees with n nodes

It is called the Catalan number, and it is almost 4n exponential!

((A0(A1A2))A3)binary tree

Observations Leading to DP

Define subproblems: Find the best parenthesization of Ai*Ai+1*…*Aj. Let Ni,j denote the minimum number of operations

required by this subproblem. The optimal solution for the whole problem is N0,n-1.

Subproblem optimality: The optimal solution can be defined in terms of optimal subproblems

There has to be a final multiplication (root of the expression tree) for the optimal solution.

Say, the final multiply is at index i: (A0*…*Ai)*(Ai+1*…*An-1).

Then the optimal solution N0,n-1 is the sum of two optimal subproblems, N0,i and Ni+1,n-1 plus the time for the last multiply.

Three-Step DP Formula

To solve matrix chain-product with DP Optimum-value function

Ni,j: the minimum number of operations required by parenthesizing Ai*Ai+1*…*Aj.

Recurrent equation

Answer N0, n-1

iNwith

dddNNN

jkijkkijki

11,1,,

(Ai*Ai+1*…*Ak)(Ak+1*Ak+2*…*Aj)

1 ki dd 11 jk dd

Subproblem Overlap 0..3

0..0 1..3 0..1 2..3 0..2 3..3

1..1 2..3 1..2 3..3 2..2 3..30..0 1..1

2..2 3..3 1..1 2..2

(A0)( A1A2A3)

Due to the overlap,we need to keep track

of previous results

(A0A1A2)(A3)(A0 A1)( A2A3)

Table Filling for DPThe bottom-up approach fills in the upper-triangle of the n×n array by diagonals, starting from Ni,i’s.

Ni,j gets values from pervious entries in row i and column j. Filling in each entry in the N table takes O(n) time Total time O(n3)Actual parenthesization can be found by storing the best “k” for each entry

}{min 11,1,, jkijkki

jkiji dddNNN

Answer!

Easy for back tracking

Walkthrough of an MCP Example

Product of A0 (2×3), A1 (3×5), A2 (5×2), A3 (2×4)

302×5k=0

422×2k=0

582×4k=2

303×2k=1

543×4k=2

405×4k=2

jkiji dddNNN

5424030

60400min

453min

3,32,1

3,21,13,1

404030

3,32,0

3,21,0

3,10,0

A02×3

A13×5

A25×2

A32×4

A02×3

A13×5

A25×2

A32×4

4220030

12300min

232min

2,21,0

2,10,02,0

Optimum value of k(for back tracking) Solution (after back tracking)

(A0A1A2)(A3)=(A0(A1A2))(A3)

ExerciseProduct of A0 (2×3), A1 (3×5), A2 (5×2), A3 (2×4), A4 (4×1)

jkiji dddNNN

A02×3

A13×5

A25×2

A32×4

A02×3

A13×5

A25×2

A32×4

Solution

302×5k=0

422×2k=0

582×4k=2

2×4k=

303×2k=1

543×4k=2

3×4k=

405×4k=2

5×4k=

02×4 5×4

A44×1

Dynamic Time Warping (DTW)

Intro to DTWApplications

DTW for speech recognition DTW for query by singing/humming

Dynamic Programming 張智星 (Roger Jang) jang@mirlab.org 多媒體資訊檢索實驗室...

Documents

Transcript of Dynamic Programming 張智星 (Roger Jang) jang@mirlab.org 多媒體資訊檢索實驗室...

第一章 資訊與資訊系統

眾至資訊Outlook connector

Information Literacy 資訊素養

Chapter 21 資訊科技：概念與管理. “Copyright 2006 滄海書局 ” Chapter 22 本章概要 資訊系統：概念與定義 資訊系統的演進 資訊系統的分類 資訊系統的例子

一一一一、、、、資產負債資訊資產負債資訊2 一一一一、、、、資產負債資訊資產負債資訊 （（（（一一一一））））資產負債表資產負債表

資訊素養 Information Literacy

2015/6/281 MIR: Status and Trends 音樂資訊檢索的現況與未來 J.-S. Roger Jang ( 張智星 ) Multimedia Information Retrieval Lab CS Dept., Tsing Hua Univ., Taiwan jang.

Ch07 資訊管理.....

資訊概論實習課 ─ 個人資訊管理

資訊 素養 研習 _ 資訊安全

台中資訊組長研習 -- 不插電的資訊科學

陸、資訊科技與人類社會 1. 資訊科技與生活 2. 資訊科技與學習 3. 資訊社會相關議題

資訊架構(Part 1)

醫事人員資訊行為 邱子恆 2008-06-09. 資訊行為 (Information behavior) 資訊需求 (information need) 資訊尋求行為 (information seeking behavior) 資訊使用 (information

圖書資訊學概論 - Part 1 圖書資訊學導言

資 訊 安 全

資訊戰─ C4ISR 簡介

第四章 地理資訊與地理資訊系統

CH03 全球資訊網

.NET 資訊能源

第一章資訊與資訊系統

Chapter 21 資訊科技：概念與管理. “Copyright 2006 滄海書局 ” Chapter 22 本章概要資訊系統：概念與定義資訊系統的演進資訊系統的分類資訊系統的例子

一一一一、、、、資產負債資訊資產負債資訊2 一一一一、、、、資產負債資訊資產負債資訊（（（（一一一一））））資產負債表資產負債表

資訊素養研習 _ 資訊安全

醫事人員資訊行為邱子恆 2008-06-09. 資訊行為 (Information behavior) 資訊需求 (information need) 資訊尋求行為 (information seeking behavior) 資訊使用 (information

資訊安全

第四章地理資訊與地理資訊系統