L16 : Logic Level Design (2) 성균관대학교 조 준 동 교수 .

L16 : Logic Level Design (2)

성균관대학교 조 준 동 교수http://vlsicad.skku.ac.kr

Low Power Logic Gate Resynthesis on Mapped Circuit

김현상 조준동 전기전자컴퓨터공학부

성균관대학교

Transition Probability

• Transition Probability: Prob. Of a transition at the output of a gate, given a change at the inputs

• Use signal probabilities

• Example: F = X’Y + XY’– Signal Prob. Of F: Pf = Px(1-Py)+(1-Px)Py

– Transistion Prob. Of F = 2Pf(1-Pf)

– Assumption of independence of inputs

• Use BDDs to compute these

• References: Najm’91

Technology Mapping •Implementing a Boolean network in terms of gates from a given library•Popular technique: Tree-based mapping•Library gates and circuits decomposed into canonical patterns•Pattern matching and dynamic programming to find the best cover•NP-complete for general DAG circuits•Ref: Keutzer’87, Rudell’89•Idea: High transition probability points are hidden within gates

Low Power Cell Mapping

• Example of High Switching Activity Node

• Internal Mapping in Complex Gate

Signal Probability vs. Power

0.5 1.00.0signal probability : p(x )

p(x) < 0.5 p(x) > 0.5

Spatial Correlation

P(x) = 0.25

P(x) = 0.25P(z) = 0.4375

P(b) = 0.5

P(c) = 0.5

P(d) = 0.5

P(x) = 0.25

P(y) = 0.25

zP(z) = 0.375

Logic Synthesis for Low Power• Precomputation logic

– selectively precompute the output logic values

-> reduce switching activity– using predictor function

• Retiming– re-positioning the F/F in a

pipelined circuit– candidates for adding

• circuit nodes with high hazard activity

• circuit nodes with high load capacitance

)1(y 1

Logic Synthesis for Low Power• State assignment

– to minimize the switching activity on high state transition arc

– can also consider the complexity of the combinational logic

– experimental result

• 10% ~17% power reductions

• Path balancing

– reduce hazards/glitches

– key issue in the delay insertion

• to use the minimum number of delay to achieve the maximum reduction

• Multi-level network optimization– use network don’t care term

– cost function• minimize sum of the number

of product terms and the weighted switching activity

• how changes in the global function of an internal node affects the switching activity of in its transitive fanout

– experimental result• ~10% power reduction

Logic Synthesis for Low Power

• Technology decomposition– minimizes the sum of the switching

activities at the internal nodes– one method

• to inject high switching activity inputs into the tree as late as possible

• Technology mapping

– general principle

• hide nodes with high switching activity inside the gates

P(a) = 0.3P(b) = 0.4P(c) = 0.7P(d) = 0.5

E(sw) = p(ab)+p(abc)+p(abcd) = 0.246

E(sw) = p(ab)+p(cd)+p(abcd) = 0.512

LH : high transition nodeL : low transition node

Low Power Logic Synthesis

Technology IndependentOptimization

Technology Mapping

Resynthesis on MappedCircuit

Logic Equation

Connection of Gates

RTL Description

Gate Level Description

Logic Synthesis

Timing & PowerAnalysis Tools

Technology Mapping

h : high switching activity node

l : low switching activity node

Tree Decomposition

(a) (b)

Low Power

gate(AND)

primary input

critical path

f output

Huffman Algorithm

x 1 x 2 x 3 x 4

y 1 y 2

2 3 4 4

Depth-Constrained Decomposition• Algorithm• problem : minimize SUM from i=1 to m p_t (x_i ) • input : 입력 시그널 확률 (p1, p2,íñíñíñ, pn), 높이 (h), 말단 노드의 수 (n), 게이트당 fanin l

imit(k)• output : k-ary 트리 topology• Begin• sort (signal probability of p1, p2,íñíñíñ, pn);• while (n!=0) • if (h>logkn)• assign k nodes to level L(=h+1);• /* 레벨 L(=h+1) 에 노드 k 개만큼 할당 */ • h=h-1, n=n-(k-1); /*upward*/• else if (h<logkn)• assign k nodes to level L(=h+2); • /* 이전 레벨 L(=h+2) 에 노드 k 개만큼 할당 */• h=h, n=n-(k-1); /*downward*/• else (h=logkn)• assign the remaining nodes to level L(=h+1); • /*complete; 레벨 L(=h+1) 에 나머지 노드를 모두 할당하고 • complete k-ary 트리 구성 */

• for (bottom level L; L>1; L--) • min_edge_weight_matching (nodes in level L);• End

Exampleh = 1

0.1 0.2 0.1 0.2 0.3 0.4 0.1 0.2 0.3 0.4

0.5 0.6

level L =0

level L =1

level L =2

level L =3

0.1 0.2 0.3 0.4

0.5 0.6

0.1 0.4 0.2 0.3

0.5 0.6

before matching after matching

After Decomposition

h=6 h=5 h=7 h=520

h=7 h=9

Fanin, Height

SIS+OURS

Improvement Ratio

After Tech. Mapping

h=3 h=310

h=4 h=5 h=315

h=4 h=5 h=520

h=6 h=7 h=8

Fanin, Height

K 1=3, k 2=3

SIS+LEVEL MAP

SIS+OURS+LEVEL MAP

Improvement Ratio

Precomputation• Power saving

– Reduces power dissipation of combinational logic– Reduces internal power to precomputed registers

• Opportunity– Can be significant, dependent on;

• percentage of time latch precomputation is successful

• Cost– Increase area– Impact circuit timing– Increase design complexity

• number of bits to precompute– Testability

• may generate redundant logic

Precomputation

R egisterB ank

Data_out

pn/R egisterB ank

R egisterB ank

/Data_out

m R egisterB ank

R egisterB ank

Entire function is computed.

Smaller function is defined,

Enable is precomputed.

• Before Precomputation Diagram

Precomputation

a > b/

Data_out

• After Precomputation Diagram

Precomputation

a(6:0)

a > b/

Data_out

a(6: 0)

b(6:0)

b(6: 0)

b(7) /

• Before Precomputation - ReportPrecomputation

• After Precomputation - ReportPrecomputation

Precomputation Example - Before Code

Library IEEE;Use IEEE.STD_LOGIC_1164.ALL;Entity before_precomputation isport ( a,b : in std_logic_vector(7 downto

0);CLK: in std_logic; D_out: out std_logic);

end before_precomputation;

Architecture Behav of before_precomputation is

signal a_in, b_in : std_logic_vector(7 downto 0);

signal comp : std_logic;

Beginprocess (a,b,CLK)

Beginif (CLK = '1' and CLK'even

t) then a_in <= a;

b_in<= b;end if;if (a_in > b_in) then

comp <= '1';else comp <= '0';end if;if (CLK'event and CLK='1')

then D_out <= comp;

end if;end process;end Behav;

Precomputation Example - After Code

Library IEEE;Use IEEE.STD_LOGIC_1164.ALL;

Entity after_precomputation isport (a, b : in std_logic_vector(7 downto 0);

CLK: in std_logic; D_out: out std_logic);end after_precomputation;

Architecture Behav of after_precomputation is

signal a_in, b_in : std_logic_vector(7 downto 0);

signal pcom, pcom_D : std_logic; signal CLK_en, comp : std_logic;

Beginprocess(a,b,CLK)Begin

if (CLK='1' and CLK'event) thena_in(7) <= a(7);b_in(7) <= b(7);

end if;

pcom <= a xor b;

if (CLK='0') thenpcom_D <= pcom;

end if;

CLK_en <= pcom_D and CLK;

Precomputation - Example After Code

if (CLK_en='1' and CLK_en'event) then

a_in(6 downto 0) <= a(6 downto 0);

b_in(6 downto 0) <= b(6 downto 0);end if;

if (a_in > b_in) thencomp <= '1';

else comp <= '0';

end if;

if (CLK='1' and CLK'event) thenD_out <= comp;

end if;end process;end Behav;

L16 : Logic Level Design (2) 성균관대학교 조 준 동 교수 .

Documents

Transcript of L16 : Logic Level Design (2) 성균관대학교 조 준 동 교수 .

성균관대학교 의과대학

VADA Lab.SungKyunKwan Univ. 1 Lower Power Embedded Architecture Design 성균관대학교 조 준 동 교수, 1999. 8 .

L29:Lower Power Embedded Architecture Design 성균관대학교 조 준 동 교수, 1999. 8 .

Iridis-pi : a low-cost, compact demonstration cluster 윤 준 기윤 준 기.

VADA Lab.SungKyunKwan Univ. 1 L5:Lower Power Architecture Design 1999. 8.2 성균관대학교 조 준 동 교수

L16.hegde.unlocked quiz

U8 l16生字生詞

성균관대학교 채 종 서

Marketing L16 Distribuzione

성균관대학교 정보통신공학부 한정현 2003 / 11 / 15

연구책임자 : 김상은 ( 성균관대학교 의과대학 )

L21:Lower Power Layout Design 1998. 6.7 성균관대학교 조 준 동 교수 .

국내대학소개- 성균관대학교 - SKKUshb.skku.edu/_res/cacrl/etc/11.pdf · 국내대학소개- 성균관대학교 2.3 연구원구성 본연구실에서는14명의연구원(박사과정4명,

황 지 혜 ·김 준 성 성균관대학교 의과대학 삼성서울병원, Overview of …€¦ · 활·운동치료, 식도암 및 폐암 수술 후 재활, 구경부암 치료

L16 lisfranc & midfoot inj

l16. Tuhan, Manusia Dan Global

VADA Lab.SungKyunKwan Univ. 1 Lower Power Architecture Design 1999. 8.2 성균관대학교 조 준 동 교수 .

L 18 : Circuit Level Design 성균관대학교 조 준 동 교수 .

성균관대학교 태양광산업 글로벌 리더 양성 고급인력양성센터kpvs.or.kr/cpvr/download_pdf.php?u=CPVR-V01-N02-10.pdf · a-1 성균관대학교 태양광산업

성균관대학교 경영전략학회 S-ONE 22nd Recruiting