Phase Bias in Niv Sabath University of Houston Overlappin enes g.

30
Phase Bias in Niv Sabath University of Houston Overlappi n enes g

description

Overlap Length Count Long OverlapsShort Overlaps T G A T A A T A G 5’3’ Phase 1 Phase 2 Phase 0 Same-Strand Overlaps

Transcript of Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Page 1: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Phase Bias in

Niv SabathUniversity of Houston

Overlappinenes g

Page 2: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Introduction• Overlapping genes are ubiquitous, particularly in

bacteria and viruses

• Genes can overlap On the same strand (→ →) On opposite strands (→ ← or ← →)

• In bacteria~30% of the genes overlap~70% of the overlaps are on the same strand

Page 3: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 4: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 5: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

1

Page 6: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

2

Page 7: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

4

Page 8: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

5

Page 9: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

7Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 10: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

8

Page 11: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

10Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

Page 12: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

T G AT A AT A G

5’ 3’Phase 1Phase 2

Phase 0

Same-Strand Overlaps

11

Page 13: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Overlap Length

Count

Long Overlaps Short Overlaps

5’ 3’Phase 1Phase 2

Phase 0

T A G C A T G T A T C A T G G

“…the phase bias must be a property of gene locations.”

“We propose that through some mechanism yet to be determined, the creation of unidirectional gene overlaps of phase +1 confers some advantage.”(Cock and Whitworth 2007)

Page 14: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

170 bacterial genomes15298 long overlaps in phase 1 4153 long overlaps in phase 2

T A G C A T G T A T C A T G G

Page 15: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

170 bacterial genomes15298 long overlaps in phase 1 4153 long overlaps in phase 2

Composition?

T A G C A T G T A T C A T G G

Page 16: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Scenarios of overlap creation

5’ 3’

5’ 3’

5’ 3’

T A G C A T G T A T C A T G G

Page 17: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

HypothesisThe phase bias in overlap frequency is a

result of difference between the frequencies of initiation/termination codons in phase 1 and phase 2 reading frames

T A G C A T G T A T C A T G G

Page 18: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

We examined the frequencies of ATG, the most common start codon, and stop codons in phase 1 and phase 2 reading frames

T A G C A T G T A T C A T G G

Page 19: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

T A G C A T G T A T C A T G G

Page 20: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

T A G C A T G T A T C A T G G

Met

Met

T G T

T G C

T G A

T G G

Cys

Cys

Stop

Trp

T A T

C A T

A A T

G A T

Tyr

His

Asn

Asp

5’ 3’ Phase 0

Phase 1

Phase 2

T A G C A T G T A T C A T G G

What causes the difference in ATG frequencies?

Page 21: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

What causes the difference in ATG frequencies?

Relative abundance of amino acids

T A G C A T G T A T C A T G G

T G T

T G C

T G A

T G G

Cys

Cys

Stop

Trp

T A T

C A T

A A T

G A T

Tyr

His

Asn

Asp

Phe Leu Ile MetVal Ser Pro ThrAla Gln Lys GluArg Gly

Phase 1 Phase 2

Page 22: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

T A G C A T G T A T C A T G G

The frequencies of ATG are correlated with the expected frequencies

00GNNNAT ff

00TGNNNA ff

r2 = 0.93

r2 = 0.80

Page 23: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Amino-acid frequency

ATG frequency

Phase bias

T A G C A T G T A T C A T G G

Page 24: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Acknowledgments

Dr. Dan Graur

Dr. Giddy Landan

NSF

© Marko Posavec

Page 25: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Why was the correlation between overlap frequency and GC content unnoticed?

Long + short overlaps:

Phase 1 + phase 2

Phase 1

Phase 2 (64% ATGA)

Page 26: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

Why was the correlation between overlap frequency and GC content unnoticed?

Long + short overlaps:

Phase 1 + phase 2

Phase 1

Phase 2 (64% ATGA)

Stop codon usage:

TGA TAA TAG

Page 27: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

What is the proportion (P) of overlaps, which are created by each scenario after accounting for ATG frequencies?

Page 28: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

frequencyoverlapphasefrequencyoverlapphaseRgenes 2

1

frequencyATGphasefrequencyATGphaseRATG 2

1

frequencycodonstopphasefrequencycodonstopphaseRstop 2

1

Page 29: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

frequencyoverlapphasefrequencyoverlapphaseRgenes 2

1

frequencyATGphasefrequencyATGphaseRATG 2

1

frequencycodonstopphasefrequencycodonstopphaseRstop 2

1

stopATGgenes RPPRR 1

Page 30: Phase Bias in Niv Sabath University of Houston Overlappin enes g.

frequencyoverlapphasefrequencyoverlapphaseRgenes 2

1

frequencyATGphasefrequencyATGphaseRATG 2

1

frequencycodonstopphasefrequencycodonstopphaseRstop 2

1

stopATGgenes RPPRR 1

5.0114.67.3 PPP

The expected value under no bias for overlaps in the two scenarios