Viral Metagenomics (CABBIO 20150629 Buenos Aires)

31
Bas E. Dutilh Bacteriófagos: Aspectos básicos y moleculares. Aplicaciones Biotecnológicas Buenos Aires, June 29 th 2015 Viral metagenomics

Transcript of Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Page 1: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Bas E. DutilhBacteriófagos: Aspectos básicos y moleculares. Aplicaciones Biotecnológicas

Buenos Aires, June 29th 2015

Viral metagenomics

Page 2: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Metagenomics

Sample

Filter

Microbesor viruses

Page 3: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Viruses and phages

Page 4: Viral Metagenomics (CABBIO 20150629 Buenos Aires)
Page 5: Viral Metagenomics (CABBIO 20150629 Buenos Aires)
Page 6: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Metagenomics

Sample

Filter

Microbesor viruses

Page 7: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Who/what is there?

Page 8: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Who/what is there?

Page 9: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Metagenomics 1.0: database mapping• General database

• Custom database– MetaHIT human gut catalogue

– Omnibus of Marine Genes

Page 10: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Taxonomic and functional profiling

Sunagawa et al. Science 2015

Page 11: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Metagenomics 2.0: genomes from metagenomes• Reference databases fail for most environmental

metagenomes– “Dark matter”: sequences not in database

• Homology searches fail for many short sequencing reads– Fast read alignment tools place upper limit on

evolutionary distance

• Interpretation fails for taxonomic and functional metagenomic profiles– Do functions co-occur in a genome?– To describe interactions between species you need

species/genome-level resolution

• Solution: assembly and binning of (draft) genomes from metagenomes

Page 12: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Biological “dark matter”

“Depending on how they are viewed,the unknowns can represent either a formidable challenge

or a treasure trove for virus discovery.”Mokili, Rohwer and Dutilh Curr. Opin. Virology 2012

Page 13: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Unknowns in viral metagenomes

Mokili, Rohwer and Dutilh Curr. Opin. Virology 2012

Page 14: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

T1 T2

M

F2T1 T2

M

F4T1 T2

M

F1T1 T2

M

F3

Reyes et al. Nature 2010

Page 15: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Pathway presence profiles

Virome readsmapping to viral database

Reyes et al. Nature 2010

Page 16: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

1 2 3 4 5 6 7 8 9 10 11 121

10

100

1000

10000

Number of samples contributing reads to contig

Num

ber o

f con

tigs

De novo assembled contigs

6,988 de novo cross-contigs

Page 17: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Family 1 Family 2 Family 3

F1M F1T1 F1T2 F2M F2T1 F2T2 F3M F3T1 F3T2 F4M F4T1 F4T2

220 contigs present in ≥9 samples

Family 4

Aver

age

dept

h →

Samples →

1 2 3 4 5 6 7 8 9 10 11 121

10

100

1000

10000

Page 18: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

crAssphage97,065 bp

29.3% G+C80 ORFs

CaudoviralesDutilh et al. Nat. Comm. 2014

Page 19: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Dutilh et al. Nat. Comm. 2014

Page 20: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

%

Genomic position: 0 – 97,065 nt

2,906 metagenomes (940)blastn≥95% identity≥75 bp hitunambiguously aligned

Page 21: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

2,906 metagenomes1,193 phages (861)blastn≥95% identity≥75 bp hitunambiguously aligned

Ubiquity/abundance plot

Dutilh et al. Nat. Comm. 2014

Page 22: Viral Metagenomics (CABBIO 20150629 Buenos Aires)
Page 23: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

??????????????????????????????????????? ???????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????????

Virome readsmapping tocrAssphage

Viral database vs crAssphage

Virome readsmapping to viral database

Page 24: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Dick et al. Genome Biol. 2009

Binning

Page 25: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Paired-end reads

Iverson et al. Science 2012

Page 26: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Binning: Hi-C sequencing

Beitel et al. PeerJ 2014

• Cross-link physically proximal DNA (formalin)

• Restrict with enzyme• Ligate restricted sites• Sequence ligation sites

Page 27: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

K-mer binning

Ghai et al. Sci. Rep. 2011

Page 28: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

K-mer (k=4) maps (ESOM)

Abe et al. DNA Res. 2005

Page 29: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Depth binning

Speth et al. Frontiers Microbiol 2012

Page 30: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Depth profile binning

Wrighton et al. Science 2012

F1M F1T1 F1T2 F2M F2T1 F2T2 F3M F3T1 F3T2 F4M F4T1 F4T2

Page 31: Viral Metagenomics (CABBIO 20150629 Buenos Aires)

Partial nitritation/ anammox reactor (600 m3)

5.0 m

0.2 m

1.4 m

2.6 m

3.8 m

untreated

washed granules

12

34

56

78

washed granules

DNA isolation

Organic extraction

Powersoil kit

Organic extraction

Powersoil kit

Organic extraction

Powersoil kit

Organic extraction

Powersoil kit

Sample treatmentSample location DNA isolation

untreated

Differential sampling and DNA extraction

Speth et al. submitted