Complex genetic structure of african cassava and genetic architecture of key traits
1. Complex genetic structure of African cassava
and genetic architecture of key traits
Ismail Y. Rabbi
IITA
Ibadan – Nigeria
2014 Annual RTB Meeting
2. Team members and funding
acknowledgements
IITA
Cornell University
Peter Kulakow,
Jean-Luc Jannink,
Melaku Gedil,
Martha Hamblin,
Elizabeth Parkes,
Charlotte Acharya,
Lava Kumar,
Delphine Ly,
Nzola Mahungu,
Puna Ramu,
Rachid Hanna,
Pheneas
Ntewaruhunga,
Edward Kanju,
Morag Ferguson,
US DoE-JGI and UC
Berkeley
Dan Rokhsar,
Simon Prochnik,
Jessen Bredesen,
Cindy Ha
CIAT
Luis-Augusto Becerra
National Programs
NRCRI – Chiedozie
Egesi
NaCCRI – Robert
Kawuki, Yona
Baguma,
Funding: CRP-RTB/HarvestPlus/BMGF and DFID
3. Objective: increase genetic gain through use
of molecular markers in cassava breeding
• Limited use of
markers in cassava
research.
• Advances in next-generation
sequencing should
change (and is
changing) this
situation.
4. PstI sites ApekI sites
GBS 96-plex Protocol (cont.)
Primers
.....
.....
..
..
. ..
.
. ...
..
..
.
Clean-up
..
.. .
.
...
..
.
.
.
.
.
.
.
..
...
.
..
.
..
.
.. .
.
...
..
.
.
.
. .
.
.
.
.
.
.
..
.
.
. .
..
.
.
...
...
..
.
.
.
. .
.......
.
.
.
.
..
.
.
.
. .
.
...
.
..
.
.
..
.
..
.
...
...
.
..
...
.
.
.
.
.
.
.
......
.
...
.
.
.
..
.
.
..
.
..
.
. .
1. Plate DNA &
adapter pair
4. Pool
DNAs
Primers
..
.. .
.
.
.
.
.
.
. .
..
.
...
..
.
.
. . .
..
.
.
.
. .
.
..
.
. .
. .
.
Primers
.
..
.
...
.
.
...
...
.
.
.. .... ..
..
.
.
.
..
.
.
.
.
.
..
. .
.
.
.
5. PCR
.....
..
..
. ..
.
Sample digestion, barcode and
2. Digest DNA with RE
3. Ligate adapters
Illumina F and R adapters
(may be done simultaneously)
ApeKI (5 base-cutter)
Clean-up
Clean-up
6. Evaluate
fragment sizes
GBS 96-plex Protocol (cont.)
...
......
..
.
.
1. Plate DNA &
adapter pair
4. Pool
DNAs
5. PCR
2. Digest DNA with RE
3. Ligate adapters
(may be done simultaneously)
A
ApeKI (5 base-cutter)
6. Evaluate
fragment sizes
GBS 96-plex Protocol (cont.)
...
......
.
..
.
.
.
.
...
.
...
.
.
...
..
1. Plate DNA &
adapter pair
4. Pool
DNAs
5. PCR
2. Digest DNA with RE
3. Ligate adapters
(may be done simultaneously)
ApeKI (5 base-cutter)
6. Evaluate
fragment sizes
Sample DNA
A/G
A
A
G
G
G
G
A
A
A
A
A
A
G
A
G
G
A
A
A
A
G
G
Step 1
Sample digestion using enzyme
Step 2
Barcode adapter ligation
Step 3
Pooling of samples, PCR
amplification and
sequencing
Reference sequence
Step 4
DNA sequences aligned to reference
genome, sorted by barcodes and
SNPs called.
DNA
barcode
Universal
forward
primer
Digestion
site
Digestion
site
Pooling PCR Illumina sequencing
Sequence sorting (bioinformatics)
Technical overview
5. Application of GBS in cassava
improvement research areas
1. Understanding genetic diversity and population
structure for targeted breeding strategy (e.g.
heterotic grouping and hybrid breeding)
2. Development of genomic resources (e.g. snp
markers, annotated reference genomes, genetic
linkage maps).
3. Determining genetic architecture of target traits
(e.g. disease resistance, nutrition).
4. Genomic selection for accelerated breeding.
6. Population structure and
genetic diversity of African cassava
• Very little is known about the population structure, levels
of diversity, and ancestry of African cassava.
• Such information is required for targeted breeding
strategy (e.g. heterotic grouping and hybrid breeding).
• Most historical studies relied on at most few dozen
markers and limited germplasm set.
7. Populations genotyped so far
Population N Description
IITA improved varieties 1055 Improved genotypes cloned since 1970s
IITA-GRC 278 IITA genebank’s core collection
NRCRI 383 National Root Crops Research Institute (Nigeria)
IITA regional breeding 543
Breeding germplasm/landraces from DRC, Tanzania,
Ghana, Malawi, Zambia, Cameroon
IITA landrace collection 809
Landraces assembled from sub-Saharan Africa since
early 1980’s
CIAT* 285
Total 3353
*** Sequencing just completed, analysis in progress
60326 SNP loci from ApekI GBS
8. Population structure/ancestry of
African cassava
100%
50%
0%
West Africa, East Africa, Central Africa
• Admixture analysis detected about nine subpopulation
(divergent founders).
• Most clones derive their ancestry from more than two
subpopulations
• Improved varieties typically show more ancestries
(expected?)
13. Improvement of reference genome
using high-density linkage maps
• High-quality genome assembly needed for QTL, GS,
GWAS, genetic transformation.
• Current cassava genome is assembled into 12977
pieces (scaffolds).
• 10 high-density GBS SNP maps used to anchor the
cassava reference genome.
• 71.9% of the assembled genome has been placed on
the 18 cassava chromosomes.
• Work done as part of the cassava genetic map
consortium
14. Application of GBS in cassava
improvement
1. Understanding genetic diversity and population
structure for targeted breeding strategy (e.g.
heterotic grouping and hybrid breeding)
2. Development of genomic resources (e.g. snp
markers, annotated reference genomes, genetic
linkage maps).
3. Determining genetic architecture of target traits
(e.g. disease resistance, nutrition).
a. QTL mapping using bi-parental populations
4. Genomic selection for accelerated breeding.
10/30/2014
15. QTL mapping in bi-parental
populations
• Mapping population:
– Full-sib F1 populations derived from pairs of non-inbred parents
• Phenotyping (>2 years):
– Susceptibility to Cassava Mosaic Disease
– Carotenoid accumulation (b-carotene) in storage roots
– Anthocyanin pigmentations
– Number of harvested roots
– root weight per plot.
• Genotyping:
– GBS using PstI Restriction enzyme (n = 770 SNPs)
• Data analysis:
– Genetic map – Joinmap®
– Phenotype data analysis - R/lme4
– QTL mapping using R/qtl
16. Log of odds ratio
QTL mapping in bi-parental
populations
Rabbi et al. Crop Science vol. 54:
17. Major locus underlying carotenoid
accumulation in cassava roots
80
60
40
20
0
S7520_912057 PULPCOL Phytoene synthase II
linkage group
lod
2011
2012
combined
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Rabbi et al. Crop Science vol. 54:
18. High-resolution mapping of the
CMD2 locus
40
30
20
10
0
CMD1S S5214_30876
linkage group
lod
CMD2 locus
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Resistant parent Susceptible parent
19. Narrow genetic base for major gene
resistance to CMD?
a 50
S5214_780931
40
30
20
10
0
-Log10(P)
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
50
40
30
20
10
0
0 30 60 90 120 150
IITA TMS 011412
IITA TMS 4(2)1425
scaffold 5214
Position (cM)
b
-Log10(P)
a. Genome-wide scan
for linkage between
CMD resistance and
6756 SNPs across 18
cassava linkage groups.
b. A detailed view of the
of linkage group 16
Rabbi et al. Virus Research 2014 DOI: 10.1016/j.virusres.2013.12.028
20. Anchoring previously-mapped
resistance loci
S4175_365840
S4175_207572
S4175_58407
S4175_116369
S4175_207488
S4175_241208
S4175_208108
S4175_64514
S4175_62342
S4175_181543
S4175_255324
S7933_3549
S7933_41036
S5214_1427030
S5214_1273806
S5214_1274892
S5214_1073280
S5214_1084049
S5214_831886
S5214_780931
S5214_776689
S5214_514589
S5214_472282
S5214_566592
S6906_46617
S6906_46672
S6906_119465
S6906_119507
S6906_359407
S6906_359424
S6906_359496
20
40
60
80
100
120
CMD Resistance
LG16P1F2
r^2
0 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1 1
NS198
PVE: 11%
SSRY106
PVE: 30%
SSRY28
PVE: 60-70%
NS158 and
NS169
PVE: N/A
All markers linked to
dominant, major gene
resistance to CMD from
different genetic
backgrounds occur in
same chromosome!
Linkage group 16
21. Application of GBS in cassava
improvement
1. Understanding genetic diversity and population
structure for targeted breeding strategy (e.g.
heterotic grouping and hybrid breeding)
2. Development of genomic resources (e.g. snp
markers, annotated reference genomes, genetic
linkage maps).
3. Determining genetic architecture of target traits
(e.g. disease/pest resistance, plant morphology,
nutrition)
b. Genome-wide association
4. Genomic selection for accelerated breeding.
10/30/2014
22. What next?
• So far, most of the applications of GBS SNPs relate
to germplasm characterization and QTL discovery.
• Next step is to change gears to ‘applications’ by
using the discovered information in the crops’
improvement.
– Heterotic pattern/grouping (on-going)
– Genomic selection and MAS
• www.nextgencassava.org
• www.cassavabase.org
23. Team members and funding
acknowledgements
IITA
Cornell University
Peter Kulakow,
Jean-Luc Jannink,
Melaku Gedil,
Martha Hamblin,
Elizabeth Parkes,
Charlotte Acharya,
Lava Kumar,
Delphine Ly,
Nzola Mahungu,
Puna Ramu,
Rachid Hanna,
Pheneas
Ntewaruhunga,
Edward Kanju,
Morag Ferguson,
US DoE-JGI and UC
Berkeley
Dan Rokhsar,
Simon Prochnik,
Jessen Bredesen,
Cindy Ha
CIAT
Luis-Augusto Becerra
National Programs
NRCRI – Chiedozie
Egesi
NaCCRI – Robert
Kawuki, Yona
Baguma,
Funding: CRP-RTB/HarvestPlus/BMGF and DFID
24. Also, check out our poster on tracking of
cassava varieties in farmers fields
•
•
•
Thank you !!!
Editor's Notes
PstI[55] Providencia stuartii 5'CTGCAG
3'GACGTC 5'---CTGCA G---3'
3'---G ACGTC---5'
ApekI
GCWGC where W is A or T
> (1/4)^4
[1] 0.00390625
> (1/6)^4
[1] 0.0007716049