Family-Based Association Analyses in Plants Clay Sneller, Ohio - PowerPoint PPT Presentation

Family-Based Association Analyses in Plants Clay Sneller, Ohio State University Kevin Smith, Jon Massman, University of Minnesota

Association Analyses (AA) • Associate – to connect in the mind or imagination • Statistically associate marker and phenotypic data • Detect a physical linkage of marker and trait loci (QTL) • Normally used in complex populations: many parents • AA must deal with population structure

Population Structure: Unequal relationship between individuals 1.Between Subgroups 2. Within Subgroups AA must accommodate structure to control type I errors: Declaring linkage when none exists

Population- vs Family-Based AA Population Family Estimation Over entire Within lineages, association population between relatives parameter then compiled Population Estimated Negated by structure & modeled sampling Inference Implied by Required for of linkage significance significance

Population-Based AA • Commonly used in plants • Applicable to many population types • Common statistics – Main effect of marker: means comparison – Covariance for effect of subgroups – TASSLE+STRUCTURE, unified mixed-model of Yu et al. 2006

A B C D E F G H I J K L M N O 0 0 1 1 0 1 1 0 1 1 0 0 2 0 1 0 0 1 1 H0: X 0 X 1 X 2 0 = 1 0 = 1 1 1 1 0 0 1 00 11 0 0 0 0 1 1 0 1 0 1 0 2 1 Genotyped X 2 X 0 X 1 Phenotyped

Mean Freq “1” Freq “0” 0 1 1 0 1 0 1 0 1 0 75 0.5 0.5 50 0.1 0.9 0 0 0 0 0 1 1 1 1 1 “75” 0.1 0.9 100 0.9 0.1 “75” 0.9 0.1 Y i = u + g i + other effects X 1 > X 0 " " " Y i = u + Cov + g i + …. = " X X 1 0

Mean Freq “1” Freq “0” 0 1 0 1 0 1Q 75 0.5 0.5 1 0 1 0 1 0q X 1 > X 0 1q 1 0 1 0 1 75 0.5 0.5 0 1 0 1 0 0Q X 1 < X 0 1 = X X Y i = u + Cov + g i + …. 2

Family-Based AA • As individuals become more related, they become more similar • Estimate association parameter within lineages • Compile and test for significance

Mean Freq “1” Freq “0” 0 1 0 1 0 1Q 75 0.5 0.5 1 0 1 0 1 0q X 1 > X 0 1q 1 0 1 0 1 75 0.5 0.5 0 1 0 1 0 0Q X 1 < X 0

Haseman “Sib” Pair Regression & Elston, 1972 Hair Behavior Pigment Marker Sweet 2 A Sassy 2 A 7 B Steady

Regress Phenotypic Difference 2 on Proportion of IBD alleles at Marker P D = Mark 1 (X i – X j ) 2 IBD Pair 1 0 Shared 1 allele 2 25 0 No shared allele Regress 3 25 P D on IBD 0

25 B = -25 σ 2 P D B = -2(1-2 c ) 2 a 0 σ 2 a = 12.5 if c=0 0 0.5 1.0 IBD

Multiple Families: Lineages Family n No. Freq Freq Freq Pairs 0 1 2 Snellers 3 3 0.66 0.33 0 Vassilyev 69 2346 0.50 0.50 0 Daad 86 3655 0.35 0.55 0.10 Hatfields 35 595 0.90 0.10 0 McCoys 35 595 0.90 0.10 0 7194

Human Genetics FBAA PBAA X • Family data is hard to X collect, verify parentage X • Studied populations are not X highly structured - random X • Careful apriori sampling to X minimize effect of structure • VERY large population size

FBAA Example: 206 Barley Lines, Barley CAP • Derived from 65 biparental crosses • Average 3.1 progeny per cross • DON data from three environments – h 2 = 0.52 • Genotyped with 2924 SNP markers BOPA_C(1) • Analysis used 676 SNPs (PIC > 0.18)

PCA of Genetic Similarity Matrix 3 2 ND AB & MN 1 P C 2 S c o re s 0 -1 -2 ND -3 -2 -2 -1 -1 0 1 1 2 PC1 Scores Average GS=0.62 +/- 0.13

Developing Pairs for the Pair-Regression 3 5886 pairs 2 1 P C 2 S c o re s x 0 -1 -2 N=29 -3 -2 -2 -1 -1 0 1 1 2 PC1 Scores 3 Lineages Used Lineages with PIC >0.18 Average GS=0.62 +/- 0.13 Used pairs with GS >0.75

Models TASSLE Y i = u + Cov + g i + polygene (Q+K) STRUCTURE Intercept Genetic similarity IBD Proportion Pair Regression P D i = u + B 1 S i + B 2 I i Covariance of individuals within a lineage

Pair Reg Tassle (VAR) PR T (LOD) *** ** (VAR) PR T (LOD) *** ***** *** 7.0 *** * ***** 105 ***** ***** 10.2 ***** * ** 50 ***** 10 22 ** ***** Mark 27 * ** ***** 9 46 ** ***** * Mark 49 ** ***** * 2.8 ***** * 4 55 ***** * Mark 56 (VAR) PR T (LOD) * ** ***** ** Prob < .00001 190 ***** ***** 10.2 ***** ** Chromosome 4H

Tassle vs Pair-Regression # of QTL Tassle & Pair-Regression 16 Tassle Only 1 Pair Regression Only 4 Population well suited for both Clear lineages 3 lineages

Xsm cM Var PR T (LOD) Xsm cM Var PR T (LOD) 7H 161 43 ***** * 2.6 3H 145 46 ***** ** 3.1 148 ***** 150 ***** 155 ***** 6H 13 **** 13 58 ***** 17 * 2.7 5H 87 26 ***** 1H 51 *** 89 ***** 53 94 56 47 ***** 94 ***** 95 * 5H 173 ** 4.0

FBAA is Well Suited for Plant Breeding Populations • Populations are EXTREMELY relevant • Many lines are phenotyped annually • Multiple large lineages are present – Full Sibs – Half Sibs – Other degrees of relationship, lineages

2009 YR1 Phenotyping: FHB Index 45 FBAA to evaluate a marker 570 Lines in a breeding population: 40 47 crosses 35 12 lines/cross 1. Build lineages based on pedigree: FS, HS 2. Genotype for marker to be tested 30 F H B In d ex (% ) Many Xs seg 25 S 20 4597 Full-Sib 15 pairs 10 5 MR 0 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 Cross

Other Types of FBAA • Quantitative Inbred Pedigree Disequilibrium Test • Two-level Haseman-Elston Regression

Quick Takes on FBAA • 1 study, much more needed to see applications: simulations • Well suited for breeding populations • May circumvent some issues inherent to population-based AA • Can handle rare alleles • QTL validation & evaluation in breeding populations • Stability of QTL effects over lineages

Thanks • Kevin Smith, Jon Massman • Barley CAP folks • Dr Elston • Diane Mather

Types of Plant Populations and Association Analyses Diverse Breeding Biparental Number of Many Many 2 Parents Ancestors Elite Selected Amount of Lots Lots V Little Structure Evolution Breeding Relevance to Some Lots Variable Breeding Type of CIM Population-Based AA Analysis Family-Based AA

Association Analysis: Associate: to connect in the mind or imagination Link: to connect, to tie or bind • Associate variation of marker genotypes with variation of phenotypes • Imply linkage of marker locus and QTL M Q

P1 = 1 Q P2 = 0 q 1 Q 0 q 0 0 0 1 1 0 1 1. Pop0 and Pop1 X 0 likely equivalent 0 0 0 If large 1 1 1 1 2. Two alleles 0 0 1 1 0 0 1 3. High LD 4. Significance requires linkage X 1 Y i = u + g i + other effects

Population, Genotyped 1 marker 0 1 0 1 0 1 0 Phenotyped 1 0 1 0 1 0 1 0 1 0 1 0 1 0 Test Association: Parameters are means 0 0 0 1 1 0 1 0 0 0 X 0 X 1 1 1 1 1 H0: = 0 0 1 1 0 0 1 Y i = u + g i + other effects

Family-Based Association Analyses in Plants Clay Sneller, Ohio - PowerPoint PPT Presentation

Family-Based Association Analyses in Plants Clay Sneller, Ohio State University Kevin Smith, Jon Massman, University of Minnesota Association Analyses (AA) Associate to connect in the mind or imagination Statistically associate

About Revit Family (NAH) Project Family Management Annotation Family System Family

Setting up new production Setting up new production Young plants Young plants Young plants

BARE ROOT AND BARE ROOT AND CONTAINERIZED FOREST CONTAINERIZED FOREST PLANTS PLANTS PLANTS

Assessing life in the stream Aquatic Plants More plants = more diversity Hard to identify

Science Year 2 Science Observing Plants Lesson 2 Week beginning 4 th May Plants This week

Science Plants Year One Aim I can use my observations to explain what plants need. I can

IMMEDIATE EFFECT WITH MODULAR PLANTS Modulesof evergreen, deciduousleaf and climbing plants A

Art and Design Making Plants in Paper Success Criteria Aim I can make a paper plant. I

Family Planning Only Programs Current Family Planning Only Programs Family Planning Only

Family Wealth Continuity ENTERPRISING FAMILY INVESTMENT APPROACHES Defining Success: What is an

C5 C5 75 C5- 75: C ENTRE FOR FAMILY MEDICINE ENTRE FOR FAMILY MEDICINE C ASE

WILLISCROFT FAMILY WILLISCROFT FAMILY WILLISCROFT FAMILY WILLISCROFT FAMILY HISTORY HISTORY

Confirmatory subgroup analyses: Case Studies Frank Bretz, Gerd Rosenkranz, Emmanuel Zuber EMA

Genome Wide Haplotype analyses Genome Wide Haplotype analyses of human complex diseases with the

Multivariate Ordination Analyses: Principal Component Analysis Dilys Vela Tatiana Boza Tatiana

Baseline Analyses Using Baseline Analyses Using DBP (2006) & AMP (2008) DBP (2006) & AMP

High Dimensional Data, Covariance Matrices High Dimensional Data Examples and Application to

Carinata Summit 2016 Update on Breeding Efforts in Carinata Growth of Agrisoma & Collaborator

Molecular genetic variation in the Galway sheep breed Dawn J. Howard Ph.D. 1,2 1 Teagasc, Athenry,

Value of Y-Chromosome Markers Y Application Advantage Forensic case work on Male-specific

Should Therapy for Newly Diagnosed Multiple Myeloma Be Risk Adapted? Evangelos Terpos, MD

REPRODUCTICE TECHNOLOGY (ART) USED. Hlinka D. Svobodov L. Hrniarov E. Reichov T. Hulvert J.

Sex and Gender in Biological Weapons-related Research Nancy Connell, PhD Senior Scholar and

Feed Forward Neural Network with Genetic Algorithm Problem definition and training data

Sambuz

Useful Links

Newsletter

Mail Us

Family-Based Association Analyses in Plants Clay Sneller, Ohio - PowerPoint PPT Presentation

Family-Based Association Analyses in Plants Clay Sneller, Ohio State University Kevin Smith, Jon Massman, University of Minnesota Association Analyses (AA) Associate to connect in the mind or imagination Statistically associate

About Revit Family (NAH) Project Family Management Annotation Family System Family

Setting up new production Setting up new production Young plants Young plants Young plants

BARE ROOT AND BARE ROOT AND CONTAINERIZED FOREST CONTAINERIZED FOREST PLANTS PLANTS PLANTS

Assessing life in the stream Aquatic Plants More plants = more diversity Hard to identify

Science Year 2 Science Observing Plants Lesson 2 Week beginning 4 th May Plants This week

Science Plants Year One Aim I can use my observations to explain what plants need. I can

IMMEDIATE EFFECT WITH MODULAR PLANTS Modulesof evergreen, deciduousleaf and climbing plants A

Art and Design Making Plants in Paper Success Criteria Aim I can make a paper plant. I

Family Planning Only Programs Current Family Planning Only Programs Family Planning Only

Family Wealth Continuity ENTERPRISING FAMILY INVESTMENT APPROACHES Defining Success: What is an

C5 C5 75 C5- 75: C ENTRE FOR FAMILY MEDICINE ENTRE FOR FAMILY MEDICINE C ASE

WILLISCROFT FAMILY WILLISCROFT FAMILY WILLISCROFT FAMILY WILLISCROFT FAMILY HISTORY HISTORY

Confirmatory subgroup analyses: Case Studies Frank Bretz, Gerd Rosenkranz, Emmanuel Zuber EMA

Genome Wide Haplotype analyses Genome Wide Haplotype analyses of human complex diseases with the

Multivariate Ordination Analyses: Principal Component Analysis Dilys Vela Tatiana Boza Tatiana

Baseline Analyses Using Baseline Analyses Using DBP (2006) &amp; AMP (2008) DBP (2006) &amp; AMP

High Dimensional Data, Covariance Matrices High Dimensional Data Examples and Application to

Carinata Summit 2016 Update on Breeding Efforts in Carinata Growth of Agrisoma &amp; Collaborator

Molecular genetic variation in the Galway sheep breed Dawn J. Howard Ph.D. 1,2 1 Teagasc, Athenry,

Value of Y-Chromosome Markers Y Application Advantage Forensic case work on Male-specific

Should Therapy for Newly Diagnosed Multiple Myeloma Be Risk Adapted? Evangelos Terpos, MD

REPRODUCTICE TECHNOLOGY (ART) USED. Hlinka D. Svobodov L. Hrniarov E. Reichov T. Hulvert J.

Sex and Gender in Biological Weapons-related Research Nancy Connell, PhD Senior Scholar and

Feed Forward Neural Network with Genetic Algorithm Problem definition and training data

Sambuz

Useful Links

Newsletter

Mail Us

Baseline Analyses Using Baseline Analyses Using DBP (2006) & AMP (2008) DBP (2006) & AMP

Carinata Summit 2016 Update on Breeding Efforts in Carinata Growth of Agrisoma & Collaborator