The infinitesimal with dominance CIRM, February 2020
Recap of the additive model Trait value = genetic + non-genetic � �� � � �� � Z E I am going to ignore the environmental component E . Parental trait values z 1 , z 2 . Genetic component: Z = z 1 + z 2 + R, R ∼ N (0 , V 0 ) 2 Deterministic shared component, normally distributed residual In a large outcrossing population, V 0 = constant, otherwise decreases in proportion to relatedness, for parents i [1] , i [2] , V 0 = σ 2 A (1 − F i [1] ,i [2] ) . � M σ 2 2 χ l ) 2 ] A = l =1 E [ η l ( � M
The infinitesimal model Let 1. P ( t ) denote the pedigree relationships between all individuals up to and including generation t ; 2. Z ( t ) denote the traits of all individuals in the pedigree up to and including the t th generation. The distribution of trait values in generation t , conditional on knowing P ( t ) and Z ( t − 1) , is multivariate normal. THIS IS A STATEMENT ABOUT DISTRIBUTION WITHIN FAMILIES, NOT ACROSS THE WHOLE POPULATION The pedigree can be quite general. It can, for example, capture population structure.
Why does it work in the additive case? For simplicity, consider haploid model. In Amandine’s notation, M � 1 Z j = ¯ η l ( χ j √ z 0 + l ) . M l =1 Simplest imaginable model: suppose η l are i.i.d. with η l = ± 1 with equal probability, ¯ z 0 = 0 . Because many different combinations of alleles can lead to the same allelic state, ‘typically’, knowing the trait value of an individual tells us very little about the allelic state at any given locus.
Put more mathematically �� M � � � l =1 η l = k � η 1 = 1 √ P P [ η 1 = 1 | Z = k/ M ] = �� M � P [ η 1 = 1] P l =1 η l = k �� M � l =2 η l = ( k − 1) P �� M � = P [ η 1 = 1] l =1 η l = k P � � M − 1 1 ( M + k − 2) / 2 2 M − 1 = � P [ η 1 = 1] � 1 M 2 M ( M + k ) / 2 � � 1 + k = P [ η 1 = 1] . M
Toy example continued If scaled allelic effects are i.i.d. Bernoulli, � � � � � k 1 + k � √ η 1 = 1 � Z = = P [ η 1 = 1] . P M M √ For a ‘typical’ trait value, k/M = O (1 / M ) . For extreme values ( k = ± M ), the trait gives complete information about the allelic effect at each locus. For ‘typical’ k , the distribution of η 1 is almost unchanged because there are so many different configurations of allelic effects that correspond to the same trait value.
Dominance? Trait value: M � � � 1 η l ( χ 1 l ) + η l ( χ 2 l ) + φ l ( χ 1 l , χ 2 √ Z = ¯ z 0 + l ) . M l =1 χ 1 χ 2 For an individual in the ancestral population, � l , � l , two independent draws from � ν l . χ 1 χ 2 Conventions: E [ η l ( � χ l )] = 0 , E [ φ l ( � l , � l )] = 0 , for any value x ′ of the allelic state at locus l , E [ φ l ( � χ l , x ′ )] = 0 = E [ φ l ( x ′ , � χ l )] .
Why does this not result in a loss in generality? χ 1 χ 2 Replace φ l ( � l , � l ) by χ 1 χ 2 χ 1 χ 2 χ 1 χ 1 χ 2 χ 2 χ 1 χ 2 φ l ( � l , � l ) − E [ φ l ( � l , � l ) | � l ] − E [ φ l ( � l , � l ) | � l ] + E [ φ l ( � l , � l )] . χ 1 χ 2 Second and third terms (functions of � l , � l ) can be subsumed into η l ( � χ l ) . For any value x ′ of the allelic state at locus l : � E [ φ l ( � χ l , x ′ )] = φ l ( x, x ′ ) � ν l ( dx ) = 0 = E [ φ l ( x ′ , � χ l )] . χ 1 χ 2 With this modification, E [ φ l ( � l , � l )] = 0 . Moreover, absorbing the mean into ¯ z 0 , we may assume that E [ η l ( � χ l )] = 0 .
Decomposing the trait value Write i [1] , i [2] for parents of individual i Decompose trait value into two parts, ◮ component shared by all offspring of Z i [1] and Z i [2] ; ◮ component which is independent for each individual within the family. trait value = shared + residual � �� � � �� � � �� � Z i A i + D i R i + S i A i + R i additive component; D i + S i dominance deviation. Residuals R i , S i determined by Mendelian inheritance
Inheritance Label chromosomes in individual i , 1 and 2 , according to whether inherited from i [1] or i [2] . X i l , Y i l , independent Ber (1 / 2) r.v.’s ◮ X i l = 1 if type at locus l on chromosome 1 in individual i , inherited from chromosome 1 in i [1] ; ◮ Y i l = 1 if type at locus l on chromosome 2 in individual i inherited from chromosome 1 in i [2] . Z i = ¯ z 0 + A i + D i + R i + S i . Mirrors our approach in additive case: A i + D i centre on the l ; residuals R i + S i capture deviation of X i means of X i l , Y i l , Y i l from their means, i.e. the randomness of Mendelian inheritance.
Terms shared by all descendants of i [1] , i [2] M � � � 1 A i = η l ( χ i [1] , 1 ) + η l ( χ i [1] , 2 ) + η l ( χ i [2] , 1 ) + η l ( χ i [2] , 2 √ ) l l l l 2 M l =1 and � M � 1 D i = φ l ( χ i [1] , 1 , χ i [2] , 1 ) + φ l ( χ i [1] , 1 , χ i [2] , 2 √ ) l l l l 4 M l =1 � + φ l ( χ i [1] , 2 , χ i [2] , 1 ) + φ l ( χ i [1] , 2 , χ i [2] , 2 ) . l l l l Even if we condition on Z i [1] , Z i [2] , with nontrivial dominance, these terms are random
Residuals �� M � � � 1 � 1 l − 1 η l ( χ i [1] , 1 η l ( χ i [1] , 2 R i X i 2 − X i √ = ) + ) l l l 2 M l =1 � � � � 1 � Y i − 1 η l ( χ i [2] , 1 η l ( χ i [2] , 2 + ) 2 − Y i ) , l l 2 �� M � � 1 l − 1 φ l ( χ i [1] , 1 , χ i [2] , 1 S i X i l Y i = √ ) l l 4 M l =1 � � l ) − 1 φ l ( χ i [1] , 1 , χ i [2] , 2 X i l (1 − Y i + ) l l 4 � � l − 1 φ l ( χ i [1] , 2 , χ i [2] , 1 (1 − X i l ) Y i + ) l l 4 � � � l ) − 1 φ l ( χ i [1] , 2 , χ i [2] , 2 (1 − X i l )(1 − Y i + ) . l l 4
When will everything be well defined? By construcion, E [ R i + S i ] = 0 , and E [ η l ( � ⇒ E [ A i ] = 0 . χ l )] = 0 = � M � 1 D i = φ l ( χ i [1] , 1 , χ i [2] , 1 ) + φ l ( χ i [1] , 1 , χ i [2] , 2 √ ) l l l l 4 M l =1 � + φ l ( χ i [1] , 2 , χ i [2] , 1 ) + φ l ( χ i [1] , 2 , χ i [2] , 2 ) . l l l l χ 1 χ 2 E [ φ l ( � l , � l )] = 0 , but what if parents are related? Define inbreeding depression M � 1 √ ι = E [ φ l ( � χ l , � χ l )] M l =1 E [ D i ] = ιF i [1] ,i [2] . Need this to be finite
. . . and then it gets really messy To calculate the variance of D i , we need to know quantities like �� φ l ( χ i [1] , 1 , χ i [2] , 1 ) + φ l ( χ i [1] , 1 , χ i [2] , 2 E ) l l l l � 2 � + φ l ( χ i [1] , 2 , χ i [2] , 1 ) + φ l ( χ i [1] , 2 , χ i [2] , 2 ) l l l l and this will depend upon 4 -way identities.
Identities required in expressions for variances = P Parent 1 F 1122 Parent 2 ~ F = P 1122 Gene 1 Gene 2 1 = F P or 2 112 1 = P F or 2 122 ~ 1 = P F or 2 1212
QG coefficients required to describe variance � M σ 2 2 χ l ) 2 ] Additive variance A = l =1 E [ η l ( � M � M 1 σ 2 χ 1 χ 2 l ) 2 ] Dominance variance D = l =1 E [ φ l ( � l , � M � M 1 Inbreeding depression ι = l =1 E [ φ l ( � χ l , � χ l )] √ M � M ι ∗ = 1 χ l )] 2 Sum of squared locus-specific l =1 E [ φ l ( � χ l , � M inbreeding depressions � M � 1 σ 2 χ l ) 2 ] Variance of dominance effects DI = E [ φ l ( � χ l , � χ l )] 2 � M l =1 in inbred individuals − E [ φ l ( � χ l , � � M 2 Covariance of additive and σ ADI = l =1 E [ η l ( � χ l ) φ l ( � χ l , � χ l )] M dominance effects in inbred individuals
But it can be done . . . And, since the actual trait value depends only on the two alleles carried at each locus, Var( Z i ) only depends on pairwise identities: Var( Z i ) = σ 2 A (1 + F ii ) + σ 2 D (1 − F ii ) + ( σ 2 DI + ι ∗ ) F ii + 2 F ii σ ADI − F 2 ii ι ∗ . (We have assumed that the ancestral population is in linkage equilibrium, or there is another term.)
Conditioning on trait values of parents M � � � 1 η l ( χ 1 l ) + η l ( χ 2 l ) + φ l ( χ 1 l , χ 2 √ Z = ¯ z 0 + l ) . M l =1 Suppose η l , φ l uniformly bounded, and inbreeding depression � M 1 ι = l =1 E [ φ l ( � χ l , � χ l )] < ∞ . √ M Before conditioning, normality from a CLT. Knowing the trait values of the parents, tells us little about allelic states at any pair of loci. Mendelian inheritance independent at unlinked loci, guarantees normality residuals after conditioning much as in additive case. For A i + D i have to work harder; use a variant of Stein’s method of exchangeable pairs.
Main result ◮ Trait values across pedigree well approximated by multivariate normal; ◮ Trait can be decomposed into two (normally distributed) parts, ( A i + D i ) (shared by all individuals in family) and ( R i + S i ) (picked independently for each individual in the family); ◮ Conditioning on the parental trait values does not affect the variance of the trait values among offspring, and it shifts mean trait values in a predictable way. The effect of conditioning can be calculated in the usual way: � x A � �� µ A � � Σ AA �� Σ AB ∼ N , . x B µ B Σ BA Σ BB Then � � µ A + Σ AB Σ − 1 BB ( x B − µ B ) , Σ AA − Σ AB Σ − 1 x A | x B ∼ N BB Σ BA .
Recommend
More recommend