Web-based Supporting Materials for Power/Sample Size Calculations - PDF document

Web-based Supporting Materials for Power/Sample Size Calculations for Assessing Correlates of Risk in Clinical Efficacy Trials Peter B. Gilbert, Holly E. Janes, Yunda Huang Appendix A: Unbiased Biomarker Characterization Accounting for the Sam- pling Design of the CoR Study Consider a 2-phase sampling design (without-replacement) with K participant strata defined by variables measured in all study participants. Let N ∗ 1 k ( N ∗ 0 k ) be the number of vaccine recipient cases (controls) in stratum k at-risk at τ (i.e., with Y τ = 0), and N 1 k ( N 0 k ) be the numbers observed to be at-risk at τ (i.e., with X τ = 0), with z ≡ � K zk and N z ≡ � K N ∗ k =1 N ∗ k =1 N zk for z = 0 , 1. The unstarred quantities are not observed (unless there is no dropout by τ ) but their expectations can easily be estimated by the numbers of randomized subjects observed to be cases and controls multiplied by an estimate of the probability of primary endpoint occurrence by τ (e.g., a Kaplan- Meier estimate). Let n 1 k ( n 0 k ) be the number of vaccine recipient cases (controls) in stratum k observed to be at-risk at τ from whom immune responses are measured at τ . In practice n 1 k is set to include all N 1 k subjects who have available specimens at τ (typically slightly less than N 1 k ). Different approaches may be taken to choose the n 0 k ; � K � K for example one approach achieves an overall case-control ratio r ≡ k =1 n 0 k / k =1 n 1 k with r in the range of 2 to 5, where the n 0 k may all equal r × n 1 k or may upweight certain strata judged to be important. A consideration for the sampling design is that vaccine trials with a correlates objective also have the objective to characterize the immunogenicity of the vaccine. To represent the trial population this analysis should provide unbiased descriptive and inferential analysis for the population of vaccine recipients at-risk at τ (possibly

within strata) not conditioning on case status. Both approaches can straightforwardly be used to provide inference on parameters of interest using all n 1 ≡ � K k =1 n 1 k and � K n 0 ≡ k =1 n 0 k subjects, for example by using inverse probability weighting. However, for graphical analysis, the prospective case-cohort approach straightforwardly provides a correct random sample, whereas the outcome-dependent sampling plan does not. This problem can be remedied by defining each n IS 1 k ≤ n 1 k ( k = 1 , · · · , K ) to be the number of cases included in the immunogenicity characterization analysis selected to maintain a controls:cases ratio of sampled subjects equal to the controls:cases ratio of the entire study cohort, i.e., to satisfy the constraint � n 0 k E [ N 0 k ] = E [ N 1 k ] . (1) n IS � 1 k The estimates � E [ N 0 k ] and � E [ N 1 k ] are determined independently of considerations of the immunogenicity and correlates studies, and any choices of n 0 k and n IS 1 k satisfying (1) will allow unbiased immunogenicity analysis within each covariate subgroup k . While this approach provides unbiased immunogenicity analysis for each stratum k separately, if certain strata k are over-sampled it may provide biased analysis for the overall study population. We can obtain unbiased analysis of the overall population by 1 ≡ f k n IS including the immune response data from all n ∗ 0 controls and from n ∗ 1 k cases, where the constants f k ≤ 1 are selected to achieve each f k n ∗ 1 k being equal to an integer ≡ � K 1 = � E [ N 0 ] / � 0 /n IS E [ N 1 ], where n IS k =1 n IS and n ∗ 1 k . 1 One way to implement the above approach is to first choose the n 0 k ( k = 1 , · · · , K ) to achieve adequate power for the overall correlates analysis, which determines the n IS 1 k by equation (1) (rounding to the nearest integer). Then, if necessary for the overall analysis, add the second fix on top of this fix. This discussion shows that it is straight- forward to conduct an unbiased immunogenicity characterization study regardless of whether the correlates analysis uses prospective case-cohort or retrospective 2-phase sampling. 2

Appendix B: Selected Mathematical Details of Power Calculations Computing Sensitivity, Specificity, False Positives, and False Negatives obs , ρ , P lat 0 , P lat Given inputs σ 2 2 , P 0 , and P 2 , the following steps yield Sens, Spec FP 1 , FP 2 , FN 1 , and FN 2 defined in the main manuscript. = P ( X ∗ > θ 2 ): θ 2 = obs and solve for θ 2 in the equation P lat 1. Set σ 2 e = (1 − ρ ) σ 2 2 √ ρσ obs Φ − 1 (1 − P lat = P ( X ∗ ≤ θ 0 ): 2 ). Similarly solve for θ 0 in the equation P lat 0 θ 0 = √ ρσ obs Φ − 1 ( P lat 0 ). 2. Simulate a large number M of realizations of X ∗ and S ∗ from normal distributions N (0 , ρσ 2 obs ) and N (0 , σ 2 obs ), respectively (e.g., M = 100 , 000). 3. With P 2 ( θ 2 ) ≡ P ( S ∗ > θ 2 ) and P 0 ( θ 0 ) ≡ P ( S ∗ ≤ θ 0 ), determine the cut-points θ 2 and θ 0 that solve equations + FP 2 ∗ P lat + FP 1 ∗ P lat P 2 = Sens ∗ P lat 2 1 0 and + FN 2 ∗ P lat + FN 1 ∗ P lat P 0 = Spec ∗ P lat 0 1 2 in the main manuscript, which are the solutions to P 2 ( θ 2 ) = Sens ( θ 2 ) P lat + FP 2 ( θ 2 ) P lat + FP 1 ( θ 2 ) P lat (1) 2 1 0 P 0 ( θ 0 ) = Spec ( θ 0 ) P lat + FN 2 ( θ 0 ) P lat + FN 1 ( θ 0 ) P lat 2 . (2) 0 1 The solution θ 2 is obtained by estimating ( P 2 ( θ 2 ), Sens ( θ 2 ), FP 1 ( θ 2 ), FP 2 ( θ 2 )) for each of the M realizations and picking the θ 2 = θ that gives the closest solution. Similarly the solution θ 0 is obtained by estimating ( P 0 ( θ 0 ), Spec ( θ 0 ), FN 1 ( θ 0 ), FN 2 ( θ 0 )) for each of the M realizations and picking the θ 0 = θ that gives the closest solution. 3

4. Output the resulting solutions θ 2 and θ 0 together with P 2 ( θ 2 ) , Sens ( θ 2 ) , FP 1 ( θ 2 ) , FP 2 ( θ 2 ) evaluated at the solution θ 2 and P 0 ( θ 0 ) , Spec ( θ 0 ) , FN 1 ( θ 0 ) , FN 2 ( θ 0 ) evaluated at the solution θ 0 . Solutions α lat and β lat for a Continuous Biomarker lowestV E , V E lowest ), β lat in the model of Section 2.4 in the Given fixed ( V E , risk 0 , P lat main article can be expressed as a function of α lat by fixing x = ν . This yields � � � − α lat � β lat = 1 risk lat logit 1 ( ν ) . (3) ν Plugging (3) into the last formula in Section 2.4 for overall V E yields a zero-equation U ( α lat ) = 0 in one unknown variable α lat , � ∞ ν D ( x ; α lat ) φ ( x/ ( √ ρσ obs )) dx U ( α lat ) = (1 − V E ) − P lat lowestV E ∗ risk lat 1 ( ν ) + (4) risk 0 � � 1 ( ν )) x/ν + A ( x ; α lat ) where D ( x ; α lat ) ≡ A ( x ; α lat ) / (1 − risk lat with A ( x ; α lat ) ≡ � � x/ν . Equation (4) can be solved by a one-dimensional exp { α lat ∗ (1 − x/ν ) }∗ risk lat 1 ( ν ) line search. Then, β lat is solved by plugging α lat into equation (3). Appendix C: Estimation of the Noise Level of a Biomarker As described in model S ∗ = X ∗ + e, X ∗ ∼ N (0 , σ 2 e ∼ N (0 , σ 2 tr ) , e ) of the main article (Section 2.3), the continuous-readout biomarker S ∗ is often measured with protection-irrelevant error, denoted by e . Typically, the error is due to two major independent sources of variability: assay-related error, e assay , and trial-related error, e trial . We suppose that e assay ∼ N (0 , σ 2 e trial ∼ N (0 , σ 2 e = e assay + e trial , assay ) , trial ) , and e assay ⊥ e trial . Consequently, σ 2 e = σ 2 assay + σ 2 trial and ρ = 1 − σ 2 assay /σ 2 obs − σ 2 trial /σ 2 obs . We describe how the proportion of variability due to trial-related error, π t = σ 2 trial /σ 2 obs , and the 4

Web-based Supporting Materials for Power/Sample Size Calculations - PDF document

Web-based Supporting Materials for Power/Sample Size Calculations for Assessing Correlates of Risk in Clinical Efficacy Trials Peter B. Gilbert, Holly E. Janes, Yunda Huang Appendix A: Unbiased Biomarker Characterization Accounting for the Sam-

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

SAMPLE SIZE IN TRIAXIAL LOADS How sample size affects the frictional behavior Photo by H.

Math 1710 Class 24 Examples Power 2-Sample CIs Dr. Allen Back and HTs 2-Sample

Sample Size Power, Sample Size, and the FDR How many observations do we need? Depends on

RISK ASSESSEMENT supporting TEST supporting supporting supporting supporting REAGENTS RISK

Sample 2 Inlet in western (Sunset) Bay 0 Sample 3 Inlet behind Christian Island 1 Sample

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

Agglomeration of Ash Particles due to Flue Gas Conditioning (a) Sample CA8S12F1 (b) Sample

Sample Preparation Sample Preparation Sample Size 6 mm x 12 mm x 50 mm 10 mm x 12 mm

(power x 0) == 1 (power x (+ n 1)) == (* (power x n) x) (power x 0) == 1 (power x (+ (* 2 m)

Sunthud Pornprasertmanit W. Joel Schneider Sample Size Estimation Approach Power

Planning Sample Size for Randomized Evaluations Jed Friedman, World Bank Based on slides from

Outline Power Law Size Distributions Distributions Power Law Size Distributions Overview

Lumber Size Lumber Size Control Control Studies Studies Lumber Size Control Lumber Size

Lab 2 discussion Last Time Debugging Its a science use experiments to refine

SEM Photographs of Activated ash samples SEM Micrographs (Original ash samples) (a) Sample S1F1

System Administration HW5 - Mini Private Lab tzute Computer Center, CS, NCTU Architecture

www.enisa.eu.int ENISA update 17 th TF-CSIRT, Amsterdam, 23.01.2006 Marco Thorbruegge

ENISA EUROPEAN NIS AGENDA AND THE PERSPECTIVE Prof. Dr. Reinhard Posch Chairperson ENISA

Exploring the Effect of an Advance Letter on Response and Eligibility Rates: A M Met eta-Ana

ETC Teaching Session Instructor: Swara Ravindranath JWST Proposal Planning Workshop May 15

Indian National System of Innovation and Globalisation: Some Lessons for African National System

CS 171: Introduction to Computer Science II Algorithm Analysis Li Xiong Today Hw1

www.transparency.org.nz Policy Implications of Transparency International Integrity Plus 2013

Sambuz

Useful Links

Newsletter

Mail Us