Back to Presentations
Evaluating the specificity of genotypic inference with TrueAllele Casework software
Jay A. Caponera, "Evaluating the specificity of genotypic inference with TrueAllele® Casework software", American Academy of Forensic Sciences 66th Annual Meeting, Seattle, WA, 20-Feb-2014.
Interpretation of low template and complex mixed DNA profiles with the binary inclusion/exclusion approach often reduces or precludes statistical weight from being applied to probative evidence items. Quantitative data modeling of DNA data offers an alternative strategy that can result in more informative profiles. This study uses probabilistic genotyping software to objectively infer individual genotypes from both low template and mixed samples with up to four contributors. An approximate limit of detection with the software was observed using DNA inputs of 15.6pg for single source samples, and maximum separation between known donor and non-donor genotypes was achieved with as little as 62.5pg. Average computer-inferred genotype specificity between donor and non-donor profiles was over 13 log units for two person mixtures, 5 log units for three person mixtures, and 4 log units for four person mixtures. Results from this study show that probabilistic genotyping match statistics were both reproducible and specific to all known donor profiles.
The forensic literature has increasingly made recommendations for the use of probabilistic genotyping, including most recently a strong encouragement from the DNA Commission of the International Society of Forensic Genetics (ISFG) to adopt likelihood ratio-based approaches that include drop-in and drop-out for solving mixed template samples.
TrueAllele Casework (Cybergenetics) is a fully continuous Bayesian method that uses an iterative Markov chain Monte Carlo (MCMC) method to infer genotypes from evidentiary profiles and compute DNA match statistics, and can easily accommodate drop-in and drop-out. By preserving more identification information, the computer is also able to add increased specificity to genotypic inference, ultimately resulting in a high degree of separation between known donor and non-donor likelihood ratios. The high genotype specificity observed with this approach can then be translated into simplified DNA match reporting based on likelihood ratio calculations.
Uncertainty exists in virtually all fields of science. In forensic STR analysis, this uncertainty may take the form of partially recovered genotypes, complex mixture profiles, or an inability to accurately provide weight of evidence. All currently used threshold-based methods attempt to address uncertainty by either discarding or altering observed DNA data, resulting in a loss of valuable genetic information with potential costs to public safety. By modeling all observed peak height variation with MCMC, computer-based genotype inference can overcome stochastic effects and produce more scientifically rigorous match results. The validation data shown here demonstrate how likelihood ratio calculations based on quantitative peak height information may be used to measure the extent of separation between individual known donor and non-donor genotypes. Results indicate that TrueAllele Casework is highly specific and can reproducibly discriminate between matching and non-matching reference profiles.
- The fully continuous approach to probabilistic genotyping can preserve more information than current inclusion/exclusion methods, resulting in highly specific genotype inference.
- Single source data suggest a donor limit of detection of approximately 15pg input DNA, although clear separation between donor and non-donor log(LR) values may be obtained below that amount.
- A decrease in specificity is evident with increased contributor numbers and mixture complexity. However, an average separation of over 16 log units between donor and non-donor LR was still observed across the four person mixture data for unrelated individuals.
- The high genotype specificity obtained in validation comparisons allows an objective, standardized approach to DNA match reporting based on log(LR) values as shown in the schematic below.