Last update: 20220131 Last update: 20241205
Table of contents
Interpretation of variants by ACMG standards and guidelines
Classification criteria
Extensive annotation is applied during our genomics analysis. Interpretation of genetic determinants of disease is based on many evidence sources. One important source of interpretation comes from the Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology, Richards et al. 1. See also Li et al., 2017 2 and Riggs et al., 2020 3. The following tables are provided as they appear in the initial steps of our filtering protocol for the addition of ACMG-standardised labels to candidate causal variants.
For reference, alternative public implementations of ACMG guidelines can be found in Li et al., 2017 4 and Xavier et al., 2019 5; please note these tools have not implemented here nor is any assertion of their quality offered. Examples of effective variant filtering and expected candidate variant yield in studies of rare human disease are provided by Pedersen et al., 2021 6.
Main criteria for classifications
Evidence type | label | Evidence | Manual adjustment | ACGM label | Caveat checks | Criteria |
---|---|---|---|---|---|---|
pathogenicity | VS1 | very_strong | PVS1 | 4 | null variant (nonsense, frameshift, canonical +- 2 splice sites, initiation codon, single or multiexon deletion) in a gene where LOF is a known mechanism of disease | |
pathogenicity | S1 | strong | PS1 | 1 | Same amino acid change as a previously established pathogenic variant regardless of nucleotide change | |
pathogenicity | S2 | strong | manual | PS2 | 1 | De novo (both maternity and paternity confirmed) in a patient with the disease and no family history |
pathogenicity | S3 | strong | manual | PS3 | 1 | Well-established in vitro or in vivo functional studies supportive of a damaging effect on the gene or gene product |
pathogenicity | S4 | strong | PS4 | 2 | The prevalence of the variant in affected individuals is significantly increased compared with the prevalence in controls | |
pathogenicity | S5 | other | PS5 | The user has additional (value) strong pathogenic evidence | ||
pathogenicity | M1 | moderate | PM1 | Located in a mutational hot spot and/or critical and well-established functional domain (e.g., active site of an enzyme) without benign variation | ||
pathogenicity | M2 | moderate | PM2 | 1 | Absent from controls (or at extremely low frequency if recessive) in Exome Sequencing Project, 1000 Genomes Project, or Exome Aggregation Consortium | |
pathogenicity | M3 | moderate | manual | PM3 | 1 | For recessive disorders, detected in trans with a pathogenic variant |
pathogenicity | M4 | moderate | PM4 | Protein length changes as a result of in-frame deletions/insertions in a nonrepeat region or stop-loss variants | ||
pathogenicity | M5 | moderate | PM5 | 1 | Novel missense change at an amino acid residue where a different missense change determined to be pathogenic has been seen before | |
pathogenicity | M6 | moderate | PM6 | Assumed de novo, but without confirmation of paternity and maternity | ||
pathogenicity | M7 | moderate | other | PM7 | The user has additional (value) moderate pathogenic evidence | |
pathogenicity | P1 | supporting | manual | PP1 | Cosegregation with disease in multiple affected family members in a gene definitively known to cause the disease | |
pathogenicity | P2 | supporting | PP2 | Missense variant in a gene that has a low rate of benign missense variation and in which missense variants are a common mechanism of disease | ||
pathogenicity | P3 | supporting | PP3 | 1 | Multiple lines of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc.) | |
pathogenicity | P4 | supporting | manual | PP4 | Patient’s phenotype or family history is highly specific for a disease with a single genetic etiology | |
pathogenicity | P5 | supporting | PP5 | Reputable source recently reports variant as pathogenic, but the evidence is not available to the laboratory to perform an independent evaluation | ||
pathogenicity | P6 | supporting | other | PP6 | The user has additional (value) supporting pathogenic evidence | |
benign | A1 | stand_alone | BA1 | Allele frequency is >5% in Exome Sequencing Project, 1000 Genomes Project, or Exome Aggregation Consortium | ||
benign | S1 | strong | BS1 | Allele frequency is greater than expected for disorder | ||
benign | S2 | strong | BS2 | Observed in a healthy adult individual for a recessive (homozygous), dominant (heterozygous), or X-linked (hemizygous) disorder, with full penetrance expected at an early age | ||
benign | S3 | strong | manual | BS3 | Well-established in vitro or in vivo functional studies show no damaging effect on protein function or splicing | |
benign | S4 | strong | manual | BS4 | 1 | Lack of segregation in affected members of a family |
benign | S5 | strong | other | BS5 | The user has additional (value) strong benign evidence | |
benign | P1 | supporting | BP1 | Missense variant in a gene for which primarily truncating variants are known to cause disease | ||
benign | P2 | supporting | manual | BP2 | Observed in trans with a pathogenic variant for a fully penetrant dominant gene/disorder or observed in cis with a pathogenic variant in any inheritance pattern | |
benign | P3 | supporting | BP3 | In-frame deletions/insertions in a repetitive region without a known function | ||
benign | P4 | supporting | BP4 | 1 | Multiple lines of computational evidence suggest no impact on gene or gene product (conservation, evolutionary, splicing impact, etc.) | |
benign | P5 | supporting | manual | BP5 | Variant found in a case with an alternate molecular basis for disease | |
benign | P6 | supporting | BP6 | Reputable source recently reports variant as benign, but the evidence is not available to the laboratory to perform an independent evaluation | ||
benign | P7 | supporting | BP7 | A synonymous (silent) variant for which splicing prediction algorithms predict no impact to the splice consensus sequence nor the creation of a new splice site AND the nucleotide is not highly conserved | ||
benign | P8 | supporting | other | BP8 | The user has additional (value) supporting benign evidence |
Caveats implementing filters
Implementing the guidelines for interpretation of annotation requires multiple programmatic steps. The number of individual caveat checks indicate the number of bioinformatic filter functions used. Unnumbered caveat checks indicate that only a single filter function is required during reference to annotation databases. However, each function depends on reference to either one or several evidence source databases (approximately 150 sources) which are not shown here.
Evidence type | label | Evidence | Manual adjustment | ACGM label | Caveat number | Caveat |
---|---|---|---|---|---|---|
pathogenicity | VS1 | very_strong | PVS1 | 1 | LoF not disease causing for gene | |
pathogenicity | VS1 | very_strong | PVS1 | 2 | LoF at 3prime end (loftee) | |
pathogenicity | VS1 | very_strong | PVS1 | 3 | exon skipping splce that leave functional protein | |
pathogenicity | VS1 | very_strong | PVS1 | 4 | multiple transcript check | |
pathogenicity | S1 | strong | PS1 | 1 | Assess for splicing vs amino acid change | |
pathogenicity | S2 | strong | manual | PS2 | 1 | “Do not assume maternity or paternity i.e. egg donor, surrogate” |
pathogenicity | S3 | strong | manual | PS3 | 1 | Quality of functional evidence |
pathogenicity | S4 | strong | PS4 | 1 | “Assess RR, OR, CI for case/control evidence” | |
pathogenicity | S4 | strong | PS4 | 2 | Rare variant in multiple indipendent cases can guide | |
pathogenicity | S5 | other | PS5 | |||
pathogenicity | M1 | moderate | PM1 | |||
pathogenicity | M2 | moderate | PM2 | 1 | Indels in population reference may not match your protocol | |
pathogenicity | M3 | moderate | manual | PM3 | 1 | Pedigree sequencing is ideal. Phasing (GATK) may guide. Ldlink may guide |
pathogenicity | M4 | moderate | PM4 | |||
pathogenicity | M5 | moderate | PM5 | 1 | Assess for splicing vs amino acid change | |
pathogenicity | M6 | moderate | PM6 | |||
pathogenicity | M7 | moderate | other | PM7 | ||
pathogenicity | P1 | supporting | manual | PP1 | ||
pathogenicity | P2 | supporting | PP2 | |||
pathogenicity | P3 | supporting | PP3 | 1 | “Prediction methods may have used the same data sources, do not assume each as independent evidence” | |
pathogenicity | P4 | supporting | manual | PP4 | ||
pathogenicity | P5 | supporting | PP5 | |||
pathogenicity | P6 | supporting | other | PP6 | ||
benign | A1 | stand_alone | BA1 | |||
benign | S1 | strong | BS1 | |||
benign | S2 | strong | BS2 | |||
benign | S3 | strong | manual | BS3 | ||
benign | S4 | strong | manual | BS4 | 1 | Question the accuracy of segregation. Presence of phenocopies can mimic lack of segregation. Additional pathogenic variants may produce autosomal dominant pattern. |
benign | S5 | strong | other | BS5 | ||
benign | P1 | supporting | BP1 | |||
benign | P2 | supporting | manual | BP2 | ||
benign | P3 | supporting | BP3 | |||
benign | P4 | supporting | BP4 | 1 | “Prediction methods may have used the same data sources, do not assume each as independent evidence” | |
benign | P5 | supporting | manual | BP5 | ||
benign | P6 | supporting | BP6 | |||
benign | P7 | supporting | BP7 | |||
benign | P8 | supporting | other | BP8 |
Scoring point system
Table 3 from Tavtigian et al. 2020 table 2 7: Point values for ACMG/AMP strength of evidence categories
Evidence | Point scale | Pathogenic | Benign |
---|---|---|---|
Indeterminate | 0 | 0 | 0a |
Supporting | 1 | 1 | -1 |
Moderate | 2 | 2 | -2b |
Strong | 4 | 4 | -4 |
Very strong | 8 | 8 | -8b |
a Note is made that Richards et al. did not specifically recognize indeterminate evidence. Nonetheless, if one thinks of the odds in favor of pathogenicity as a continuous variable, there exists a range that falls between Supporting Benign and Supporting Pathogenic. This is Indeterminate.
b Note is also made that Richards et al. did not specify benign evidence at the moderate or very strong levels. Nevertheless, the point system would readily support the addition of such criteria.
Table 4. from Tavtigian et al. 2020 table 2 7: Point-based variant classification categories
Category | Point ranges | Notes |
---|---|---|
Pathogenic | ≥10 | |
Likely Pathogenic | 6 to 9a | |
Uncertain | 0 to 5 | |
Likely Benign | −1 to −6a | |
Benign | ≤ −7 |
a Operationally, the prior probability should be understood to be infinitesimally >0.10. This has two effects. First, it makes the posterior probability of the American College of Medical Genetics (ACMG) Likely Pathogenic combining rules infinitesimally greater than 0.90, so that the Likely Pathogenic rules work properly. A specific value of 0.102 would have the added benefit that seven points would meet the IARC (International Agency for Research on Cancer) Likely Pathogenic threshold of 0.95. Second, it enforces a requirement for some evidence of benign effect for sequence variants to be classified as Likely Benign. One could also argue that the point threshold for Likely Benign should really be −2. This would match the ACMG rule “Likely Benign (ii)” rather than the simple numerical requirement that the posterior probability be <0.10.
For reference, alternative public implementations of ACMG guidelines can be found in Li & Wang, 2017 and Xavier et al., 2019; please note these tools have not been implemented here nor is any assertion of their quality offered. Examples of effective variant filtering and expected candidate variant yield in studies of rare human disease are provided by Pedersen et al., 2021.
We use the evaluation of in silico predictors from Varsome. However, this paper discusses it also Wilcox et al., 2022 8.
ACMGuru code example
The code from ACMGuru runs these runs using a range of functions and then performs a final tally. Some excerts from the code are shown here:
# acmg_filters ----
# PVS1 ----
# PVS1 are null variants where IMPACT=="HIGH" and inheritance match,
# in gene where LoF cause disease.
df$ACMG_PVS1 <- NA
df <- df %>% dplyr::select(ACMG_PVS1, everything())
df$ACMG_PVS1 <- ifelse(df$IMPACT == "HIGH" &
df$genotype == 2, "PVS1", NA) # homozygous
df$ACMG_PVS1 <- ifelse(df$IMPACT == "HIGH" &
df$Inheritance == "AD", "PVS1", df$ACMG_PVS1) # dominant
# All functions for classification ...
# acmg tally ----
# List of all ACMG labels
# acmg_labels <- c("ACMG_PVS1", "ACMG_PS1", "ACMG_PS2", "ACMG_PS3",
"ACMG_PS4", "ACMG_PS5", "ACMG_PM1", "ACMG_PM2",
"ACMG_PM3", "ACMG_PM4", "ACMG_PM5", "ACMG_PM6",
"ACMG_PM7", "ACMG_PP1", "ACMG_PP2", "ACMG_PP3",
"ACMG_PP4")
# Transform 'Evidence_type' to 'P' for pathogenic and 'B' for benign
df_acmg$code_prefix <- ifelse(df_acmg$Evidence_type == "pathogenicity", "P", "B")
# Create the ACMG code by combining the new prefix and the label, prepending 'ACMG_'
df_acmg$ACMG_code <- paste0("ACMG_", df_acmg$code_prefix, df_acmg$label)
acmg_labels <- df_acmg$ACMG_code
print(acmg_labels)
# df_acmg$ACMG_label
names(df) |> head(30) |> as.character()
# Check if each ACMG column exists, if not create it and fill with NA
for (acmg_label in acmg_labels) {
if (!acmg_label %in% names(df)) {
print("missing label")
df[[acmg_label]] <- NA
}
}
# Then use coalesce to find the first non-NA ACMG label
df$ACMG_highest <- dplyr::coalesce(!!!df[acmg_labels])
df <- df %>% dplyr::select(ACMG_highest, everything())
# Count the number of non-NA values across the columns
df$ACMG_count <- rowSums(!is.na(df[, acmg_labels ]))
df <- df %>% dplyr::select(ACMG_count, everything())
# df$ACMG_count[df$ACMG_count == 0] <- NA
# ACMG Verdict----
# Define scores for Pathogenic criteria
pathogenic_scores <- c(
"PVS1" = 8,
setNames(rep(4, 5), paste0("PS", 1:5)),
setNames(rep(2, 7), paste0("PM", 1:7)),
"PP3" = 1
)
# Define scores for Benign criteria
benign_scores <- c(
"BA1" = -8,
setNames(rep(-4, 5), paste0("BS", 1:5)),
setNames(rep(-1, 8), paste0("BP", 1:8))
)
# Combine both scoring systems into one vector
acmg_scores <- c(pathogenic_scores, benign_scores)
# Print the complete ACMG scoring system
print(acmg_scores)
# Create ACMG_score column by looking up ACMG_highest in acmg_scores
df$ACMG_score <- acmg_scores[df$ACMG_highest]
# If there are any ACMG labels that don't have a corresponding score,
# these will be NA. You may want to set these to 0.
df$ACMG_score[is.na(df$ACMG_score)] <- 0
df <- df |> dplyr::select(ACMG_score, everything())
# Total ACMG score ----
# Mutate all ACMG columns
df <- df %>%
mutate_at(acmg_labels, function(x) acmg_scores[x])
# Replace NAs with 0 in ACMG columns only
df[acmg_labels] <- lapply(df[acmg_labels], function(x) ifelse(is.na(x), 0, x))
# Calculate total ACMG score
df$ACMG_total_score <- rowSums(df[acmg_labels])
df <- df |> dplyr::select(ACMG_total_score, everything())
References
Richards, S. et al., 2015. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genetics in Medicine, 17(5), pp.405–423. DOI: 10.1038/gim2015.30. ↩
Li, M.M. et al., 2017. Standards and guidelines for the interpretation and reporting of sequence variants in cancer: a joint consensus recommendation of the Association for Molecular Pathology, American Society of Clinical Oncology, and College of American Pathologists. The Journal of Molecular Diagnostics, 19(1), pp.4–23. DOI: 10.1016/j.jmoldx.2016.10.002. ↩
Riggs, E.R. et al., 2020. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMGe and the Clinical Genome Resource (ClinGen). Genetics in Medicine, 22(2), pp.245–257. DOI: 10.1038/s41436-019-0686-8. ↩
Li, Q. and Wang, K., 2017. InterVar: clinical interpretation of genetic variants by the 2015 ACMG-AMP guidelines. The American Journal of Human Genetics, 100(2), pp.267–280. DOI: 10.1016/j.ajhg.2017.01.004. ↩
Xavier, A. et al., 2019. TAPES: A tool for assessment and prioritisation in exome studies. PLoS Computational Biology, 15(10), e1007453. DOI: 10.1371/journal.pcbi.1007453. ↩
Pedersen, B.S. et al., 2021. Effective variant filtering and expected candidate variant yield in studies of rare human disease. NPJ Genomic Medicine, 6(1), pp.1–8. DOI: 10.1038/s41525-021-00227-3. ↩
Tavtigian SV, Harrison SM, Boucher KM, Biesecker LG. (2020). Fitting a naturally scaled point system to the ACMG/AMP variant classification guidelines. Human Mutation, 41, 1734–1737. https://doi.org/10.1002/humu.24088 ↩ ↩2
Emma H. Wilcox, Mahdi Sarmady, Bryan Wulf, Matt W. Wright, Heidi L. Rehm, Leslie G. Biesecker, Ahmad N. Abou Tayoun. (2022). Evaluating the impact of in silico predictors on clinical variant classification. Genetics in Medicine, Volume 24, Issue 4, Pages 924-930, ISSN 1098-3600, https://doi.org/10.1016/j.gim.2021.11.018. ↩