Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Last update: 20220131 Last update: 20241205

Table of contents

Interpretation of variants by ACMG standards and guidelines

Classification criteria

Extensive annotation is applied during our genomics analysis. Interpretation of genetic determinants of disease is based on many evidence sources. One important source of interpretation comes from the Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology, Richards et al. 1. See also Li et al., 2017 2 and Riggs et al., 2020 3. The following tables are provided as they appear in the initial steps of our filtering protocol for the addition of ACMG-standardised labels to candidate causal variants.

For reference, alternative public implementations of ACMG guidelines can be found in Li et al., 2017 4 and Xavier et al., 2019 5; please note these tools have not implemented here nor is any assertion of their quality offered. Examples of effective variant filtering and expected candidate variant yield in studies of rare human disease are provided by Pedersen et al., 2021 6.

Main criteria for classifications

Evidence typelabelEvidenceManual adjustmentACGM labelCaveat checksCriteria
pathogenicityVS1very_strong PVS14null variant (nonsense, frameshift, canonical +- 2 splice sites, initiation codon, single or multiexon deletion) in a gene where LOF is a known mechanism of disease
pathogenicityS1strong PS11Same amino acid change as a previously established pathogenic variant regardless of nucleotide change
pathogenicityS2strongmanualPS21De novo (both maternity and paternity confirmed) in a patient with the disease and no family history
pathogenicityS3strongmanualPS31Well-established in vitro or in vivo functional studies supportive of a damaging effect on the gene or gene product
pathogenicityS4strong PS42The prevalence of the variant in affected individuals is significantly increased compared with the prevalence in controls
pathogenicityS5other PS5 The user has additional (value) strong pathogenic evidence
pathogenicityM1moderate PM1 Located in a mutational hot spot and/or critical and well-established functional domain (e.g., active site of an enzyme) without benign variation
pathogenicityM2moderate PM21Absent from controls (or at extremely low frequency if recessive) in Exome Sequencing Project, 1000 Genomes Project, or Exome Aggregation Consortium
pathogenicityM3moderatemanualPM31For recessive disorders, detected in trans with a pathogenic variant
pathogenicityM4moderate PM4 Protein length changes as a result of in-frame deletions/insertions in a nonrepeat region or stop-loss variants
pathogenicityM5moderate PM51Novel missense change at an amino acid residue where a different missense change determined to be pathogenic has been seen before
pathogenicityM6moderate PM6 Assumed de novo, but without confirmation of paternity and maternity
pathogenicityM7moderateotherPM7 The user has additional (value) moderate pathogenic evidence
pathogenicityP1supportingmanualPP1 Cosegregation with disease in multiple affected family members in a gene definitively known to cause the disease
pathogenicityP2supporting PP2 Missense variant in a gene that has a low rate of benign missense variation and in which missense variants are a common mechanism of disease
pathogenicityP3supporting PP31Multiple lines of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc.)
pathogenicityP4supportingmanualPP4 Patient’s phenotype or family history is highly specific for a disease with a single genetic etiology
pathogenicityP5supporting PP5 Reputable source recently reports variant as pathogenic, but the evidence is not available to the laboratory to perform an independent evaluation
pathogenicityP6supportingotherPP6 The user has additional (value) supporting pathogenic evidence
benignA1stand_alone BA1 Allele frequency is >5% in Exome Sequencing Project, 1000 Genomes Project, or Exome Aggregation Consortium
benignS1strong BS1 Allele frequency is greater than expected for disorder
benignS2strong BS2 Observed in a healthy adult individual for a recessive (homozygous), dominant (heterozygous), or X-linked (hemizygous) disorder, with full penetrance expected at an early age
benignS3strongmanualBS3 Well-established in vitro or in vivo functional studies show no damaging effect on protein function or splicing
benignS4strongmanualBS41Lack of segregation in affected members of a family
benignS5strongotherBS5 The user has additional (value) strong benign evidence
benignP1supporting BP1 Missense variant in a gene for which primarily truncating variants are known to cause disease
benignP2supportingmanualBP2 Observed in trans with a pathogenic variant for a fully penetrant dominant gene/disorder or observed in cis with a pathogenic variant in any inheritance pattern
benignP3supporting BP3 In-frame deletions/insertions in a repetitive region without a known function
benignP4supporting BP41Multiple lines of computational evidence suggest no impact on gene or gene product (conservation, evolutionary, splicing impact, etc.)
benignP5supportingmanualBP5 Variant found in a case with an alternate molecular basis for disease
benignP6supporting BP6 Reputable source recently reports variant as benign, but the evidence is not available to the laboratory to perform an independent evaluation
benignP7supporting BP7 A synonymous (silent) variant for which splicing prediction algorithms predict no impact to the splice consensus sequence nor the creation of a new splice site AND the nucleotide is not highly conserved
benignP8supportingotherBP8 The user has additional (value) supporting benign evidence

Caveats implementing filters

Implementing the guidelines for interpretation of annotation requires multiple programmatic steps. The number of individual caveat checks indicate the number of bioinformatic filter functions used. Unnumbered caveat checks indicate that only a single filter function is required during reference to annotation databases. However, each function depends on reference to either one or several evidence source databases (approximately 150 sources) which are not shown here.

Evidence typelabelEvidenceManual adjustmentACGM labelCaveat numberCaveat
pathogenicityVS1very_strong PVS11LoF not disease causing for gene
pathogenicityVS1very_strong PVS12LoF at 3prime end (loftee)
pathogenicityVS1very_strong PVS13exon skipping splce that leave functional protein
pathogenicityVS1very_strong PVS14multiple transcript check
pathogenicityS1strong PS11Assess for splicing vs amino acid change
pathogenicityS2strongmanualPS21“Do not assume maternity or paternity i.e. egg donor, surrogate”
pathogenicityS3strongmanualPS31Quality of functional evidence
pathogenicityS4strong PS41“Assess RR, OR, CI for case/control evidence”
pathogenicityS4strong PS42Rare variant in multiple indipendent cases can guide
pathogenicityS5other PS5  
pathogenicityM1moderate PM1  
pathogenicityM2moderate PM21Indels in population reference may not match your protocol
pathogenicityM3moderatemanualPM31Pedigree sequencing is ideal. Phasing (GATK) may guide. Ldlink may guide
pathogenicityM4moderate PM4  
pathogenicityM5moderate PM51Assess for splicing vs amino acid change
pathogenicityM6moderate PM6  
pathogenicityM7moderateotherPM7  
pathogenicityP1supportingmanualPP1  
pathogenicityP2supporting PP2  
pathogenicityP3supporting PP31“Prediction methods may have used the same data sources, do not assume each as independent evidence”
pathogenicityP4supportingmanualPP4  
pathogenicityP5supporting PP5  
pathogenicityP6supportingotherPP6  
benignA1stand_alone BA1  
benignS1strong BS1  
benignS2strong BS2  
benignS3strongmanualBS3  
benignS4strongmanualBS41Question the accuracy of segregation. Presence of phenocopies can mimic lack of segregation. Additional pathogenic variants may produce autosomal dominant pattern.
benignS5strongotherBS5  
benignP1supporting BP1  
benignP2supportingmanualBP2  
benignP3supporting BP3  
benignP4supporting BP41“Prediction methods may have used the same data sources, do not assume each as independent evidence”
benignP5supportingmanualBP5  
benignP6supporting BP6  
benignP7supporting BP7  
benignP8supportingotherBP8  

Scoring point system

Table 3 from Tavtigian et al. 2020 table 2 7: Point values for ACMG/AMP strength of evidence categories

EvidencePoint scalePathogenicBenign
Indeterminate000a
Supporting11-1
Moderate22-2b
Strong44-4
Very strong88-8b

a Note is made that Richards et al. did not specifically recognize indeterminate evidence. Nonetheless, if one thinks of the odds in favor of pathogenicity as a continuous variable, there exists a range that falls between Supporting Benign and Supporting Pathogenic. This is Indeterminate.

b Note is also made that Richards et al. did not specify benign evidence at the moderate or very strong levels. Nevertheless, the point system would readily support the addition of such criteria.

Table 4. from Tavtigian et al. 2020 table 2 7: Point-based variant classification categories

CategoryPoint rangesNotes
Pathogenic≥10 
Likely Pathogenic6 to 9a 
Uncertain0 to 5 
Likely Benign−1 to −6a 
Benign≤ −7 

a Operationally, the prior probability should be understood to be infinitesimally >0.10. This has two effects. First, it makes the posterior probability of the American College of Medical Genetics (ACMG) Likely Pathogenic combining rules infinitesimally greater than 0.90, so that the Likely Pathogenic rules work properly. A specific value of 0.102 would have the added benefit that seven points would meet the IARC (International Agency for Research on Cancer) Likely Pathogenic threshold of 0.95. Second, it enforces a requirement for some evidence of benign effect for sequence variants to be classified as Likely Benign. One could also argue that the point threshold for Likely Benign should really be −2. This would match the ACMG rule “Likely Benign (ii)” rather than the simple numerical requirement that the posterior probability be <0.10.

For reference, alternative public implementations of ACMG guidelines can be found in Li & Wang, 2017 and Xavier et al., 2019; please note these tools have not been implemented here nor is any assertion of their quality offered. Examples of effective variant filtering and expected candidate variant yield in studies of rare human disease are provided by Pedersen et al., 2021.

We use the evaluation of in silico predictors from Varsome. However, this paper discusses it also Wilcox et al., 2022 8.

ACMGuru code example

The code from ACMGuru runs these runs using a range of functions and then performs a final tally. Some excerts from the code are shown here:

# acmg_filters ----
# PVS1 ----
# PVS1 are null variants where IMPACT=="HIGH" and inheritance match, 
# in gene where LoF cause disease.
df$ACMG_PVS1 <- NA
df <- df %>% dplyr::select(ACMG_PVS1, everything())
df$ACMG_PVS1 <- ifelse(df$IMPACT == "HIGH" & 
                        df$genotype == 2, "PVS1", NA) # homozygous
df$ACMG_PVS1 <- ifelse(df$IMPACT == "HIGH" & 
                        df$Inheritance == "AD", "PVS1", df$ACMG_PVS1) # dominant

# All functions for classification ...

# acmg tally  ----
# List of all ACMG labels
# acmg_labels <- c("ACMG_PVS1", "ACMG_PS1", "ACMG_PS2", "ACMG_PS3", 
"ACMG_PS4", "ACMG_PS5", "ACMG_PM1", "ACMG_PM2", 
"ACMG_PM3", "ACMG_PM4", "ACMG_PM5", "ACMG_PM6", 
"ACMG_PM7", "ACMG_PP1", "ACMG_PP2", "ACMG_PP3", 
"ACMG_PP4")

# Transform 'Evidence_type' to 'P' for pathogenic and 'B' for benign
df_acmg$code_prefix <- ifelse(df_acmg$Evidence_type == "pathogenicity", "P", "B")

# Create the ACMG code by combining the new prefix and the label, prepending 'ACMG_'
df_acmg$ACMG_code <- paste0("ACMG_", df_acmg$code_prefix, df_acmg$label)

acmg_labels <- df_acmg$ACMG_code
print(acmg_labels)

# df_acmg$ACMG_label
names(df) |> head(30) |> as.character()

# Check if each ACMG column exists, if not create it and fill with NA
for (acmg_label in acmg_labels) {
  if (!acmg_label %in% names(df)) {
      print("missing label")
          df[[acmg_label]] <- NA
            }
            }

# Then use coalesce to find the first non-NA ACMG label
df$ACMG_highest <- dplyr::coalesce(!!!df[acmg_labels])
df <- df %>% dplyr::select(ACMG_highest, everything())

# Count the number of non-NA values across the columns
df$ACMG_count <- rowSums(!is.na(df[, acmg_labels ]))
df <- df %>% dplyr::select(ACMG_count, everything())
# df$ACMG_count[df$ACMG_count == 0] <- NA

# ACMG Verdict----
# Define scores for Pathogenic criteria
pathogenic_scores <- c(
 "PVS1" = 8,
  setNames(rep(4, 5), paste0("PS", 1:5)),
    setNames(rep(2, 7), paste0("PM", 1:7)),
      "PP3" = 1
      )

      # Define scores for Benign criteria
      benign_scores <- c(
        "BA1" = -8,
          setNames(rep(-4, 5), paste0("BS", 1:5)),
            setNames(rep(-1, 8), paste0("BP", 1:8))
            )

            # Combine both scoring systems into one vector
            acmg_scores <- c(pathogenic_scores, benign_scores)

            # Print the complete ACMG scoring system
            print(acmg_scores)

            # Create ACMG_score column by looking up ACMG_highest in acmg_scores
            df$ACMG_score <- acmg_scores[df$ACMG_highest]

# If there are any ACMG labels that don't have a corresponding score, 
# these will be NA. You may want to set these to 0.
df$ACMG_score[is.na(df$ACMG_score)] <- 0
df <- df |> dplyr::select(ACMG_score, everything())

# Total ACMG score ----

# Mutate all ACMG columns
df <- df %>% 
  mutate_at(acmg_labels, function(x) acmg_scores[x])

# Replace NAs with 0 in ACMG columns only
df[acmg_labels] <- lapply(df[acmg_labels], function(x) ifelse(is.na(x), 0, x))

# Calculate total ACMG score
df$ACMG_total_score <- rowSums(df[acmg_labels])

df <- df |> dplyr::select(ACMG_total_score, everything())

References

  1. Richards, S. et al., 2015. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genetics in Medicine, 17(5), pp.405–423. DOI: 10.1038/gim2015.30

  2. Li, M.M. et al., 2017. Standards and guidelines for the interpretation and reporting of sequence variants in cancer: a joint consensus recommendation of the Association for Molecular Pathology, American Society of Clinical Oncology, and College of American Pathologists. The Journal of Molecular Diagnostics, 19(1), pp.4–23. DOI: 10.1016/j.jmoldx.2016.10.002

  3. Riggs, E.R. et al., 2020. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMGe and the Clinical Genome Resource (ClinGen). Genetics in Medicine, 22(2), pp.245–257. DOI: 10.1038/s41436-019-0686-8

  4. Li, Q. and Wang, K., 2017. InterVar: clinical interpretation of genetic variants by the 2015 ACMG-AMP guidelines. The American Journal of Human Genetics, 100(2), pp.267–280. DOI: 10.1016/j.ajhg.2017.01.004

  5. Xavier, A. et al., 2019. TAPES: A tool for assessment and prioritisation in exome studies. PLoS Computational Biology, 15(10), e1007453. DOI: 10.1371/journal.pcbi.1007453

  6. Pedersen, B.S. et al., 2021. Effective variant filtering and expected candidate variant yield in studies of rare human disease. NPJ Genomic Medicine, 6(1), pp.1–8. DOI: 10.1038/s41525-021-00227-3

  7. Tavtigian SV, Harrison SM, Boucher KM, Biesecker LG. (2020). Fitting a naturally scaled point system to the ACMG/AMP variant classification guidelines. Human Mutation, 41, 1734–1737. https://doi.org/10.1002/humu.24088  2

  8. Emma H. Wilcox, Mahdi Sarmady, Bryan Wulf, Matt W. Wright, Heidi L. Rehm, Leslie G. Biesecker, Ahmad N. Abou Tayoun. (2022). Evaluating the impact of in silico predictors on clinical variant classification. Genetics in Medicine, Volume 24, Issue 4, Pages 924-930, ISSN 1098-3600, https://doi.org/10.1016/j.gim.2021.11.018