We analyzed somatic mutations and DNA methylation in 5243 cancers of types for which tobacco smoking confers an elevated risk. Thank you for visiting nature.com. Hung,Catherine J. Kennedy&Anna deFazio, Korea Advanced Institute of Science and Technology, Daejeon, South Korea, Jung Kyoon Choi,Young Seok Ju&Christopher J. Yoon, Electronics and Telecommunications Research Institute, Daejeon, South Korea, Wan Choi,Seung-Hyup Jeon,Hyunghwan Kim&Youngchoon Woo, Institut National du Cancer (INCA), Boulogne-Billancourt, France, Department of Genetics, Informatics Institute, University of Alabama at Birmingham, Birmingham, AL, USA, Division of Medical Oncology, National Cancer Centre, Singapore, Singapore, Medical Oncology, University and Hospital Trust of Verona, Verona, Italy, Department of Pediatrics, University Hospital Schleswig-Holstein, Kiel, Germany, Hepatobiliary/Pancreatic Surgical Oncology Program, University Health Network, Toronto, ON, Canada, Sean Cleary,Ashton A. Connor&Steven Gallinger, School of Biological Sciences, University of Auckland, Auckland, New Zealand, Department of Surgery, University of Melbourne, Parkville, VIC, Australia, The Murdoch Childrens Research Institute, Royal Childrens Hospital, Parkville, VIC, Australia, Walter and Eliza Hall Institute, Parkville, VIC, Australia, Vancouver Prostate Centre, Vancouver, Canada, Colin C. Collins,Nilgun Donmez,Faraz Hach,Salem Malikic,S. Cenk Sahinalp,Iman Sarrafi&Raunak Shrestha, Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada, Ashton A. Connor,Steven Gallinger,Robert C. Grant,Treasa A. McPherson&Iris Selander, Norfolk and Norwich University Hospital NHS Trust, Norwich, UK, Matthew G. Cordes,Catrina C. Fronick&Tom Roques, Victorian Institute of Forensic Medicine, Southbank, VIC, Australia, Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA, Isidro Corts-Ciriano,Jake June-Koo Lee&Peter J. Merlevede, J. et al. In both cases, SigProfiler is a fully functional, free and open-source tool distributed under the permissive 2-Clause BSD License. receives research funds from IBM and Pharmacyclics and is an inventor on patent applications related to MuTect, ABSOLUTE, MutSig, MSMuTect and POLYSOLVER. Nature Somatic ERCC2 mutations are associated with a distinct genomic signature in urothelial tumors. The suffix P on a SignatureAnalyzer signature name indicates a signature extracted from non-hypermutated, non-melanoma tumours. N.J.H. SBS7c and SBS7d (consisting predominantly of T>A at NTT and T>C at NTT, respectively43) may be due to low frequencies of the misincorporation of T and G opposite to thymines in pyrimidine dimers. Saini, N. et al. Article It contributes to multiple types of cancer, but its similarity to SBS5 renders the extent of this contribution uncertain. They can be identified using mutational signatures, due to their unique mutational pattern and specific activity on the genome. 1). The nature of the mutational processes that underlie these signatures in human cancers that are unrelated to smoking, and in healthy mice, is unknown. EBioMedicine 20, 3949 (2017). Hotspots 3). 2). The Repertoire of Mutational Signatures in Human Cancer The topography of mutational processes in breast cancer genomes. Bioinformatics 33, 816 (2017). 23, 228235 (2013). was partially funded by the Molecular Biophysics Training Grant NIH/ NIGMS T32 GM008313 (PI: Venkatesh N. Murthy).We acknowledge the contributions of the many clinical networks across the ICGC and TCGA who provided samples and data to the PCAWG Consortium, and the contributions of the Technical Working Group and the Germline Working Group of the PCAWG Consortium for collation, realignment and harmonized variant calling of the cancer genomes used in this study. Genomic alterations in cancer cells consist of two major categories: (i) small scale alterations that include single base substitutions (SBSs) and small insertion and WebHere, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. The substantial size of our dataset, compared with previous analyses3,4,5,6,7,8,9,10,11,12,13,14,15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associatedbut distinctDNA damage, repair and/or replication mechanisms. These mutations are retained in the premalignant clone and cancer, along with additional mutations acquired during tumorigenesis. Our goal was to evaluate SignatureAnalyzer and SigProfiler on realistic synthetic data to identify any potential limitations of these two methods. Excluding cancers associated with exposure to UV light also yielded a signature (DBS11) that was characterized predominantly by CC>TT mutations, but only contributing tens of mutations in many samples from multiple types of cancer (Figs. The release of the Catalogue of Alexandrov, L. B. et al. Blue mutations present in the non-malignant normal cell. Matsuda, T., Kawanishi, M., Yagi, T., Matsui, S. & Takebe, H. Specific tandem GG to TT base substitutions induced by acetaldehyde are due to intra-strand crosslinks between adjacent guanine bases. In step 2, additional signatures unique to hypermutated samples were extracted (again selecting the highest posterior probability over ten runs) while allowing all signatures found in the samples with low mutation burden, to explain some of the spectra of hypermutated samples. Comprehensive topography analysis of mutational signatures encompassing 82,890,857 somatic mutations in 5,120 whole-genome sequenced tumours integrated with 516 tissue-matched topographical features from the ENCODE project allows researchers to explore the interactions between somatic mutational processes and genome architecture Article 17, 99 (2016). However, the many-fold-greater numbers of somatic mutations in whole genomes provide substantially increased power for signature decomposition, enabling the better separation of partially correlated signatures and the extraction of signatures that contribute relatively small numbers of mutations. Resolving mutational signatures in cancer development Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-Way Data Analysis and Blind Source Separation (John Wiley & Sons, 2009). Alexandrov, L. B., Nik-Zainal, S., Wedge, D. C., Campbell, P. J. WebG.G and J.K were partially supported by the National Cancer Institute grants U24CA210999 and U24CA143845. Somatic rearrangements across cancer reveal classes of samples with distinct patterns of DNA breakage and rearrangement-induced hypermutability. Nat. Mutational Signatures Funnell, T. et al. The 23,829sampleswhich include most types of cancer, and comprise the 2,780PCAWG whole genomes2, 1,865additional whole genomes and 19,184exomesyielded 79,793,266somatic SBSs, 814,191doublet-base substitutions (DBSs) and 4,122,233small indels that were analysed for mutational signatures, about 10-fold-more mutations than any previous study of which we are aware (syn11801889)6. Article WebMutational signatures can leave their mark in the form of differential mutational frequencies between the two DNA regions: genic regions and intergenic regions. Nature https://doi.org/10.1038/s41586-019-1907-7 (2020). Commun. 7, 10767 (2016). The repertoire of mutational signatures in human cancer The Repertoire of Mutational Signatures in Human Cancer In step 1 of the two-step extraction process, global signature extraction was performed for the samples with a low mutation burden (n=2,624). was partially supported by the Paul C. Zamecnick, MD, Chair in Oncology at the Massachusetts General Hospital Cancer Center. & Stratton, M. R. Deciphering signatures of mutational processes operative in human cancer. & Fvotte, C. Automatic relevance determination in nonnegative matrix factorization with the -divergence. The International Cancer Genome Consortium and TCGA tumour specific providers provided tumour and matched non-tumour samples, and the PCAWG Technical Working Group, the PCAWG Quality Control Working Group and the PCAWG Novel Somatic Mutation Calling Methods Working Group provided standardized mutation calls for the 2,780PCAWG whole genomes. The Repertoire of Mutational Signatures in Human Cancer. It is widely understood that the choice of the number of latent variables (for our purposes, the number of mutational signatures) is rarely amenable to complete automation. WebMutational signatures were predicted using the deconstructSigs package and illustrated according to the 96 substitution classification defined by the substitution classes (C>A, C>G, C>T, T>A, T>C, and T > G bins). Anyone you share the following link with will be able to read this content: Sorry, a shareable link is not currently available for this article. The repertoire of mutational signatures in human cancer G.G. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer. Oncogene 36, 746755 (2017). Characterizing mutational signatures in human cancer cell lines reveals episodic APOBEC mutagenesis. Any two signatures with sample attributions that significantly correlated with R2>0.3 (SigProfiler) (a) or>0.5 (SignatureAnalyzer) (b) are connected by edges. Mutational signatures as a collection of operative mutational processes. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. In addition to applying SigProfiler and SignatureAnalyzer to cancer data, the tools were also applied to realistic synthetic data with known solutions. PCAWG mutational signatures working group, G. Getz, S.G. Rozen, M.R. WebHere, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. However, the analysis of other classes of mutation has been relatively limited3,11,31,32,33. 2, of which three have previously been reported33,48. Abstract. They can be identified using mutational signatures, due to their unique mutational pattern and specific activity on the genome. Mutational signature analysis identifies MUTYH deficiency in colorectal cancers and adrenocortical carcinomas. Using 84,729,690 somatic mutations from 4,645 whole cancer genome and 19,184 exome sequences & Mustonen, V. EMu: probabilistic inference of mutational processes and their localization in the cancer genome. Frontiers | Integrative Genomic Analyses of 1,145 Patient Samples FIGURE 1.Mutational signatures over time. 2076: and S.G.R. PLoS Comput. ADS An overview of mutational and copy number signatures in 2016), suggesting that NER-deficiency might be associated with other mutational signatures as well. A similar strategy was used for signature attribution: we performed a separate attribution process for low- and hypermutated samples in all COMPOSITE, SBS, DBS and indel signatures. Nature https://doi.org/10.1038/s41586-020-1969-6 (2020). This is illustrated in the figure The repertoire of mutational signatures in human cancer. The repertoire of mutational signatures in human cancer Pan-cancer analysis of whole genomes. 27, 14751486 (2017). Marat Kazanov. CAS The results confirm that use of NMF-based approaches for extracting mutational signatures is not a purely algorithmic process, but also requires consideration of evidence from experimentally determined mutational signatures and the DNA damage and repair literature, prior evidence of biological plausibility and human-guided sensitivity analysis confirming that extractions from different groupings of tumours yield consistent results. Genome rearrangement signatures have also previously been described11,25,28,29,30. Mutational signatures IEEE Signal Process. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate. Author Correction: The repertoire of mutational signatures in human cancer | Nature article Author Correction Open Access Published: 25 January 2023 Author and S.M.A.I. Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. 29, 521531 (2019). PCAWG Mutational Signatures Working Group, https://www.synapse.org/#!Synapse:syn11801889, https://cancer.sanger.ac.uk/cosmic/signatures/SBS, https://cancer.sanger.ac.uk/cosmic/signatures/DBS, https://cancer.sanger.ac.uk/cosmic/signatures/ID, https://cancer.sanger.ac.uk/cosmic/signatures_v2, https://cancer.sanger.ac.uk/cosmic/signatures, https://cancer.sanger.ac.uk/cosmic/signatures/SBS/, https://www.mathworks.com/help/stats/robustfit.html, https://dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login, https://cancer.sanger.ac.uk/cosmic/signatures/, https://www.synapse.org/#!Synapse:syn11726601/wiki/513478, https://dcc.icgc.org/releases/PCAWG/mutational_signatures/, https://www.mathworks.com/matlabcentral/fileexchange/38724-sigprofiler, https://github.com/AlexandrovLab/SigProfilerExtractor, https://github.com/broadinstitute/getzlab-SignatureAnalyzer, https://github.com/steverozen/SynSig/tree/v0.2.0, https://dockstore.org/search?search=pcawg, https://doi.org/10.1038/s41586-022-05600-5, https://doi.org/10.1038/s41586-020-1969-6, https://doi.org/10.1038/s41586-019-1913-9, https://doi.org/10.1038/s41586-019-1907-7. Signatures of mutational processes in human cancer. SignatureAnalyzers reference signature profiles were based on COMPOSITE signatures, consisting of 1,536 types ofstrand-agnostic SBSs in pentanucleotide context, 78 types ofDBSs and 83 types of small indels, for a total of 1,697 mutation types. The repertoire of mutational signatures in human cancer. The Repertoire of Mutational Signatures in Human Cancer For SBSs, the 1,536 SBS COMPOSITE signatures are preferred; for DBSs and indels, the separately extracted signatures are preferred. Results are available at syn12030687 and syn20317940. Finally, there were differences in signature attributions to individual samples. The newly derived signatures showed much improved separation from each other and more-distinct signature profiles, as compared with COSMIC v.2 signatures (see Better separation compared to COSMICv.2 signatures in Supplementary Note 2 for more information). Cell. WebSignatures of mutational processes in human cancer. Carcinogenesis 37, 817826 (2016). Nature https://doi.org/10.1038/s41586-019-1913-9 (2020). In this talk, I will present mutational signatures analyses encompassing 23,517 cancer genomes across 40 distinct types of human cancer revealing more than 60 different signatures of mutational processes. The first column of the plot corresponds to the fraction of mutations assigned by one method (summed across samples and mutation types) that was also assigned by the other method. Mutational signatures are depicted based on the trinucleotide frequency of the whole human genome. Some previous signatures split into multiple constituent signatures: these were numbered as in the previous annotation, but with additional letter suffixes (for example, SBS17 was split into SBS17a and SBS17b). This work introduces a computational framework to identify common and rare mutational signatures and CAS Each signature was allocated an identifier consistent with, and extending, the COSMICv.2 annotation. In addition, to access somatic single nucleotide variants derived from TCGA donors, researchers will also need to obtain dbGaP authorization. Signatures ID8, which is predominantly composed of 5-bp deletions with no or 1bp of microhomology at their boundaries, is probably due to DNA double-strand breaks repaired by a non-homologous-end-joining mechanism. This file contains Supplementary Table 1: Summary of datasets used in this paper. However, with many signatures and/or heterogeneous mutation burdens across samples, the mutations observed in a particular sample can be reconstructed in multiple waysoften with small and/or biologically implausible contributions from many signatures. Some were rare (SBS31, SBS32, SBS35, SBS36, SBS42 and SBS44). In-depth characterization of the cisplatin mutational signature in human cell lines and in esophageal and liver tumors. Google Scholar. Nature. Occasional cases with these signatures additionally showed large numbers of indels attributed to ID7 (syn11738668), and rare samples showed large numbers of ID4, ID11, ID14, ID15, ID16 or ID17 mutations but did not show large numbers of ID1 and ID2 mutations or the SBS signatures associated with deficiency in DNA mismatch repair. Mutational Signatures The likely DNA-damaging agents are known for SBS4 (tobacco mutagens), SBS7a, SBS7b, SBS7c and SBS7d (UV light), SBS22 (aristolochic acid), SBS24 (aflatoxin), SBS25 (chemotherapy), SBS31 and SBS35 (platinum compounds), SBS32 (azathioprine) and SBS42 (haloalkanes). Steven G. Rozen or Michael R. Stratton. Both mechanisms result in more mutations of damaged bases on untranscribed than on transcribed strands of genes. a, b, Comparison of the attributions for corresponding SigProfiler (a) and SignatureAnalyzer (b) signatures. Mutational Signature Requires Python 3.6.0 or higher. SignatureAnalyzer used more signatures to reconstruct the mutational profiles (Extended Data Fig. Some mutational processes generate base substitutions that cluster in small genomic regions. The Repertoire of Mutational Signatures in Human Cancer A detailed description of each of the 11 sets of synthetic data and the results from applying SigProfiler and SignatureAnalyzer are provided in Supplementary Note 2. The aim of this study was to determine the pan-cancer landscape of MUTYH alterations and the relationship between MUTYH mutations and potentially The classifications of each mutation type (SBS, 96classes; DBS, 78classes; and indels, 83classes)are described in the main text. However, many signatures are of unknown cause. In addition, this approach provided information about which signatures are unique to the hypermutated samples, which was later used when attributing signatures to samples. Mutational Signature These datasets include (1) CSV files comprising all catalogues of observed mutational spectra that were used as input to signature extraction (syn11801889), (2) CSV files and plots of signatures extracted by SigProfiler (syn11738306) and SignatureAnalyzer (syn11738307), (3) CSV files with estimates of the numbers of mutations generated by each signature in individual tumours (syn11804065), (4) estimates of the probability that each signature was responsible for each mutational type (for example, CTG>CAG) in individual tumours (syn11804068) and (5) synthetic test input data plus the results of tests of signature extraction (discovery) on the synthetic test data (syn18497223). A compendium of mutational signatures of environmental agents. Nik-Zainal, S. et al. 7, 11383 (2016). Nature. Shiraishi, Y., Tremmel, G., Miyano, S. & Stephens, M. A simple model-based approach to inferring and visualizing cancer mutation signatures. Therefore, each method has developed a separate procedure for estimating the contributions of signatures to each sample (Methods). AML, acute myeloid leukaemia; liposarc, liposarcoma; MDS, myelodysplastic syndrome. Biol. A signature similar to DBS2 contributed hundreds of mutations to liver cancers and tens of mutations to other types ofcancer without evidence of exposure to tobacco smoke. Trends Cell Biol. Correspondence to Nature 578, 94101 (2020). Mutat. This analysis provides a systematic perspective on the repertoire of mutational processes contributing to the SBS7 has now been decomposed into four constituent signatures. The assigned signatures reconstruct well the mutational spectra of the cancer samples (in PCAWG samples, the median cosine similarity was 0.97; 96.3% of samples with cosine similarity >0.90): Fig. Extended Data Fig. Magnified versions of signatures SBS4, DBS2 and ID3 (all of which are associated with tobacco smoking) are shown to illustrate the positions of each mutation subtype on each plot. G.G. was partially supported by the Paul C. Zamecnick, MD, Chair in Oncology at the Massachusetts General Hospital Cancer Center. Webparticular, interrogating cancer genomes has unveiled mutational signatures that associate with specic etiologies and mutagenic mechanisms (Alexandrov et al., 2020). Pattern Anal. Longer indels were classified as occurring at repeats or with overlapping microhomology at deletion boundaries, and according to the size of indel, repeat and microhomology (83 classes; available at https://cancer.sanger.ac.uk/cosmic/signatures/ID). 47, 14021407 (2015). We used these types of evidence and approaches in determining the signature profiles reported here. In principle, our framework can be applied to a wider repertoire of mutation types than the Using 84,729,690 somatic mutations from 4,645 whole cancer genome and 19,184 exome sequences encompassing. Nature 2013; 500:415. WebHere, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium 2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. 7 Mutational signatures extracted from the COMPOSITE feature set consisting ofthe concatenation of SBSs in pentanucleotide context, DBSs and indels. Signatures of mutational processes in human cancer. Signatures with no significant correlation to any other signature above the relevant threshold are not shown. To obtain Internet Explorer). 2020;578(7793):94-101. doi: 10.1038/s41586-020-1943-3 PubMed Google Scholar Crossref X Article CAS PubMed PubMed Central Google Scholar Lee CA, Abd Petljak, M. & Alexandrov, L. B. & Stratton, M. R. Mutational signatures: the patterns of somatic mutations hidden in cancer genomes. 15, e1006799 (2019). ID6 and ID8 were both characterized predominantly by 5-bp deletions (Fig. 11, e1005657 (2015). and M.R.S. A specific mutational signature associated with DNA 8-oxoguanine persistence in MUTYH-defective colorectal cancer. Both deletion patterns may be characteristic of DNA double-strand-break repair by non-homologous-recombination-based end-joining mechanisms andif sothis suggests that at least two distinct forms are operative in human cancer53. Extracted mutational signatures and their activities in individual samples are saved into a set (S). The code used to generate the synthetic data and summarize SignatureAnalyzer and SigProfiler results is open source and freely available as the SynSig package: https://github.com/steverozen/SynSig/tree/v0.2.0 under the GNU General Public License v.3.0. Several base substitution signatures showed transcriptional strand bias, which may be attributable to transcription-coupled nucleotide excision repair acting on DNA damage and/or to an excess of DNA damage on untranscribed strands of genes44. Distinct mutational signatures characterize concurrent loss of polymerase proofreading and mismatch repair. 2, 3). The algorithm continues to step C when step A cannot find any stable signatures. We found that the Pathways in cancer, Herpes simplex virus 1 infection, Human papillomavirus infection were Ng, A. W., et al. Blokzijl, F., Janssen, R., van Boxtel, R. & Cuppen, E. MutationalPatterns: comprehensive genome-wide analysis of mutational processes. SigProfilerExtraction is an extension of a previous framework for the analysis of mutational signatures11,17. Signatures 2, 4, 5, 13 and 16 are extracted from cancers associated with tobacco smoking. We also derived a classification for DBSs (78 classes; available at https://cancer.sanger.ac.uk/cosmic/signatures/DBS). These excluded hypermutated tumours: those with putative polymerase epsilon (POLE) defects or mismatch repair defects (microsatellite instable tumours), skin tumours (which had intense UV-light mutagenesis) and one tumour with temozolomide (TMZ) exposure. Mutational Signatures Nature 578, 94101 (2020). However, with increasing numbers of signatures and differences of multiple orders of magnitude in mutation burdens between some signatures, prior knowledge has helped to avoid biologically implausible results. Haradhvala, N. J. et al. Opin. Alexandrov, L. B. 2). WebHere, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. Mutational Google Scholar. The genome as a record of environmental exposure. 6, 8866 (2015). Therefore, the uniformly processed and highly curated sets of all classes of somatic mutations from the 2,780cancer genomes of the PCAWG project2, combined with most other suitable cancer genomes (accession code syn11801889, available at https://www.synapse.org/#!Synapse:syn11801889), present a notable opportunity to establish the repertoire of mutational signatures and determine their activities across different types of cancer. 4 Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore. The activity of any signature that does not increase the cosine similarity of a sample by>0.01 was removed from the sample (assigned a value of 0). However, deletions of cytosine at cytosinecytosine dinucleotides did not feature strongly in ID13, which may reflect the predominance of thymine compared to cytosine dimers induced by UV light52. COSMIC | Signatures of Mutational Processes in Human Cancer This work was supported by Wellcome grant reference 206194 (M.R.S. ), the European Research Council Consolidator Grant 682398 (N.L.-B. Hypermutation and unique mutational signatures of occupational cholangiocarcinoma in printing workers exposed to haloalkanes.

23 Ludlow Ave, Belle Mead, Nj, Articles T

the repertoire of mutational signatures in human cancer

the repertoire of mutational signatures in human cancer

Scroll to top