The Cancer Genome Atlas (TCGA) is a resource with omics data for over 20,000 cancer patient tissue profiles spanning 33 cancer types. It is a joint effort between the National Cancer Institute and the National Human Genome Research Institute and is publicly available. See the links below for more information:
Below are some papers you can read for a better understanding of TCGA:
See this page for a list of computational tools developed by researchers using TCGA data.
See this page for a user’s guide for accessing the GDC data portal.
TCGAbiolinks is an R package that can be used for integrative analysis with Genomic Data Commons data. This package facilitates data retrieval and analysis. You can learn more about its aim and install it here.
After installation, feel free to go through this webpage for a better understanding. There are case studies and workshops available through the page that might also be helpful.
This vignette is another walkthrough option for learning the TCGA Workflow.
You can also reference these YouTube videos from the creators of the package:
recount2 is a resource that allows access to thousands of RNA-seq samples from TCGA, GTEx, and the Sequence Read Archive. It can be accessed through the recount2 website and the recount Bioconductor package. See below for some helpful resources: