Notes on the RNAsum Rmd structure for developers
ref_dataset.list
Processed data with following elements:
Element | Description |
---|---|
[[dataset]][["combined_data"]] |
combined read count data (ref + sample datasets)
(combineDatasets output in chunk
load_ref_data ) |
[[dataset]][["sample_annot"]] |
combined data samples annotation (“combineDatasets()” output in chunk “load_ref_data”) |
[[dataset]][["clinical_info"]] |
clinical (survival + treatment) info |
[[dataset]][["combined_data_processed"]] |
transformed, filtered and normalised data (chunks “data_transformation” and “data_normalisation”) |
[[dataset]][["batch_effect_corrected"]] |
transformed, filtered, normalised and batch effect corrected data (chunk “batch_effect_correction”) |
[[dataset]][["pca_combined_data_processed"]] |
PCA results for combined data |
[[dataset]][["pca_batch_effect_corrected"]] |
PCA results for batch-effect corrected data |
[[dataset]][["rle_combined_data_processed"]] |
RLE plot for combined data |
[[dataset]][["rle_batch_effect_corrected"]] |
RLE plot for batch-effect corrected data |
[[dataset]][["data_to_report"]] |
fully combined and processed data for reporting |
[[dataset]][["gene_annot_all"]] |
gene annotation for combined read count data, containing all input genes. Includes SYMBOL, GENEBIOTYPE, ENSEMBL, SEQNAME, GENESEQSTART, and GENESEQEND. ENSEMBL is used for rownames. |
[[dataset]][["gene_annot"]] |
gene annotation for transformed, filtered and normalised data. Includes SYMBOL, GENEBIOTYPE, ENSEMBL, SEQNAME, GENESEQSTART, GENESEQEND. SYMBOL is used for rownames. |
[[dataset]][["expr_mut_cn_data_all"]] |
combined expression, mutation and CN data. |
[[dataset]][["expr_mut_cn_data"]] |
combined expression, mutation and CN data limited to cancer genes that meet user-defined CN values threshold. |
ref_genes.list
Genes of interest with following gene sets:
Element | Description |
---|---|
[["genes_cancer"]] |
list of cancer genes derived from UMCCR Cancer Gene list (https://github.com/vladsaveliev/NGS_Utils/blob/master/ngs_utils/reference_data/key_genes/umccr_cancer_genes.2019-03-20.tsv) and OncoKB portal (http://oncokb.org/#/cancerGenes) |
[["genes_oncokb"]] |
list of cancer genes derived from OncoKB only (although genes present in the UMCCR panel are also flagged) |
[["genes_immune"]] |
list of immune reponse markers provided in the “An Immunogram for the Cancer-Immunity Cycle” paper by Karasaki at al (2017) (https://www.ncbi.nlm.nih.gov/pubmed/28088513) and OmniSeq report (https://www.omniseq.com/) |
[["genes_hrd"]] |
list of hrd genes |
[["pcgr"]] |
list and PCGR annotation of indels |
[["purple"]] |
list and PURPLE annotation of CN altered genes |
[["manta"]] |
list and Manta annotation of SVs |
[["arriba"]] |
list and Arriba annotation of gene fusion events |
[["pizzly"]] |
list and Pizzly annotation of gene fusion events |
[["summary"]] |
summary of above-mentioned gene lists, also used for expression summary tables and plots. |