Chapter 1 ๐ฅ Load data into R environment
1.1 Organization of built-in data
The built-in data follows several specifications for constructing:
โ They are constructed into a SummarizedExperiment object using the SummarizedExperiment() function in SummarizedExperiment package.
โก The assays of the SummarizedExperiment object are stored as a SimpleList containing an expression matrix, where rows represent genes and columns represent samples.
โข The colData of the SummarizedExperiment object is a table(DFrame object of S4), where the row names must be the same as the column names of the expression matrix. The column names of the table represent the clinical information of patients. The detailed information about the column names is presented below.
1.2 Obtain data from tigeR.data package (recommended)
::install_github("YuLab-SMU/tigeR.data")
devtoolslibrary(tigeR.data) ## load all built-in data set into R environment
1.5 Pre-processing of custom Data
โWhen conducting analysis using custom data, you need to pre-process your data and construct a SummarizedExperiment object. Prepare at least one gene expression matrix (rows for genes, columns for samples) and a data frame including corresponding clinical information of the samples. The data frame should include at least the following information:
- sample id
- Treatment (PRE/POST)
- response_NR (R/N)
- overall.survival..days. (for survival analysis)
- vital.status (for survival analysis)
โHere is a brief example: