The TPM Normalization process (Transcripts Per Kilobase Million) is a normalization method for Count data that takes reads per kilobase (RPK) and adjusts per million scaling factor for each sample to generate the TPM.
The trimmed bam10_ctdata.sas7bdat data set shown below lists sample BAM data.
The second data set is the Experimental Design Data Set (EDDS). This required data set tells how the experiment was performed, providing information about the columns in the input data set. Note that one column in the EDDS must be named
ColumnName and the values contained in this column must exactly match the column names in the input data set.
The edf_bam_sas7bdat EDDS, shown below, corresponds to the
bam10_ctdata.sas7bdat input data set.
Refer to the TPM Normalization output documentation for detailed descriptions of the output of this process.