The BAM Input Engine imports a set of Compressed Binary Sequence Alignment Map (
.bam) files and combines them into three SAS data sets containing
chromosome, location, and sequence identity of each sample with a reference sequence. The total number of reads are reduced through binning, and read count is calculated.
Important: Before running this process, you must navigate to the
C:\Program Files\SASHome\JMPGenomics\13\Genomics\ThirdPartyAnnotation\ folder and create a new folder and name it
NextGen. You must then download SAMtools version
0.1.12 (for Windows) from
https://sourceforge.net/projects/samtools/files/samtools/0.1.12/ and save the executable files in the newly created
NextGen folder.
Important: The EDF used for importing BAM data must contain a
variable called
SampleName. The values listed in this column identify each sample in the experiment. Providing this name allows the BAM data for each sample, which are typically spread across multiple
.bam files, to be merged into one SAS data set.
The output data sets generated by this process are listed in a Results window. Refer to the
BAM Input Engine output documentation for detailed descriptions.