The SAM Input Engine imports a set of Sequence Alignment Map (
.sam) files and combines them into three SAS data sets containing
chromosome, location, and sequence identity of each sample with a reference sequence.
Important: Before running this process, you must navigate to the
C:\Program Files\SASHome\JMPGenomics\13\Genomics\ThirdPartyAnnotation\ folder and create a new folder and name it
NextGen. You must then download SAMtools version
0.1.12 (for Windows) from
https://sourceforge.net/projects/samtools/files/samtools/0.1.12/ and save the executable files in the newly created
NextGen folder.
Important: The EDF used for importing SAM data must contain a
variable called
SampleName. The values listed in this column identify each sample in the experiment. Providing this name allows the SAM data for each sample, which are typically spread across multiple
.sam files, to be merged into one SAS data set.
The following example uses .sam files for the Y chromosome of two different mice (from the list shown below) that were downloaded from
The 1000 Genomics Project and saved in the
Next-Gen\SAM\GSE18905 folder created in the JMP Genomics
Sample Data folder.
The output data sets generated by this process are listed in a Results window. Refer to the
SAM Input Engine output documentation for detailed descriptions.