The Tassel-GBS Import Engine transcodes output data from the TASSEL 3 Genotyping by Sequencing (GBS) pipeline into Input and Annotation SAS Data Sets ready for use in downstream JMP Genomics analytical processes.. The Annotation SAS Data Set contains columns of data summary and statistics relevant to downstream quality control analyses. The Input SAS Data Set contains columns of markers with coded
genotypes for each individual in the sample.
The gbs_2xbc1geno.sas7bdat data set contains 83 columns: loci are identified in column one (
Loci), parental genotypes at each
locus are listed in columns 2 and 3 (
P1 and
P2, respectively), and the genotypes of 80 individual progeny (
ind1 -
ind80) at each locus are listed in the remaining columns.
A second, optional, data set is the Annotation Data Set that lists marker map or other annotation information. The
gbs_2xbc1anno.sas7bdat annotation data set is shown below: