6/13/2023 0 Comments Bam file formatThis is not ideal because among other flaws, much of the metadata associated with sequencing runs cannot be stored in FASTQ files, unlike BAM files which can store more information. Most sequencing providers generate FASTQ files with the raw unmapped read sequences, so that is the most common form in which the data is input into the mapping step of the pre-processing pipeline. This is basically an "off-label" use of the BAM format (which was specifically designed to contain mapping information) that is used for data management reasons: it allows you to attach metadata to the reads from as early on in the analysis process as possible.įor this to make sense, it helps to take a step back and look at the context. UBAM is a variant form of the BAM file format in which the read data does not contain mapping information.
0 Comments
Leave a Reply. |