Page History

Next generation sequencing (NGS) data is notably huge in file size. Dealing with NGS data is not only time consuming but also puts constraints on hard disk space. This is especially true if analysis parameters need to be optimized. The Subsample FASTQ function is a very useful tool to get a subset of the raw data upon which optimization can be performed. The optimized parameters can then be saved and applied to the whole dataset

...

Aligned reads data can be converted to unaligned reads in Partek^® Flow^®. The task is available in the task menu when the user selects any aligned data node (Figure 1). It can be from a result of an aligner in Partek Flow or data already aligned before importing into a project.

Generating unaligned reads from aligned data gives you the flexibility to realign using either a different aligner or a different genome. This is particularly useful in sequencing samples from xenograft models where the same set of reads are aligned to different species. It is also helpful if the original unaligned FASTQ files are not as easily accessible to the user as the aligned BAM files.

To perform the task, select an aligned reads data node and click Convert alignments to unaligned reads task in the task menu (Figure 1).

Numbered figure captions

SubtitleText	Convert alignments to unaligned reads
AnchorName	Convert alignments to unaligned reads

A new data node will be generated containing FASTQ files (Figure 2) . Note that the files generated are compressed and the filenames are *.fastq.gz.

Numbered figure captions

SubtitleText	Unaligned reads generated from aligned reads
AnchorName	Unaligned reads generated from aligned reads

During the conversion, the BAM files are converted to FASTQ files. If the BAM file contains paired end reads, two FASTQ files will be generated for each BAM, and the files names will contain _1 and _2. In Figure 3, the example shows 18 fastq.gz files from 9 .bam files

Image Removed

Numbered figure captions

SubtitleText	Subsample FASTQ page. This option shows getting a subset of raw data by keeping one read for every 10 reads.
AnchorName	subsample-fastq

Task details of unaligned reads generated from aligned paired end samples
AnchorName	Task details of unaligned reads generated from aligned paired end samples

Additional assistance

Rate Macro

allowUsers	false

Partek Flow Documentation

Page tree

Versions Compared

Old Version 1

New Version 2

Key