ChIP-Seq and ATAC-Seq identify enriched regions or peaks in genome. Depending on the assay, the biological meaning of enrichment changes; in ChIP-Seq, enrichment indicates protein binding, while in ATAC-Seq, enrichment indicates open chromatin. To understand the importance of enriched regions in regulating gene expression, we can add information about overlapping or nearby genomic features.
Annotate peaks takes an input set of regions and checks for overlap between those regions and a gene/feature annotation. This gives regulatory context for enriched regions.
The input for Annotate peaks is a Peaks type data node.
Click the Peak analysis section in the toolbox
The Genomics overlaps parameter lets you choose one of two options (Figure 1).
User should define the transcription start site (TSS) and transcription termination site (TTS) limit in the unit of bp
Annotate peaks produces an Annotated peaks data node. The Annotated peaks task report adds a Gene section breakdown pie chart and adds columns with information about the Gene IDs, Transcript IDs, Gene section, Distance to TSS, and Distance to TTS of each peak to the standard Peaks report (Figure 2). If run with the option to report all gene sections selected, each peak will have a row for each gene section it overlaps. If run with the option to report one gene section selected, each peak will have one row with the gene section it overlaps chosen using the order of precedence.
The table can be sorted by any of its columns (Figure 3). Click on the Optional columns on the upper-left corner of the table to add more information on each region
Transcription start site (TSS) is -1000bp and +100bp (default setting) from the TSS for a transcript
Transcription termination site (TTS) is -100bp and +1000bp (default setting) from the TTS for a transcript
Coding sequence (CDS) Exon is overlapping a coding exon in a transcript
5' Untranslated Region (UTR) Exon is overlapping an exon in the 5' UTR of a transcript
3' Untranslated Region (UTR) Exon is overlapping an exon in the 3' UTR of a transcript
Intron is overlapping an intron in a transcript
Intergenic is not located within 1000bp of a transcript