Page History
...
Numbered figure captions | ||||
---|---|---|---|---|
| ||||
Searching The By sequence option uses a string search tool to return all positions in the set of genomic regions that match the given stringsstring(s). The string match is case insensitive, meaning if you search for ATCG, you may get atcG as a match. Nucleotides that are lower case have been "repeat masked", meaning they are located in a repetitive region of the genome. Your search string may contain any of the characters from the IUPAC nucleotide code . For example, if you search for WAAA, you may get back AAAA or TAAA (or any variation of upper and lower cases), because W represents A or T.
By database
The By database option uses an alignment matrix to match sequences against a motif database. We distribute the JASPAR database, but you can add any custom or public motif database.
...
Let h be the maximum of LA. The quality score of a sequence is calculated as QA(S) =LA(S)/h. A quality score of 1 corresponds to a sequence with the most likely base at each position of the alignment matrix. The user will specify a threshold QA. all All sequences that have a score TA>QA *h will be reported.
...