The name of the alphabet symbol.

[close ]

The frequency of the alphabet symbol as defined by the background model.

[close ]

CentriMo outputs a tab-separated values (TSV) file ('centrimo.tsv') that contains one line for each region found to be significantly enriched for a motif. The lines are sorted in order of decreasing statistical significance. The first line in the file contains the (tab-separated) names of the fields. Your command line is given at the end of the file in a comment line starting with the character '#'. The names and meanings of each of the fields, which depend on whether or not you provide control sequences to CentriMo, are described in the table below.

field name contents
1 db_index The index of the motif file that contains the motif. Motif files are numbered in the order the appeared in the command line.
2 motif_id The name of the motif, which is unique in the motif database file. If more than one motif has the same ID, CentriMo uses only the first such motif. The name is single-quoted and preceded with '+' or '-' if you scanned separately with the reverse complement motif (using the --sep option).
3 motif_alt_id An alternate name for the motif that may be provided in the motif database file.
4 consensus A consensus sequence computed from the motif (as described below).
5 E-value The expected number motifs that would have at least one region as enriched for best matches to the motif as the reported region (or would have optimal average distance to the sequence center as low as observed, if you used the --cd option). The E-value is the adjusted p-value multiplied by the number of motifs in the input files(s).
6 adj_p-value The statistical significance of the enrichment of the motif, adjusted for multiple tests. By default, a p-value is calculated by using the one-tailed binomial test on the number of sequences with a match to the motif that have their best match in the reported region; if you provided control sequences, the p-value of Fisher's exact test on the enrichment of best matches in the positive sequences relative to the negative sequences is computed instead; if you used the --cd option, the p-value is the probability that the average distance between the best site and the sequence center would be as low or lower than observed, computed using the cumulative Bates distribution, optimized over different score thresholds. In all cases, the reported p-value has been adjusted for the number of regions and/or score thresholds tested.
7 log_adj_p-value Log of adjusted p-value.
8 bin_location Location of the center of the most enriched region, or 0 if you used the --cd option.
9 bin_width The width (in sequence positions) of the most enriched region (default), or two times the average distance between the center of the best site and the sequence center if you used the option --cd. A best match to the motif is counted as being in the region if the center of the motif falls in the region.
10 total_width The maximum number of regions possible for this motif
   round(sequence_length - motif_length + 1)/2,
or the number of places the motif will fit if you used the --cd option.
11 sites_in_bin The number of (positive) sequences whose best match to the motif falls in the reported region (default) or anywhere in the sequence (if you used the option --cd).
Note: This number may be less than the number of (positive) sequences that have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.
12 total_sites The number of sequences containing a match to the motif above the score threshold.
13 p_success The probability of a random match falling into the enriched region:
   bin_width / total_width
14 p-value The uncorrected p-value before it gets adjusted for the number of multiple tests to give the adjusted p-value.
15 mult_tests This is the number of multiple tests (n) done for this motif. It was used to adjust the p-value of a region for multiple tests using the formula:
   p' = 1 - (1-p)n where p is the unadjusted p-value.
The number of multiple tests is the number of regions considered times the number of score thresholds considered. It depends on the motif length, sequence length, and the type of optimizations being done (central enrichment, local enrichment, central distance or score optimization).
The following additional columns are present when you provide control sequences to CentriMo (using the --neg option).
16 neg_sites_in_bin The number of negative sequences where the best match to the motif falls in the reported region. This value is rounded but the underlying value may contain fractional counts. Note: This number may be less than the number of negative have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.
17 neg_sites The number of negative sequences containing a match to the motif above the minimum score threshold. When score optimization is enabled the score threshold may be raised higher than the minimum.
18 neg_adj_pvalue The probability that any tested region in the negative sequences would be as enriched for best matches to this motif according to the Binomial test.
19 log_neg_adj_pvalue Log of negative adjusted p-value.
20 fisher_adj_pvalue Fisher adjusted p-value before it gets adjusted for the number of motifs in the input files(s).
21 log_fisher_adj_pvalue Log of Fisher adjusted p-value.

A consensus sequence is constructed from each column in a motif's frequency matrix using the "50% rule" as follows:

  1. The letter frequencies in the column are sorted in decreasing order.
  2. Letters with frequency less 50% of the maximum are discarded.
  3. The letter used in this position in the consensus sequence is determined by the first rule below that applies:
[ close ]

CentriMo outputs a text file ('site_counts.txt') that contains, for each motif, pairs of values (bin_position, site_count), or triples of values (bin_position, site_count, neg_site_count) if you provided control sequences to CentriMo. This data can be used to plot the density of motif best matches (sites) along the input sequences. Fractional counts are possible if multiple (n) bins contain the best match for a given sequence, with each bin receiving an incremental count of 1/n.

The data for each motif begins with a header line with the format:
   DB <db_number> MOTIF <id> <alt>
where <id> and <alt> are as described above. The following lines (up to the next header line) each contain a single value-pair or value-triple for the motif named in the header line.

[ close ]

Each "motif probability curve" shows the (estimated) probability of the best match to a given motif occurring at a given position in the input sequences. This estimated probability is based only on sequences that contain at least one match with score above than the score threshold defined for this motif, and is the maximum likelihood estimate of the conditional probability shown below.

Points (X,Y) on the plot are:
  Y = Pr(best match occurs at position X | sequence contains a match)

Note: The plots are smoothed according to the function selected from the "Smoothing" menu on the right. Setting the smoothing window size to 1 turns off smoothing.

If a negative dataset has been supplied then two curves are drawn for each motif, one for each dataset. The distribution of the motif in the primary dataset is plotted with a single unbroken curve, whereas the distribution in the negative dataset is plotted with a dashed curve.

[ close ]

This shows a listing of all motifs currently plotted on the graph.

The color used to plot a motif can be changed by clicking on the color swatch next to the motif you want to change, followed by clicking on the color swatch you wish to swap it with.

[ close ]

These are extra colors you may use for plotting motifs.

Click on the color swatch next to one of the above motifs, then click on one of these "unused color" swatches to change the color of the motif's plot.

[ close ]

These options change the display of the graph.

Smoothing:

Allows selection of the smoothing function applied to the graph.

The weighted moving average option uses weights shaped as an isosceles triangle where the central point (or points in an even sized window) get the maximum weight.

The moving average simply weights all points in the smoothing window equally.

Note: Setting the smoothing window size to 1 turns off smoothing.

Window

The window size used to smooth the graph. The larger the smoothing window size, the smoother the graph, at the cost of hiding detail.

Below a smoothing window size of 10, thinner lines are used on the graph to allow more detail to be visible.

Note: Remember to press "return" or "enter" after changing the number in the input box in order to see the effect of the new smoothing window size.

Legend

Choose to display/disable the on-graph legend. The legend can be moved by clicking on the graph.

Negative Sequences

Choose whether to plot the motif probability curve(s) for the negative sequences (if provided). The curve(s) are plotted as dashed lines, using the same color as the corresponding curve for the positive sequences.

Zoom
Drag a range on the graph to zoom into that section. Clicking "Undo Zoom" will return the view to the previously displayed part of the graph and clicking "Center on 0" will move the view so 0 is in the center.
Download EPS

Download the graph that you are currently viewing as an encapsulated postscript (EPS) image. EPS images are scalable making them suitable for publication.

[ close ]

List only enriched motifs that meet the selected filter criteria below.

Selected motifs are always listed; deselect all motifs first by clicking on the "X" above the color swatches if you wish to filter all motifs.

To filter on "ID" or "Name", you can enter any Javascript regular expression pattern. See here for documentation on Javascript regular expression patterns.

[ close ]

Sorting is applied after filtering where possible (the exception being the "Top" filter) so the filters applied will affect the sort. You can choose the motif sorting feature using the "Motifs:" menu.

If CentriMo is searching for locally enriched regions (not just centrally enriched regions), then multiple regions may be found per motif, and the "Regions:" menu will also be displayed. In this case, CentriMo first sorts all regions using the feature shown in the "Regions:" menu, and then it sorts the highest-ranked region of each motif according to the feature shown in the "Motifs:" menu.

Unless you check the box next to the "Regions:" menu, it will automatically show the same feature as the "Motifs:" menu (or "E-value" if a motif-only feature is chosen in the "Motifs:" menu).

Note:The motif p-value shown in the plot legend will always be for the region with the lowest p-value, and therefore may not match the value shown in the table "p-value" column when the "Regions:" menu is not set to "p-value".

[ close ]

The name of a file of motifs ("motif database file") that contains motif.

[ close ]

The name of the motif, which is unique in the motif database file.

If more than one motif has the same ID, CentriMo uses only the first such motif.

[ close ]

An alternate name for the motif that may be provided in the motif database file.

[ close ]

A consensus sequence is constructed from each column in a motif's frequency matrix using the "50% rule" as follows:

  1. The letter frequencies in the column are sorted in decreasing order.
  2. Letters with frequency less 50% of the maximum are discarded.
  3. The letter used in this position in the consensus sequence is determined by the first rule below that applies:
[ close ]

The expected number motifs that would have least one region as comparatively enriched for best matches to the motif as the reported region in the positive sequences compared with the negative sequences.

The Fisher E-value is the (one-sided) p-value of the one-sided Fisher's exact test that at least as many best matches in the region in the positive sequences that contain at least one match, multiplied by the number of motifs in the input database(s). The Fisher's exact test p-value is corrected for the number of regions and score thresholds tested ("Multiple Tests").

Fisher's exact test assumes that the probability that the best match (if any) falls into a given region is the same for all positive and negative sequences.

[ close ]

The statistical significance of the enrichment of the motif, adjusted for multiple tests.

The enrichment p-value of a motif is calculated by using the one-tailed binomial test on the number of sequences with a match to the motif ("Sequence Matches") that have their best match in the reported region ("Region Matches"), corrected for the number of regions and score thresholds tested ("Multiple Tests"). The test assumes that the probability that the best match in a sequence falls in the region is the region width divided by the number of places a motif can align in the sequence (sequence length minus motif width plus 1).

[ close ]

The expected number motifs that would have at least one region as enriched for best matches to the motif as the reported region. The E-value is the adjusted p-value multiplied by the number of motifs in the input files(s).

[ close ]

The Matthew's Correlation Coefficient (MCC) gives a measure of the ability of the motif to discriminate the positive sequences from the negative sequences:

MCC = [TP*TN - FP*FN] / [(TP + FP) * (TP + FN) * (TN + FP) * (TN + FN)]
where
TP is the number of positive sequences with a best match in the reported region,
FP is the number of negative sequences with a best match in the reported region,
TN is the number of negative sequences without a best match in the reported region, and
FN is the number of positive sequences without a best match in the reported region.

MCC ranges from -1 to +1, where a +1 result indicates that the occurrence of the best match to the motif in the reported region perfectly discriminates positive sequences from negative sequences.

[ close ]

The width (in sequence positions) of the most enriched region (default), or two times the average distance between the center of the best site and the sequence center if you used the option --cd. A best match to the motif is counted as being in the region if the center of the motif falls in the region.

[ close ]

The number of (positive) sequences whose best match to the motif is in the reported region.
Note: This number may be less than the number of (positive) sequences that have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.

[ close ]

The number of negative sequences where the best match to the motif falls in the reported region. This value is rounded but the underlying value may contain fractional counts.

Note: This number may be less than the number of negative have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.

[ close ]

The number of sequences containing a match to the motif above the score threshold ("Score Threshold").

[ close ]

This is the score threshold (bits) for determining if a sequence contains a match to this motif.

[ close ]

The number of negative sequences containing a match to the motif above the score threshold. When score optimization is enabled the score threshold may be raised higher than the minimum.

[ close ]

The probability that any tested region in the negative sequences would be as enriched for best matches to this motif according to the Binomial test.

Use the filter to display only motifs differentially enriched in both datasets (low p-value and high negative p-value).

[ close ]

The maximum probability that the best match occurs at any single sequence position. If the smoothing window size ("Window:", to right of graph) is set to "1", then this is value is the maximum value of the match-probability curve.

[ close ]

Concentration is defined as the total probability of all the positions in the central region whose width is the same as the size of the "smoothing window". You can change the size of the smoothing window using the "Window:" input field in the Graph options section, above. (A value of "NaN" indicates that the smoothing window size is too large for the motif.)

The "concentration" of the motif sites in the central window can somtimes be more informative than the E-value. For example, in some ChIP-seq datasets, motifs for cofactors show more significant enrichment overall (smaller E-value), but are less concentrated in a small (20 to 50bp) window than the motif for the ChIP-ed transcription factor. In such cases, you may wish to sort the motifs by Concentration, using the Sort menu, below.

[ close ]

This is the number of multiple tests (n) done for this motif. It was used to adjust the p-value of a region for multiple tests using the formula:
   p' = 1 - (1-p)n where p is the unadjusted p-value.
The number of multiple tests is the number of regions considered times the number of score thresholds considered. It depends on the motif length, sequence length, and the type of optimizations being done (central enrichment, local enrichment, central distance or score optimization).

[ close ]

Location of the center of the most enriched region.

[ close ]

The text box lists the sequence identifiers for sequences that have at least one of their best matches in the most significant region of all the selected motifs.

The "Intersection" subheading gives the number of identifiers in the text box and their percentage out of the total number of input sequences.

The "Union" subheading lists the number and percentage of sequences that have at least one of their best matches in the most significant region of any of the selected motifs and their percentage out of the total number of input sequences.

Note that the number of sequences with a match to a given motif in its best region may be larger than the value of "Region Matches". This is because a sequence may have multiple equally best matches and in that case a fractional match count is assigned to each of them when "Region Matches" is computed.

[ close ]

When more than one significant, non-overlapping region is found, they can be shown (and hidden again) by clicking the arrow.

By default the regions are sorted by E-value, but this can be changed by the menu on the right of the page.

[ close ]

Sequence position where the (unsmoothed) match-probability curve for this motif attains its maximum. Set the smoothing window size ("Window:", to right of graph) to "1" to see the unsmoothed match probability curve.

[ close ]

For further information on how to interpret these results please access https://meme-suite.org/meme/doc/centrimo-output-format.html.
To get a copy of the MEME software please access https://meme-suite.org.

If you use CentriMo in your research, please cite the following paper:
Timothy L. Bailey and Philip Machanick, "Inferring direct DNA binding from ChIP-seq", Nucleic Acids Research, 40:e128, 2012. [Full Text]

Motif Probability Graph   |   Enriched Motifs   |   Input Files   |   Program information   |   Results in TSV Format 
  |   Sequence position vs. number of matches for each motif 

Your browser does not support canvas!

Results

Motif Probability Graph (motif score ≥ 5

Options

Plotting 
MA0130.1
Unused Colors 
Graph 

Enriched motifs (E-value ≤ 10 using the binomial test )

Database 
ID 
Alt ID 
Consensus 
Concentration 
E-value 
Fisher E-value 
p-value 
Negative p-value 
MCC 
Region Center 
Region Width 
Region Matches 
Sequence Matches 
Negative Region Matches 
Negative Sequence Matches 
Max Probability 
Max Probability Location 
Multiple Tests 
Score Threshold 
Other Regions 
memeACRDACGGWGGRWNGMEME-1ACRDACGGWGGRWNG0.50743.6e-5681.7e-57101470915470.1083-1.51925.00e+0-
streme1-AYCCACCGTHYGYBSTREME-1AYCCACCGTHYGYB0.47056.9e-5373.3e-54001573117160.087201935.00e+0-
jolma2013CTCF_fullWGCGCCMYCTAGYGGYV0.01695.6e-892.7e-9201685025930.0202-59.51915.00e+0-
streme7-CCACAGGGTGGCSTREME-7CCACAGGGTGGC0.10018.7e-654.2e-68017011340.061701945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0139.1CTCFYGRCCASYAGRKGGCRSYR0.05035.5e-612.6e-64016875210740.0149-59.51905.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0695.1ZBTB7CRCGACCACCGAN0.15391.2e-495.8e-5301723316370.0281-31945.00e+0-
jolma2013ZBTB7C_fullRCGACCACCGAN0.15402.3e-491.1e-5201723316430.0280-31945.00e+0-
streme3-CCAGMAGRGGGCRSTREME-3CCAGMAGRGGGCR0.01683.8e-451.8e-4801784545970.0201-59.51935.00e+0-
jolma2013ZBTB7B_fullRCGACCACCGAA0.19205.4e-382.6e-410171397970.0439-31945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1930.1CTCFCTGCAGTKCCNVCHNNYRGCCASYAGRKGGCRSYN0.03554.3e-362.1e-3901564947330.016443.51825.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0694.1ZBTB7BRCGACCACCGAA0.19283.4e-351.6e-380171287260.0455-31945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1929.1CTCFCTGCAGTKCCNNNNNYNRCCASYAGRKGGCRSYV0.04153.4e-341.6e-37018376911080.0117361835.00e+0-
streme2-ACGCACGCAHGYGSTREME-2ACGCACGCAHGYG0.16683.8e-331.8e-3602619710520.0233-4.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1568.1TCF21CACCATATGKYR0.15095.2e-322.5e-35079510270.040921945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1468.1ATOH7AVCATATGBY0.12107.7e-323.7e-350711514920.033821955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1638.1HAND2NVCAGATGNN0.12381.3e-296.1e-3301409130.043801955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0818.2BHLHE22AMCATATGKY0.19219.1e-284.4e-3105534450.075321955.00e+0-
jolma2013MYBL1_DBD_3NHAACCGTTAA0.14029.6e-284.6e-3101211710200.0373-0.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1717.1ZNF784GTACYTACCGH0.16631.3e-276.5e-310201438900.03372.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1997.1Olig2NVCAGCTGBN0.12891.8e-258.5e-2901347290.046601955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0633.2Twist2NVCAGCTGBN0.10982.4e-231.2e-2601326740.046701955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1629.1Zic2NDCACAGCAGGDRG0.07281.1e-225.2e-26014114700.027901935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1993.1Neurod2VVCAGCTGBB0.11038.0e-223.8e-2501306530.045901955.00e+0-
jolma2013OLIG3_DBDAMCATATGBY0.11102.8e-211.4e-24056910770.034421955.00e+0-
jolma2013NEUROD2_fullRMCATATGBY0.12043.4e-201.6e-23078912960.027421955.00e+0-
jolma2013BHLHE23_DBDAMCATATGBT0.13984.2e-202.0e-2305516010.054921955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0130.1ZNF354CMTCCAC0.10825.6e-202.7e-2301718718700.017951975.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0827.1OLIG3AMCATATGBY0.10821.2e-196.0e-23056610540.034221955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1472.2Bhlha15NVACAGCTGTBN0.06921.3e-196.3e-23013612360.028701945.00e+0-
uniprobe mouseUP00005_2Tcfap2a_secondaryYHRCCYBWGGKCWB0.09061.5e-197.2e-230912819200.0146-41935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0521.2Tcf12NNACAGCTGTNN0.06683.5e-191.7e-22013713690.026701945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1655.1ZNF341NRGAACAGCCNN0.09586.0e-192.9e-22058617580.035021945.00e+0-
uniprobe mouseUP00006_1Zic3_primaryBMCCCCCGGGGGGGB0.09067.1e-193.4e-22046412580.0191-0.51925.00e+0-
jolma2013OLIG2_fullAMCATATGKT0.12609.8e-194.7e-2205557420.046521955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0002.2Runx1BBYTGTGGTTT0.08791.0e-184.8e-220811719050.05203.51945.00e+0-
uniprobe mouseUP00084_1Gmeb1_primaryBHBBKKACGTMMNWNNN0.13701.3e-186.3e-220261508980.0145-5.51915.00e+0-
uniprobe mouseUP00057_1Zic2_primaryNMCCCCCGGGGGGGT0.09147.5e-183.6e-21046212360.0186-0.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0615.1Gmeb1BHBBKKACGTMMNWNNN0.13498.9e-184.3e-210261488970.0134-5.51915.00e+0-
uniprobe mouseUP00102_1Zic1_primaryBHCCCCCGGGGGGG0.08751.5e-177.3e-21035312110.017311935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0678.1OLIG2AMCATATGKT0.12867.7e-173.7e-2005516960.046721955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0744.2SCRT2NNWGCAACAGGTGDNN0.07401.9e-169.0e-20057113640.041121925.00e+0-
jolma2013BHLHA15_DBDACCATATGGT0.14323.3e-151.6e-1805384120.058321955.00e+0-
jolma2013ZBTB7A_DBDBGCGACCACCGA0.13065.3e-152.5e-1801912510180.021651945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1102.2CTCFLNSCAGGGGGCGS0.04627.3e-153.5e-18015573314080.0107481945.00e+0-
jolma2013ZNF784_fullGTACCTACCT0.16908.1e-153.9e-18021844970.040231955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0751.1ZIC4GRCCCCCCGCKGYGH0.09623.3e-141.6e-17066711330.01942.51925.00e+0-
jolma2013ETV1_DBDACCGGAAGTD0.10206.3e-143.0e-17078113920.0115-11955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1619.1Ptf1ANNACAGCTGTNN0.06401.1e-135.2e-17013012100.024401945.00e+0-
uniprobe mouseUP00053_2Rxra_secondaryKCRCRWAGKTYRNDHN0.06651.4e-136.7e-17013618870.018801925.00e+0-
jolma2013ZIC4_DBDGRCCCCCCGCKGYGH0.09741.7e-138.1e-17066711700.0188-1.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1642.1NEUROG2NNVACAGATGGNN0.08681.8e-138.8e-17024516180.02600.51935.00e+0-
uniprobe mouseUP00006_2Zic3_secondaryNNDCACAGCAKGMNN0.07302.4e-131.2e-16024415480.02780.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0607.2BHLHA15ACCATATGGT0.13773.4e-131.6e-1605384720.046621955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0597.2THAP1VSGCAGGGCASV0.06683.7e-131.8e-16013316610.019901945.00e+0-
jolma2013GMEB2_DBD_1TTACGYAM0.12106.2e-133.0e-1602715610720.013301965.00e+0-
uniprobe mouseUP00057_2Zic2_secondaryNHDCVCAGCAKGNRW0.06917.1e-133.4e-16024315200.02700.51925.00e+0-
uniprobe mouseUP00000_1Smad3_primaryMDWNNCCAGACAKNWNH0.08057.9e-133.8e-16024415910.02640.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0461.2Atoh1RMCATATGBY0.09458.9e-134.3e-16055911640.028421955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1641.1MYF5NVACAGCTGTBN0.06281.7e-128.4e-16012912340.023101945.00e+0-
jolma2013Atoh1_DBDRMCATATGBY0.09222.7e-121.3e-15055911930.027721955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0511.2RUNX2WAACCGCAA0.10861.2e-115.6e-1501210012610.03494.51955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0734.3Gli2NRGACCACCCASV0.11421.2e-115.6e-1502215012610.02305.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0668.2Neurod2NNGRACAGATGGYNN0.08491.2e-116.0e-15024114950.02540.51925.00e+0-
jolma2013OLIG2_DBDAMCATATGKT0.09982.1e-111.0e-1405488520.032921955.00e+0-
uniprobe mouseUP00036_2Myf6_secondaryVNNRACAGVCGCRCC0.09762.3e-111.1e-14024619160.02240.51925.00e+0-
jolma2013GABPA_fullACCGGAAGTR0.10605.0e-112.4e-1407599150.013121955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0743.2SCRT1NDWKCAACAGGTGKNN0.06831.1e-105.3e-14055712010.031621925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1989.1Bcl11BNNAAACCACAARNN0.08721.2e-105.5e-1401111918230.043941935.00e+0-
jolma2013RUNX2_DBD_3WAACCGCAA0.10491.2e-105.7e-140129912870.03344.51955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1618.1Ptf1aNNACAGATGTTNN0.06402.4e-101.2e-13024116410.0207-0.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1621.1RbpjlNNVACACCTGTBNN0.05484.3e-102.1e-13013016510.017901935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1109.1NEUROD1NRACAGATGGYNN0.08514.8e-102.3e-13023915160.0237-0.51935.00e+0-
jolma2013MYF6_fullAACARCTGTT0.06607.2e-103.4e-1301217420.028301955.00e+0-
jolma2013RUNX3_fullWAACCRCAAA0.09648.7e-104.2e-1301310914050.034241955.00e+0-
uniprobe mouseUP00042_2Gm397_secondaryNVMVGCACACACKCVH0.10131.0e-95.0e-13097711750.030621925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0667.1MYF6AACARCTGTT0.06872.7e-91.3e-1201206990.028601955.00e+0-
jolma2013ETV5_DBDACCGGAWGYN0.08992.9e-91.4e-12077715570.010931955.00e+0-
jolma2013RUNX3_DBD_2WAACCGCAAA0.09823.2e-91.5e-120139912420.037841955.00e+0-
uniprobe mouseUP00102_2Zic1_secondaryYHDCRCAGCAGGMVD0.06733.8e-91.8e-12023815300.02290.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0500.2MYOGNDRCAGCTGYHN0.06683.8e-91.8e-12012411230.021401945.00e+0-
uniprobe mouseUP00094_1Zfp128_primaryNHWDDGGCGTACCCYMA0.14926.7e-93.2e-12030773820.0183-4.51915.00e+0-
jolma2013ELK3_DBDACCGGAAGTD0.09636.7e-93.2e-12076411730.013631955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0817.1BHLHE23AAACATATGBTN0.15358.2e-94.0e-1205242410.066421945.00e+0-
uniprobe mouseUP00036_1Myf6_primaryVVRSAACAGBTGTYHG0.07201.1e-85.2e-12034816400.016511925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1106.1HIF1ASBACGTGCNN0.12591.5e-87.1e-12019826630.0136-171955.00e+0-
jolma2013ETV3_DBDACCGGAAGTR0.09262.1e-89.9e-12076712960.013101955.00e+0-
jolma2013OLIG1_DBDAMCATATGKT0.11862.2e-81.0e-1105304130.048421955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0499.2MYOD1NNGCACCTGTCNB0.07152.5e-81.2e-11023715530.02060.51935.00e+0-
jolma2013MYBL2_DBD_3WHAACCGTTAH0.10162.7e-81.3e-1104419600.0208-0.51945.00e+0-
uniprobe mouseUP00047_2Zbtb7b_secondaryNNNRVGACCACCWKTRN0.10513.8e-81.8e-110107911800.0263-2.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1964.1SMAD2NBCCAGACDN0.09075.1e-82.5e-11034213340.030411955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1990.1Gli1NNRGACCACCCASV0.09995.5e-82.6e-1101713415360.019561935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0763.1ETV3ACCGGAAGTR0.09235.9e-82.9e-11076612790.013301955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1570.1TFAP4AHCATRTGDT0.06991.0e-75.0e-11012311950.019201955.00e+0-
jolma2013Tcf21_DBDRYAACAGCTGTTRN0.11242.1e-71.0e-1001132670.048701935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0259.1ARNT::HIF1AVBACGTGC0.11032.2e-71.1e-100191019630.0130-51965.00e+0-
jolma2013MESP1_DBDHRCACCTGBN0.06094.2e-72.0e-10012718240.014801955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1467.2Atoh1RVCAGATGGYN0.08658.2e-73.9e-10066415200.0204-0.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1947.1ETV5::FOXO1GTMAACAGGAWRY0.08828.6e-74.2e-1001210115200.03031.51935.00e+0-
jolma2013BHLHE22_DBDAAACATATGBTN0.11529.7e-74.7e-1005263560.047821945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1725.1ZNF189NNTGCTGTTCCHB0.08371.1e-65.3e-10066716370.02202.51935.00e+0-
jolma2013NEUROG2_DBDRACATATGYY0.09041.2e-65.5e-1005366860.030621955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0697.2Zic3CNCAGCAGGAGNN0.05791.5e-67.2e-10014876716500.0139-1.51935.00e+0-
jolma2013GLI2_DBD_1GACCACCCACRDHG0.09402.2e-61.0e-901712514730.018331935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0745.2SNAI2NBGCACCTGTMNY0.06622.8e-61.3e-9023012240.02120.51935.00e+0-
uniprobe mouseUP00097_1Mtf1_primaryDBRYCGTGTGCAAMNN0.11347.5e-63.6e-90531586790.030931925.00e+0-
jolma2013ZIC1_fullGACCCCCYGCTGTG0.10518.2e-64.0e-905263900.0462-21935.00e+0-
jolma2013TFEC_DBDRTCACRTGAB0.07611.2e-55.7e-9055414660.027621955.00e+0-
memeTRYGTGYGTRTGTGTMEME-3TRYGTGYGTRTGTGT0.17051.3e-56.4e-9022462700.0270-3.51925.00e+0-
jolma2013HSFY2_DBD_2TTTCGAACG0.09831.4e-56.7e-9012729820.02123.51955.00e+0-
jolma2013TFAP4_fullAWCAGCTGWT0.07251.6e-57.5e-901166620.024201955.00e+0-
jolma2013SCRT2_DBDRWGCAACAGGTGS0.10541.7e-58.4e-906253130.05432.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0826.1OLIG1AMCATATGKT0.12292.2e-51.1e-805243500.045721955.00e+0-
jolma2013SCRT1_DBDRWGCAACAGGTGGBH0.08262.9e-51.4e-804223630.04411.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1620.1Ptf1ANVACACCTGTNN0.04855.2e-52.5e-8012316280.014101945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1987.1ZNF701GAGCACYAARGGGGGRADDDD0.07745.2e-52.5e-8022811890.0143-0.51895.00e+0-
uniprobe mouseUP00003_1E2F3_primaryVHDADGGCGCGCSHW0.06517.0e-53.4e-801582935380.0112-65.51925.00e+0-
jolma2013Zic3_DBDGACCCCCYGCTGYGH0.08519.1e-54.4e-804234230.0402-1.51925.00e+0-
jolma2013GLI2_DBD_2GCGACCACMCTR0.09529.2e-54.4e-809569300.032341945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1730.1ZNF708NNRGCTGTRCCTCCY0.09381.5e-47.4e-803214910660.04033.51925.00e+0-
uniprobe mouseUP00087_2Tcfap2c_secondaryNYGCCCDAGGGCWD0.07271.9e-49.2e-8098517470.014321935.00e+0-
jolma2013ELK4_DBDACCGGAARTV0.09213.6e-41.7e-7075110750.013031955.00e+0-
jolma2013ZIC3_fullGACCCCCCGCTGYGH0.08257.3e-43.5e-706336180.0259-1.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0623.2NEUROG1RACATATGTY0.13508.0e-43.8e-707253260.039921955.00e+0-
jolma2013ETV4_DBDACCGGAAGTR0.07948.9e-44.3e-7076214680.0123-11955.00e+0-
jolma2013TFAP4_DBDAWCAGCTGAT0.06831.3e-36.2e-701157760.019301955.00e+0-
uniprobe mouseUP00001_1E2F2_primaryNHWARGGCGCGCSAH0.06861.5e-37.1e-701603466560.0122-6.51925.00e+0-
uniprobe mouseUP00065_1Zfp161_primaryKGGCGCGCGCRCHYRD0.10271.5e-37.3e-7035995940.0168-51925.00e+0-
jolma2013ESRRG_full_3TCAAGGTCAH0.05791.7e-38.3e-7011710540.016101955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1559.1SNAI3RRCAGGTGYA0.05141.8e-38.8e-701169240.017301955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0756.2ONECUT2CGATCRATA0.12791.9e-39.0e-7026583830.0144-4.51955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1648.1TCF12NNCACCTGCNN0.06822.2e-31.1e-6014272316720.009342.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0643.1EsrrgTCAAGGTCAH0.05712.5e-31.2e-601169460.016901955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0764.3ETV4ACCGGAAGTR0.08462.8e-31.4e-6075011110.0108-11955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0748.2YY2NVATGGCGGCS0.06153.2e-31.5e-6014454712190.0115-1.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0087.2Sox5VRVAACAATGGNN0.06133.4e-31.6e-6044214690.0136-0.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1946.1ETV5::FOXI1GTAAACAGGAWGY0.09225.4e-32.6e-60148211610.01641.51935.00e+0-
jolma2013FIGLA_DBDWMCACCTGKW0.05256.1e-32.9e-6012016100.012401955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0006.1Ahr::ArntYGCGTG0.08506.3e-33.0e-601711214850.009121975.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0506.2Nrf1SNCTGCGCMTGCGCV0.09837.2e-33.5e-60681816920.01455.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0442.2SOX10NDAACAAAGVN0.06139.5e-34.6e-6087116650.01773.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1982.1ZNF574VGSCTAGAGMGGCCS0.07231.1e-25.2e-606388710.0195-0.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1108.2MXI1NNCACATGBN0.08241.1e-25.5e-6054111710.026921955.00e+0-
uniprobe mouseUP00081_2Mybl1_secondaryNBACCAACTGCCGHK0.08311.3e-26.4e-6064812630.01902.51925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1646.1OSR2NNACAGAAGCNN0.05171.4e-26.9e-6012118580.011301945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0830.2TCF4NNGCACCTGCCNN0.05501.8e-28.7e-6014873116360.008642.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1650.1ZBTB14SSCCGCGCACVS0.09702.1e-21.0e-5021949690.0144-81945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0820.1FIGLAWMCACCTGKW0.05322.4e-21.2e-5011915870.012001955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1584.1ZIC5VGACCCCCCGCTGHGM0.09062.8e-21.3e-502291992650.0340-21925.00e+0-
uniprobe mouseUP00074_2Isgf3g_secondaryGVAAAACABDACYD0.05392.9e-21.4e-5012119300.010901935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1977.1ZNF324NNNWRGMAGHYAAGGATGGTT0.09683.0e-21.5e-508438010.01562.51895.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0028.2ELK1ACCGGAAGTR0.10783.0e-21.5e-5017484730.0169-81955.00e+0-
jolma2013Hic1_DBD_1RTGCCAVCCYRTGCCARC0.03503.4e-21.6e-501511162000.025011915.00e+0-
uniprobe mouseUP00081_1Mybl1_primaryNDDNHAACCGTTAHHHN0.09763.5e-21.7e-5014526350.0173-5.51915.00e+0-
jolma2013RUNX2_DBD_2WRACCGCADWAACCGCAD0.08663.8e-21.8e-501114850.022701915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1936.1ERF::FOXO1RTMAACAGGAARBS0.07293.9e-21.9e-50117714130.011351935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0162.4EGR1VCMCGCCCACGCVS0.09004.2e-22.0e-501910712780.012831935.00e+0-
uniprobe mouseUP00092_1Myb_primaryNTDDHAACCGTTAWHHN0.09524.3e-22.0e-5014556930.0159-0.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0832.1Tcf21RYAACAGCTGTTRN0.08884.8e-22.3e-50182140.037401935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1100.2ASCL1VGCAGCTGCN0.05795.2e-22.5e-5011916680.011401955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0864.2E2F2RWTTTGGCGCCAWWWY0.02316.9e-23.3e-501651963460.0145-651925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1954.1FOXO1::ELK1RWMAACAGGAAGTD0.06757.3e-23.5e-5032811400.018411935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0091.1TAL1::TCF3NSAMCATCTGKT0.06628.1e-23.9e-503227550.0159-11945.00e+0-
jolma2013RUNX3_DBD_3HRACCGCARWAACCRCAR0.09579.8e-24.7e-501115330.022591915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1580.1ZBTB32TGTACAGTAW0.09281.0e-14.9e-507223340.0195-31955.00e+0-
uniprobe mouseUP00046_1Tcfe2a_primaryNKCHVCAGGTGCRNDNW0.07501.0e-15.0e-5022312260.0131-0.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1731.1ZNF768YBVCYBRSCCTCTCTGDG0.06911.1e-15.1e-5096513890.0252-41915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1524.2Msgn1VRRRACAAATGGTNNN0.07151.4e-16.6e-505339230.018421925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0691.1TFAP4AWCAGCTGWT0.05141.4e-16.8e-501115640.019501955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0143.4SOX2DNACAATGGNN0.05421.6e-17.6e-5065516690.01440.51945.00e+0-
jolma2013MSC_fullAACAGCTGTT0.05341.6e-17.6e-501138330.015601955.00e+0-
uniprobe mouseUP00026_2Zscan4_secondaryHVHRGCACACAANHHN0.06951.7e-18.1e-5054715980.021921925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0632.2TCFL5KCACGCGCMC0.09341.8e-18.5e-50391056440.0124-81955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0869.2Sox11NRGAACAAAGVV0.06172.3e-11.1e-4097617830.019941945.00e+0-
jolma2013ELK1_DBD_2ACCGGAAGTR0.09802.8e-11.4e-4017505410.016631955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1628.1Zic1::Zic2CVCAGCAGGNV0.05993.1e-11.5e-4065015100.0152-0.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1561.1SOX12ACCGAACAATV0.07183.2e-11.5e-4084810720.02153.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1593.1ZNF317WVACAGCAGAYW0.06053.4e-11.6e-4011817020.010601945.00e+0-
jolma2013GMEB2_DBD_4TTACGTAA0.09363.7e-11.8e-40591446520.0161-51965.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0862.1GMEB2TTACGTAA0.09363.7e-11.8e-40591446520.0161-51965.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1950.1FLI1::FOXI1RTAAACAGGAARYN0.07003.7e-11.8e-4033013860.015911935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1514.1KLF17MMCCACGCACCCMTY0.09893.8e-11.8e-4022818090.0148-3.51925.00e+0-
uniprobe mouseUP00003_2E2F3_secondaryNNYWYGGCGCCAMDVBN0.05004.4e-12.1e-401644288400.00957.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0696.1ZIC1GACCCCCYGCTGTG0.09064.5e-12.2e-401451623200.0344-21935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0522.3TCF3NVCACCTGCNN0.07205.6e-12.7e-4019891616170.0105-4.51945.00e+0-
uniprobe mouseUP00066_2Hnf4a_secondaryKKYHAAAGTCCAMTNH0.04646.0e-12.9e-401139270.014001925.00e+0-
jolma2013ZNF238_fullNATCCAGATGTKB0.05226.6e-13.2e-402178040.0149-0.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1953.1FOXO1::ELF1RTMAACAGGAAGTN0.06236.9e-13.3e-4033014290.017511935.00e+0-
jolma2013FEV_DBDACCGGAAGTN0.07436.9e-13.3e-4074712240.013931955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0039.4KLF4NNCCCCACCCHN0.08117.7e-13.7e-402314816890.0154-31945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0484.2HNF4GDNCAAAGTCCANN0.04838.7e-14.2e-4022415100.01590.51935.00e+0-
jolma2013RUNX2_DBD_1WRACCGCAAACCGCAR0.09259.1e-14.4e-403218000.025011925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0114.4HNF4ADNCAAAGTCCANN0.05469.7e-14.7e-4022415200.01580.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0738.1HIC2RTGCCCRSB0.04589.9e-14.8e-4018298819120.00893.51955.00e+0-
jolma2013E2F1_DBD_3WWTTGGCGCCAAWW0.02781.36.0e-401632123960.0126-651935.00e+0-
jolma2013NRF1_fullYGCGCATGCGCA0.14461.36.3e-4071551660.0241-41945.00e+0-
jolma2013ERG_DBD_1ACCGGAARTV0.07481.36.4e-4074712360.012531955.00e+0-
jolma2013ELK1_DBD_1ACCGGAAGTD0.07001.36.5e-4074411430.014031955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1508.1IKZF1VVAACAGGAARN0.05711.46.7e-4033518860.011711945.00e+0-
uniprobe mouseUP00027_1Osr1_primaryNNNWACRGTAGCWMHR0.07441.57.1e-4011310080.012901925.00e+0-
jolma2013HIC2_DBDRTGCCCRSB0.04551.57.4e-4018298619120.00893.51955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1728.1ZNF549NNTGCTGCCCWR0.04791.67.5e-4019797017310.0081-31945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0103.3ZEB1SNCACCTGSVN0.06981.67.9e-4022314680.0136-4.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1716.1ZNF76NNTWCCCAYAATGCAHYGCGC0.08091.88.5e-404163830.0209-1.51895.00e+0-
jolma2013Elk3_DBDACCGGAAGTD0.08861.88.6e-407296210.017731955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1984.1ZNF667TTAAGAGCTCAV0.07581.88.6e-40138114250.040061945.00e+0-
uniprobe mouseUP00026_1Zscan4_primaryDHNATGTGCACAYAHWN0.09252.09.6e-4018678000.02624.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1517.1KLF6NRCCACGCCCH0.08292.21.1e-30189612850.01486.51945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0098.3ETS1ACCGGAARTR0.07442.21.1e-3074311290.012431955.00e+0-
uniprobe mouseUP00079_2Esrra_secondaryVKNNAGGGGTCAHDNBH0.03662.31.1e-3015854611470.008740.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1099.2HES1GGCRCGTGBC0.09392.81.4e-3021697350.0181-41955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0470.2E2F4TTTTGGCGCCAWWW0.03513.11.5e-301631292280.0175601935.00e+0-
uniprobe mouseUP00042_1Gm397_primaryNNDATGTGCACATAMDH0.08743.41.6e-3012508070.02734.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1542.1OSR1HGCTACYGTD0.07503.41.7e-3011311070.011701955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1560.1SOHLH2YGCACGTGCN0.08653.91.9e-30331037920.0139-51955.00e+0-
jolma2013ESRRA_DBD_1BTCAAGGTCAH0.05324.62.2e-302179390.01700.51945.00e+0-
jolma2013ETS1_DBD_1ACCGGAARTR0.07445.22.5e-3074311700.0111-11955.00e+0-
jolma2013SNAI2_DBDDRCAGGTGB0.05445.52.6e-3022518200.0088-4.51955.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0024.3E2F1TTTGGCGCCAAA0.02716.02.9e-301631953690.0136591945.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1942.1ETV2::FOXI1BGTAAACAGGAAGYR0.07596.23.0e-3043414490.01591.51925.00e+0-
uniprobe mouseUP00092_2Myb_secondaryNKACCAACTGCCRNNN0.07366.33.0e-3053512090.016521925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0116.1Znf423GSMMCCYARGGKKBM0.08336.73.2e-306142280.03071.51925.00e+0-
uniprobe mouseUP00079_1Esrra_primaryNMNYCAAGGTCAHNNNN0.05727.13.4e-3021810570.0161-0.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1585.1ZKSCAN1AYAGTAGGTS0.09647.43.6e-301271874460.0202-11955.00e+0-
uniprobe mouseUP00099_1Ascl2_primaryNKYARCAGCTGCYVHHK0.04587.63.7e-3017677115060.0073-4.51915.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1123.2TWIST1NNDCCAGATGTBN0.05328.13.9e-3022316180.00870.51935.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA0147.3MYCNNCCACGTGCNB0.06068.13.9e-3020570211970.0100-51945.00e+0-
uniprobe mouseUP00001_2E2F2_secondaryNNNWYGGCGCCAANNKB0.04708.44.0e-3020059810220.01277.51915.00e+0-
uniprobe mouseUP00031_2Zbtb3_secondaryCWATCACTGGYADNWD0.05438.64.1e-3054819150.008421925.00e+0-
uniprobe mouseUP00083_2Tcf7l2_secondaryVDRVATCAATCRBKHN0.05578.84.2e-3011413740.010201925.00e+0-
JASPAR2022 CORE vertebrates non-redundant v2MA1535.1NR2C1NRRGGTCAN0.05459.04.3e-3017675915130.0086-118.51955.00e+0-
jolma2013TFE3_DBDVYCACGTGAY0.05909.84.7e-3020585114740.012921955.00e+0-
jolma2013NR2F1_fullGRGGTCAA0.05359.94.8e-3018367913000.0077-471965.00e+0-

Matching sequences (out of 2000)

Union: 212 sequences (11%).
Intersection: 212 sequences (11%).

Filter & Sort

Filters 
Sort 

Columns to display

Input Files

Alphabet

    Background source: the file 'GSM4160247-ETO--BTZ_WO_meme-chip/background'

Name 
Bg. 
Bg. 
Name 
Adenine0.2614A~T0.2614Thymine
Cytosine0.2386C~G0.2386Guanine

Sequences

DatabaseSourceSequence Count
GSM4160247-ETO--BTZ WO.mm10plusrDNA.summits 200 GSM4160247-ETO--BTZ_WO_meme-chip/GSM4160247-ETO--BTZ_WO.mm10plusrDNA.summits_200.fa 2000

Motifs

DatabaseSourceMotif Count
memeGSM4160247-ETO--BTZ_WO_meme-chip/meme_out/meme.xml3
stremeGSM4160247-ETO--BTZ_WO_meme-chip/streme_out/streme.xml7
uniprobe mouse/sibcb1/wuweilab1/liangyu/meme/motif_databases/MOUSE/uniprobe_mouse.meme386
JASPAR2022 CORE vertebrates non-redundant v2/sibcb1/wuweilab1/liangyu/meme/motif_databases/JASPAR/JASPAR2022_CORE_vertebrates_non-redundant_v2.meme841
jolma2013/sibcb1/wuweilab1/liangyu/meme/motif_databases/EUKARYOTE/jolma2013.meme843

Other Settings

Objective Function central region enrichment (CE)
Convert Motifs to Different Alphabet? No
Motif Pseudo-Counts 0.1
Required sequence length 400
Site Scoring Method log-odds scores
Score Threshold 5 (bits)
Optimize Score Threshold? No
Minimum Region Size 0
Maximum Region Size 0
Strand Handling scan both strands if alphabet is complementable
Plotting of Matches on Negative Strand same as for positive strand
Sequence IDs Included in Output? Yes
CentriMo version
5.5.2 (Release date: Sun Jan 29 10:33:12 2023 -0800)
Reference
Timothy L. Bailey and Philip Machanick, "Inferring direct DNA binding from ChIP-seq", Nucleic Acids Research, 40:e128, 2012.
[Full Text]
Command line summary