CentriMo Results

field	name	contents
1	db_index	The index of the motif file that contains the motif. Motif files are numbered in the order the appeared in the command line.
2	motif_id	The name of the motif, which is unique in the motif database file. If more than one motif has the same ID, CentriMo uses only the first such motif. The name is single-quoted and preceded with '+' or '-' if you scanned separately with the reverse complement motif (using the `--sep` option).
3	motif_alt_id	An alternate name for the motif that may be provided in the motif database file.
4	consensus	A consensus sequence computed from the motif (as described below).
5	E-value	The expected number motifs that would have at least one region as enriched for best matches to the motif as the reported region (or would have optimal average distance to the sequence center as low as observed, if you used the `--cd` option). The E-value is the adjusted p-value multiplied by the number of motifs in the input files(s).
6	adj_p-value	The statistical significance of the enrichment of the motif, adjusted for multiple tests. By default, a p-value is calculated by using the one-tailed binomial test on the number of sequences with a match to the motif that have their best match in the reported region; if you provided control sequences, the p-value of Fisher's exact test on the enrichment of best matches in the positive sequences relative to the negative sequences is computed instead; if you used the `--cd` option, the p-value is the probability that the average distance between the best site and the sequence center would be as low or lower than observed, computed using the cumulative Bates distribution, optimized over different score thresholds. In all cases, the reported p-value has been adjusted for the number of regions and/or score thresholds tested.
7	log_adj_p-value	Log of adjusted p-value.
8	bin_location	Location of the center of the most enriched region, or 0 if you used the `--cd` option.
9	bin_width	The width (in sequence positions) of the most enriched region (default), or two times the average distance between the center of the best site and the sequence center if you used the option `--cd`. A best match to the motif is counted as being in the region if the center of the motif falls in the region.
10	total_width	The maximum number of regions possible for this motif round(sequence_length - motif_length + 1)/2, or the number of places the motif will fit if you used the `--cd` option.
11	sites_in_bin	The number of (positive) sequences whose best match to the motif falls in the reported region (default) or anywhere in the sequence (if you used the option `--cd`). Note: This number may be less than the number of (positive) sequences that have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.
12	total_sites	The number of sequences containing a match to the motif above the score threshold.
13	p_success	The probability of a random match falling into the enriched region: bin_width / total_width
14	p-value	The uncorrected p-value before it gets adjusted for the number of multiple tests to give the adjusted p-value.
15	mult_tests	This is the number of multiple tests (n) done for this motif. It was used to adjust the p-value of a region for multiple tests using the formula: p' = 1 - (1-p)ⁿ where p is the unadjusted p-value. The number of multiple tests is the number of regions considered times the number of score thresholds considered. It depends on the motif length, sequence length, and the type of optimizations being done (central enrichment, local enrichment, central distance or score optimization).
The following additional columns are present when you provide control sequences to CentriMo (using the `--neg` option).
16	neg_sites_in_bin	The number of negative sequences where the best match to the motif falls in the reported region. This value is rounded but the underlying value may contain fractional counts. Note: This number may be less than the number of negative have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.
17	neg_sites	The number of negative sequences containing a match to the motif above the minimum score threshold. When score optimization is enabled the score threshold may be raised higher than the minimum.
18	neg_adj_pvalue	The probability that any tested region in the negative sequences would be as enriched for best matches to this motif according to the Binomial test.
19	log_neg_adj_pvalue	Log of negative adjusted p-value.
20	fisher_adj_pvalue	Fisher adjusted p-value before it gets adjusted for the number of motifs in the input files(s).
21	log_fisher_adj_pvalue	Log of Fisher adjusted p-value.

field

name

contents

db_index

The index of the motif file that contains the motif. Motif files are numbered in the order the appeared in the command line.

motif_id

The name of the motif, which is unique in the motif database file. If more than one motif has the same ID, CentriMo uses only the first such motif. The name is single-quoted and preceded with '+' or '-' if you scanned separately with the reverse complement motif (using the --sep option).

motif_alt_id

An alternate name for the motif that may be provided in the motif database file.

consensus

A consensus sequence computed from the motif (as described below).

E-value

The expected number motifs that would have at least one region as enriched for best matches to the motif as the reported region (or would have optimal average distance to the sequence center as low as observed, if you used the --cd option). The E-value is the adjusted p-value multiplied by the number of motifs in the input files(s).

adj_p-value

The statistical significance of the enrichment of the motif, adjusted for multiple tests. By default, a p-value is calculated by using the one-tailed binomial test on the number of sequences with a match to the motif that have their best match in the reported region; if you provided control sequences, the p-value of Fisher's exact test on the enrichment of best matches in the positive sequences relative to the negative sequences is computed instead; if you used the --cd option, the p-value is the probability that the average distance between the best site and the sequence center would be as low or lower than observed, computed using the cumulative Bates distribution, optimized over different score thresholds. In all cases, the reported p-value has been adjusted for the number of regions and/or score thresholds tested.

log_adj_p-value

Log of adjusted p-value.

bin_location

Location of the center of the most enriched region, or 0 if you used the --cd option.

bin_width

The width (in sequence positions) of the most enriched region (default), or two times the average distance between the center of the best site and the sequence center if you used the option --cd. A best match to the motif is counted as being in the region if the center of the motif falls in the region.

total_width

The maximum number of regions possible for this motif
round(sequence_length - motif_length + 1)/2,
or the number of places the motif will fit if you used the --cd option.

sites_in_bin

The number of (positive) sequences whose best match to the motif falls in the reported region (default) or anywhere in the sequence (if you used the option --cd).
Note: This number may be less than the number of (positive) sequences that have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.

total_sites

The number of sequences containing a match to the motif above the score threshold.

p_success

The probability of a random match falling into the enriched region:
bin_width / total_width

p-value

The uncorrected p-value before it gets adjusted for the number of multiple tests to give the adjusted p-value.

mult_tests

This is the number of multiple tests (n) done for this motif. It was used to adjust the p-value of a region for multiple tests using the formula:
p' = 1 - (1-p)ⁿ where p is the unadjusted p-value.
The number of multiple tests is the number of regions considered times the number of score thresholds considered. It depends on the motif length, sequence length, and the type of optimizations being done (central enrichment, local enrichment, central distance or score optimization).

The following additional columns are present when you provide control sequences to CentriMo (using the --neg option).

neg_sites_in_bin

The number of negative sequences where the best match to the motif falls in the reported region. This value is rounded but the underlying value may contain fractional counts. Note: This number may be less than the number of negative have a best match in the region. The reason for this is that a sequence may have many matches that score equally best. If n matches have the best score in a sequence, 1/n is added to the appropriate bin for each match.

neg_sites

The number of negative sequences containing a match to the motif above the minimum score threshold. When score optimization is enabled the score threshold may be raised higher than the minimum.

neg_adj_pvalue

The probability that any tested region in the negative sequences would be as enriched for best matches to this motif according to the Binomial test.

log_neg_adj_pvalue

Log of negative adjusted p-value.

fisher_adj_pvalue

Fisher adjusted p-value before it gets adjusted for the number of motifs in the input files(s).

log_fisher_adj_pvalue

Log of Fisher adjusted p-value.

Results

Enriched motifs (E-value ≤ 10 using the binomial test )

Database	ID	Alt ID	Consensus	Concentration	E-value	Fisher E-value	Region Center	Region Width	Region Matches	Negative Sequence Matches	Max Probability	Max Probability Location	Multiple Tests	Score Threshold
streme	1-AGATMGGAAGAGVGD	STREME-1	AGATMGGAAGAGVGD	0.3235	2.4e-175	1.1e-178	16	319	1018	0.2790	-7.5	192	5.00e+0	-
meme	CBCTCTTCCKMTCTN	MEME-1	CBCTCTTCCKMTCTN	0.2643	1.3e-156	6.2e-160	14	311	1235	0.2154	-6.5	192	5.00e+0	-
jolma2013	ELK1_DBD_1		ACCGGAAGTD	0.1663	3.7e-39	1.8e-42	15	146	962	0.1174	-7	195	5.00e+0	-
jolma2013	ETV5_DBD		ACCGGAWGYN	0.1385	1.7e-34	8.0e-38	15	172	1390	0.0846	-7	195	5.00e+0	-
jolma2013	Elk3_DBD		ACCGGAAGTD	0.1992	1.2e-33	5.7e-37	15	99	522	0.1667	-7	195	5.00e+0	-
jolma2013	FEV_DBD		ACCGGAAGTN	0.1483	5.5e-32	2.6e-35	15	145	1090	0.0978	-7	195	5.00e+0	-
jolma2013	ELK1_full_1		ACCGGAAGTD	0.1951	1.1e-31	5.2e-35	15	97	528	0.1553	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0759.2	ELK3	ACCGGAAGTRV	0.2635	1.3e-31	6.2e-35	16	75	296	0.2230	-7.5	194	5.00e+0	-
jolma2013	ETV1_DBD		ACCGGAAGTD	0.1433	2.3e-31	1.1e-34	15	150	1169	0.0933	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0645.1	ETV6	MSCGGAAGTR	0.1396	1.8e-30	8.7e-34	15	150	1196	0.0886	-7	195	5.00e+0	-
jolma2013	ETV4_DBD		ACCGGAAGTR	0.1385	7.5e-30	3.6e-33	15	155	1275	0.0860	-7	195	5.00e+0	-
jolma2013	ETV6_full_2		MSCGGAAGTR	0.1361	1.1e-28	5.5e-32	15	149	1227	0.0863	-7	195	5.00e+0	-
jolma2013	ELK3_DBD		ACCGGAAGTD	0.1443	1.0e-26	4.9e-30	15	128	991	0.0918	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0764.3	ETV4	ACCGGAAGTR	0.1435	9.5e-26	4.6e-29	15	122	934	0.0910	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1952.1	FOXJ2::ELF1	RTAAACMGGAAGTR	0.1094	2.6e-25	1.3e-28	11	143	1646	0.0668	-5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1935.1	ERF::FOXI1	RTAAACMGGAARTR	0.1083	1.8e-24	8.8e-28	11	147	1755	0.0513	-5	193	5.00e+0	-
uniprobe mouse	UP00015_2	Ehf_secondary	WWVDABTTCCKAWSWW	0.1169	2.3e-24	1.1e-27	15	170	1634	0.0777	-7	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1931.1	ELK1::HOXA1	ACCGGAAGTAATTA	0.2375	3.5e-24	1.7e-27	19	75	320	0.1781	-9	193	5.00e+0	-
jolma2013	ERF_DBD		ACCGGAAGTR	0.1408	5.1e-24	2.5e-27	15	120	941	0.0962	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1942.1	ETV2::FOXI1	BGTAAACAGGAAGYR	0.1161	2.9e-23	1.4e-26	10	118	1370	0.0620	-4.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0750.2	ZBTB7A	NVCCGGAAGTGSV	0.1221	2.9e-21	1.4e-24	18	164	1392	0.0690	-7.5	193	5.00e+0	-
jolma2013	ELK4_DBD		ACCGGAARTV	0.1354	1.4e-20	6.5e-24	15	112	923	0.0910	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0474.3	Erg	NNACAGGAAGTGVN	0.1159	2.2e-20	1.1e-23	15	162	1658	0.0600	-7	193	5.00e+0	-
jolma2013	GATA3_DBD		WGATAASV	0.0869	9.3e-19	4.5e-22	9	122	1838	0.0437	-4	196	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0062.3	GABPA	NNCACTTCCTGTNN	0.1092	1.2e-17	6.0e-21	15	162	1763	0.0519	-7	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1946.1	ETV5::FOXI1	GTAAACAGGAWGY	0.1136	3.6e-17	1.7e-20	10	94	1118	0.0555	-4.5	193	5.00e+0	-
jolma2013	ETS1_DBD_1		ACCGGAARTR	0.1220	7.1e-17	3.4e-20	15	113	1042	0.0788	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1642.1	NEUROG2	NNVACAGATGGNN	0.0805	7.9e-16	3.8e-19	4	66	1543	0.0350	-1.5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0473.3	ELF1	RDVCAGGAAGTGVN	0.1081	1.1e-15	5.1e-19	15	148	1612	0.0583	-7	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1708.1	ETV7	DGSCGGAAGTR	0.1099	2.0e-15	9.6e-19	14	138	1568	0.0565	-6.5	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0761.2	ETV1	NNACAGGAAGTGNN	0.1062	3.4e-15	1.7e-18	15	152	1692	0.0505	-7	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0037.4	Gata3	NHTCTTATCTNH	0.0845	9.7e-15	4.6e-18	9	111	1763	0.0391	-4	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1954.1	FOXO1::ELK1	RWMAACAGGAAGTD	0.1025	1.6e-14	7.7e-18	11	96	1151	0.0591	-5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1725.1	ZNF189	NNTGCTGTTCCHB	0.1142	4.1e-14	2.0e-17	24	195	1572	0.0585	-9.5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0080.6	Spi1	RRAAAGAGGAAGTGGDD	0.1032	1.0e-13	4.8e-17	14	138	1624	0.0517	-6.5	191	5.00e+0	-
jolma2013	GATA5_DBD		WGATAASR	0.0822	2.1e-13	9.9e-17	9	109	1807	0.0391	-4	196	5.00e+0	-
jolma2013	ERG_DBD_1		ACCGGAARTV	0.1120	2.1e-13	9.9e-17	15	110	1107	0.0650	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1992.1	Ikzf3	NNRCAGGAAGTGGVN	0.1040	1.3e-12	6.2e-16	18	163	1674	0.0490	-7.5	192	5.00e+0	-
jolma2013	FLI1_DBD_1		ACCGGAARTV	0.1106	3.5e-12	1.7e-15	15	106	1080	0.0653	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1467.2	Atoh1	RVCAGATGGYN	0.0817	3.8e-12	1.8e-15	6	73	1457	0.0398	-2.5	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0081.2	SPIB	TYTCACTTCCTCTTTY	0.1028	6.1e-12	2.9e-15	17	145	1525	0.0518	-7	192	5.00e+0	-
jolma2013	GATA4_DBD		WGATAASV	0.0784	1.7e-11	8.0e-15	9	106	1831	0.0361	-4	196	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1955.1	FOXO1::ELK3	RWMAACAGGAAGTN	0.1064	1.7e-11	8.2e-15	11	75	861	0.0581	-5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1109.1	NEUROD1	NRACAGATGGYNN	0.0756	1.4e-10	6.8e-14	6	69	1415	0.0332	-2.5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0668.2	Neurod2	NNGRACAGATGGYNN	0.0702	1.3e-9	6.1e-13	4	53	1397	0.0308	-1.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1968.1	TFCP2	AAACCGGTTY	0.0721	9.7e-9	4.7e-12	1	21	849	0.0247	0	195	5.00e+0	-
jolma2013	TFCP2_full_1		AAACCGGTTY	0.0713	1.6e-8	7.8e-12	1	21	872	0.0241	0	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1982.1	ZNF574	VGSCTAGAGMGGCCS	0.1253	2.4e-8	1.2e-11	6	44	726	0.0399	-2.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0765.3	ETV5	ACCGGAAGTR	0.1508	6.6e-8	3.1e-11	15	45	325	0.1046	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0098.3	ETS1	ACCGGAARTR	0.1030	1.4e-7	6.8e-11	15	88	977	0.0595	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1970.1	TRPS1	NHTCTTATCTNH	0.0759	1.6e-7	7.6e-11	9	92	1713	0.0298	-4	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0076.2	ELK4	BCRCTTCCGGB	0.1092	1.9e-7	9.3e-11	18	87	806	0.0571	-7.5	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1638.1	HAND2	NVCAGATGNN	0.0956	2.3e-7	1.1e-10	5	40	790	0.0418	-2	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0028.2	ELK1	ACCGGAAGTR	0.1479	2.7e-7	1.3e-10	15	45	338	0.1006	-7	195	5.00e+0	-
uniprobe mouse	UP00085_1	Sfpi1_primary	NDAWGVGGAAGTDN	0.0841	4.1e-7	2.0e-10	15	132	1741	0.0370	-6	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0760.1	ERF	ACCGGAAGTR	0.1051	8.2e-7	4.0e-10	15	80	875	0.0629	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0156.3	FEV	VACCGGAAGTVV	0.1398	1.6e-6	7.6e-10	15	45	354	0.1017	-7	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1950.1	FLI1::FOXI1	RTAAACAGGAARYN	0.0828	3.9e-6	1.9e-9	11	86	1346	0.0409	-5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0056.2	MZF1	NNAATCCCCANNN	0.0692	4.6e-6	2.2e-9	2	34	1590	0.0151	0.5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0598.3	EHF	NNCACTTCCTGTTNN	0.0893	6.6e-6	3.2e-9	16	131	1690	0.0337	-6.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1655.1	ZNF341	NRGAACAGCCNN	0.0933	7.2e-6	3.4e-9	23	164	1607	0.0423	-10	194	5.00e+0	-
jolma2013	GATA3_full		WGATAASV	0.0763	8.6e-6	4.2e-9	9	92	1864	0.0276	-4	196	5.00e+0	-
uniprobe mouse	UP00013_1	Gabpa_primary	MNWWACCGGAAGTDNNN	0.1083	8.7e-6	4.2e-9	14	63	674	0.0668	-6.5	191	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1944.1	ETV5::DRGX	RSMGGAAGYAATTA	0.1024	1.2e-5	5.6e-9	19	106	1087	0.0451	-9	193	5.00e+0	-
jolma2013	ELK1_DBD_2		ACCGGAAGTR	0.1293	1.7e-5	8.3e-9	15	47	410	0.0878	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0111.1	Spz1	AGGGTWWCAGC	0.0878	2.3e-5	1.1e-8	16	117	1486	0.0330	-7.5	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1713.1	ZNF610	SSCGCCGCTCCSSS	0.0593	1.1e-4	5.1e-8	43	123	624	0.0449	-12	193	5.00e+0	-
jolma2013	SPIB_DBD		RAAAAGMGGAAGTD	0.0958	1.2e-4	5.6e-8	13	57	668	0.0449	-5	193	5.00e+0	-
uniprobe mouse	UP00232_1	Dobox4_3956.2	HWAWTAGATACCCYWTD	0.0757	1.3e-4	6.2e-8	8	55	998	0.0200	-1.5	191	5.00e+0	-
jolma2013	HSFY2_DBD_3		TTCGAAHSRTTCGAA	0.0783	2.9e-4	1.4e-7	4	29	702	0.0128	-0.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1104.2	GATA6	HWWTCTTATCTNH	0.0719	1.1e-3	5.3e-7	10	84	1613	0.0242	-4.5	193	5.00e+0	-
jolma2013	GABPA_full		ACCGGAAGTR	0.1009	1.4e-3	6.5e-7	15	61	694	0.0533	-7	195	5.00e+0	-
jolma2013	ETV3_DBD		ACCGGAAGTR	0.0837	2.5e-3	1.2e-6	15	92	1248	0.0377	-7	195	5.00e+0	-
uniprobe mouse	UP00032_1	Gata3_primary	BDWDDAKAGATAAGARWTDARD	0.0802	2.6e-3	1.2e-6	9	57	1023	0.0362	-4	189	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0092.1	Hand1::Tcf3	BRTCTGGMWT	0.0659	2.6e-3	1.2e-6	1	22	1848	0.0119	0	195	5.00e+0	-
uniprobe mouse	UP00190_1	Nkx2-3_3435.1	YYTTAAGTACTTAAHR	0.0607	2.8e-3	1.3e-6	1	16	939	0.0181	17	192	5.00e+0	-
jolma2013	ERG_full_1		ACCGGAARTR	0.0920	3.0e-3	1.4e-6	15	73	908	0.0474	-7	195	5.00e+0	-
jolma2013	ETS1_full_1		ACCGGAARYV	0.0883	4.5e-3	2.2e-6	15	84	1116	0.0417	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0869.2	Sox11	NRGAACAAAGVV	0.0862	6.9e-3	3.3e-6	21	153	1772	0.0124	6	194	5.00e+0	-
uniprobe mouse	UP00153_1	Pitx1_2312.1	HKRRRGGGATTAAMDAN	0.0749	7.4e-3	3.5e-6	4	31	908	0.0220	-1.5	191	5.00e+0	-
uniprobe mouse	UP00089_2	Tcf1_secondary	NBRYCBGGATTADD	0.0686	7.7e-3	3.7e-6	3	36	1569	0.0128	1	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1953.1	FOXO1::ELF1	RTMAACAGGAAGTN	0.0783	1.1e-2	5.2e-6	11	79	1418	0.0303	-5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1936.1	ERF::FOXO1	RTMAACAGGAARBS	0.0821	1.8e-2	8.4e-6	13	84	1323	0.0166	-5	193	5.00e+0	-
uniprobe mouse	UP00088_1	Plagl1_primary	BNGGGGGSSCCCCNVN	0.0644	2.7e-2	1.3e-5	3	23	761	0.0131	1	192	5.00e+0	-
uniprobe mouse	UP00100_1	Gata6_primary	WHNVDWGATAAGADTHN	0.0713	3.3e-2	1.6e-5	8	55	1192	0.0268	-3.5	191	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0763.1	ETV3	ACCGGAAGTR	0.0812	5.8e-2	2.8e-5	15	84	1194	0.0352	-7	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0475.2	FLI1	ACCGGAARTR	0.0966	5.9e-2	2.8e-5	15	56	683	0.0556	-7	195	5.00e+0	-
jolma2013	FLI1_full_1		ACCGGAARTR	0.0949	6.6e-2	3.2e-5	15	59	738	0.0556	-7	195	5.00e+0	-
uniprobe mouse	UP00109_1	Obox6_3440.2	ANAADCGGATTAWHG	0.0737	7.7e-2	3.7e-5	4	25	706	0.0127	1.5	192	5.00e+0	-
uniprobe mouse	UP00021_2	Zfp281_secondary	DKKWDACCCCCAWTNDN	0.0544	1.2e-1	5.6e-5	4	32	1085	0.0175	40.5	191	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1524.2	Msgn1	VRRRACAAATGGTNNN	0.0768	1.3e-1	6.1e-5	11	55	911	0.0187	-2	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1116.1	RBPJ	BVTGGGAANN	0.0763	1.5e-1	7.3e-5	13	85	1430	0.0420	-6	195	5.00e+0	-
jolma2013	RHOXF1_full_1		GGMTWATCC	0.0831	1.6e-1	7.9e-5	20	125	1538	0.0224	-9.5	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1652.1	ZKSCAN5	NNRGGARGTGAGRR	0.0766	1.7e-1	8.3e-5	103	436	1331	0.0180	-9	193	5.00e+0	-
uniprobe mouse	UP00074_2	Isgf3g_secondary	GVAAAACABDACYD	0.0679	1.8e-1	8.5e-5	3	38	1958	0.0097	1	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0035.4	GATA1	NHCTAATCTDH	0.0725	2.0e-1	9.7e-5	8	68	1722	0.0151	-3.5	194	5.00e+0	-
jolma2013	Spic_DBD		AAAAAGMGGAAGTA	0.0930	2.0e-1	9.7e-5	13	37	441	0.0431	-5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0482.2	GATA4	NNCCTTATCTNH	0.0693	2.2e-1	1.0e-4	9	74	1717	0.0221	-4	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1991.1	Hnf1A	NBCCTTTGATSTBN	0.0777	2.2e-1	1.0e-4	13	100	1764	0.0125	-5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1483.2	ELF2	AAMCCGGAAGTR	0.0901	2.4e-1	1.1e-4	15	66	882	0.0346	-6	194	5.00e+0	-
jolma2013	Otx1_DBD_1		NHTAATCCGATTADN	0.0844	4.0e-1	1.9e-4	2	11	312	0.0256	0.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0036.3	GATA2	NBCTTATCTNH	0.0753	5.5e-1	2.6e-4	8	59	1461	0.0212	-3.5	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1933.1	ELK1::SREBF2	DCCGGAAGTSRCGTGA	0.1339	6.9e-1	3.3e-4	23	33	224	0.0670	-10	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0694.1	ZBTB7B	RCGACCACCGAA	0.0521	1.0	5.0e-4	1	10	557	0.0180	0	194	5.00e+0	-
streme	5-GGTTAGTTCATAWT	STREME-5	GGTTAGTTCATAWT	0.0417	1.2	5.9e-4	91	35	72	0.2639	-34	193	5.00e+0	-
jolma2013	RHOXF1_full_2		HTRATCCM	0.0658	1.3	6.2e-4	7	64	1919	0.0083	-2	196	5.00e+0	-
jolma2013	OTX2_DBD_1		DHTAATCCGATTADN	0.0824	1.4	6.8e-4	2	11	356	0.0253	0.5	192	5.00e+0	-
uniprobe mouse	UP00086_1	Irf3_primary	NARAAHSGAAACYR	0.0712	1.6	7.6e-4	13	95	1721	0.0134	1	193	5.00e+0	-
uniprobe mouse	UP00013_2	Gabpa_secondary	CYNKYWTCCSMYBNVN	0.0783	1.6	7.6e-4	13	101	1867	0.0230	-6	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1117.1	RELB	RDATTCCCCNN	0.0615	2.0	9.7e-4	4	36	1508	0.0106	-155.5	194	5.00e+0	-
jolma2013	ETV2_DBD		AACCGGAAATR	0.0822	2.4	1.1e-3	14	45	608	0.0428	-6.5	194	5.00e+0	-
jolma2013	ZBTB7B_full		RCGACCACCGAA	0.0473	2.4	1.2e-3	1	10	613	0.0163	0	194	5.00e+0	-
uniprobe mouse	UP00003_2	E2F3_secondary	NNYWYGGCGCCAMDVBN	0.0894	2.9	1.4e-3	16	56	716	0.0307	7.5	191	5.00e+0	-
jolma2013	ELF1_full		AACCCGGAAGTR	0.1157	3.1	1.5e-3	21	39	337	0.0475	-6	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0139.1	CTCF	YGRCCASYAGRKGGCRSYR	0.0773	3.1	1.5e-3	120	247	621	0.0209	-9.5	190	5.00e+0	-
jolma2013	HSFY2_DBD_1		TTCGAAHVRTTCGAA	0.0760	3.3	1.6e-3	4	23	763	0.0105	-0.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0508.3	PRDM1	YNCTTTCTCTH	0.0700	3.3	1.6e-3	62	344	1737	0.0135	-2.5	194	5.00e+0	-
jolma2013	GRHL1_full		AAAACCGGTTTD	0.0759	3.4	1.6e-3	1	9	503	0.0179	0	194	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1123.2	TWIST1	NNDCCAGATGTBN	0.0519	3.9	1.9e-3	4	37	1623	0.0191	-1.5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0149.1	EWSR1-FLI1	GGAAGGAAGGAAGGAAGG	0.0607	4.9	2.4e-3	79	31	70	0.1133	-14	191	5.00e+0	-
jolma2013	NKX3-1_full		VCCACTTAA	0.0653	5.1	2.5e-3	6	51	1707	0.0126	-1.5	195	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0122.3	Nkx3-2	WWAAMCACTTAAN	0.0564	5.8	2.8e-3	2	21	1365	0.0147	79.5	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1723.1	PRDM9	RGDGGGVAGGGRGGVRRMARVARR	0.0674	6.0	2.9e-3	123	414	1069	0.0187	-5	188	5.00e+0	-
uniprobe mouse	UP00086_2	Irf3_secondary	RKAGAAWGGDSCDN	0.0727	6.3	3.0e-3	11	87	1898	0.0111	-5	193	5.00e+0	-
uniprobe mouse	UP00007_2	Egr1_secondary	NKNBGAGTGGGAYWNN	0.0561	7.3	3.5e-3	3	31	1692	0.0089	-1	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1999.1	Prdm5	CBGTTCTCCATCTNN	0.0622	8.2	4.0e-3	2	24	1725	0.0122	0.5	192	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA0768.2	Lef1	NNBCCTTTGATSTN	0.0775	8.5	4.1e-3	23	151	1801	0.0117	48	193	5.00e+0	-
JASPAR2022 CORE vertebrates non-redundant v2	MA1949.1	FLI1::DRGX	ACCGGAAGTAATTAT	0.0917	8.6	4.1e-3	20	71	818	0.0330	-9.5	192	5.00e+0	-
meme	CCTBCCYCYSYCYCY	MEME-2	CCTBCCYCYSYCYCY	0.0765	8.9	4.3e-3	112	518	1529	0.0170	-3.5	192	5.00e+0	-
uniprobe mouse	UP00389_1	Nkx3-1_2923.2	DWHDAAGTACTTAAAWN	0.0665	9.6	4.6e-3	2	18	1082	0.0194	16.5	191	5.00e+0	-

Input Files

Alphabet

Background source: the file 'GSM4160260-co-BTZ--ETO_WO_2h_meme-chip/background'

Name	Bg.				Bg.	Name
Adenine	0.2922	A	~	T	0.2922	Thymine
Cytosine	0.2078	C	~	G	0.2078	Guanine

Sequences

Database	Source	Sequence Count
GSM4160260-co-BTZ--ETO WO 2h.mm10plusrDNA.summits 200	GSM4160260-co-BTZ--ETO_WO_2h_meme-chip/GSM4160260-co-BTZ--ETO_WO_2h.mm10plusrDNA.summits_200.fa	2000

Motifs

Database	Source	Motif Count
meme	GSM4160260-co-BTZ--ETO_WO_2h_meme-chip/meme_out/meme.xml	3
streme	GSM4160260-co-BTZ--ETO_WO_2h_meme-chip/streme_out/streme.xml	8
uniprobe mouse	/sibcb1/wuweilab1/liangyu/meme/motif_databases/MOUSE/uniprobe_mouse.meme	386
JASPAR2022 CORE vertebrates non-redundant v2	/sibcb1/wuweilab1/liangyu/meme/motif_databases/JASPAR/JASPAR2022_CORE_vertebrates_non-redundant_v2.meme	841
jolma2013	/sibcb1/wuweilab1/liangyu/meme/motif_databases/EUKARYOTE/jolma2013.meme	843

Other Settings

Objective Function	central region enrichment (CE)
Convert Motifs to Different Alphabet?	No
Motif Pseudo-Counts	0.1
Required sequence length	400
Site Scoring Method	log-odds scores
Score Threshold	5 (bits)
Optimize Score Threshold?	No
Minimum Region Size	0
Maximum Region Size	0
Strand Handling	scan both strands if alphabet is complementable
Plotting of Matches on Negative Strand	same as for positive strand
Sequence IDs Included in Output?	Yes

CentriMo

Local Motif Enrichment Analysis

Your browser does not support canvas!

Results

Motif Probability Graph (motif score ≥ 5)

Options

Plotting

Unused Colors

Graph

Enriched motifs (E-value ≤ 10 using the binomial test )

Matching sequences (out of 2000)

Union: 122 sequences (6%).

Intersection: 122 sequences (6%).

Filter & Sort

Filters

Sort

Columns to display

Input Files

Alphabet

Sequences

Negative sequences

Motifs

Other Settings

CentriMo version

Reference

Command line summary