PLACE-FAQs (1 January 2006)

Q1:
Updated frequently? How often do you update the PLACE database content?

A1:
We try to update the database every three or four months. But we cannot promise it. Sometimes we cannot find any new cis-element motif to add after surveying papers published during two- to three-month period. In such case, we await one or two more months.

Q2:
Current best knowledge? Can I be confident that I am always looking at the current best knowledge about plant cis-acting regulatory DNA element?

A2:
As far as I know, not only we, but also any other cis-element databases similar to ours, can NOT guarantee that. We survey many journals by ourselves, and read many papers. You need to realize that, since human being, not the computer, survey publications and read them, there would be some errors, and some motifs may be inadvertently missed. No authors have ever deposited the motif sequence to us prior to or after the publication of their papers. Many journal publishers require payment for downloading pdf files, thus it is getting more difficult for us to obtain a copy of important papers soon after publication.

>Q3:
Similar on-line resources?
Are there any similar on-line resources like PLACE for me to search such cis-acting motif (not necessarily limited to plant)? I always like to compare results from different web servers.

A3:
Yes, there are many. TRANSFAC is the oldest of its kind, and includes animal, yeast, plant motifs and transcription factors data. Some web sites providing the service similar to ours have their own database, at least they claims so, and others use PLACE and TRANSFAC database. You can obtain information by Internet search.

>Q4:
Analysis of a large set of promoter sequences?
I am interested in identifying the number of occurrences of certain cis-elements in large sets of promoter sequences. I would like to compare the occurrence of the cis-element in for example pathogen induced set of promoters vs. the entire rice genome.

A4:
In order to compare the occurrence of some cis-elements between two populations of promoters, and to make sure that the difference or preference observed are really significant, you will need to carry out a number of statistical analyses of the results of PLACE analyses.
Let me refer you to the following interesting publication.
"Interdependency of brasssinosteroid and auxin signaling in Arabidopsis" by Nemhauser, Mockler, Chory, which appeared in PLoS Biology September 2004/Volume 2/ Issue 9/e258. This can be found at www.plosbiology.org.

>Q5:
Run this query (large sets of promoter sequences) at PLACE site? Would it be possible to run this query (large sets of promoter sequences) on PLACE if I provided the promoter set of interest from rice?

A5:
We can not provide that kind of service now, mainly due to the lack of manpower.

>Q6:
A bug in the programming setting? I found that I got exactly the same results for ANY group of promoters in terms of two motifs MYCATERD1 and MYCATRD22. From the PLACE documentations, these two motifs are different, one is CATGTG (MYCATERD1) and the other is CACATG (MYCATRD22). I believe that this is probably due to a bug in the programming setting, i.e., the search patterns for these two motifs were set as the same in the PLACE scan program. Could you please check this?

A6:
These two entries are:
MYCATERD1 (S000413) CATGTG
MYCATRD22 (S000174) CACATG
SignalScan program search the query promoter sequence for the presence of motif sequences in the SENSE and ANTISENSE directions.
If CACATG (MYCATRD22) is found in the sense strand of the query, the result, in the case of table format, is shown as, e.g., 219 (+) CACATG. If CATGTG (MYCATERD1) is found in the antisense strand of the query, it is shown as, e.g., 219 (-) CATGTG..So, two "different" motifs are shown at the same site. But, if you look carefully at the motif sequences, you will notice that these two motifs are "complementary" to each other.
MYCATRD22 (S000174) 5'-CACATG-3'
Antisense of this sequence is 3'-GTGTAC-5'
If you read the bottom sequence form the tail, it is 5'-CATGTG-3'. It is the same sequence as MYCATERD1 (S000413) CATGTG. These entries are based on the original reports as they appeared in Journals. MYCATRD22 and MYCATERD1, both, have been reported by the same research group in two different genes (RD22 and ERD1) as the MYC-like transcription factor binding sites, as I recollect, from Arabidopsis thaliana (AT). If we are sure that these two motifs are exactly identical in function, then we will combine these two entries.

>Q7:
What is the difference? I was using your PLACE site. I don't see what the difference between your place site and TRES Transcription Regulatory Element Search a tool for Comparative Promoter Analysis)
What is the difference in output of these two programs? What makes the one program more selective, and what would be the advantages (or disadvantages) of such selectivity?

A7:
Please compare these two systems by yourself.

>Q8:
Security? I could not find any disclaimer about the security in regards to sequence queries sent; specifically, if the sequence is stored and if it's accessible by others. I would appreciate any information you could share with me regarding this topic.

A8:
The sequences are hidden from others. We are not collecting query sequences. We have two types of signal scan query to our PLACE database.
(1)
http://www.dna.affrc.go.jp/htdocs/PLACE/signalscan.html
In this type of service, each query sequences are purged each time when the result is sent.
(2) http://www.dna.affrc.go.jp/htdocs/PLACE/signalup.html
In this type of service, the query sequences are kept one week for future result browsing by the user (submitter). The results are guarded from others view by generating random ID. After one week passed, all the results are purged.
We are collecting information of user's top level and second level domain name for our statistics.
http://www.dna.affrc.go.jp/htdocs/stats/place_ss.cnt.html
There are possibilities that some one is watching query sequences by using network sniffing tools. The tools are used to rob a credit card ID in non secure online shopping web site. This can be prevented by using secure layer services. But we are not providing secure layer services now. This secure level is as same as submission in web keyword search in Google, Yahoo etc.

>Q9:
Transcription factors database? We are looking for a database that lists transcription factors in plants and the genes that they regulate. Does you database include this information in downloadable form? If not, do you know of a way to obtain such data?

A9:
Our database, PLACE, contains information about TF in DE section of the cis-element entry. But we do not have a separate downloadable form for TFs. If you would like to obtain such data, try RegSite Database of Plant Regulatory Elements, for example.
http://www.softberry.com/berry.phtml?topic=regsite/

>Q10:
Date of update and number of entries? Could you please tell me when the site was last updated and how many entries are present in the database now?

A10:
You will be able to find the answers to the most of your questions at
http://www.dna.affrc.go.jp/htdocs/PLACE/info.html
For example, the release note shows that:
===PLACE 13.0, 380 entries, Jun.03, 2003===
Fifteen new motifs, all published in 2003, were added.
"Update (almost) every three months" is our plan, but sometimes we miss the update due to circumstance beyond our control.

>Q11:
Only 100% homology detected? As I understand it, PLACE only recognizes sequences with 100% homology to a consensus sequence. Is this correct?

A11:
We compile the motifs or the consensus motifs reported in the papers. Thus, some motifs are registered in the PLACE database as ACGTGC, or GTGCAC, etc. Some papers report the consensus sequence of the motifs, e.g., ACWTGC, etc., where W=A/T. Some consensus sequence look like CWWWWWWWG, or ANNNTGC. In the latter case, Signal Scan program allow any nucleotides between A and T. So yes, the Signal Scan result report shows only 100% identity, but that allows some flexibility if the registered sequences contains W or Y or R or K, etc.

>Q12:
Matrix created? Has a matrix been created to search for sequences with degrees of homology to consensus sequences, as in databases such as TransFac?

A12:
We do not create that. I do not think we have sufficient number of motifs to create reliable matrix for plant cis-elements. Our policy* is to collect motifs which have experimental evidence. But since the genome sequences of Arabidopsis and rice, and also microarray data became available, in silico search of cis-regulatory DNA elements become feasible, and we will soon get a flood of such motifs. Then, we may be able to create matrix. (*Note that we have changed this policy in late 2005. See the release notes.)

>Q13:
How to download?
How can I download promoter sequences from PLACE? I can't find a download link. I am interested in all your plant promoter sequences for my research.

A13:
For downloading the data, please look at the following page.
http://www.dna.affrc.go.jp/htdocs/PLACE/info.html

>Q14:
Lot of similar motifs? I have found that in your data base there are lot of similar motifs with the same functionality (i.e., there are many elements that belongs to MYB). Is there any method or rule of thumb to group this functionality together? In this way it will reduce the features section from the data point of view. Can you advice me on this issue?

A14:
Yes, I noticed that there are so many similar or identical motifs in this database. In some entries, probably I can simply summing up these similar motifs and replace them by "super-consensus motifs," or something like that. Many MYB-related motifs may be the case.
In other cases, the motifs reported in several publications are slightly different so that it is difficult for us to choose which motif should be entered in to the database. But, can we exclude a possibility that the slightly different motif sequence might be due to species-specific binding site? So we entered all these motifs in to the database as they were reported.
In other cases, there is evolution (!) of the motif sequences in publications. Usually, longer sequences were reported in earlier publications, and we found shorter sequences in later publications.
If you have any suggestion, please let me know.

>Q15:
Is my motif previously identified? I have identified some putative promoter motifs (6-10 bases in length) in my own data and now want to check whether these motifs have been previously identified and/or whether they form part of known promoters/transcription factors. Is there any way I can do this in the PLACE (or other) database??

A15:
Please try Place Keyword Search. The "Keyword" windows actually accept any word, i.e., word/author name/nucleotide sequence. Use the same symbols for nucleotide sequence as in the PLACEdb entries.
The symbols used in addition to A, G, C, or T in PLACE Database are:
B: C, G or T
D: A, G or T
H: A, C or T
K: G or T
M: A or C
N: A, C, G or T
R: A or G
S: C or G
V: A, C or G
W: A or T
Y: C or T
If your putative motif has been previously registered to the PLACEdb, or a part of the previously registered motif, you will get a hit(s). But the sequence must be exactly same in order to be detected by this procedure. So, you might also better try PLACE Homology Search.
http://www.dna.affrc.go.jp/htdocs/PLACE/fasta.html
This may show you if there is any similar sequence in the PLACEdb. But I guess the result report may look quite messy. You might need to read it carefully.

>Q16:
I really like your signalscan program. Is it possible to download it and run it as a command-line on my local machine?

A16:
We are sorry that we do not have the source code of the program. The PLACE web signalscan is provided by Ms Meena K Sakharkar at BioInformatics Centre, National University of Singapore. We add file upload interface to it. The source code of signalscan was developped by Dr. Dan Prestridge who unfortunately expired in 2000. Information on signalscan was maintained at the Advanced Biosciences Computing Center, University of Minnesota. You can find it in the Web Archive:
http://web.archive.org/web/20030416012916/ http://biosci.cbs.umn.edu/software/sigscan.html