WWW Browser Requirements: Because the Advanced Query Form uses modern Javascript features for updating the contents of the selector boxes, a modern WWW browser is needed. The form has been tested on Netscape 4.05 and Internet Explorer 4.0. However, it will not work on version 3.x of these browsers.
For example, if the class of small molecules (called Compounds) is selected, each clause is evaluated against every small molecule in the database for the selected dataset. Every class has a different set of slots; each clause refers to one slot, and is selected using a box within the clause. Compounds, for example, contain slots such as Molecular-Weight and Names whose values are a numerical molecular weight and one or more names for the chemical compound, respectively. A clause can be used to compare the values of a slot for each compound against a value supplied by the user.
The main part of the form contains 5 identical sections for formulating clauses of the query. At least one clause should be filled in for each query -- the remaining sections should be left blank.
The selector boxes within a clause are used to build up the statement of the condition to form a readable phrase such as, "The value of slot molecular-weight is < 54". The meaning of those boxes is as follows.
If in any of the slot queries the choice of slot name or operator is not made, or no query value is entered in the text box, that particular slot query is ignored from the final results.
Additionally, a check box allows inclusion of the frame ID (the internal database unique identifier) in the results table. The frame ID is used in URL links to EcoCyc objects.
Finally, at the bottom of the form, clicking on the Submit button will send the query to the server, and will return the results in a table. Each result row corresponds to one frame that satisfied the query. The columns of the table list the values of the slots that were chosen by the user. The first column shows the "common name" of the frame if one is available; the second column optionally shows the frame ID, if this choice was made above.
Additionally, the result page includes the original query.
In dataset [E. coli], in class [Compounds] :
AND
The [value] of slot [NAMES] [is] [superstring of] ["tryp"]
In the early 1999 version of EcoCyc, this query will return 11 compound frames.
We can further narrow down the results by additionally asking for only those compounds that are larger than 200Da. To achieve this, we can query an additional slot that stores the MOLECULAR-WEIGHT. We will need to connect the two slot queries with AND, like this:
In dataset [E. coli], in class [Compounds] :
AND
The [value] of slot [NAMES] [is] [superstring of] ["tryp"]
The [value] of slot [MOLECULAR-WEIGHT] [is] [larger than] ["200"]
This query reduces the number of returned frames to 8.
2. As a more complicated example, let us assume we would like to investigate the usage of the term "synthase" and "synthetase". Do they tend to be used interchangeably, or is there a difference in meaning and usage ? The class of frames we need to query is Enzymatic-Reactions, because this class encodes enzyme names.
First, we can just query for all the frames that contain either or both of these terms, and so our connective would be OR:
In dataset [E. coli], in class [Enzymatic Reactions] :
OR
The [value] of slot [NAMES] [is] [superstring of] ["synthase"]
The [value] of slot [NAMES] [is] [superstring of] ["synthetase"]
This query returns 177 frames. This query, however, will not tell us whether some of these frames use both terms at the same time. If we change the connective to AND, we will be able to see that 42 frames are returned.
To find the frames which use these terms unambiguously, i.e. uniquely, we can change the connective to EXCLUSIVE-OR. This will return 135 frames. Looking at the frames returned in these two sets could tell us something about whether the uses of these terms are very strict or rather loose.
To find all frames that only use one term, but not the other, we can modify the query as follows, using negation of one term:
In dataset [E. coli], in class [Enzymatic Reactions] :
AND
The [value] of slot [NAMES] [is] [superstring of] ["synthase"]
The [value] of slot [NAMES] [is not] [superstring of] ["synthetase"]
We find that 71 frames only use "synthase", and 64 use "synthetase". These two numbers add up the what the EXCLUSIVE-OR query returned, as we expected. It appears that the usage of these two terms is approximately balanced, and used relatively interchangeably.
Return to SGD |