|
Standardization of information inputTo input the information into the database, experts in biology annotate publications on the relevant experimental data.
To provide the standardization of information in TRRD, the program TRRD-INPUT was developed [Ananko E.A. et al., 1998]. This program produces an entry corresponding to an individual gene in a flat file form. It enables both editing and creating of new entries. Individual lines of an entry are checked for their compliance with the vocabularies supported within the TRRD database. Totally, TRRD contains 22 vocabularies comprising about 10829 words: six vocabularies are stable, the rest are being expanded. The contents of the controlled vocabularies used by the program TRRD-INPUT for the standartization of data input in the TRRD is shown in the table.
At present the vocabularies describing organs, tissues, and cells where the genes described in TRRD are expressed were essentially developed. Now these vocabularies are united and organized hierarchically. The highest level of vocabulary structure corresponds to organs and parts of organs. Organs are constituted out of tissues, which in turn contain different types of cells. This hierarchy may be used for modification of the queries (generalization or specification) and for realization of associated search in TRRD.
|