A database is a repository of sequence (DNA or amino acid) which provides a centralised and homogeneous view of its content. The repository is created and modified through a database management system (DBMS). Every data item in the database is structured according to scheme, defined as a set of pre-specified rules through the data definition language. The contents of database can be access through a graphical user interfase (GUI) that allows browsing through the contents of repository very much similar as one may browse through the books in library.
Most database
also allow querying of its content through a specialized query language. The data
definition language and the query language from the data modal.
Classification of database: The database are broadly classified into two broad categories:
(1) sequence databases (that involves the sequence of both proteins and nucleic acids),
(2) structural database (that involves only protein database).
In addition, it is also classified into three categories (a) primary
database, (b) secondary database, (3) composite database.
Primary database contain information of the sequence
or structure alone of either protein or nucleic acid e.g. PIR or protein
sequence, gene bank and DDBJ for genome sequences. Primary database tools are
effective for identifying the sequence similarities, but analytics of output is
sometimes difficult and cannot always answer some of the more sophisticated questions
of sequence analysis. In 1998, gene bank obtained a millions of sequence from
more than 18,000 organisms.
Secondary database contains derived information from
primary database, for example, information on conserved sequence, signature
sequence and active sites residues for protein families by using SCOP, eMOTUIF,
etc. It is more useful than primary databases. Orthology provides an important
layer of information when considering phylogenetic relationship between the
genes. Depending on the type of analysis method used, relationship may be
elucidated into considerable detail including superfamily, family, sub-family
and species-specific sequence levels.
Composite database is obviating the need to search
multiple resources. The scope is structural classification of protein in which
the proteins are classified into hierarchial levels such as classes, folds,
super-families.
what is DNA and Chemical nature of DNA ?
Units of Gene and How it was discovered ?
What are types of Gene therapy ?
DNA replication or Blotting techniques ?
What is Overlapping Genes or Genes within Genes ?
How infertility in Haman occur ( male and female ) ?
Comments
Post a Comment