What is database ? / classification of database

 A database is a repository of sequence (DNA or amino acid) which provides a centralised and homogeneous view of its content. The repository is created and modified through a database management system (DBMS). Every data item in the database is structured according to scheme, defined as a set of pre-specified rules through the data definition language. The contents of database can be access through a graphical user interfase (GUI) that allows browsing through the contents of repository very much similar as one may browse through the books in library.

Most database also allow querying of its content through a specialized query language. The data definition language and the query language from the data modal.


Classification of database: The database are broadly classified into two broad categories:

 (1) sequence databases (that involves the sequence of both proteins and nucleic acids),

 (2) structural database (that involves only protein database). 

In addition, it is also classified into three categories (a) primary database, (b) secondary database, (3) composite database.

Primary database contain information of the sequence or structure alone of either protein or nucleic acid e.g. PIR or protein sequence, gene bank and DDBJ for genome sequences. Primary database tools are effective for identifying the sequence similarities, but analytics of output is sometimes difficult and cannot always answer some of the more sophisticated questions of sequence analysis. In 1998, gene bank obtained a millions of sequence from more than 18,000 organisms.

Secondary database contains derived information from primary database, for example, information on conserved sequence, signature sequence and active sites residues for protein families by using SCOP, eMOTUIF, etc. It is more useful than primary databases. Orthology provides an important layer of information when considering phylogenetic relationship between the genes. Depending on the type of analysis method used, relationship may be elucidated into considerable detail including superfamily, family, sub-family and species-specific sequence levels.

 Composite database is obviating the need to search multiple resources. The scope is structural classification of protein in which the proteins are classified into hierarchial levels such as classes, folds, super-families.


What is Gene therapy ?

What is Gene library ?

What is Bioinformatics ?

what is DNA and Chemical nature of DNA ?

What are Cosmids ?

Units of Gene and How it was discovered ?

What are types of Gene therapy ?

DNA replication or Blotting techniques ?

 What is Overlapping Genes or Genes within Genes ?

How infertility in Haman occur ( male and female ) ?



Comments