SlideShare a Scribd company logo
1 of 21
Submitted by: SOMBIR SINGH
Research Scholar
Center for Biotechnology
MD University Rohtak Haryana
sombirkumar9195@gmail.com
BIOLOGICAL DATABASE
 A collection of data that is structured, searchable,
updated periodically and cross-referenced.
 Store biological data in electronic form
 Purpose-
systemization of database
availability of biological data
analysis of computed biological data
FEATURES OF BIOLOGICAL
DATABASE
 Heterogeneity
 High volume data
 Uncertainity
 Data curation
 Data integration
 Data sharing
 Dynamics
1. Data heterogeneity
Availabilty of diverse and complex data types.
Data types:
 sequence- nucleotide, protein
 graph- data indicating relationship among
themselves can be captured as graph. It include
pathway data, genetic maps and structural texonomy.
 high dimensional data- data generated from micro-
array experiments that involve thousands of genes and
hundreds of experimental conditions.
 shapes- consists of 3D molecular structural data.
 temporal data- for studing dynamics of any
biological system. Example- development biology.
 Patterns- there are patterns lying within the genome
that characterize biologically entities. example-
regulatory sequence
 Scalar and vector fields
 Extracted features data- numerical data obtained
from combination of one of the above mentioned data
types.
2. High volume data- in addition to being highly
heterogeneous, biological data are voluminous to
support comprehensive investigations in various field
and directions.
3. Uncertainity- biological data have great deal of
uncertainity as they represent biological phenomena
that are observed and assumed.
4. Data curation- biological data collected from various
sources across different and functional boundaries.
 There are always chances of missing links.
 To fill these, the data is analyzed and curated via
automated methods.
5. Data integration- after years of research, across
different structural and functional scales, data is
collected from laboratories through a database and
made available for use.
6. Data sharing- biological data is share via databases.
 Purpose:
for scientific community’s inspection
for cross verification
to prevent repetition and validation of data
 Dynamics-
new data is generated every day in laboratories.
sometimes this new data contradicts with the old data
so, it is necessary to develop new organizational
database schemes to incorporate new data.
Classification of biological
databases
 Data type
 Maintainer status
 data access
 Data source
 Database design
 Organism
1. Data type
 Sequence database
a. nucleotide database: GenBank, EMBL BANK
b. Protein database: Swiss-Prot, PIR
 Structure database: PDB, NDB, DALI, MSD
 Microarray database: ArrayExpress
 Chemical database: PubChem
 Pathway database: KEGG, BioSilico
 Enzyme database: REBASE, ExPASy
 Disease database: OMIM
 Literature database: PubMed
2. Maintainer status
 NCBI, EMBL
 Academic group or scientist
 Commercial comany
3. Data access
 Publicly available
 Available with copyright
 Browsing only, accessible but not downloadable
 Academic but not freely available
 Restricted
4. DATA SOURCE
a) primary database
original data submission by researcher occurs.
Examples:
Nucleotide - genBank, EMBL, DDBJ
Protein - UniProt
Structure – PDB
Literature - PubMed
b) Secondary database
- results of analysis of primary databases.
- either manually curated or by automated methods
examples: Prosite, Pfam, RefSeq
5. Database design
 Flat files
 relational datase (SQL)
 Object oriented database
 Exchange/ publication technologies (FTP, HTML,
SOAP, COBRA, XML)
6. Organism
 Bacteria
 Virus
 Human
Biological Database

More Related Content

What's hot (20)

Scop database
Scop databaseScop database
Scop database
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Introduction to Biological databases
Introduction to Biological databasesIntroduction to Biological databases
Introduction to Biological databases
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Ddbj
DdbjDdbj
Ddbj
 
Protein databases
Protein databasesProtein databases
Protein databases
 
UniProt
UniProtUniProt
UniProt
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Data retrieval
Data retrievalData retrieval
Data retrieval
 
Databases pathways of genomics and proteomics
Databases pathways of genomics and proteomics Databases pathways of genomics and proteomics
Databases pathways of genomics and proteomics
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
String.pptx
String.pptxString.pptx
String.pptx
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
Kegg
KeggKegg
Kegg
 

Similar to Biological Database

Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databasesCharu Sharma
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics AakifahAmreen
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBioinformaticsCentre
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdfBIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdfPravanjanDash
 
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxCOMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxPravanjanDash
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptxscience lover
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdfnedalalazzwy
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGijbbjournal
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesUniversity of Malaya
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein databasechinmayeec
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyZac Darcy
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYZac Darcy
 

Similar to Biological Database (20)

Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databases
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
 
Biological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdfBiological Database (1)pptxpdfpdfpdf.pdf
Biological Database (1)pptxpdfpdfpdf.pdf
 
Biological data base
Biological data baseBiological data base
Biological data base
 
Bioinformatics principles and applications
Bioinformatics principles and applicationsBioinformatics principles and applications
Bioinformatics principles and applications
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdfBIOINFORMATICS  AND  DATABASES IN BIOINFORMATICS.pdf
BIOINFORMATICS AND DATABASES IN BIOINFORMATICS.pdf
 
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptxCOMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
COMPUNATIONAL BIOLOGY AND DATABASES IN BIOINFORMATICS.pptx
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 
Biological database
Biological databaseBiological database
Biological database
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Bioinformatics مي.pdf
Bioinformatics  مي.pdfBioinformatics  مي.pdf
Bioinformatics مي.pdf
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future PerspectivesBioinformatics databases: Current Trends and Future Perspectives
Bioinformatics databases: Current Trends and Future Perspectives
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein database
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
Protein sequence classification in data mining– a study
Protein sequence classification in data mining– a studyProtein sequence classification in data mining– a study
Protein sequence classification in data mining– a study
 
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDYPROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
PROTEIN SEQUENCE CLASSIFICATION IN DATA MINING– A STUDY
 

Recently uploaded

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 

Recently uploaded (20)

ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 

Biological Database

  • 1. Submitted by: SOMBIR SINGH Research Scholar Center for Biotechnology MD University Rohtak Haryana sombirkumar9195@gmail.com
  • 2. BIOLOGICAL DATABASE  A collection of data that is structured, searchable, updated periodically and cross-referenced.  Store biological data in electronic form  Purpose- systemization of database availability of biological data analysis of computed biological data
  • 3. FEATURES OF BIOLOGICAL DATABASE  Heterogeneity  High volume data  Uncertainity  Data curation  Data integration  Data sharing  Dynamics
  • 4. 1. Data heterogeneity Availabilty of diverse and complex data types. Data types:  sequence- nucleotide, protein  graph- data indicating relationship among themselves can be captured as graph. It include pathway data, genetic maps and structural texonomy.
  • 5.  high dimensional data- data generated from micro- array experiments that involve thousands of genes and hundreds of experimental conditions.  shapes- consists of 3D molecular structural data.  temporal data- for studing dynamics of any biological system. Example- development biology.
  • 6.  Patterns- there are patterns lying within the genome that characterize biologically entities. example- regulatory sequence  Scalar and vector fields  Extracted features data- numerical data obtained from combination of one of the above mentioned data types.
  • 7. 2. High volume data- in addition to being highly heterogeneous, biological data are voluminous to support comprehensive investigations in various field and directions. 3. Uncertainity- biological data have great deal of uncertainity as they represent biological phenomena that are observed and assumed.
  • 8. 4. Data curation- biological data collected from various sources across different and functional boundaries.  There are always chances of missing links.  To fill these, the data is analyzed and curated via automated methods.
  • 9. 5. Data integration- after years of research, across different structural and functional scales, data is collected from laboratories through a database and made available for use.
  • 10. 6. Data sharing- biological data is share via databases.  Purpose: for scientific community’s inspection for cross verification to prevent repetition and validation of data
  • 11.  Dynamics- new data is generated every day in laboratories. sometimes this new data contradicts with the old data so, it is necessary to develop new organizational database schemes to incorporate new data.
  • 12.
  • 13. Classification of biological databases  Data type  Maintainer status  data access  Data source  Database design  Organism
  • 14. 1. Data type  Sequence database a. nucleotide database: GenBank, EMBL BANK b. Protein database: Swiss-Prot, PIR  Structure database: PDB, NDB, DALI, MSD  Microarray database: ArrayExpress  Chemical database: PubChem  Pathway database: KEGG, BioSilico  Enzyme database: REBASE, ExPASy  Disease database: OMIM  Literature database: PubMed
  • 15.
  • 16. 2. Maintainer status  NCBI, EMBL  Academic group or scientist  Commercial comany
  • 17. 3. Data access  Publicly available  Available with copyright  Browsing only, accessible but not downloadable  Academic but not freely available  Restricted
  • 18. 4. DATA SOURCE a) primary database original data submission by researcher occurs. Examples: Nucleotide - genBank, EMBL, DDBJ Protein - UniProt Structure – PDB Literature - PubMed b) Secondary database - results of analysis of primary databases. - either manually curated or by automated methods examples: Prosite, Pfam, RefSeq
  • 19. 5. Database design  Flat files  relational datase (SQL)  Object oriented database  Exchange/ publication technologies (FTP, HTML, SOAP, COBRA, XML)
  • 20. 6. Organism  Bacteria  Virus  Human