SlideShare a Scribd company logo
1 of 28
Download to read offline
STRUCTURAL
DATABASES
PDB , CSD , CATH
INTRODUCTION:
• Structural databases are the essential tools for all
crystallographic works.
• They are used in the process of producing, solving
,refining and publishing the structure of a new material.
THE COMMON INFORMATION FOUND IN THE
STRUCTURAL DATABASE INCLUDE:
• Bibliographic information- author name, journal reference.
• The chemical compound name, formula and oxidation states
of the element present.
• Number of formula units per unit cell(contents)
• Dimension and symmetry of the unit cell.
• symmetry of the structure.
• Atomic coordinates, occupancies and thermal parameters.
• Any special features of the experiment to collect the
diffraction data.
• The structures in the database have been solved using X-ray,
neutron and electron diffraction techniques on sample,
computational modelling or by using NMR.
PDB:(PROTEIN DATABASES)
• Protein database contains the information about 3D structures of
the proteins.
• The structural information of the protein can be determined by
X-ray crystallography or Nuclear magnetic resonance(NMR)
spectroscopy methods.
• The PDB is overseen by an organisation called World Wide
Protein Data Bank,wwPDB.
• It is available at
• www.wwpdb.org
• www.pdbe.org
• www.pdbj.org
• Each entry in the PDB is provided with a unique identification
number called PDB ID.It is a 4 letter identification number which
consists of both alpha numeric characters.
PDB FILE FORMAT:
The PDB file format is the standard file format for protein
structure file. It describes how molecules are held together in
3-D Structure of a protein.
• The file contain hundreds or thousands of lines called
records. Each record provides a different set of information
like
• HEADER: This reocord contains file name, date of submission
and the PDB ID of the molecule.
• TITLE: This record contains the title of the PDB entry.
• COMPND: This record includes the protein name.
• SOURCE: This record contains the name of the organism in
which the particular protein is obtained.
• KEYWDS: This record contains the keywords that describes
about the protein.
PDB FILE FORMAT:
• EXPDTA: This record contains the method used for the
protein structure experiment.
• AUTHOR: This record contains the name of the
contributors who put the data into the database.
• REVDATA: This record contains the revision date of the
data related to protein.(Date of modification)
• JRNL: This record contains the journal details of the
literature about the protein
• REMARK: This record contains the remarks about the
protein structure.
• DBREF: This record contains the reference to the protein
in the sequence databases.
PDB FILE FORMAT:
• SEQRES: This record contains information about the
amino acid sequence of protein.
• HET: This record contains details about the non protein
substances in the protein.
• HETNAM: This record contain the compound name of
the non protein substances.
• HETSYN: This record contains the identical compound
name for the non protein substances.
• FORMUL: This record contain the chemical formula of
the non protein substances.
• HELIX: This record holds the recognition of helical
substructures.
PDB FILE FORMAT:
• LINK: This record holds the recognition of inter-residue bonds.
• ATOM: This record contains the atomic coordinates for the
structure.
• HEATM: This record contains the atomic coordinate record for
non protein substances.
• CONECT: This record contains the details about the bonds
involved in non protein atoms.
• MASTER: This record contains the details about the number of
REMARK records, HET records, HELIX records, CONECT records
and SEQRES records, etc.
• END: This record represent the end of the file.
•
THE PDB FORMAT
• 123456789+123456789+123456789+123456789+123456789+123456789+123456789+123456789+
• HEADER RETINOIC-ACID TRANSPORT 28-SEP-94 1CBS 1CBS 2
• COMPND CELLULAR RETINOIC-ACID-BINDING PROTEIN TYPE II COMPLEXED 1CBS 3
• COMPND 2 WITH ALL-TRANS-RETINOIC ACID (THE PRESUMED PHYSIOLOGICAL 1CBS 4
• COMPND 3 LIGAND) 1CBS 5
• SOURCE HUMAN (HOMO SAPIENS) 1CBS 6
• SOURCE 2 EXPRESSION SYSTEM: (ESCHERICHIA COLI) BL21 (DE3) 1CBS 7
• SOURCE 3 PLASMID: PET-3A 1CBS 8
• SOURCE 4 GENE: HUMAN CRABP-II 1CBS 9
• AUTHOR G.J.KLEYWEGT,T.BERGFORS,T.A.JONES 1CBS 10
• REVDAT 1 26-JAN-95 1CBS 0 1CBS 11
• -------------------------------------------------------------------------------------------------------------------------------------------
CATH:
• The CATH means Class, Architecture,Topology and
homologouus super family database for proteins
• It was created by Janet Thornton and colleagues at the
university college London.
• It is available at
http://www.biochem.ucl.ac.uk/bsm/cath
• http://www.cathdb.info
• It is a protein classification tool
IT CONSISTS OF FOUR LEVELS
• Class: It includes structural conformations of proteins
and their contents(alpha, beta, alpha/beta, etc.)
• Architecture: It describes the gross orientation of
secondary structures. It also gives information about
folding of polypeptide chains.
• Topology: It deals with the structures formed due to
different topological arrangement of secondary
structures. It explains the super families of the proteins.
• Homologous super family: It compares the sequence
and structure of various proteins. It helps to trace the
evolutionary relationship among the proteins.
CATH
• The CATH aims to provide official releases of protein
structures every 12 months
• It is a free publicly available online resource.
• The latest version of CATH contains 1,14,215
domains,2178 homologous superfamilies,1110 fold
groups.
THE CATH SERVER
• The CATH have recently set up a server which allows
the user to submit the co-ordinates of the newly
determined structure for automatic classification in
CATH.
• DOMAIN BOUNDARIES AND SEQUENCE COMPARISON
• CATH contains a detective program which is good for
identifying multidomain proteins.
• The results from the detective are returned to the user in
less than a minutes.
• Identified domains are scanned against non identical
representatives from CATH using a global sequence
alignment method
CATH SERVER
• If a sequence match 95% then the domain is identical
to one in CATH.
• If a sequence match less than 30% then the structures
are compared with all the sequence families (s-level).
• ASSESING STRUCTURAL SIMILARITY:
• TOPSCAN compares the secondary strucutres in each
fold family to identify the possible fold families to which
the new structures belong.
• Subsequently the fast version of structure comparison
SSAP scans represetatives from all the families
• Structural pairs having a ssap score more than 80 are
possible homologues while the score with 70-80 don’t
have no sequence or functional similiarity.
• Finally the SSAP structural alignment is displayed using a
graphical display package.
CSD
• The cambridge structural Database is both a repository
and a validated resource for 3-D structural data of
molecules containing carbon and hydrogen.
• It is used to know about the structures of organic,
metal-organic and organometallic molecules
• The specific entries in the CSD are complementary to
PDB and Inorganic crystal structure database.
• The data in the CSD is typically obtained by X-ray
crystallography and less frequently by neutron
diffraction
CSD
• The data in the CSD is submitted by crystallographers and
chemists from all over the world.
• The CSD is maintained by an incorporated company called
Cambridge Crystallographic Data centre, CCDC
• The CCDC are publicly available for download at the point of
publication.
• The CSD is updated with about 50,000 new structures each
year and are freely available to support teaching and other
activities
• The CSD is available at
• www.ccdc.cam.ac.uk
• webcsd.ccdc.cam.ac.uk
Structural
Database
Applications
Prediction
Analysis
Mining
Compariso
n
Classificatio
n
Structure
Refinement
Databases
Annotation
Structural databases

More Related Content

What's hot (20)

DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Prosite
PrositeProsite
Prosite
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Protein database
Protein databaseProtein database
Protein database
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
EMBL
EMBLEMBL
EMBL
 

Similar to Structural databases

Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptxSilpa87
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2Razzaqe
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptNaglaaFathy42
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcAdiM27
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2Razzaqe
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiiiMuhammad Younis
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptorsRAJAN ROLTA
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database bhargvi sharma
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...geraintduck
 

Similar to Structural databases (20)

Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
PDF文档.pdf
PDF文档.pdfPDF文档.pdf
PDF文档.pdf
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Bioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.pptBioinformatic_Databases_2.ppt
Bioinformatic_Databases_2.ppt
 
Bioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzcBioinformatic_Databases_2xcxzczxcxzxcxzc
Bioinformatic_Databases_2xcxzczxcxzxcxzc
 
Bioinformatic databases 2
Bioinformatic databases 2Bioinformatic databases 2
Bioinformatic databases 2
 
Biological databases
Biological databases Biological databases
Biological databases
 
Analisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.pptAnalisis 16S dan 18S rRNA.ppt
Analisis 16S dan 18S rRNA.ppt
 
Bioinformatics lecture xxiii
Bioinformatics lecture xxiiiBioinformatics lecture xxiii
Bioinformatics lecture xxiii
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptors
 
Introduction to pdb
Introduction to pdbIntroduction to pdb
Introduction to pdb
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Enfin, DAS and BioMart
Enfin, DAS and BioMartEnfin, DAS and BioMart
Enfin, DAS and BioMart
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...ECCB 2014: Extracting patterns of database and software usage from the bioinf...
ECCB 2014: Extracting patterns of database and software usage from the bioinf...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Biological data base
Biological data baseBiological data base
Biological data base
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 

More from Priyadharshana

Advantages of herbal medicine.
Advantages of herbal medicine.Advantages of herbal medicine.
Advantages of herbal medicine.Priyadharshana
 
National research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedNational research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedPriyadharshana
 
History of herbal medicine.
History of herbal medicine.History of herbal medicine.
History of herbal medicine.Priyadharshana
 
Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Priyadharshana
 
Climate change causes, effects and prevention
Climate change  causes, effects and preventionClimate change  causes, effects and prevention
Climate change causes, effects and preventionPriyadharshana
 
Radio immuno assay (priya)
Radio immuno assay (priya)Radio immuno assay (priya)
Radio immuno assay (priya)Priyadharshana
 

More from Priyadharshana (12)

Advantages of herbal medicine.
Advantages of herbal medicine.Advantages of herbal medicine.
Advantages of herbal medicine.
 
National research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).editedNational research laboratories in medicinal plant research (autosaved).edited
National research laboratories in medicinal plant research (autosaved).edited
 
History of herbal medicine.
History of herbal medicine.History of herbal medicine.
History of herbal medicine.
 
Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.Cultivation of Asparagus racemosus.
Cultivation of Asparagus racemosus.
 
Pickling
PicklingPickling
Pickling
 
Canning
CanningCanning
Canning
 
Chromatography
ChromatographyChromatography
Chromatography
 
Climate change causes, effects and prevention
Climate change  causes, effects and preventionClimate change  causes, effects and prevention
Climate change causes, effects and prevention
 
Radio immuno assay (priya)
Radio immuno assay (priya)Radio immuno assay (priya)
Radio immuno assay (priya)
 
Vaccines
Vaccines Vaccines
Vaccines
 
Anorexia
Anorexia Anorexia
Anorexia
 
Landslides
LandslidesLandslides
Landslides
 

Recently uploaded

Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 

Recently uploaded (20)

Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 

Structural databases

  • 2. INTRODUCTION: • Structural databases are the essential tools for all crystallographic works. • They are used in the process of producing, solving ,refining and publishing the structure of a new material.
  • 3. THE COMMON INFORMATION FOUND IN THE STRUCTURAL DATABASE INCLUDE: • Bibliographic information- author name, journal reference. • The chemical compound name, formula and oxidation states of the element present. • Number of formula units per unit cell(contents) • Dimension and symmetry of the unit cell. • symmetry of the structure. • Atomic coordinates, occupancies and thermal parameters. • Any special features of the experiment to collect the diffraction data. • The structures in the database have been solved using X-ray, neutron and electron diffraction techniques on sample, computational modelling or by using NMR.
  • 4. PDB:(PROTEIN DATABASES) • Protein database contains the information about 3D structures of the proteins. • The structural information of the protein can be determined by X-ray crystallography or Nuclear magnetic resonance(NMR) spectroscopy methods. • The PDB is overseen by an organisation called World Wide Protein Data Bank,wwPDB. • It is available at • www.wwpdb.org • www.pdbe.org • www.pdbj.org • Each entry in the PDB is provided with a unique identification number called PDB ID.It is a 4 letter identification number which consists of both alpha numeric characters.
  • 5.
  • 6.
  • 7. PDB FILE FORMAT: The PDB file format is the standard file format for protein structure file. It describes how molecules are held together in 3-D Structure of a protein. • The file contain hundreds or thousands of lines called records. Each record provides a different set of information like • HEADER: This reocord contains file name, date of submission and the PDB ID of the molecule. • TITLE: This record contains the title of the PDB entry. • COMPND: This record includes the protein name. • SOURCE: This record contains the name of the organism in which the particular protein is obtained. • KEYWDS: This record contains the keywords that describes about the protein.
  • 8. PDB FILE FORMAT: • EXPDTA: This record contains the method used for the protein structure experiment. • AUTHOR: This record contains the name of the contributors who put the data into the database. • REVDATA: This record contains the revision date of the data related to protein.(Date of modification) • JRNL: This record contains the journal details of the literature about the protein • REMARK: This record contains the remarks about the protein structure. • DBREF: This record contains the reference to the protein in the sequence databases.
  • 9. PDB FILE FORMAT: • SEQRES: This record contains information about the amino acid sequence of protein. • HET: This record contains details about the non protein substances in the protein. • HETNAM: This record contain the compound name of the non protein substances. • HETSYN: This record contains the identical compound name for the non protein substances. • FORMUL: This record contain the chemical formula of the non protein substances. • HELIX: This record holds the recognition of helical substructures.
  • 10. PDB FILE FORMAT: • LINK: This record holds the recognition of inter-residue bonds. • ATOM: This record contains the atomic coordinates for the structure. • HEATM: This record contains the atomic coordinate record for non protein substances. • CONECT: This record contains the details about the bonds involved in non protein atoms. • MASTER: This record contains the details about the number of REMARK records, HET records, HELIX records, CONECT records and SEQRES records, etc. • END: This record represent the end of the file. •
  • 11.
  • 12. THE PDB FORMAT • 123456789+123456789+123456789+123456789+123456789+123456789+123456789+123456789+ • HEADER RETINOIC-ACID TRANSPORT 28-SEP-94 1CBS 1CBS 2 • COMPND CELLULAR RETINOIC-ACID-BINDING PROTEIN TYPE II COMPLEXED 1CBS 3 • COMPND 2 WITH ALL-TRANS-RETINOIC ACID (THE PRESUMED PHYSIOLOGICAL 1CBS 4 • COMPND 3 LIGAND) 1CBS 5 • SOURCE HUMAN (HOMO SAPIENS) 1CBS 6 • SOURCE 2 EXPRESSION SYSTEM: (ESCHERICHIA COLI) BL21 (DE3) 1CBS 7 • SOURCE 3 PLASMID: PET-3A 1CBS 8 • SOURCE 4 GENE: HUMAN CRABP-II 1CBS 9 • AUTHOR G.J.KLEYWEGT,T.BERGFORS,T.A.JONES 1CBS 10 • REVDAT 1 26-JAN-95 1CBS 0 1CBS 11 • -------------------------------------------------------------------------------------------------------------------------------------------
  • 13. CATH: • The CATH means Class, Architecture,Topology and homologouus super family database for proteins • It was created by Janet Thornton and colleagues at the university college London. • It is available at http://www.biochem.ucl.ac.uk/bsm/cath • http://www.cathdb.info • It is a protein classification tool
  • 14. IT CONSISTS OF FOUR LEVELS • Class: It includes structural conformations of proteins and their contents(alpha, beta, alpha/beta, etc.) • Architecture: It describes the gross orientation of secondary structures. It also gives information about folding of polypeptide chains. • Topology: It deals with the structures formed due to different topological arrangement of secondary structures. It explains the super families of the proteins. • Homologous super family: It compares the sequence and structure of various proteins. It helps to trace the evolutionary relationship among the proteins.
  • 15.
  • 16. CATH • The CATH aims to provide official releases of protein structures every 12 months • It is a free publicly available online resource. • The latest version of CATH contains 1,14,215 domains,2178 homologous superfamilies,1110 fold groups.
  • 17.
  • 18. THE CATH SERVER • The CATH have recently set up a server which allows the user to submit the co-ordinates of the newly determined structure for automatic classification in CATH. • DOMAIN BOUNDARIES AND SEQUENCE COMPARISON • CATH contains a detective program which is good for identifying multidomain proteins. • The results from the detective are returned to the user in less than a minutes. • Identified domains are scanned against non identical representatives from CATH using a global sequence alignment method
  • 19. CATH SERVER • If a sequence match 95% then the domain is identical to one in CATH. • If a sequence match less than 30% then the structures are compared with all the sequence families (s-level). • ASSESING STRUCTURAL SIMILARITY: • TOPSCAN compares the secondary strucutres in each fold family to identify the possible fold families to which the new structures belong. • Subsequently the fast version of structure comparison SSAP scans represetatives from all the families • Structural pairs having a ssap score more than 80 are possible homologues while the score with 70-80 don’t have no sequence or functional similiarity. • Finally the SSAP structural alignment is displayed using a graphical display package.
  • 20.
  • 21.
  • 22. CSD • The cambridge structural Database is both a repository and a validated resource for 3-D structural data of molecules containing carbon and hydrogen. • It is used to know about the structures of organic, metal-organic and organometallic molecules • The specific entries in the CSD are complementary to PDB and Inorganic crystal structure database. • The data in the CSD is typically obtained by X-ray crystallography and less frequently by neutron diffraction
  • 23. CSD • The data in the CSD is submitted by crystallographers and chemists from all over the world. • The CSD is maintained by an incorporated company called Cambridge Crystallographic Data centre, CCDC • The CCDC are publicly available for download at the point of publication. • The CSD is updated with about 50,000 new structures each year and are freely available to support teaching and other activities • The CSD is available at • www.ccdc.cam.ac.uk • webcsd.ccdc.cam.ac.uk
  • 24.
  • 25.
  • 26.