SlideShare a Scribd company logo
1 of 31
1
 Data is stored in a biological database in
the form of sequences or molecular form
 Unique file format
 Representation of data in biological
database
 Categories of file formats
 Sequence database
 Molecular database
2
 Gene bank flat-file Format
 FASTA Format
 Multi-FASTA Format
 GCG Format
 GCG-MSF Format
 EMBL Format
 Clustal Format
 SWIS PROT format
3
 Used by NCBI
 It is divided into three parts
 Header just a direct and very precise or
brief introductory part
 Features
all genes in seq., location of genes in genome,
protein product and coding genes etc.
 Sequence : ORIGIN atcgatcgatgcgctat //
4
 HEADRES
 Locus
 Definition
 Accession
 Version
 Dbsource: dates for creation and modifications
 Keywords
 Source
 Organism
 References
 Authors
 Title
 Journal
 Medline ID: all published sources
 Comment
 FEATURES
 SEQUENCE
5
6
7
8
 One line header
 Stats with > followed by name of gene
 Sequence of gene or protein
 Blank spaces
 Paragraph marks
 Numerals
 Are all ignored
 Steric sign * at the end
9
 >p53
ctcgaggggc ctagacattg ccctccagag agagcaccca
acaccctcca ggcttgaccg
61 gccagggtgt ccccttccta ccttggagag agcagcccca
gggcatcctg cagggggtgc
121 tgggacacca gctggccttc aaggtctctg cctccctcca
gccaccccac tacacgctgc
181 tgggatcctg gatctcagct ccctggccga caacactggc
aaactcctac tcatccacga
241 aggccctcct gggcatggtg gtccttccca gcctggcagt
ctgttcctca cacaccttgt
301 tagtgcccag cccctgaggt tgcagctggg ggtgtctctg
aagggctgtg agcccccagg
361 aagccctggg gaagtgcctg ccttgcctcc ccccggccct10
11
 Just like an aggregation of FASTA file as listed
above
 Multiple sequences follow one after the other
 Single file
 Accepted by several databases
 Clustal W
 Multalin
12
 > jhuma
gccagggtgt ccccttccta ccttggagag agcagcccca
gggcatcctg cagggggtgc
 >bhuma
gccagggtgt ccccttccta ccttggagag agcagcccca
gggcatcctg cagggggtgc
 >puma
gccagggtgt ccccttccta ccttggagag agcagcccca
gggcatcctg cagggggtgc
 >zuma
gccagggtgt ccccttccta ccttggagag agcagcccca
gggcatcctg cagggggtgc 13
14
 GCG: genetics computer group
 First line says it all ….
 !!N.A_SEQUENCE 1.0
 !!AA_SEQUENCE 1.0
 Just a simple format in which we just get
to now the sequence for the genes or
proteins
15
16
 Multiple sequences
 Sequence name
 Sequences
 Alignment
 Word pileup indicates that It is a multiple
sequence containing file
 Mandatory MSF word indicated in the file that
tells that it is an MSF GCG file and is not just
GCG
 Comments terminated with //
 2 consecutive blank lines
 Multiple sequences 17
18
 Sequence format of European molecular
biology laboratory database
 Starts with ID identification number
 Ends with // as terminator
 Different lines with own format
 Used to record various forms of data
 i.e DNA, RNA, GENE, PROTEIN etc etc
19
20
 Most widely used sequence alignment tool
 CLUSTAL W
 CLUSTAL X
 Aligned protein or gene sequences
21
22
 Protein sequence database
 ID : identification number
 AC: accession number
 DE: description
 GN: gene name
 OS: organism specie
 OG: organelle
 OC: organism classification
 OX: organism taxonomy cross reference
 RN: reference number
 RP: reference position 23
 RC: reference comment
 RX: reference cross reference
 RA: reference author
 RT: reference title
 RL: reference location
 CC: blank
 DR: database cross reference
 KW: key word
 FT: feature table
 SQ: sequence 24
25
 Several software's have been designed by … ?
 The aim of these software's is to make a
detailed conversion of one sequence format
into another
 Some of the software used widely for sequence
inter-conversion are :
 ReadSeq
 GCG
 SeqVerter
 Seqret 26
 Developed by Dr. D.G Gilbert
 Automated conversion
 18 supported file formats are there which
can be interconverted into one another
27
28
29
 FASTA
 Multi FASTA
 Flat file
 GCG format
 EMBL
 Clustal
 SWISS PROT
Make each file by this Friday and send as
attachments in an email 30
31

More Related Content

What's hot (20)

Dot matrix
Dot matrixDot matrix
Dot matrix
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Scop database
Scop databaseScop database
Scop database
 
NCBI
NCBINCBI
NCBI
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Biological database
Biological databaseBiological database
Biological database
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
TrEMBL
TrEMBLTrEMBL
TrEMBL
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
Protein database
Protein databaseProtein database
Protein database
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Ddbj
DdbjDdbj
Ddbj
 

Viewers also liked

Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303Bruno Mmassy
 
molecular file formats in bioinformatics
molecular file formats in bioinformaticsmolecular file formats in bioinformatics
molecular file formats in bioinformaticsnadeem akhter
 
Intro to Open Babel
Intro to Open BabelIntro to Open Babel
Intro to Open Babelbaoilleach
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES nadeem akhter
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataAbhik Seal
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformaticsnadeem akhter
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources innocent87
 

Viewers also liked (13)

Computational biology bls 303
Computational biology bls 303Computational biology bls 303
Computational biology bls 303
 
Design your own test automation tool
Design your own test automation toolDesign your own test automation tool
Design your own test automation tool
 
molecular file formats in bioinformatics
molecular file formats in bioinformaticsmolecular file formats in bioinformatics
molecular file formats in bioinformatics
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Intro to Open Babel
Intro to Open BabelIntro to Open Babel
Intro to Open Babel
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
Chemical File Formats for storing chemical data
Chemical File Formats for storing chemical dataChemical File Formats for storing chemical data
Chemical File Formats for storing chemical data
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Biological Databases
Biological DatabasesBiological Databases
Biological Databases
 
Kegg database resources
Kegg database resources Kegg database resources
Kegg database resources
 
Biological databases
Biological databasesBiological databases
Biological databases
 

Similar to sequence of file formats in bioinformatics

Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisSANJANA PANDEY
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesJackie Wirz, PhD
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databasesMeetika Gupta
 
Introduction to Bioinformatics: Part 2
Introduction to Bioinformatics: Part 2Introduction to Bioinformatics: Part 2
Introduction to Bioinformatics: Part 2AhmedAbdElMoniem35
 
Transcription and Translation
Transcription and TranslationTranscription and Translation
Transcription and TranslationAnkit Kumar
 
Marker devt. workshop 27022012
Marker devt. workshop 27022012Marker devt. workshop 27022012
Marker devt. workshop 27022012Koppolu Ravi
 
RNA-Seq_Presentation
RNA-Seq_PresentationRNA-Seq_Presentation
RNA-Seq_PresentationToyin23
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012Dan Gaston
 
A comprehensive study of shuttle vector & binary vector and its rules of in ...
A comprehensive study of shuttle vector & binary vector and its rules of in  ...A comprehensive study of shuttle vector & binary vector and its rules of in  ...
A comprehensive study of shuttle vector & binary vector and its rules of in ...PRABAL SINGH
 
Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012Mark Pallen
 
LECTURE 7.pptx
LECTURE 7.pptxLECTURE 7.pptx
LECTURE 7.pptxericndunek
 

Similar to sequence of file formats in bioinformatics (20)

Gen bank
Gen bankGen bank
Gen bank
 
Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)Gen bank (genetic sequence databank)
Gen bank (genetic sequence databank)
 
Intro to databases
Intro to databasesIntro to databases
Intro to databases
 
2015 12-09 nmdd
2015 12-09 nmdd2015 12-09 nmdd
2015 12-09 nmdd
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners Slides
 
100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
 
Introduction to Bioinformatics: Part 2
Introduction to Bioinformatics: Part 2Introduction to Bioinformatics: Part 2
Introduction to Bioinformatics: Part 2
 
Transcription and Translation
Transcription and TranslationTranscription and Translation
Transcription and Translation
 
Databases
DatabasesDatabases
Databases
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Marker devt. workshop 27022012
Marker devt. workshop 27022012Marker devt. workshop 27022012
Marker devt. workshop 27022012
 
RNA-Seq_Presentation
RNA-Seq_PresentationRNA-Seq_Presentation
RNA-Seq_Presentation
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012
 
A comprehensive study of shuttle vector & binary vector and its rules of in ...
A comprehensive study of shuttle vector & binary vector and its rules of in  ...A comprehensive study of shuttle vector & binary vector and its rules of in  ...
A comprehensive study of shuttle vector & binary vector and its rules of in ...
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
 
Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012Bio305 genome analysis and annotation 2012
Bio305 genome analysis and annotation 2012
 
LECTURE 7.pptx
LECTURE 7.pptxLECTURE 7.pptx
LECTURE 7.pptx
 
Ncbi
NcbiNcbi
Ncbi
 

More from nadeem akhter

Human development and sex determination
Human development and sex determination Human development and sex determination
Human development and sex determination nadeem akhter
 
DNA structure and chromosome organization
DNA structure and chromosome organization DNA structure and chromosome organization
DNA structure and chromosome organization nadeem akhter
 
Protein 3D structure and classification database
Protein 3D structure and classification database Protein 3D structure and classification database
Protein 3D structure and classification database nadeem akhter
 
ATOMIC ABSORPTION SPECTROSCOPY
ATOMIC ABSORPTION SPECTROSCOPYATOMIC ABSORPTION SPECTROSCOPY
ATOMIC ABSORPTION SPECTROSCOPYnadeem akhter
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple nadeem akhter
 
Islam and environmental biology Msc Biology
Islam and environmental biology Msc BiologyIslam and environmental biology Msc Biology
Islam and environmental biology Msc Biologynadeem akhter
 
Chromatography and its types
Chromatography and its typesChromatography and its types
Chromatography and its typesnadeem akhter
 

More from nadeem akhter (10)

UV-VIS Spectroscopy
UV-VIS SpectroscopyUV-VIS Spectroscopy
UV-VIS Spectroscopy
 
Islamandscience
IslamandscienceIslamandscience
Islamandscience
 
Human development and sex determination
Human development and sex determination Human development and sex determination
Human development and sex determination
 
DNA structure and chromosome organization
DNA structure and chromosome organization DNA structure and chromosome organization
DNA structure and chromosome organization
 
Protein 3D structure and classification database
Protein 3D structure and classification database Protein 3D structure and classification database
Protein 3D structure and classification database
 
Molecular viewers
Molecular viewers Molecular viewers
Molecular viewers
 
ATOMIC ABSORPTION SPECTROSCOPY
ATOMIC ABSORPTION SPECTROSCOPYATOMIC ABSORPTION SPECTROSCOPY
ATOMIC ABSORPTION SPECTROSCOPY
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
 
Islam and environmental biology Msc Biology
Islam and environmental biology Msc BiologyIslam and environmental biology Msc Biology
Islam and environmental biology Msc Biology
 
Chromatography and its types
Chromatography and its typesChromatography and its types
Chromatography and its types
 

Recently uploaded

Top Rated Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...
Top Rated  Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...Top Rated  Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...
Top Rated Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...Call Girls in Nagpur High Profile
 
(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...
(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...
(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...Riya Pathan
 
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...ritikasharma
 
Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...
Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...
Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...aamir
 
↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...noor ahmed
 
VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...
VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...
VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...anamikaraghav4
 
Call Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service NashikCall Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Call Girl Nashik Amaira 7001305949 Independent Escort Service Nashik
Call Girl Nashik Amaira 7001305949 Independent Escort Service NashikCall Girl Nashik Amaira 7001305949 Independent Escort Service Nashik
Call Girl Nashik Amaira 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Call Girls Manjri Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Manjri Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Manjri Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Manjri Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130Suhani Kapoor
 
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...anamikaraghav4
 
Beyond Bar & Club Udaipur CaLL GiRLS 09602870969
Beyond Bar & Club Udaipur CaLL GiRLS 09602870969Beyond Bar & Club Udaipur CaLL GiRLS 09602870969
Beyond Bar & Club Udaipur CaLL GiRLS 09602870969Apsara Of India
 
↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...
↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...
↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...noor ahmed
 
2k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 92055419142k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 9205541914Delhi Call girls
 
Beautiful 😋 Call girls in Lahore 03210033448
Beautiful 😋 Call girls in Lahore 03210033448Beautiful 😋 Call girls in Lahore 03210033448
Beautiful 😋 Call girls in Lahore 03210033448ont65320
 
VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...
VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...
VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...Neha Kaur
 
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...noor ahmed
 

Recently uploaded (20)

CHEAP Call Girls in Malviya Nagar, (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in  Malviya Nagar, (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in  Malviya Nagar, (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Malviya Nagar, (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Top Rated Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...
Top Rated  Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...Top Rated  Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...
Top Rated Pune Call Girls Pimpri Chinchwad ⟟ 6297143586 ⟟ Call Me For Genuin...
 
Call Girls South Avenue Delhi WhatsApp Number 9711199171
Call Girls South Avenue Delhi WhatsApp Number 9711199171Call Girls South Avenue Delhi WhatsApp Number 9711199171
Call Girls South Avenue Delhi WhatsApp Number 9711199171
 
(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...
(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...
(Dipika) Call Girls in Bangur ! 8250192130 ₹2999 Only and Free Hotel Delivery...
 
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
 
Desi Bhabhi Call Girls In Goa 💃 730 02 72 001💃desi Bhabhi Escort Goa
Desi Bhabhi Call Girls  In Goa  💃 730 02 72 001💃desi Bhabhi Escort GoaDesi Bhabhi Call Girls  In Goa  💃 730 02 72 001💃desi Bhabhi Escort Goa
Desi Bhabhi Call Girls In Goa 💃 730 02 72 001💃desi Bhabhi Escort Goa
 
Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...
Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...
Dakshineswar Call Girls ✔ 8005736733 ✔ Hot Model With Sexy Bhabi Ready For Se...
 
↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Howrah ⟟ 8250192130 ⟟ High Class Call Girl In...
 
VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...
VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...
VIP Call Girls Sonagachi - 8250192130 Escorts Service 50% Off with Cash ON De...
 
Call Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service NashikCall Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
 
Call Girl Nashik Amaira 7001305949 Independent Escort Service Nashik
Call Girl Nashik Amaira 7001305949 Independent Escort Service NashikCall Girl Nashik Amaira 7001305949 Independent Escort Service Nashik
Call Girl Nashik Amaira 7001305949 Independent Escort Service Nashik
 
Call Girls Manjri Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Manjri Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Manjri Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Manjri Call Me 7737669865 Budget Friendly No Advance Booking
 
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
 
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
 
Beyond Bar & Club Udaipur CaLL GiRLS 09602870969
Beyond Bar & Club Udaipur CaLL GiRLS 09602870969Beyond Bar & Club Udaipur CaLL GiRLS 09602870969
Beyond Bar & Club Udaipur CaLL GiRLS 09602870969
 
↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...
↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...
↑Top Model (Kolkata) Call Girls Sonagachi ⟟ 8250192130 ⟟ High Class Call Girl...
 
2k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 92055419142k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 9205541914
 
Beautiful 😋 Call girls in Lahore 03210033448
Beautiful 😋 Call girls in Lahore 03210033448Beautiful 😋 Call girls in Lahore 03210033448
Beautiful 😋 Call girls in Lahore 03210033448
 
VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...
VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...
VIP Call Girls Darjeeling Aaradhya 8250192130 Independent Escort Service Darj...
 
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
 

sequence of file formats in bioinformatics

  • 1. 1
  • 2.  Data is stored in a biological database in the form of sequences or molecular form  Unique file format  Representation of data in biological database  Categories of file formats  Sequence database  Molecular database 2
  • 3.  Gene bank flat-file Format  FASTA Format  Multi-FASTA Format  GCG Format  GCG-MSF Format  EMBL Format  Clustal Format  SWIS PROT format 3
  • 4.  Used by NCBI  It is divided into three parts  Header just a direct and very precise or brief introductory part  Features all genes in seq., location of genes in genome, protein product and coding genes etc.  Sequence : ORIGIN atcgatcgatgcgctat // 4
  • 5.  HEADRES  Locus  Definition  Accession  Version  Dbsource: dates for creation and modifications  Keywords  Source  Organism  References  Authors  Title  Journal  Medline ID: all published sources  Comment  FEATURES  SEQUENCE 5
  • 6. 6
  • 7. 7
  • 8. 8
  • 9.  One line header  Stats with > followed by name of gene  Sequence of gene or protein  Blank spaces  Paragraph marks  Numerals  Are all ignored  Steric sign * at the end 9
  • 10.  >p53 ctcgaggggc ctagacattg ccctccagag agagcaccca acaccctcca ggcttgaccg 61 gccagggtgt ccccttccta ccttggagag agcagcccca gggcatcctg cagggggtgc 121 tgggacacca gctggccttc aaggtctctg cctccctcca gccaccccac tacacgctgc 181 tgggatcctg gatctcagct ccctggccga caacactggc aaactcctac tcatccacga 241 aggccctcct gggcatggtg gtccttccca gcctggcagt ctgttcctca cacaccttgt 301 tagtgcccag cccctgaggt tgcagctggg ggtgtctctg aagggctgtg agcccccagg 361 aagccctggg gaagtgcctg ccttgcctcc ccccggccct10
  • 11. 11
  • 12.  Just like an aggregation of FASTA file as listed above  Multiple sequences follow one after the other  Single file  Accepted by several databases  Clustal W  Multalin 12
  • 13.  > jhuma gccagggtgt ccccttccta ccttggagag agcagcccca gggcatcctg cagggggtgc  >bhuma gccagggtgt ccccttccta ccttggagag agcagcccca gggcatcctg cagggggtgc  >puma gccagggtgt ccccttccta ccttggagag agcagcccca gggcatcctg cagggggtgc  >zuma gccagggtgt ccccttccta ccttggagag agcagcccca gggcatcctg cagggggtgc 13
  • 14. 14
  • 15.  GCG: genetics computer group  First line says it all ….  !!N.A_SEQUENCE 1.0  !!AA_SEQUENCE 1.0  Just a simple format in which we just get to now the sequence for the genes or proteins 15
  • 16. 16
  • 17.  Multiple sequences  Sequence name  Sequences  Alignment  Word pileup indicates that It is a multiple sequence containing file  Mandatory MSF word indicated in the file that tells that it is an MSF GCG file and is not just GCG  Comments terminated with //  2 consecutive blank lines  Multiple sequences 17
  • 18. 18
  • 19.  Sequence format of European molecular biology laboratory database  Starts with ID identification number  Ends with // as terminator  Different lines with own format  Used to record various forms of data  i.e DNA, RNA, GENE, PROTEIN etc etc 19
  • 20. 20
  • 21.  Most widely used sequence alignment tool  CLUSTAL W  CLUSTAL X  Aligned protein or gene sequences 21
  • 22. 22
  • 23.  Protein sequence database  ID : identification number  AC: accession number  DE: description  GN: gene name  OS: organism specie  OG: organelle  OC: organism classification  OX: organism taxonomy cross reference  RN: reference number  RP: reference position 23
  • 24.  RC: reference comment  RX: reference cross reference  RA: reference author  RT: reference title  RL: reference location  CC: blank  DR: database cross reference  KW: key word  FT: feature table  SQ: sequence 24
  • 25. 25
  • 26.  Several software's have been designed by … ?  The aim of these software's is to make a detailed conversion of one sequence format into another  Some of the software used widely for sequence inter-conversion are :  ReadSeq  GCG  SeqVerter  Seqret 26
  • 27.  Developed by Dr. D.G Gilbert  Automated conversion  18 supported file formats are there which can be interconverted into one another 27
  • 28. 28
  • 29. 29
  • 30.  FASTA  Multi FASTA  Flat file  GCG format  EMBL  Clustal  SWISS PROT Make each file by this Friday and send as attachments in an email 30
  • 31. 31