SlideShare a Scribd company logo
1 of 13
PRINCIPLES OF COMPILER DESIGN
 REGULAR EXPRESSION
Concepts :
• Regular Expression
• Strings
• Languages
Regular Expressions

• A regular expression is a pattern that defines a
  string or portion thereof. When comparing this
  pattern against a string, it'll either be true or
  false. If true, it'll return something.
• The return value will depend on the specific
  function used and its attributes.
Strings And Languages
• Strings:
  A string is a data type used in programming, such as
    an integer and floating point unit, but is used to
    represent text rather than numbers. It is comprised
    of a set of characters that can also contain spaces and
    numbers. For example, the word “hamburger”. Even
    "12345" could be considered a string, if specified
    correctly.
   Two important examples of programming language
    alphabets are ASCII and EBCDIC character sets.
Strings (contd…):
• A string is a finite sequence of symbols such as 001.
  The length of a string x, usually denoted |x|, is the total
  number of symbols in x.
• For eg.: 110001 is a string of length 6. A special string
  is a empty string which is denoted by ε. This string is
  of length 0(zero).
• If x and y are strings, then the concatenation of x and y,
  written as x.y or just xy, is the string formed by
  following the symbols of x by the symbols of y.
• For eg.:abd.ce = abdce i.e. if x= abd & y=ce , then
   xy = abdce.
Strings (contd…)
• The concatenation of the empty string with any
  string is that string i.e. εx = xε = x.
• Concatenation is not any sort of product, thus it is
  an iterated product in form of exponential.
• E.g.: X1 = x, X2 = XX, X3 =XXX
• In general xi is the string x repeated i times. We
  take x0 to be ε for any string x. Thus, ε plays the
  role of 1, the multiplicative identity.
Strings (contd…)
• If x is some string, then any string formed by discarding
  zero or more trailing symbols of x is called a prefix of x.
• For e.g.: abc is a prefix of abcde.
• A suffix of x is a string formed by deleting zero or more of
  the leading symbols of x. cde is a suffix of abcde. A
  substring of x is any string obtained by deleting a prefix
  and a suffix from x.
• For any string x, both x and ε are prefixes, suffixes and
  substrings of x.
• Any prefix or suffix of x is a substring of x, but a
  substring need not be a prefix or suffix.
• For e.g..: cd is a substring of abcde but not a prefix or
  suffix .
Languages
• The term language means any set of strings formed
  from some specific alphabets.
• Simple set such as Φ, the empty set {ε} having no
  members or the set containing only the empty
  string, are languages.
• The notation of concatenation can also be applied to
  languages.
• For e.g.: If L and M are languages, then L.M, or just
  LM
• LM is language consisting of all strings xy which can
  be formed by selecting a string x from L, a string y
  from M, and concatenating them in that order.
   LM= {xy| x is in L and y is in M}
Languages (contd…)
• E.g.: If L={0, 01, 110} and M= {10, 110}. Then
  LM={010, 0110, 01110, 11010, 110110}
• 11010 can be written as the concatenation of of
  110 from L and 10 from M.
• 0110 can be written as either 0.110 or 01.10 i.e. it
  is a string from L followed by one from M.
• In analogy with strings, we use Li to stand for
  LL….L (i times). It is logicat to define L0 to be {ε},
  since {ε} is the identity under concatenation of
  languages. i.e. {ε}L = L{ε} = L
• The union of languages L & M is given by
  L ∪ M ={x | x is in L or x is in M}
Languages (contd…)
• If concatenation is analogous to multiplication.
  Then ø, the empty set is the identity under union
  (analogous to zero)
                 øUL=LUø=L
                       &
                 ø L=Lø=ø
• Any string in the concatenation of ø with L
 must be formed from x in ø and y in L.
• There is another operation in specifying
  tokens, that is closure or “any number of”
  operator. We use L* to denote the concatenation of
  language L with itself any number of times.
       ∞
• L* = U Li
      i=0


• Consider D be the language consisting of the
  strings 0,1,……9 i.e. each string is a single decimal
  digit. Then D* is all strings of digits including
  empty string.
• If L ={aa}, the L* is all strings of an even number of
  a’s.
• LO = {ε}
Regular expression (compiler)

More Related Content

What's hot (20)

Lexical analyzer
Lexical analyzerLexical analyzer
Lexical analyzer
 
Operator precedence
Operator precedenceOperator precedence
Operator precedence
 
Data Structures - Lecture 10 [Graphs]
Data Structures - Lecture 10 [Graphs]Data Structures - Lecture 10 [Graphs]
Data Structures - Lecture 10 [Graphs]
 
Flat unit 2
Flat unit 2Flat unit 2
Flat unit 2
 
Role-of-lexical-analysis
Role-of-lexical-analysisRole-of-lexical-analysis
Role-of-lexical-analysis
 
Parsing LL(1), SLR, LR(1)
Parsing LL(1), SLR, LR(1)Parsing LL(1), SLR, LR(1)
Parsing LL(1), SLR, LR(1)
 
Top down parsing
Top down parsingTop down parsing
Top down parsing
 
Lex (lexical analyzer)
Lex (lexical analyzer)Lex (lexical analyzer)
Lex (lexical analyzer)
 
Specification-of-tokens
Specification-of-tokensSpecification-of-tokens
Specification-of-tokens
 
Flat unit 1
Flat unit 1Flat unit 1
Flat unit 1
 
Regular expressions-Theory of computation
Regular expressions-Theory of computationRegular expressions-Theory of computation
Regular expressions-Theory of computation
 
Post Machine
Post MachinePost Machine
Post Machine
 
Regular Expression
Regular ExpressionRegular Expression
Regular Expression
 
NFA & DFA
NFA & DFANFA & DFA
NFA & DFA
 
Regular expressions
Regular expressionsRegular expressions
Regular expressions
 
Type Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLikeType Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLike
 
Java package
Java packageJava package
Java package
 
Recognition-of-tokens
Recognition-of-tokensRecognition-of-tokens
Recognition-of-tokens
 
Lexical analyzer generator lex
Lexical analyzer generator lexLexical analyzer generator lex
Lexical analyzer generator lex
 
Syntax directed translation
Syntax directed translationSyntax directed translation
Syntax directed translation
 

Similar to Regular expression (compiler)

1 introduction
1 introduction1 introduction
1 introductionparmeet834
 
Chapter2CDpdf__2021_11_26_09_19_08.pdf
Chapter2CDpdf__2021_11_26_09_19_08.pdfChapter2CDpdf__2021_11_26_09_19_08.pdf
Chapter2CDpdf__2021_11_26_09_19_08.pdfDrIsikoIsaac
 
01-Introduction&Languages.pdf
01-Introduction&Languages.pdf01-Introduction&Languages.pdf
01-Introduction&Languages.pdfTariqSaeed80
 
theory of computation lecture 02
theory of computation lecture 02theory of computation lecture 02
theory of computation lecture 028threspecter
 
2_2Specification of Tokens.ppt
2_2Specification of Tokens.ppt2_2Specification of Tokens.ppt
2_2Specification of Tokens.pptRatnakar Mikkili
 
Presentation (5).pdf
Presentation (5).pdfPresentation (5).pdf
Presentation (5).pdfGaurav447273
 
Lecture: Regular Expressions and Regular Languages
Lecture: Regular Expressions and Regular LanguagesLecture: Regular Expressions and Regular Languages
Lecture: Regular Expressions and Regular LanguagesMarina Santini
 
Automata
AutomataAutomata
AutomataGaditek
 
Automata
AutomataAutomata
AutomataGaditek
 
9781284077247_PPTx_CH01.pptx
9781284077247_PPTx_CH01.pptx9781284077247_PPTx_CH01.pptx
9781284077247_PPTx_CH01.pptxmainakmail2585
 
Syntax Analyzer.pdf
Syntax Analyzer.pdfSyntax Analyzer.pdf
Syntax Analyzer.pdfkenilpatel65
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2shah zeb
 
Regular expressions h1
Regular expressions h1Regular expressions h1
Regular expressions h1Rajendran
 
Specification of Token
Specification of TokenSpecification of Token
Specification of TokenA. S. M. Shafi
 

Similar to Regular expression (compiler) (20)

Regular expression (compiler)
Regular expression (compiler)Regular expression (compiler)
Regular expression (compiler)
 
1 introduction
1 introduction1 introduction
1 introduction
 
Module 1 TOC.pptx
Module 1 TOC.pptxModule 1 TOC.pptx
Module 1 TOC.pptx
 
Chapter2CDpdf__2021_11_26_09_19_08.pdf
Chapter2CDpdf__2021_11_26_09_19_08.pdfChapter2CDpdf__2021_11_26_09_19_08.pdf
Chapter2CDpdf__2021_11_26_09_19_08.pdf
 
01-Introduction&Languages.pdf
01-Introduction&Languages.pdf01-Introduction&Languages.pdf
01-Introduction&Languages.pdf
 
Unit-1-part-1.pptx
Unit-1-part-1.pptxUnit-1-part-1.pptx
Unit-1-part-1.pptx
 
theory of computation lecture 02
theory of computation lecture 02theory of computation lecture 02
theory of computation lecture 02
 
2_2Specification of Tokens.ppt
2_2Specification of Tokens.ppt2_2Specification of Tokens.ppt
2_2Specification of Tokens.ppt
 
Presentation (5).pdf
Presentation (5).pdfPresentation (5).pdf
Presentation (5).pdf
 
Regular Expression
Regular ExpressionRegular Expression
Regular Expression
 
Theory of computation
Theory of computationTheory of computation
Theory of computation
 
Lecture: Regular Expressions and Regular Languages
Lecture: Regular Expressions and Regular LanguagesLecture: Regular Expressions and Regular Languages
Lecture: Regular Expressions and Regular Languages
 
Automata
AutomataAutomata
Automata
 
Automata
AutomataAutomata
Automata
 
9781284077247_PPTx_CH01.pptx
9781284077247_PPTx_CH01.pptx9781284077247_PPTx_CH01.pptx
9781284077247_PPTx_CH01.pptx
 
Syntax
SyntaxSyntax
Syntax
 
Syntax Analyzer.pdf
Syntax Analyzer.pdfSyntax Analyzer.pdf
Syntax Analyzer.pdf
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Regular expressions h1
Regular expressions h1Regular expressions h1
Regular expressions h1
 
Specification of Token
Specification of TokenSpecification of Token
Specification of Token
 

More from Jagjit Wilku

More from Jagjit Wilku (7)

Health insurance
Health insuranceHealth insurance
Health insurance
 
Auto insurance
Auto insuranceAuto insurance
Auto insurance
 
Mobile communication
Mobile communicationMobile communication
Mobile communication
 
Neural networks
Neural networksNeural networks
Neural networks
 
Decision trees
Decision treesDecision trees
Decision trees
 
Complier designer
Complier designerComplier designer
Complier designer
 
Mc wireless lan
Mc wireless lanMc wireless lan
Mc wireless lan
 

Recently uploaded

Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 

Recently uploaded (20)

Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 

Regular expression (compiler)

  • 1. PRINCIPLES OF COMPILER DESIGN REGULAR EXPRESSION
  • 2. Concepts : • Regular Expression • Strings • Languages
  • 3. Regular Expressions • A regular expression is a pattern that defines a string or portion thereof. When comparing this pattern against a string, it'll either be true or false. If true, it'll return something. • The return value will depend on the specific function used and its attributes.
  • 4. Strings And Languages • Strings: A string is a data type used in programming, such as an integer and floating point unit, but is used to represent text rather than numbers. It is comprised of a set of characters that can also contain spaces and numbers. For example, the word “hamburger”. Even "12345" could be considered a string, if specified correctly.  Two important examples of programming language alphabets are ASCII and EBCDIC character sets.
  • 5. Strings (contd…): • A string is a finite sequence of symbols such as 001. The length of a string x, usually denoted |x|, is the total number of symbols in x. • For eg.: 110001 is a string of length 6. A special string is a empty string which is denoted by ε. This string is of length 0(zero). • If x and y are strings, then the concatenation of x and y, written as x.y or just xy, is the string formed by following the symbols of x by the symbols of y. • For eg.:abd.ce = abdce i.e. if x= abd & y=ce , then xy = abdce.
  • 6. Strings (contd…) • The concatenation of the empty string with any string is that string i.e. εx = xε = x. • Concatenation is not any sort of product, thus it is an iterated product in form of exponential. • E.g.: X1 = x, X2 = XX, X3 =XXX • In general xi is the string x repeated i times. We take x0 to be ε for any string x. Thus, ε plays the role of 1, the multiplicative identity.
  • 7. Strings (contd…) • If x is some string, then any string formed by discarding zero or more trailing symbols of x is called a prefix of x. • For e.g.: abc is a prefix of abcde. • A suffix of x is a string formed by deleting zero or more of the leading symbols of x. cde is a suffix of abcde. A substring of x is any string obtained by deleting a prefix and a suffix from x. • For any string x, both x and ε are prefixes, suffixes and substrings of x. • Any prefix or suffix of x is a substring of x, but a substring need not be a prefix or suffix. • For e.g..: cd is a substring of abcde but not a prefix or suffix .
  • 8. Languages • The term language means any set of strings formed from some specific alphabets. • Simple set such as Φ, the empty set {ε} having no members or the set containing only the empty string, are languages. • The notation of concatenation can also be applied to languages. • For e.g.: If L and M are languages, then L.M, or just LM • LM is language consisting of all strings xy which can be formed by selecting a string x from L, a string y from M, and concatenating them in that order. LM= {xy| x is in L and y is in M}
  • 9. Languages (contd…) • E.g.: If L={0, 01, 110} and M= {10, 110}. Then LM={010, 0110, 01110, 11010, 110110} • 11010 can be written as the concatenation of of 110 from L and 10 from M. • 0110 can be written as either 0.110 or 01.10 i.e. it is a string from L followed by one from M. • In analogy with strings, we use Li to stand for LL….L (i times). It is logicat to define L0 to be {ε}, since {ε} is the identity under concatenation of languages. i.e. {ε}L = L{ε} = L • The union of languages L & M is given by L ∪ M ={x | x is in L or x is in M}
  • 10. Languages (contd…) • If concatenation is analogous to multiplication. Then ø, the empty set is the identity under union (analogous to zero) øUL=LUø=L & ø L=Lø=ø • Any string in the concatenation of ø with L must be formed from x in ø and y in L.
  • 11. • There is another operation in specifying tokens, that is closure or “any number of” operator. We use L* to denote the concatenation of language L with itself any number of times. ∞ • L* = U Li i=0 • Consider D be the language consisting of the strings 0,1,……9 i.e. each string is a single decimal digit. Then D* is all strings of digits including empty string. • If L ={aa}, the L* is all strings of an even number of a’s.
  • 12. • LO = {ε}