0% found this document useful (0 votes)

21 views

Compiler Design_ 2-Mark and 16-Mark Answers (1)

The document provides detailed answers to various questions related to compilers, including definitions of key concepts such as compilers, interpreters, and context-free grammar. It outlines the phases of a compiler, differentiates between parsing techniques, and explains regular expressions and finite automata. Additionally, it includes examples and comparisons of different concepts in compiler design and lexical analysis.

Uploaded by

khatuasubrat417

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Compiler Design_ 2-Mark and 16-Mark Answers (1)

Uploaded by

khatuasubrat417

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Here are the answers to the short and long questions.

Unit-1

Short Answer Questions (2 marks)

● Define compiler.
A compiler is a program that translates source code written in a high-level
programming language into low-level code (e.g., assembly language or machine
code) that a computer can execute.
● What is Context-Free Grammar?
Context-Free Grammar (CFG) is a formal grammar where production rules are of
the form A → α, where A is a non-terminal symbol, and α is a string of terminals
and/or non-terminals. The derivation of a string depends only on the non-terminal
being replaced, not the surrounding symbols (the context).
● Define pre-processor. What are the functions of the pre-processor?
A preprocessor is a program that processes the source code before the
compilation stage. Its functions include:
○ Macro substitution: Replacing symbolic names with their values.
○ File inclusion: Inserting the contents of other files into the source code.
○ Conditional compilation: Including or excluding code sections based on
specified conditions.
● What is an input buffer?
An input buffer is a memory area used to store the source program text as it is
read by the compiler. It helps in efficient reading of the source code, especially
when dealing with lookahead in lexical analysis.
● Differentiate between compiler and interpreter.

Feature Compiler Interpreter

Translation Translates the entire source Translates and executes the

code into machine code at source code line by line.
once.

Execution Executes the generated Executes the translated

machine code. code directly.

Speed Generally faster execution. Generally slower execution.

Error Handling Reports errors after Reports errors during

compiling the entire code. execution, line by line.
Output Creates an executable file. Does not create a separate
executable file.

● What is input buffering?

Input buffering is the technique of reading large chunks of source program input
into a buffer to speed up the lexical analysis process. Instead of reading one
character at a time, the lexical analyzer reads from the buffer.
● Define the following terms: a) Lexeme b) Token
○ Lexeme: A lexeme is the actual sequence of characters in the source code
that matches a pattern (e.g., "identifier", "number", "operator"). It's the "text"
of a language element.
○ Token: A token is a pair consisting of a token name (abstract symbol
representing the kind of lexical unit) and an optional attribute value (additional
information about the specific instance of the token). For example, <id, "x">,
<num, 10>.
● Define interpreter.
An interpreter is a program that reads and executes source code instructions one
line at a time, without first translating it into machine code.
● What are the differences between NFA and DFA?

Feature NFA DFA

Number of next states Can have zero, one, or Has exactly one next state
multiple next states for a for each input symbol
given input symbol

Epsilon transitions Allows epsilon transitions Does not allow epsilon

(transitions without transitions
consuming input)

Complexity More complex to construct Simpler to construct

Space Can be more compact Can require more space

Acceptance Accepts a string if any path Accepts a string if the path

leads to a final state leads to a final state

Long Answer Questions (16 marks)

● Explain the various phases of a compiler with an illustrative example.
A compiler operates in several phases, each transforming the source code into a
different representation. Here's a breakdown with an example using the C
statement: position = initial + rate * 60;
1. Lexical Analysis (Scanner):
■ Breaks the source code into a stream of tokens, which are the basic
building blocks of the language.
■ The lexical analyzer reads the source program, character by character,
and groups the characters into meaningful sequences called lexemes. For
each lexeme, the lexical analyzer produces a token of the form
<token-name, attribute-value>.
■ Example:
■ position -> <id, 1> (identifier, attribute value 1)
■ = -> <assign_op> (assignment operator)
■ initial -> <id, 2>
■ + -> <add_op>
■ rate -> <id, 3>
■ * -> <mul_op>
■ 60 -> <num, 60> (number)
■ ; -> <semicolon>
■ The attribute value is a pointer to an entry in the symbol table. The symbol
table stores additional information about the identifier.
2. Syntax Analysis (Parser):
■ Constructs a tree-like representation of the program's structure, typically
a parse tree or an abstract syntax tree (AST). This verifies that the
sequence of tokens follows the grammar of the programming language.
■ The parser uses the tokens produced by the lexical analyzer to create a
hierarchical representation of the source code's syntactic structure. This
structure represents how the different parts of the code relate to each
other.
■ Example (AST):
=
/ \
position +
/ \
initial *
/ \
rate 60

■ The AST represents the expression position = initial + rate * 60 in a tree

format, showing the order of operations.
3. Semantic Analysis:
■ Checks the program for consistency and meaning. It verifies type
compatibility, ensures that variables are declared before use, and
performs other static checks.
■ This phase adds information to the AST by performing type checking and
other semantic analysis. It uses the symbol table to get information about
the types and properties of variables and functions.
■ Example:
■ Checks if position, initial, and rate have been declared with
appropriate data types.
■ If rate and 60 are of different types (e.g., float and int), it may perform
type conversion (coercion) or report an error.
■ Checks if the assignment is valid (e.g., assigning a value to a variable).
4. Intermediate Code Generation:
■ Translates the source program into an intermediate representation (IR),
which is a machine-independent code. Common IRs include
three-address code and quadruples.
■ The intermediate code should be easy to produce from the syntax tree
and easy to translate into the target machine code.
■ Example (Three-Address Code):
temp1 = rate * 60
temp2 = initial + temp1
position = temp2

■ Each instruction in the three-address code has at most three operands.

temp1 and temp2 are temporary variables generated by the compiler.
5. Code Optimization:
■ Improves the intermediate code by eliminating redundancies and
improving efficiency. This phase aims to produce faster-executing or
smaller code.
■ This phase is optional, but it can significantly improve the performance of
the generated code. Optimizations can be machine-independent or
machine-dependent.
■ Example:
■ If rate is a constant, the multiplication rate * 60 could be performed at
compile time (constant folding).
■ If there are multiple occurrences of the expression rate * 60, it can be
computed only once and the result stored in a temporary variable
(common subexpression elimination).
6. Code Generation:
■ Generates the target machine code (e.g., assembly language). This
involves selecting registers, allocating memory, and generating machine
instructions.
■ The code generator takes the intermediate representation and produces
the final machine code that can be executed by the computer.
■ Example (Assembly Code - simplified):
MOV R1, rate ; Load 'rate' into register R1
MUL R1, #60 ; Multiply R1 by 60
MOV R2, initial ; Load 'initial' into R2
ADD R2, R1 ; Add R1 to R2
MOV position, R2 ; Store the result in 'position'

■ This is a simplified example. The actual assembly code would depend on

the specific target machine architecture.
7. Symbol Table Management:
■ The symbol table is a data structure that stores information about
identifiers (variables, functions, etc.) used in the program. Each phase
uses and updates the symbol table.
■ The symbol table stores information such as the name, type, scope, and
memory location of each identifier.
■ Example:
■ The symbol table would contain entries for position, initial, and rate,
including their data types, scope, and memory locations. For example:
■ position: type = float, scope = global, memory_location = 0x1000
■ initial: type = float, scope = global, memory_location = 0x1004
■ rate: type = float, scope = global, memory_location = 0x1008
● Define Regular Expression. Explain the properties of Regular Expressions.
A Regular Expression (RE) is a string of characters that defines a search pattern. It
is used to describe a set of strings that match the pattern. Regular expressions
are a powerful tool for specifying patterns in text.
Properties/Rules of Regular Expressions:
1. Basic Symbols:
■ a (any character): Represents the literal character 'a'.
■ ε (epsilon): Represents the empty string (a string with no characters). It is
a string of zero length.
2. Operators:
■ | (Alternation): Indicates a choice between two regular expressions. If r
and s are REs, then r|s denotes the set of strings that are either in the set
denoted by r or the set denoted by s.
■ Example: a|b matches either "a" or "b".
■ . (Concatenation): Indicates the concatenation of two regular expressions.
If r and s are REs, then rs denotes the set of strings that are the
concatenation of a string in r and a string in s.
■ Example: ab matches "ab".
■ * (Kleene Star): Indicates zero or more occurrences of a regular
expression. If r is an RE, then r* denotes the set of strings obtained by
concatenating zero or more strings from the set denoted by r.
■ Example: a* matches " ", "a", "aa", "aaa", and so on.
■ + (Positive Closure): Indicates one or more occurrence of a regular
expression.
■ Example: a+ matches "a", "aa", "aaa", and so on.
■ ? (Zero or one occurrence): Indicates zero or one occurrence of a regular
expression.
■ Example: a? matches " " or "a".
3. Precedence:
■ Parentheses () can be used to group regular expressions and override the
default precedence.
■ The order of precedence is usually: * (highest), concatenation, | (lowest).
4. Identity and Annihilator:
■ ε is the identity element for concatenation: rε = εr = r. Concatenating any
regular expression with epsilon results in the same regular expression.
5. Examples:
■ a|b: Matches either "a" or "b".
■ ab: Matches "ab".
■ a*: Matches " ", "a", "aa", "aaa", and so on.
■ a+: Matches "a", "aa", "aaa", and so on.
■ a?: Matches " " or "a".
■ (a|b)*: Matches any string of "a"s and "b"s, including the empty string.
■ (a|b)(a|b): Matches "aa", "ab", "ba", "bb".
● Differentiate between top-down and bottom-up parsing techniques.

Feature Top-Down Parsing Bottom-Up Parsing

Starting Point Starts with the start symbol Starts with the input string
of the grammar. (tokens).

Direction Tries to derive the input Tries to reduce the input

string from the start symbol. string to the start symbol.

Approach Expands non-terminals to Recognizes patterns in the

match the input. input and reduces them to
non-terminals.

Tree Construction Builds the parse tree from Builds the parse tree from
the root to the leaves the leaves to the root.
(pre-order).

Key Operations Prediction (choosing the Shift (moving tokens onto

correct production) and the stack) and Reduce
Matching. (replacing a handle with a
non-terminal).

Grammar Handling Works well with grammars Can handle a wider range of
where it's easy to predict grammars, including those
the production to use. with left recursion.

Error Handling May have difficulty with Can detect errors later in the
left-recursive grammars and parsing process.
grammars requiring
backtracking.

Implementation Can be implemented using Typically implemented using

recursive functions a table-driven approach.
(recursive descent parsing).

Examples Recursive Descent Parsing, LR Parsers (SLR, LR(1),

LL Parsers (LL(1), LL(k)). LALR), Operator Precedence
Parsing.

● Construct an FA equivalent to the regular expression (0+1)(00+11)(0+1)

Here's a Finite Automaton (FA) that recognizes the language defined by the
regular expression (0+1)*(00+11)(0+1)*:
States: Q = {q0, q1, q2, q3, q4}
Input Alphabet: Σ = {0, 1}
Start State: q0
Final State: q4
Transitions: δ

δ(q0, 0) = q0
δ(q0, 1) = q0
δ(q0, 0) = q1
δ(q0, 1) = q2
δ(q1, 0) = q3
δ(q1, 1) = ∅
δ(q2, 0) = ∅
δ(q2, 1) = q3
δ(q3, 0) = q4
δ(q3, 1) = q4
δ(q4, 0) = q4
δ(q4, 1) = q4

Explanation:
○ q0: Start state. Represents reading any sequence of 0s and 1s.
○ q1: State after reading '0' of the '00' sequence.
○ q2: State after reading '1' of the '11' sequence.
○ q3: State after reading '00' or '11'.
○ q4: Final state. Represents reading any sequence of 0s and 1s after '00' or
'11'.
The FA works as follows:
○ From the start state q0, the automaton can read any sequence of 0s and 1s
and stay in q0. This corresponds to the (0+1)* part of the regular expression.
○ If the automaton reads a '0', it may go to state q1, indicating that it has started
reading the '00' sequence. If it reads a '1', it may go to state q2, indicating that
it has started reading the '11' sequence.
○ From q1, if the automaton reads another '0', it goes to state q3, indicating that
it has read '00'. From q2, if the automaton reads a '1', it goes to state q3,
indicating that it has read '11'.
○ Once the automaton reaches state q3 (after reading either '00' or '11'), it can
read any sequence of 0s and 1s and stay in state q4. This corresponds to the
(0+1)* part of the regular expression after (00+11).
○ State q4 is the final state, so any string that reaches q4 is accepted by the
automaton.
● Explain the various phases of a compiler in detail. Also, write down the
output for the following expression: position = initial + rate * 60
(See the detailed explanation of compiler phases in the first long answer. The
example output is included there as well.)
● Construct an FA equivalent to the regular expression 10+(0+11)0*1
Here's a Finite Automaton (FA) for the regular expression 10+(0+11)0*1:
States: Q = {q0, q1, q2, q3, q4, q5}
Input Alphabet: Σ = {0, 1}
Start State: q0
Final State: q5
Transitions: δ

δ(q0, 1) = q1
δ(q1, 0) = q5
δ(q0, 0) = q2
δ(q0, 1) = q3
δ(q2, 0) = q4
δ(q3, 1) = q4
δ(q4, 0) = q4
δ(q4, 1) = q4
δ(q4, 0) = q5

Explanation
○ q0: Start state
○ q1: State after reading '1'
○ q2: State after reading '0' from (0+11)
○ q3: State after reading '1' from (0+11)
○ q4: State after reading '0' or '1' any number of times
○ q5: Final State

The FA works as follows:

○ From the start state q0, the automaton can either read "10" and reach the
final state q5 directly.
○ Or, it can read a '0' and go to state q2, or a '1' and go to state q3. These states
represent the beginning of the (0+11) part of the expression.
○ From q2, after reading a '0', it goes to q4.
○ From q3, after reading a '1', it goes to q4.
○ In state q4, the automaton can read any number of 0s.
○ Finally, after reading a '1' in state q4, the automaton reaches the final state q5.
Unit-2

Short Answer Questions (2 marks)

● Define augmented grammar.
An augmented grammar is a modified version of a grammar G, created by adding
a new start symbol S' and a production S' → S, where S is the original start
symbol of G. This is primarily used in LR parsing to indicate when the parser
should stop and accept the input. The purpose of the augmented grammar is to
have a unique start symbol.
● Compare the LR Parsers.
LR parsers are bottom-up parsers that handle a large class of context-free
grammars. The main types are:
○ SLR (Simple LR): The simplest LR parser, uses FOLLOW sets for reduction. It
has the smallest parsing table size but is the least powerful.
○ LR(1): The most powerful LR parser, uses lookahead of 1 symbol. It can handle
the largest class of grammars but has the largest parsing table size.
○ LALR (LookAhead LR): Intermediate in power and complexity, merges states
from LR(1) to reduce table size. It is more powerful than SLR and has a smaller
table size than LR(1), making it a good compromise.
● Compare and contrast LR and LL Parsers.

Feature LL Parsers LR Parsers

Direction Top-down Bottom-up

Starting Point Start symbol Input string

Derivation Leftmost derivation Rightmost derivation in

reverse

Grammar Handling Handles grammars that are Handles a larger class of

LL(k) (left-to-right, leftmost, grammars, including LR(k)
k lookahead). grammars.

Complexity Generally simpler to More complex to implement

implement (e.g., recursive (table-driven).
descent).

Lookahead Uses k symbols of Uses information from the

lookahead to predict the entire right-hand side of a
production to use. production, plus k
lookahead symbols.

Error Handling Errors are detected early in Errors are detected later in
the parsing process. the parsing process.

Table Size Smaller parsing tables. Larger parsing tables

(especially LR(1)).

Common Use Used in some simple Widely used in compiler

parsers, but less common construction for most
for complex languages. programming languages.

● Differentiate between top-down parsers.

Top-down parsers start with the start symbol and try to derive the input string.
They differ in how they choose which production to apply:
○ Recursive Descent Parsing: Uses a set of recursive procedures (one for
each non-terminal) to parse the input. May involve backtracking, where the
parser tries a production and if it doesn't work out, it backtracks to try
another production.
○ LL Parsers: Use a parsing table to determine which production to apply
based on the current input symbol (lookahead). LL(k) parsers use k symbols
of lookahead. No backtracking. LL(1) parsers are commonly used.
● Define Dead Code Elimination?
Dead code elimination is a compiler optimization technique that removes code
that will never be executed. This includes statements that compute a result that is
never used. Removing dead code can make the program smaller and faster.
● Eliminate immediate left recursion for the following grammar:
E -> E + T | T
T -> T * F | F
F -> (E) | id

Solution:
1. For E -> E + T | T:
■ Rewrite as:
E -> T E'
E' -> + T E' | ε

■ This transformation replaces the left recursion with right recursion. E'
represents the part of the expression that can be repeated (i.e., "+ T").
2. For T -> T * F | F:
■ Rewrite as:
T -> F T'
T' -> * F T' | ε
■ Similar to the previous case, this eliminates the left recursion in the T
production.
3. F -> (E) | id: No left recursion.

Resulting Grammar:E -> T E'

E' -> + T E' | ε
T -> F T'
T' -> * F T' | ε
F -> (E) | id

● Mention the types of LR parser.

The types of LR parsers are:
○ SLR (Simple LR) Parser
○ LR(1) Parser
○ LALR (Look-Ahead LR) Parser
● Explain bottom-up parsing method.
Bottom-up parsing is a parsing technique that starts with the input string of
tokens and attempts to reduce it step-by-step to the start symbol of the grammar.
It identifies patterns in the input that match the right-hand side of grammar rules
and replaces them with the corresponding non-terminal on the left-hand side.
The main operations are "shift" (reading input tokens onto a stack) and "reduce"
(replacing a handle on the stack with a non-terminal). LR parsing is a common
type of bottom-up parsing. It is also known as shift-reduce parsing.
Long Answer Questions (16 marks)
● Discuss left recursion and left factoring with examples.
Left Recursion:
○ Definition: A grammar is left recursive if it has a non-terminal A such that
there is a derivation A => Aα for some string α. In immediate left recursion, the
production is of the form A -> Aα | β.
○ Problem: Top-down parsers (like recursive descent) cannot handle
left-recursive grammars because they can enter an infinite loop.
○ Example:
E -> E + T | T
T -> T * F | F
F -> id

E -> E + T is left recursive
○ Elimination: Left recursion can be eliminated by rewriting the grammar. For a
general production A -> Aα | β, it can be rewritten as:
A -> β A'
A' -> α A' | ε

Applying this to the example:
E -> T E'
E' -> + T E' | ε
T -> F T'
T' -> * F T' | ε
F -> id

○ Left Factoring:
○ Definition: Left factoring is a grammar transformation technique that is useful
for top-down parsers. If a non-terminal has multiple productions with a
common prefix, the prefix is factored out.
○ Problem: Top-down parsers have difficulty when a non-terminal has multiple
productions with the same starting symbols because it is not clear which
production to choose.
○ Example:
S -> i E t S | i E t S e S | a

Both productions for S start with "i E t S".
○ Solution: Factor out the common prefix. For productions A -> αβ1 | αβ2,
rewrite them as:
A -> α A'
A' -> β1 | β2

Applying this to the example:
S -> i E t S S' | a
S' -> e S | ε

● Construct the predictive parser for the following grammar:

S -> ( L ) | a
L -> L , S | S

Solution:
First, eliminate left recursion from L:
S -> ( L ) | a
L -> S L'
L'-> , S L' | ε

Now, compute FIRST and FOLLOW sets:
FIRST(S) = { '(', 'a' }
FIRST(L) = { '(', 'a' }
FIRST(L') = { ',', ε }
FOLLOW(S) = { ')', ',' }
FOLLOW(L) = { ')' }
FOLLOW(L') = { ')' }

Predictive Parsing Table:

Non-terminal Input Symbol Production

S ( S -> ( L )

S a S -> a

L ( L -> S L'

L a L -> S L'

L' , L' -> , S L'

L' ) L' -> ε

● Check whether the following grammar is SLR (1) or not. Explain your answer
with reasons.
S -> L = R
S -> R
L -> * R
L -> id
R -> L

Solution:
1. Augment the grammar:
S' -> S
S -> L = R
S -> R
L -> * R
L -> id
R -> L

2. Compute FIRST and FOLLOW sets:

FIRST(S) = { '*', 'id' }
FIRST(L) = { '*', 'id' }
FIRST(R) = { '*', 'id' }
FOLLOW(S) = { '$' }
FOLLOW(S') = { '$' }
FOLLOW(L) = { '=', '$' }
FOLLOW(R) = { '$', '=' }

3. Construct the SLR(1) parsing table (states and transitions omitted for
brevity).
4. Check for conflicts:
In the construction of the SLR(1) parsing table, a shift-reduce conflict arises in
a state when the lookahead symbol is '='. Specifically, there's a conflict
between:
■ Shifting '=' onto the stack (from the production S -> L = R).
■ Reducing by the production R -> L (since '=' is in FOLLOW(R)).

Conclusion: The grammar is not SLR(1) because there is a shift-reduce conflict

in the SLR(1) parsing table.
● Construct SLR parse table for
S -> L = R | R
R -> L
L -> * R | id

Solution:
1. Augment the grammar:
S' -> S
S -> L = R
S -> R
R -> L
L -> * R
L -> id

2. Compute FIRST and FOLLOW sets:

FIRST(S) = { '*', 'id' }
FIRST(L) = { '*', 'id' }
FIRST(R) = { '*', 'id' }
FOLLOW(S) = { '$' }
FOLLOW(S') = { '$' }
FOLLOW(L) = { '=', '$' }
FOLLOW(R) = { '$', '=' }

3. Construct the SLR(1) parsing table. (Table construction is lengthy and omitted
here, but would involve creating states, items, GOTO and ACTION tables, and
resolving any conflicts.)
● State and explain the rules to compute FIRST and FOLLOW functions
Given a grammar G, FIRST and FOLLOW are two functions associated with
non-terminal symbols.
FIRST(α)

* FIRST(α), where α is any string of grammar symbols, is the set of terminal symbols
that begin the strings derived from α.
* **Rules for computing FIRST(X), where X is a grammar symbol (terminal or
non-terminal):**

1. If X is a terminal, then FIRST(X) = { X }.
2. If X is a non-terminal and X -> Y1 Y2 ... Yn is a production, then:
* Add FIRST(Y1) - { ε } to FIRST(X).
* If Y1 can derive ε (i.e., ε is in FIRST(Y1)), then add FIRST(Y2) - { ε } to FIRST(X),
and so on, until Yn or a symbol that cannot derive ε.
* If Y1 Y2 ... Yn can all derive ε, then add ε to FIRST(X).
3. If X -> ε is a production, then add ε to FIRST(X).

**FOLLOW(A)**

* FOLLOW(A), where A is a non-terminal, is the set of terminals that can appear
immediately to the right of A in some sentential form.
* **Rules for computing FOLLOW(A) for a non-terminal A:**

1. If A is the start symbol, then add '$' (end-of-input marker) to FOLLOW(A).
2. If there is a production B -> αAβ, then add FIRST(β) - { ε } to FOLLOW(A).
3. If there is a production B -> αA, or a production B -> αAβ where ε is in FIRST(β),
then add FOLLOW(B) to FOLLOW(A).

**Example:**

```
E -> T E'
E' -> + T E' | ε
T -> F T'
T' -> * F T' | ε
F -> ( E ) | id
```

Computing FIRST and FOLLOW (result):

```
FIRST(E) = { '(', 'id' }
FIRST(E') = { '+', ε }
FIRST(T) = { '(', 'id' }
FIRST(T') = { '*', ε }
FIRST(F) = { '(', 'id' }

FOLLOW(E) = { ')', '$' }
FOLLOW(E') = { ')', '$' }
FOLLOW(T) = { '+', ')', '$' }
FOLLOW(T') = { '+', ')', '$' }
FOLLOW(F) = { '*', '+', ')', '$' }
```

● Construct CLR parse table for

S -> L = R | R
R -> L
L -> * R | id

Constructing a CLR (Canonical LR) parsing table is a complex process involving
the following steps:
1. Augment the grammar: Add a new start symbol S' and a production S' -> S.
2. Construct the LR(1) items: An LR(1) item is a production with a dot (.) at
some position in the right-hand side, along with a lookahead symbol.
3. Create the canonical collection of sets of LR(1) items: This involves
starting with the initial item and repeatedly applying the closure and goto
operations to generate all sets of items.
4. Construct the parsing table: The states of the parser correspond to the sets
of LR(1) items. The actions (shift, reduce, accept) and gotos are determined
based on the items in each state.
The CLR parsing table will have a larger number of states compared to SLR or
LALR parsing tables for the same grammar. Due to the complexity and length of
this process, the actual table construction is omitted here.
● Construct the LR Parsing table for the following grammar:
E -> E + T | T
T -> T * F | F
F -> ( E ) | id

Constructing a full LR parsing table (Canonical LR or LR(1)) is quite lengthy. Here's
an outline of the process:
1. Augment the grammar: Add E' -> E
2. Compute FIRST and FOLLOW sets (see previous example).
3. Construct the sets of LR(1) items: This involves finding the closure and goto
of the items. An LR(1) item is of the form [A -> α.β, a] where 'a' is the
lookahead.
4. Create the parsing table: The table has two parts: ACTION and GOTO.
■ ACTION[i, a] specifies the action (shift, reduce, accept, error) for state 'i'
and input symbol 'a'.
■ GOTO[i, A] specifies the next state when the parser is in state 'i' and
encounters non-terminal A.
Because of the number of states and table entries, the complete table is not
provided here. A parser generator tool would typically be used to create this
table.
● Construct an LALR Parsing table for the following grammar:
E -> E + T | T
T -> T * F | F
F -> id

LALR parsing is similar to CLR parsing, but it merges states with the same core
(productions before the lookahead) to reduce the table size. Here's the process:
1. Construct the LR(1) item sets (as in CLR parsing).
2. Merge states with the same core: Identify states that have the same
productions with the dot in the same position, but possibly different
lookahead sets. Combine these states.
3. Create the LALR parsing table: The ACTION and GOTO tables are
constructed as in CLR parsing, but with the merged states.
LALR tables are smaller than CLR tables, making them more practical for
implementation. The actual table construction is complex and omitted here.
● Find the SLR parsing table for the given grammar: E -> E + E | E * E | ( E ) | id.
And parse the sentence (a + b) * c
1. Augment the grammar:
E' -> E
E -> E + E
E -> E * E
E -> ( E )
E -> id

2. Compute FIRST and FOLLOW sets:

FIRST(E) = { '(', 'id' }
FOLLOW(E) = { '+', '*', ')', '$' }

3. Construct the SLR parsing table: (Table construction is lengthy and omitted
here. It involves creating states, items, GOTO and ACTION tables.)
4. Parse the sentence (a + b) * c:
Here's a sketch of the parsing process (using a stack and input):

Stack Input Action

$ (a+b)*c$ Shift '('

$( a+b)*c$ Shift 'a'

$( a +b)*c$ Reduce E -> id

$( E +b)*c$ Shift '+'

$( E + b)*c$ Shift 'b'

$( E + b )*c$ Reduce E

Compiler Design Note
No ratings yet
Compiler Design Note
313 pages
UNIT-I Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
No ratings yet
UNIT-I Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
27 pages
SCS13033
No ratings yet
SCS13033
121 pages
SCSA1604
No ratings yet
SCSA1604
133 pages
Phases of Compiler
No ratings yet
Phases of Compiler
17 pages
CD Sanchit Sir Notes
No ratings yet
CD Sanchit Sir Notes
115 pages
AK CD CSE 305 ASSIGNMENT 1
No ratings yet
AK CD CSE 305 ASSIGNMENT 1
15 pages
Compiler Desining Complete Notes
No ratings yet
Compiler Desining Complete Notes
175 pages
CC Assignment
No ratings yet
CC Assignment
6 pages
Introduction To Compiling
100% (1)
Introduction To Compiling
26 pages
Compiler Design Material
No ratings yet
Compiler Design Material
107 pages
Unit 1
No ratings yet
Unit 1
29 pages
Unit 1
No ratings yet
Unit 1
29 pages
ATC IMPORTANT
No ratings yet
ATC IMPORTANT
7 pages
409.f
No ratings yet
409.f
13 pages
CD Unit-1 (Complete)
No ratings yet
CD Unit-1 (Complete)
90 pages
1st Unit CD
No ratings yet
1st Unit CD
16 pages
Lecture 1,2 Introduction
No ratings yet
Lecture 1,2 Introduction
40 pages
Overview of Compiler
No ratings yet
Overview of Compiler
56 pages
Chapter One-Introduction
No ratings yet
Chapter One-Introduction
6 pages
unit1
No ratings yet
unit1
34 pages
1 qp
No ratings yet
1 qp
31 pages
Compiler Construction Final[1]
No ratings yet
Compiler Construction Final[1]
6 pages
Imp CS1352 APR08
No ratings yet
Imp CS1352 APR08
15 pages
Untitled 3
No ratings yet
Untitled 3
12 pages
Compiler Design Study Material Unit 1
No ratings yet
Compiler Design Study Material Unit 1
26 pages
Unit-1 PCD
No ratings yet
Unit-1 PCD
28 pages
7MCE1C4-Principles of Compiler Design
No ratings yet
7MCE1C4-Principles of Compiler Design
117 pages
Compiler Construction - 01
No ratings yet
Compiler Construction - 01
57 pages
Compiler Design
No ratings yet
Compiler Design
7 pages
Compiler Theory: 001 - Introduction and Course Outline
No ratings yet
Compiler Theory: 001 - Introduction and Course Outline
33 pages
1-Introduction to programming language translators-13-12-2024
No ratings yet
1-Introduction to programming language translators-13-12-2024
38 pages
Language Processing System in Compiler Design: Difficulty Level: Last Updated: 22 Feb, 2021
No ratings yet
Language Processing System in Compiler Design: Difficulty Level: Last Updated: 22 Feb, 2021
54 pages
WINSEM2024-25_CSI2005_TH_VL2024250502429_2024-12-13_Reference-Material-I (1)
No ratings yet
WINSEM2024-25_CSI2005_TH_VL2024250502429_2024-12-13_Reference-Material-I (1)
42 pages
Mini Compiler: Submitted By: Tejash Niroula 16bce2292
No ratings yet
Mini Compiler: Submitted By: Tejash Niroula 16bce2292
14 pages
2-Introduction to Compilation and Lexical Analysis-19!07!2024
No ratings yet
2-Introduction to Compilation and Lexical Analysis-19!07!2024
135 pages
Introduction To Compiler Lexical Analysis Notes
No ratings yet
Introduction To Compiler Lexical Analysis Notes
21 pages
PCD
No ratings yet
PCD
14 pages
Compiler Design: Dr. M. Moshiul Hoque Dept. of CSE, CUET
No ratings yet
Compiler Design: Dr. M. Moshiul Hoque Dept. of CSE, CUET
53 pages
CC Assignment No 03
No ratings yet
CC Assignment No 03
12 pages
Compiler Design Slide Chapter 1-6
No ratings yet
Compiler Design Slide Chapter 1-6
250 pages
Unit 1 Slides
No ratings yet
Unit 1 Slides
49 pages
CSE353 Slides
No ratings yet
CSE353 Slides
76 pages
COMPILER DESIGN UNIT 1
No ratings yet
COMPILER DESIGN UNIT 1
30 pages
CD Overview
No ratings yet
CD Overview
9 pages
Module 1
No ratings yet
Module 1
86 pages
Introduction To Compilation
No ratings yet
Introduction To Compilation
33 pages
Phase 1: Lexical Analysis: Example
No ratings yet
Phase 1: Lexical Analysis: Example
6 pages
CD Model Set - 2 Answer Key
No ratings yet
CD Model Set - 2 Answer Key
20 pages
CS6660 Compiler Design MSM
No ratings yet
CS6660 Compiler Design MSM
128 pages
CD
No ratings yet
CD
4 pages
CD Final Nnotes for Semester Exam
No ratings yet
CD Final Nnotes for Semester Exam
13 pages
Compiler Construction
No ratings yet
Compiler Construction
39 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
17 pages
CD
No ratings yet
CD
38 pages
Compiler Design Mod 1
No ratings yet
Compiler Design Mod 1
75 pages
Compiler Design Ch1
No ratings yet
Compiler Design Ch1
13 pages
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
Python Programming Concepts
From Everand
Python Programming Concepts
MRB
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Slides Malloc
No ratings yet
Slides Malloc
74 pages
Java Full Stack Sales Brochure Softcopy[1]
No ratings yet
Java Full Stack Sales Brochure Softcopy[1]
41 pages
Data Structures and Algorithms LAB Assignment-1 Register Number: 20BCE2285 Name: Namish Devaraj
No ratings yet
Data Structures and Algorithms LAB Assignment-1 Register Number: 20BCE2285 Name: Namish Devaraj
33 pages
Algorithm Unit 2 Notes
No ratings yet
Algorithm Unit 2 Notes
12 pages
Swift 5 for Absolute Beginners - FIfth Edition Stefan Kaczmarek 2024 Scribd Download
100% (6)
Swift 5 for Absolute Beginners - FIfth Edition Stefan Kaczmarek 2024 Scribd Download
55 pages
Chapter#2 Exercises - Code Analysis-Java Oop
No ratings yet
Chapter#2 Exercises - Code Analysis-Java Oop
14 pages
VikasYadav CV
No ratings yet
VikasYadav CV
1 page
Python Modules
No ratings yet
Python Modules
32 pages
18CS71-AI&ML Notes
100% (1)
18CS71-AI&ML Notes
159 pages
CS402 Quiz-2 File by Vu Topper RM
No ratings yet
CS402 Quiz-2 File by Vu Topper RM
56 pages
Fingerprint Image Identification for Crime
No ratings yet
Fingerprint Image Identification for Crime
13 pages
XII CS Periodical Test II Study Material 2024-25
No ratings yet
XII CS Periodical Test II Study Material 2024-25
11 pages
15 CW
No ratings yet
15 CW
8 pages
InstaCollect Virtual Account With KYC Flow
100% (1)
InstaCollect Virtual Account With KYC Flow
4 pages
Experiment 8 & 9
No ratings yet
Experiment 8 & 9
14 pages
(1M, 2M, 3M) Typical Configuration OF Computer System. 1 Marks
No ratings yet
(1M, 2M, 3M) Typical Configuration OF Computer System. 1 Marks
11 pages
Mod Menu Log - Com - Mobile.legends
No ratings yet
Mod Menu Log - Com - Mobile.legends
933 pages
Delta X Technical Question Answers
100% (1)
Delta X Technical Question Answers
21 pages
BrainBusterSeries CS Class12 FREE
No ratings yet
BrainBusterSeries CS Class12 FREE
12 pages
C++ Note
No ratings yet
C++ Note
99 pages
Number System _ Base System _ DPP-07 __ MBA Pioneer 2023
No ratings yet
Number System _ Base System _ DPP-07 __ MBA Pioneer 2023
9 pages
DELTA - IA-HMI - Lenze LECOM-A-B Protocol - CM - EN - 20111122
No ratings yet
DELTA - IA-HMI - Lenze LECOM-A-B Protocol - CM - EN - 20111122
5 pages
C++ Assignment 11
100% (1)
C++ Assignment 11
9 pages
M269 Lec5
No ratings yet
M269 Lec5
39 pages
Overview of C Programming Language
No ratings yet
Overview of C Programming Language
78 pages
Numerical method 6sem 2024
No ratings yet
Numerical method 6sem 2024
7 pages
Lecture 9 Pointer
No ratings yet
Lecture 9 Pointer
20 pages
ISO IEC 10026-6-1995 Scan
No ratings yet
ISO IEC 10026-6-1995 Scan
15 pages
I. Présentation de L 'Automate II. Représentation Des Entrées Sorties III. Le Langage À Contact Ou LADDER IV. Blocs Fonctions Standard
No ratings yet
I. Présentation de L 'Automate II. Représentation Des Entrées Sorties III. Le Langage À Contact Ou LADDER IV. Blocs Fonctions Standard
27 pages
Basic To Advance Knowledge
No ratings yet
Basic To Advance Knowledge
4 pages

Compiler Design_ 2-Mark and 16-Mark Answers (1)

Uploaded by

Compiler Design_ 2-Mark and 16-Mark Answers (1)

Uploaded by

Here are the answers to the short and long questions.

Short Answer Questions (2 marks)

Feature Compiler Interpreter

Translation Translates the entire source Translates and executes the

Execution Executes the generated Executes the translated

Speed Generally faster execution. Generally slower execution.

Error Handling Reports errors after Reports errors during

●​ What is input buffering?​

Feature NFA DFA

Epsilon transitions Allows epsilon transitions Does not allow epsilon

Complexity More complex to construct Simpler to construct

Space Can be more compact Can require more space

Acceptance Accepts a string if any path Accepts a string if the path

Long Answer Questions (16 marks)

■​ The AST represents the expression position = initial + rate * 60 in a tree

■​ Each instruction in the three-address code has at most three operands.

■​ This is a simplified example. The actual assembly code would depend on

Feature Top-Down Parsing Bottom-Up Parsing

Direction Tries to derive the input Tries to reduce the input

Approach Expands non-terminals to Recognizes patterns in the

Key Operations Prediction (choosing the Shift (moving tokens onto

Implementation Can be implemented using Typically implemented using

Examples Recursive Descent Parsing, LR Parsers (SLR, LR(1),

●​ Construct an FA equivalent to the regular expression (0+1)*(00+11)(0+1)*​

The FA works as follows:

Short Answer Questions (2 marks)

Feature LL Parsers LR Parsers

Direction Top-down Bottom-up

Starting Point Start symbol Input string

Derivation Leftmost derivation Rightmost derivation in

Grammar Handling Handles grammars that are Handles a larger class of

Complexity Generally simpler to More complex to implement

Lookahead Uses k symbols of Uses information from the

Table Size Smaller parsing tables. Larger parsing tables

Common Use Used in some simple Widely used in compiler

●​ Differentiate between top-down parsers.​

Resulting Grammar:E -> T E'​

●​ Mention the types of LR parser.​

●​ Construct the predictive parser for the following grammar:​

Non-terminal Input Symbol Production

L' , L' -> , S L'

L' ) L' -> ε

2.​ Compute FIRST and FOLLOW sets:​

Conclusion: The grammar is not SLR(1) because there is a shift-reduce conflict

2.​ Compute FIRST and FOLLOW sets:​

●​ Construct CLR parse table for​

2.​ Compute FIRST and FOLLOW sets:​

Stack Input Action

$ (a+b)*c$ Shift '('

$( a+b)*c$ Shift 'a'

$( a +b)*c$ Reduce E -> id

$( E +b)*c$ Shift '+'

$( E + b)*c$ Shift 'b'

You might also like

● What is input buffering?

■ The AST represents the expression position = initial + rate * 60 in a tree

■ Each instruction in the three-address code has at most three operands.

■ This is a simplified example. The actual assembly code would depend on

● Construct an FA equivalent to the regular expression (0+1)(00+11)(0+1)

● Differentiate between top-down parsers.

Resulting Grammar:E -> T E'

● Mention the types of LR parser.

● Construct the predictive parser for the following grammar:

2. Compute FIRST and FOLLOW sets:

2. Compute FIRST and FOLLOW sets:

● Construct CLR parse table for

2. Compute FIRST and FOLLOW sets: