0% found this document useful (0 votes)

137 views

Decaf

This document describes a compiler design project for a language called Decaf. Decaf is a strongly typed object-oriented language similar to C/C++/Java but with a smaller feature set, making it suitable for learning compiler design concepts. The document outlines the lexical components, grammar, and implementation of a lexical analyzer using Flex and parser using Bison for the Decaf language.

Uploaded by

Shounak Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

137 views

Decaf

Uploaded by

Shounak Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Decaf

Compiler Design Project

Shivika Singh (14)

Dynelle Fernandes(15)
Shounak Dey (32)
Objective

Decaf is a strongly-typed, object-oriented language with support for inheritance and

encapsulation. By design, it has many similarities with C/C++/Java. The feature set of this
language is smaller and more straightforward compared to C/C++/Java and hence makes the
programming projects manageable, yet it’s expressive enough to support object-oriented
programming. Thus it makes for an ideal language to study and learn the concepts of compiler
design. This document gives the implementation for the lexical analyzer and parser for this
language.

Lexical Components

 The following are keywords. They are all reserved, which means they cannot be used as
identifiers or redefined.

void int double bool string class interface null this extends implements
for while if else return break New NewArray Print ReadInteger
ReadLine

 An identifier is a sequence of letters, digits, and underscores, starting with a letter. Decaf
is case-sensitive.
 Whitespace (i.e. spaces, tabs, and newlines) serves to separate tokens, but is otherwise
ignored.
 A boolean constant is either true or false. Like keywords, these words are reserved.
 An integer constant can either be specified in decimal (base 10) or hexadecimal (base
16). A decimal integer is a sequence of decimal digits (0-9). A hexadecimal integer must
begin with 0X or 0x.
 A double constant is a sequence of digits, a period, followed by any sequence of digits,
maybe none.
 A string constant is a sequence of characters enclosed in double quotes. Strings can
contain any character except a newline or double quote.
 Operators and punctuation characters used by the language includes:

+ - * / % < <= > >= = == != && || ! ; , . [ ] ( ) { }

 A single-line comment is started by // and extends to the end of the line. Multi-line
comments start with /* and end with the first subsequent */.
Grammar
All the Decaf programs should conform to the following language.

 Notations:
The terminal symbols used in this description of the Decaf grammar are:

Category Symbols

Identifiers identifier
Literals intLiteral charLiteral booleanLiteral
Keywords if while else . . .
Primitive Types boolean char int void
Punctuation (){}[];,.
Operators +-*/=...

(Lexical Component section contains the whole list)

 Decaf Grammar Productions:

The productions in the Decaf program are as follows:

Program ➝ Decl+

Decl ➝ VariableDecl | FunctionDecl | ClassDecl | InterfaceDecl

VariableDecl ➝ Variable ;

Variable ➝ Type indent

Type ➝ int | double | bool | string | indent | Type[ ]

FunctionDecl ➝ Type indent ( Formals ) StmtBlock | void indent ( Formals) StmtBlock

Formals ➝ Variable+, | ∈

ClassDecl ➝ class indent <extends indent> <implements indent+,> { Field* }

Field ➝ VariableDecl | FunctionDecl

InterfaceDecl ➝ interface indent { Prototype* }

Prototype ➝ Type indent ( Formals ) ; | void indent ( Formals ) ;

StmtBlock ➝ { VariableDecl* Stmt* }

Stmt ➝ <Expr> ; | IfStmt | WhileStmt | ForStmt | BreakStmt | ReturnStmt | PrintStmt |

StmtBlock

IfStmt ➝ if ( Expr ) Stmt <else Stmt>

WhileStmt ➝ while ( Expr ) Stmt

ForStmt ➝ for ( <Expr> ; Expr ; <Expr> ) Stmt

ReturnStmt ➝ return <Expr> ;

BreakStmt ➝ break ;

PrintStmt ➝ Print ( Expr+, );

Expr ➝ Lvalue = Expr | Constant | Lvalue | this | Call | ( Expr ) | Expr + Expr | Expr -
Expr | Expr * Expr | Expr / Expr | Expr % Expr | - Expr | Expr < Expr | Expr <= Expr |
Expr > Expr | Expr >= Expr | Expr == Expr | Expr != Expr | Expr && Expr | Expr || Expr
| !Expr | ReadInteger( ) | ReadLine() | New ( Indent ) | New Array ( Expr, Type )

Lvalue ➝ indent | Expr . indent | Expr [ Expr ]

Call ➝ indent ( Actuals ) | Expr . indent ( Actuals )

Actuals ➝ Expr+, | ∈

Constant ➝ intConstant | doubleConstant | boolConstant | stringConstant | null

Implementation
The parser has been implemented using Bison while the lexical analysis is done using Flex. C
Language has been used for the implementation of the symbol table.
Type of Parser
Bottom up parser as available in Flex and Bison.

Methodology

Using Flex and Bison, we built the lexical and syntax phases of the compiler of the language
Decaf. Lexical analyzer / scanning phase will scan each lexeme and classify them on the basis of
the type of tokens generated by them. Syntax analyzer will check if the tokens form the proper
grammar as required by the language definition. A symbol table will be maintained in order to
keep track of variables defined in a Decaf program. In case of any error in these two stages, a
message will be displayed indicated in which row and column the error has occurred.

User Documentation / Readme Text

The nomenclature of the flex and bison files should follow Decaf.l and
Decaf.y .

To build the parser, run

make build
To specify input file to the parser, change the filename in the Decaf.y file.
To run the parser, run
make run
To remove the built files, run
make clean

Code

Lexical Analyzer (Flex)

//Decaf.l file

%{
#include"Decaf.tab.h"
%}
%option yylineno
%x C_COMMENT

%%
"/*" { BEGIN(C_COMMENT); }
<C_COMMENT>"*/" { BEGIN(INITIAL); }
<C_COMMENT>\n { }
<C_COMMENT>. {}
"//".*
\n {printf("%s\n",yytext);}
(" "|\t) {printf("%s\n",yytext);}
(";") {printf("%s\n",yytext);return END;}
(",") {printf("%s\n",yytext);return COMMA;}
"." {printf("%s\n",yytext);return FS;}
"[" {printf("%s\n",yytext);return SQBO;}
"]" {printf("%s\n",yytext);return SQBC;}
("(") {printf("%s\n",yytext);return OB;}
(")") {printf("%s\n",yytext);return CB;}
("{") {printf("%s\n",yytext);return OCB;}
("}") {printf("%s\n",yytext);return CCB;}
("0x"|"0X")[0-9|A-F|a-f]+ {printf("%s\n",yytext);return HEXCONST;}
[+|-]?[0-9]+[\.][0-9]*([E][+|-]?[0-9]+)? {printf("%s\n",yytext);return FLOAT;}
[+|-]?[0-9]+ {printf("%s\n",yytext);return DECCONST;}
"null" {printf("%s\n",yytext);return NULLCONST;}
"true"|"false" {printf("%s\n",yytext);return BOOLCONST;}
"void" {printf("%s\n",yytext);return VOID;}
"class" {printf("%s\n",yytext);return CLASS;}
"extends" {printf("%s\n",yytext);return EXTENDS;}
"implements" {printf("%s\n",yytext);return IMPLEMENTS;}
"interface" {printf("%s\n",yytext);return INTERFACE;}
"int"|"double"|"bool"|"string" {printf("%s\n",yytext);return DT;}
"if" {printf("%s\n",yytext);return IF;}
"else" {printf("%s\n",yytext);return ELSE;}
"for" {printf("%s\n",yytext);return FOR;}
"while" {printf("%s\n",yytext);return WHILE;}
"return" {printf("%s\n",yytext);return RETURN;}
"break" {printf("%s\n",yytext);return BREAK;}
"Print" {printf("%s\n",yytext);return PRINT;}
"this" {printf("%s\n",yytext);return THIS;}
"ReadInteger"|"ReadLine" {printf("%s\n",yytext);return READ;}
"New" {printf("%s\n",yytext);return NEW;}
"NewArray" {printf("%s\n",yytext);return NEWARR;}
[A-Za-z_][0-9|A-Za-z_]* {printf("%s\n",yytext);return ID;}
("-") {printf("%s\n",yytext);return MINUS;}
("+") {printf("%s\n",yytext);return PLUS;}
("*") {printf("%s\n",yytext);return MULT;}
("/") {printf("%s\n",yytext);return DIVIDE;}
("%") {printf("%s\n",yytext);return MOD;}
("!") {printf("%s\n",yytext);return NOT;}
("&&") {printf("%s\n",yytext);return AND;}
("||") {printf("%s\n",yytext);return OR;}
("<") {printf("%s\n",yytext);return GT;}
(">") {printf("%s\n",yytext);return LT;}
("!=") {printf("%s\n",yytext);return NE;}
("==") {printf("%s\n",yytext);return EQQ;}
("<=") {printf("%s\n",yytext);return LTE;}
(">=") {printf("%s\n",yytext);return GTE;}
("=") {printf("%s\n",yytext);return EQ;}
(\"(\\.|[^"\\])*\") {printf("%s\n",yytext);return STRCONST;}
%%

int yywrap()
{
return 1;
}

Parser (Bison)
//dec.y file
%{
#include<stdio.h>
#include<stdlib.h>
#include "scoper.h"
int yylex();
int yyerror();
extern FILE* yyin;
extern int yylineno;
%}
%token COMM TS NL HEXCONST FLOAT DECCONST BOOLCONST KEY ID
STRCONST END DT COMMA FS NULLCONST SQBO SQBC OB CB VOID CLASS
OCB CCB EXTENDS IMPLEMENTS INTERFACE FOR WHILE IF ELSE RETURN
BREAK EQ THIS MINUS NOT READ NEW NEWARR PRINT SP PLUS MULT
DIVIDE MOD AND OR NE EQQ LT GT LTE GTE U_MINUS
%locations
//setting precedences and associativity in order to avoid shift-reduce conflicts
%left EQ
%left OR
%left AND
%nonassoc EQQ NE
%nonassoc LT GT LTE GTE
%left PLUS MINUS
%left MULT DIVIDE MOD
%nonassoc T_UnaryMinus NOT
%nonassoc FS SQBO
%nonassoc T_Lower_Than_Else
%nonassoc ELSE

%%
start : declList{printf("Success \n"); exit(0);}
;
declList : declList decl
| decl
;
decl : classDecl
| fnDecl
| varDecl
| intDecl
;
varDecl : var END
;
var : type ID
;
type : DT
| ID
| type SQBO SQBC
;
intDecl : INTERFACE ID OCB intfList CCB
;
intfList : intfList fnHeader END
|
;
classDecl : CLASS ID optExt optImpl OCB fieldList CCB
;
optExt : EXTENDS ID
|
;
optImpl : IMPLEMENTS impList
|
;
impList : impList COMMA ID
| ID
;
fieldList : fieldList field
|
;
field : varDecl
| fnDecl
;
fnHeader : type ID OB formals CB
| VOID ID OB formals CB
;
formals : formalList
|
;
formalList : formalList COMMA var
| var
;
fnDecl : fnHeader stmtBlock
;
stmtBlock : OCB varDecls stmtList CCB
;
varDecls : varDecls varDecl
|
;
stmtList : stmt stmtList
|
;
stmt : optExpr END
| stmtBlock
| IF OB expr CB stmt optElse
| WHILE OB expr CB stmt
| FOR OB optExpr END expr END optExpr CB stmt
| RETURN expr END
| RETURN END
| PRINT OB exprList CB END
| BREAK END
;
lvalue : ID
| expr FS ID
| expr SQBO expr SQBC
;
call : ID OB actuals CB
| expr FS ID OB actuals CB
;
optExpr : expr
|
;
expr : lvalue
| call
| constant
| lvalue EQ expr
| expr PLUS expr
| expr MINUS expr
| expr DIVIDE expr
| expr MULT expr
| expr MOD expr
| expr EQQ expr
| expr NE expr
| expr LT expr
| expr GT expr
| expr LTE expr
| expr GTE expr
| expr AND expr
| expr OR expr
| OB expr CB
| '-' expr %prec T_UnaryMinus
| NOT expr
| READ OB CB
| NEW OB ID CB
| NEWARR OB expr COMMA type CB
| THIS
;
constant : DECCONST
| FLOAT
| BOOLCONST
| STRCONST
| NULLCONST
| HEXCONST
;
actuals : exprList
|
;
exprList : exprList COMMA expr
| expr
;
optElse : ELSE stmt
| %prec T_Lower_Than_Else
;

%%
int yyerror(char *msg)
{
printf("Invalid expression at line number: %d %s\n",yylineno,msg);
return 1;
}
void main()
{
printf("Enter expression: ");
yyin=fopen("ex.txt","r");
generateSymbolTable();
do{
if(yyparse()){
printf("Error\n");exit(0);
}
}while(feof(yyin)!=0);
printf("Success\n");
}

Symbol Table header file ( C language)

//scoper.h

#include<stdio.h>

#include<string.h>

#include<stdlib.h>

#include<math.h>

/*Symbol table has the following fields

1. Token ID

2. Variable Name

3. Data Type

4. Scope
5. Scope ID

5. Arguments

6. Argument count

7. Return type

8. Lifetime of the construct

// Define structures and constants

typedef struct line{

char content[100];

int lineno;

}line;

typedef struct var{

int id;

char name[100];

char type[100];

int size;

int entrypoint;

}var;

typedef struct funcinfo{

int id;

char return_type[100];

char func_name[100];

var args[100];

int argc;

int entrypoint;
int exitpoint;

}function;

typedef struct token{

//Visible fields

int id;

char name[100];

char type[100];

int size;

char scope;

int scopeID;

struct token *args[10];

int argc;

char ret_type[100];

int lifetime;

//Hidden fields

int entrypoint;

int isFunction;

int exitpoint;

}token;

char keyword[][1000] =
{"void","int","double","bool","string","class","interface","null","this","extends","implements","for","whi
le","if","else","return","break","New","NewArray","Print","ReadInteger","ReadLine"};

char datatype[][1000] = {"int","double","bool","string"};

//Define general methods

int isWhitespace(char s){

return (s==32)||(s==9); //Checking if s is SPACE or \t

int isKeyword(char *s){

int size = 22,i;

for(i=0;i<size;i++){

if(strcmp(s,keyword[i])==0)

return 1;

return 0;

int isDataType(char *s){ // Check if the parameter is a datatype

int size = 5,i;

for(i=0;i<size;i++){

if(strcmp(s,datatype[i])==0)

return 1;

return 0;

int isIdentifier(char *s){

int l = strlen(s);

int i;

int flag = (s[0]>='a' && s[0]<='z') || (s[0]>='A' && s[0]<='Z') || (s[0]=='_');

for(i=1;i<l;i++)

flag &= (s[0]>='a' && s[0]<='z') || (s[0]>='A' && s[0]<='Z') || (s[0]=='_') || (s[i]>='0' &&
s[i]<='9');
return flag;

void findFirstLexeme(char *s,char *lex){ // Finding the first word of the line.

int pos=0;

while(s[pos]!='\0' && s[pos]!=' '){

lex[pos] = s[pos];

pos++;

lex[pos]=='\0';

void getFirstWord(char *pcontent,char *word){ // Find the first word in a string when leading
whitespace characters are possible

int state=0,l=strlen(pcontent),i=0,pos=0;

while(i<l){

if(state && isWhitespace(pcontent[i]))

break;

if(!state && !isWhitespace(pcontent[i])){

state = 1;

word[pos] = pcontent[i];

pos++;

else{

if(!isWhitespace(pcontent[i]))

word[pos] = pcontent[i];

else

break;

pos++;
}

i++;

word[pos]='\0';

return;

// Define methods corresponding to function data type

void getFuncName(char pcontent,char name){ // get function name

int l = strlen(pcontent),pos=0,i=0;

while(!isWhitespace(pcontent[i]))

i++;

while(pcontent[i]!='('){

name[pos] = pcontent[i];

pos++;

i++;

name[pos]='\0';

return;

int isFunc(line tmp){ // Check if the parameter line is a function

int start=0,brpos=0,end=0,l=strlen(tmp.content);

// printf("%s %d\n",tmp.content,l);

while(brpos<l && tmp.content[brpos]!='(')

brpos++;
start = brpos;

while(brpos<l && tmp.content[brpos]!=')')

brpos++;

end = brpos;

if(end==l || start==l)

return 0;

start++;end--;

if(start > end)

return 1;

int pos=0;

char pcontent[100];

while(start<=end){

pcontent[pos] = tmp.content[start];

pos++;

start++;

pcontent[pos]='\0';

char word[100];

getFirstWord(pcontent,word);

return isDataType(word) || (!isKeyword(word) && isIdentifier(word));

void parse(line inp,function *f){ // parse

char tmp[100];

getFirstWord(inp.content,tmp);

strcpy(f->return_type,tmp);

f->entrypoint = inp.lineno;

getFuncName(inp.content,tmp);

strcpy(f->func_name,tmp);
return;

int getExitpoint(line *inp,int start){ // find line of death

int balance=1;

start++;

while(balance>0){

if(inp[start].content[0]=='{')

balance++;

else if(inp[start].content[0]=='}')

balance--;

start++;

return start-1;

void getFargs(char decl,char arglist){ // generate argument string from func

declaration

int start=0;

while(decl[start]!='(')

start++;

int end=start;

while(decl[end]!=')')

end++;

end--;

if(start>end){

arglist[0]='\0';

return;
}

int pos=0;

while(start<=end){

arglist[pos] = decl[start];

pos++;

start++;

arglist[pos] = '\0';

return;

void getArg(char *decl,int typelen,char *arglist){ // Get arg name when there's only one variable

int start=typelen+1,l=strlen(decl),pos=0;

while(start<l){

arglist[pos] = decl[start];

start++;

pos++;

arglist[pos] = '\0';

return;

// Define functions for the var data type

void generateArgs(char *decl,int typelen,char *arglist){ // the 3rd function that generates arguments.
Consider modularization

int start=typelen+1,l=strlen(decl),pos=0;

while(start<l){

arglist[pos] = decl[start];

start++;
pos++;

arglist[pos-1] = '\0';

return;

void create(var* tmp,char name,char type,int id,int lineno){ // create a variable

strcpy(tmp->name,name);

strcpy(tmp->type,type);

tmp->id = id;

tmp->entrypoint = lineno;

return;

void strip(char *tmp){

int pos=0;

char ret[100];

while(tmp[pos]!='\0' && tmp[pos]!='=' && tmp[pos]!=' '){

ret[pos] = tmp[pos];

pos++;

ret[pos] = '\0';

strcpy(tmp,ret);

return;

int getSize(char *s){

if(strcmp(s,"char")==0) return 1;

if(strcmp(s,"int")==0 || strcmp(s,"float")==0) return 2;

return 4;

// Define functions for the token data type

void printToken(token* tmp){

// printf("Name-12Type-12Size-12Scope-12Return Type-12Number of Arguments-12Arguments-

12\n");

if(!tmp->isFunction)

printf("%-12d%-12s%-12s%-12d%-12c%-12d%-12s%-12s%-12d%-12s\n",tmp->id,tmp-
>name,tmp->type,tmp->size,tmp->scope,tmp->scopeID,"NA","NA",tmp->lifetime,"NA");

else{

printf("%-12d%-12s%-12s%-12s%-12c%-12d%-12s%-12d%-12d",tmp->id,tmp-
>name,tmp->type,"NA",tmp->scope,tmp->scopeID,tmp->ret_type,tmp->argc,tmp->lifetime);

int i;

if(tmp->argc == 0){

printf("%-12s\n","None");

return;

for(i=0;i<(tmp->argc - 1);i++)

printf("%s,",tmp->args[i]->name);

printf("%s\n",tmp->args[tmp->argc - 1]->name);

return;

int cmp(const void A,const void B){

token a = (token )A;

token b = (token )B;

if(a->entrypoint == b->entrypoint)
return a->isFunction < b->isFunction;

return a->entrypoint > b->entrypoint;

void printSymbolTable(token *tlist,int tcount){

printf("%-12s%-12s%-12s%-12s%-12s%-12s%-12s%-12s%-12s%-12s\n\n","Token
ID","Name","Type","Size","Scope","ScopeID","Ret Type","Argc","Lifetime","Arguments");

// Sort all tokens by entry time.

int i;

for(i=0;i<tcount;i++)

printToken(&(tlist[i]));

// Main function

void generateSymbolTable(){

//Definitions

line inp[100];

char c;

int lines=0;

int i,j;

line loi[100];

function flist[100];

int loicount=0;

int fcount=0;

int linescope[100];

memset(linescope,-1,sizeof(linescope));

var vlist[100];
int vcount=0;

token tlist[100];

int tcount=0;

// Take input

while(scanf("%[^\n]",inp[lines].content)!=EOF){

// printf("%s\n",inp[lines]);

inp[lines].lineno = lines;

lines++;

c = getc(stdin);

// Remove indentations blocks - Tested only with tabspaces

for(i=0;i<lines;i++){

int state=0,pos=0; //State variable is 1 if it has passed

indentation stage.

char tmp[100];

for(j=0;j<strlen(inp[i].content);j++){

if(!isWhitespace(inp[i].content[j]) || state){

state = 1;

tmp[pos++] = inp[i].content[j];

tmp[pos] = '\0';

strcpy(inp[i].content,tmp);

// Find lines of interest.

i=0;

while(i<lines){

char str[100];

memset(str,0,sizeof(str));

findFirstLexeme(inp[i].content,str);

// printf("%s %s\n",str,inp[i].content);

if(isDataType(str) || (!isKeyword(str) && isIdentifier(str))){

strcpy(loi[loicount].content,inp[i].content);

loi[loicount].lineno = inp[i].lineno;

loicount++;

i++;

// Identify functions first.

printf("Generating Functions...\n");

i=0;

while(i<loicount){

if(isFunc(loi[i])){

parse(loi[i],&flist[fcount]);

flist[fcount].id = fcount;

flist[fcount].exitpoint = getExitpoint(inp,loi[i].lineno+1);

flist[fcount].argc = 0;

printf("ID:%-12dReturn Type:%-12s\tFunction Name:%-12s\tEntry Point:%-

12d\tExit Point:%-
12d\n",flist[fcount].id,flist[fcount].return_type,flist[fcount].func_name,flist[fcount].entrypoint,flist[fcou
nt].exitpoint);

for(j=flist[fcount].entrypoint;j<=flist[fcount].exitpoint;j++)
linescope[j]=flist[fcount].id;

fcount++;
}

i++;

printf("\n");

//Identify variables now.

printf("Identifying Variables...\n");

i=0;

while(i<loicount){

char arglist[100];

if(isFunc(loi[i])){

// printf("Function Variables : %s\n",loi[i].content);

getFargs(loi[i].content,arglist);

char *v,type[100],name[100];

v = strtok(arglist,",");

while(v != NULL){

char type[100];

getFirstWord(v,type);

getArg(v,strlen(type),name);

printf("%s %s\n",type,name);

create(&(vlist[vcount]),name,type,vcount,loi[i].lineno);

vcount++;

v = strtok(NULL,",");

else{

// printf("Variables : %s\n",loi[i].content);

char type[100];

getFirstWord(loi[i].content,type);
//Parsing the variables from the arglist.

generateArgs(loi[i].content,strlen(type),arglist);

char *v;

v = strtok(arglist,",");

while(v != NULL){

strip(v);

create(&(vlist[vcount]),v,type,vcount,loi[i].lineno);

vcount++;

v = strtok(NULL,",");

i++;

//Build Symbol table

printf("Building Symbol Table...\n\n");

//Tokenize the variables first.

for(i=0;i<vcount;i++){

strcpy(tlist[tcount].name,vlist[i].name);

strcpy(tlist[tcount].type,vlist[i].type);

tlist[tcount].size = getSize(tlist[tcount].type);

tlist[tcount].scope = linescope[vlist[i].entrypoint]>=0?'L':'G';

tlist[tcount].entrypoint = vlist[i].entrypoint;

tlist[tcount].isFunction = 0;

tlist[tcount].scopeID = -1;

tlist[tcount].lifetime = -1;

tcount++;

}
// Tokenize the functions now.

for(i=0;i<fcount;i++){

strcpy(tlist[tcount].name,flist[i].func_name);

strcpy(tlist[tcount].ret_type,flist[i].return_type);

strcpy(tlist[tcount].type,"FUNC");

tlist[tcount].size = -1;

if(i==0 || (strcmp(tlist[tcount].name,"main")==0))

tlist[tcount].scope = 'G';

else

tlist[tcount].scope = 'L';

tlist[tcount].entrypoint = flist[i].entrypoint;

tlist[tcount].exitpoint = flist[i].exitpoint;

tlist[tcount].isFunction = 1;

tlist[tcount].scopeID = linescope[tlist[tcount].entrypoint];

tlist[tcount].lifetime = tlist[tcount].exitpoint - tlist[tcount].entrypoint;

tcount++;

// Sort all tokens by entry time.

qsort(tlist,tcount,sizeof(token),cmp);

for(i=0;i<tcount;i++){

tlist[i].id = i+1;

if(!tlist[i].isFunction)

continue;

tlist[i].argc = 0;

for(j=0;j<tcount;j++){

if(tlist[j].entrypoint==tlist[i].entrypoint && !tlist[j].isFunction){

tlist[i].args[tlist[i].argc] = &(tlist[j]);

tlist[i].argc++;
}

if(!tlist[j].isFunction && tlist[j].entrypoint>=tlist[i].entrypoint &&

tlist[j].entrypoint<=tlist[i].exitpoint){

tlist[j].scopeID = tlist[i].scopeID;

tlist[j].lifetime = tlist[i].exitpoint - tlist[j].entrypoint;

printSymbolTable(tlist,tcount);

Sample Input - Output

The Ultimate Python Beginner's Handbook
No ratings yet
The Ultimate Python Beginner's Handbook
119 pages
The Decaf Language: 1 Lexical Considerations
No ratings yet
The Decaf Language: 1 Lexical Considerations
13 pages
Lexical Considerations: Handout - Decaf Language
No ratings yet
Lexical Considerations: Handout - Decaf Language
9 pages
c-grammar
No ratings yet
c-grammar
8 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
26 pages
Compiler Design 3170701 LabManual 2022
No ratings yet
Compiler Design 3170701 LabManual 2022
85 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
29 pages
CD Lab Manual
No ratings yet
CD Lab Manual
83 pages
CD Lab Manual PDF
No ratings yet
CD Lab Manual PDF
83 pages
21bai1724 Lab-01
No ratings yet
21bai1724 Lab-01
11 pages
CD Manual
No ratings yet
CD Manual
58 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
2775
No ratings yet
2775
65 pages
Report
No ratings yet
Report
20 pages
CD Lab Manual
No ratings yet
CD Lab Manual
48 pages
Compiler Isha
No ratings yet
Compiler Isha
30 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
68 pages
Compiler Design Practical File PDF
No ratings yet
Compiler Design Practical File PDF
33 pages
Welcome To CS4212: Compiler Design
No ratings yet
Welcome To CS4212: Compiler Design
31 pages
Compiler Design Record Old
No ratings yet
Compiler Design Record Old
43 pages
Lexical Analyzer Generator Lex (Flex in Recent Implementation)
No ratings yet
Lexical Analyzer Generator Lex (Flex in Recent Implementation)
14 pages
Regular Expressions To Finite Automata: - High-Level Sketch
No ratings yet
Regular Expressions To Finite Automata: - High-Level Sketch
32 pages
EXP - 4 - To - 6CD - Lab Manual - ODD - 2024 - Removed
No ratings yet
EXP - 4 - To - 6CD - Lab Manual - ODD - 2024 - Removed
16 pages
309-PCD REC - Removed
No ratings yet
309-PCD REC - Removed
46 pages
Stu
No ratings yet
Stu
6 pages
Compiler Design Lab
100% (1)
Compiler Design Lab
15 pages
Compiler Lab
No ratings yet
Compiler Lab
63 pages
CS 105 Project Requirements
No ratings yet
CS 105 Project Requirements
14 pages
EX 8 - 14 47 ACD - Merged
No ratings yet
EX 8 - 14 47 ACD - Merged
30 pages
CD LAB MANUAL
No ratings yet
CD LAB MANUAL
68 pages
CD final Lab manual
No ratings yet
CD final Lab manual
44 pages
Compiler Record
No ratings yet
Compiler Record
42 pages
CD Lab-1
No ratings yet
CD Lab-1
34 pages
CD Lab Manual
No ratings yet
CD Lab Manual
40 pages
Compiler Compiler: Flex and Bison: 1 Today's Goal
No ratings yet
Compiler Compiler: Flex and Bison: 1 Today's Goal
7 pages
CD Lab Manual
No ratings yet
CD Lab Manual
28 pages
Lang Spec
No ratings yet
Lang Spec
5 pages
Compiler For Flat Tiny C
No ratings yet
Compiler For Flat Tiny C
24 pages
CD Lab1
No ratings yet
CD Lab1
68 pages
2021UCS1618 Compiler
No ratings yet
2021UCS1618 Compiler
31 pages
Compiler Design Pur Vi
No ratings yet
Compiler Design Pur Vi
39 pages
Specman Cheat Book
0% (1)
Specman Cheat Book
15 pages
Tiny C Grammar
No ratings yet
Tiny C Grammar
1 page
V02 Parsing, Conditionals, Names
No ratings yet
V02 Parsing, Conditionals, Names
64 pages
Compiler Design Lab File
No ratings yet
Compiler Design Lab File
46 pages
Forth 79 Handy Reference PDF
No ratings yet
Forth 79 Handy Reference PDF
2 pages
CD Lab Manual File
No ratings yet
CD Lab Manual File
27 pages
Lecture 10
No ratings yet
Lecture 10
33 pages
COMPILER DESIGN LAB MANUAL
No ratings yet
COMPILER DESIGN LAB MANUAL
28 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
18 pages
cdmk
No ratings yet
cdmk
32 pages
Cdrec 1
No ratings yet
Cdrec 1
29 pages
BCSE307P - Compiler Lab Manual
No ratings yet
BCSE307P - Compiler Lab Manual
48 pages
Compiler_Lab_Experiments[1]
No ratings yet
Compiler_Lab_Experiments[1]
24 pages
Screenshot 2025-01-14 at 4.08.29 PM
No ratings yet
Screenshot 2025-01-14 at 4.08.29 PM
59 pages
C in Two Pages
No ratings yet
C in Two Pages
2 pages
Ra-CD Manual
No ratings yet
Ra-CD Manual
70 pages
Pcc
No ratings yet
Pcc
33 pages
Cs6612 Compiler Laboratory (1)
No ratings yet
Cs6612 Compiler Laboratory (1)
42 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
C Programming
From Everand
C Programming
Netra
No ratings yet
Multiple Choice Question Bank (MCQ) Term - I
No ratings yet
Multiple Choice Question Bank (MCQ) Term - I
86 pages
Java Questions5
No ratings yet
Java Questions5
5 pages
Database Interview Question
No ratings yet
Database Interview Question
27 pages
Major Project REPORT
No ratings yet
Major Project REPORT
42 pages
SQL MCQ
No ratings yet
SQL MCQ
11 pages
Set 2 PDF
No ratings yet
Set 2 PDF
15 pages
4 Python Regex Match Function
No ratings yet
4 Python Regex Match Function
4 pages
Lab 5,6,7
No ratings yet
Lab 5,6,7
18 pages
212 IMS DB Interview Questions Answers Guide
No ratings yet
212 IMS DB Interview Questions Answers Guide
9 pages
Lecture 23: Memory Representation of Trees, Traversal Algorithms
No ratings yet
Lecture 23: Memory Representation of Trees, Traversal Algorithms
4 pages
Homework Set 6 Solution
100% (1)
Homework Set 6 Solution
10 pages
Savitribai Phule Pune University B.C.A. (Sem-V) Practical Examination Oct / Apr 2019-2020
No ratings yet
Savitribai Phule Pune University B.C.A. (Sem-V) Practical Examination Oct / Apr 2019-2020
36 pages
Monkey and Banana
50% (2)
Monkey and Banana
22 pages
PHP Laravel
100% (1)
PHP Laravel
15 pages
DirectCertify 1Z0-448 Practice Questions
No ratings yet
DirectCertify 1Z0-448 Practice Questions
7 pages
Major Project 2 PDF
No ratings yet
Major Project 2 PDF
22 pages
Billing Project Complete C++
No ratings yet
Billing Project Complete C++
8 pages
Muhammad Sohail
No ratings yet
Muhammad Sohail
4 pages
Project 1: Threads: 2.1 Background
No ratings yet
Project 1: Threads: 2.1 Background
14 pages
Industrial Training Nikhil
No ratings yet
Industrial Training Nikhil
13 pages
Frontend Developer Manual, CoreMedia
No ratings yet
Frontend Developer Manual, CoreMedia
229 pages
Advanced Troubleshooting Guide 2010 - tcm121-69551
100% (2)
Advanced Troubleshooting Guide 2010 - tcm121-69551
162 pages
5 Dynamic Programming
No ratings yet
5 Dynamic Programming
16 pages
01 First Steps
No ratings yet
01 First Steps
37 pages
The CSS Box Model
100% (1)
The CSS Box Model
4 pages
Owl - Tutorial - Todoapp - MD at Master Odoo - Owl
No ratings yet
Owl - Tutorial - Todoapp - MD at Master Odoo - Owl
20 pages
Sap Abap Bapi 1
No ratings yet
Sap Abap Bapi 1
27 pages
SQL PL SQL Answers
No ratings yet
SQL PL SQL Answers
4 pages
Functions: Csc128 Mahfudzah Othman Uitm Perlis
No ratings yet
Functions: Csc128 Mahfudzah Othman Uitm Perlis
15 pages

Decaf

Uploaded by

Decaf

Uploaded by

Decaf

Compiler Design Project

Shivika Singh (14)

Decaf is a strongly-typed, object-oriented language with support for inheritance and

+ - * / % < <= > >= = == != && || ! ; , . [ ] ( ) { }

(Lexical Component section contains the whole list)

 Decaf Grammar Productions:

Decl ➝ VariableDecl | FunctionDecl | ClassDecl | InterfaceDecl

Variable ➝ Type indent

Type ➝ int | double | bool | string | indent | Type[ ]

FunctionDecl ➝ Type indent ( Formals ) StmtBlock | void indent ( Formals) StmtBlock

ClassDecl ➝ class indent <extends indent> <implements indent+,> { Field* }

Field ➝ VariableDecl | FunctionDecl

InterfaceDecl ➝ interface indent { Prototype* }

StmtBlock ➝ { VariableDecl* Stmt* }

Stmt ➝ <Expr> ; | IfStmt | WhileStmt | ForStmt | BreakStmt | ReturnStmt | PrintStmt |

IfStmt ➝ if ( Expr ) Stmt <else Stmt>

WhileStmt ➝ while ( Expr ) Stmt

ForStmt ➝ for ( <Expr> ; Expr ; <Expr> ) Stmt

ReturnStmt ➝ return <Expr> ;

PrintStmt ➝ Print ( Expr+, );

Lvalue ➝ indent | Expr . indent | Expr [ Expr ]

Call ➝ indent ( Actuals ) | Expr . indent ( Actuals )

Constant ➝ intConstant | doubleConstant | boolConstant | stringConstant | null

User Documentation / Readme Text

To build the parser, run

Lexical Analyzer (Flex)

Symbol Table header file ( C language)

/*Symbol table has the following fields

8. Lifetime of the construct

// Define structures and constants

typedef struct line{

typedef struct var{

typedef struct funcinfo{

typedef struct token{

struct token *args[10];

char datatype[][1000] = {"int","double","bool","string"};

//Define general methods

int isWhitespace(char s){

int isKeyword(char *s){

int size = 22,i;

int isDataType(char *s){ // Check if the parameter is a datatype

int size = 5,i;

int isIdentifier(char *s){

int flag = (s[0]>='a' && s[0]<='z') || (s[0]>='A' && s[0]<='Z') || (s[0]=='_');

while(s[pos]!='\0' && s[pos]!=' '){

if(state && isWhitespace(pcontent[i]))

if(!state && !isWhitespace(pcontent[i])){

// Define methods corresponding to function data type

void getFuncName(char *pcontent,char *name){ // get function name

int isFunc(line tmp){ // Check if the parameter line is a function

while(brpos<l && tmp.content[brpos]!='(')

while(brpos<l && tmp.content[brpos]!=')')

if(start > end)

return isDataType(word) || (!isKeyword(word) && isIdentifier(word));

void parse(line inp,function *f){ // parse

int getExitpoint(line *inp,int start){ // find line of death

void getFargs(char *decl,char *arglist){ // generate argument string from func

// Define functions for the var data type

void create(var* tmp,char *name,char *type,int id,int lineno){ // create a variable

void strip(char *tmp){

while(tmp[pos]!='\0' && tmp[pos]!='=' && tmp[pos]!=' '){

int getSize(char *s){

if(strcmp(s,"int")==0 || strcmp(s,"float")==0) return 2;

// Define functions for the token data type

void printToken(token* tmp){

// printf("Name-12Type-12Size-12Scope-12Return Type-12Number of Arguments-12Arguments-

int cmp(const void *A,const void *B){

token *a = (token *)A;

token *b = (token *)B;

return a->entrypoint > b->entrypoint;

void printSymbolTable(token *tlist,int tcount){

// Sort all tokens by entry time.

// Remove indentations blocks - Tested only with tabspaces

int state=0,pos=0; //State variable is 1 if it has passed

// Find lines of interest.

void getFuncName(char pcontent,char name){ // get function name

void getFargs(char decl,char arglist){ // generate argument string from func

void create(var* tmp,char name,char type,int id,int lineno){ // create a variable

int cmp(const void A,const void B){

token a = (token )A;

token b = (token )B;