Looking at the grammar for a function in the Parser, it is observed that the only valid variable type is an integer (INT):
function:
INT ID OP_PAR CL_PAR OP_BRACE statements CL_BRACE {
$$ = new_ast_func_node($6);
}
;
The compiler must be able to handle more variable types such as characters or doubles. The first step to adding new variable types is to create an enum used to distinguish between the different variable types:
typedef enum Variable_t
{
CHARtype,
DOUBLEtype,
FLOATtype,
INTtype,
LONGtype,
SHORTtype,
VOIDtype
} VariableType;
Then, the Lexer is updated so that it can return tokens for the different variable types as well as their corresponding enum values:
"char" {
yylval.intval = CHARtype;
return CHAR;
}
"double" {
yylval.intval = DOUBLEtype;
return DOUBLE;
}
"float" {
yylval.intval = FLOATtype;
return FLOAT;
}
"int" {
yylval.intval = INTtype;
return INT;
}
"long" {
yylval.intval = LONGtype;
return LONG;
}
"short" {
yylval.intval = SHORTtype;
return SHORT;
}
"void" {
yylval.intval = VOIDtype;
return VOID;
}
Next, the Parser must be updated. First, the tokens must be added to the Parser (must be <intval> tokens as they are also passing the enum values through the yylval union):
%token <intval> CHAR DOUBLE FLOAT INT LONG SHORT VOID /*Variable Types*/
Now, the compiler is able to pass different variable types to the Parser; however, the grammar rules within the Parser still need to be updated. One solution would to be have 7 rules for function where the variable type changes each time. This is inefficient as every time the compiler has a grammar rule with a variable type in it, multiple lines would be required. Therefore, a new %type <intval> called “var_type” will be created:
%type <intval> var_type
The grammar for var_type is set so that it can be any variable type ($1 in this case is the enum value passed through yylval):
var_type:
CHAR { $$ = $1; }
| DOUBLE { $$ = $1; }
| FLOAT { $$ = $1; }
| INT { $$ = $1; }
| LONG { $$ = $1; }
| SHORT { $$ = $1; }
| VOID { $$ = $1; }
;
With var_type introduced, the grammar rule for function can be updated to account for multiple variable types without needing to add more lines:
function:
var_type ID OP_PAR CL_PAR OP_BRACE statements CL_BRACE {
$$ = new_ast_func_node($6);
}
;
The addition of the var_type will not only help with the existing grammar for functions but will also prove useful for other grammar rules with variable types that are to be added in the future (i.e., declarations).