Scanner generation using JLex (6 Points)

Due date: Midnight Feb 8, 2017 Extended to 11:59pm Feb 9 due to high volume of submission.

Where to submit

You must submit your assignment at the web submission site. You are enrounraged to submit many times until you obtain the full marks. We will use your last submission for your final marks for this assignment. For each submission you will get your marks instantly on the web and by email.

For questions regarding the marking program, please contact our GA Yaxin Li at


Understand the lexical definition of a language. Generate a scanner using JLex. 

Assignment Specification

The task is to write a JLex specification for Tiny language.  Please note that in this assignment  we don't need to use all the grammar definitions there.  Only the lexical part is needed.

You will write a JLex specification named "A2.lex". We will run the following commands to generate the scanner, compile it, and run it. 

  > java JLex.Main A2.lex
  > javac
  > java A2

You should take extra care on the file names. Make sure all the three commands can run without problem, and the A2.output file is generated. If any of the 3 commands fails, you will receive very low marks, even 0, no matter how good the other part of your program is.

The A2.class program will read a text file named "A2.input", and produce a file named "A2.output" which contains following five lines:

     identifiers: NumberOfIdentifiers
     keywords: NumberOfKeyowrds
     numbers: NumberOfIntergersOrRealNumbers
     comments: NumberOfComments
     quotedString: NumberOfQuotedStrings
Here are the sample A2.input and the corresponding output file A2.output. Note that this time you only need to count the occurrences of the identifiers, keywords, etc. You do not need to remove the duplciates as in last assignment.

Note that you don't need to write any Java programs. The scanner is generated from your lex specification. 

Marking scheme

This assignment will give you bonus up to 1 point depending on the length of your program. If your program counts everything correct, you shall receive 5. On top of that, there is a bonus calculated using your program length.
	if ( A2.lex file is not sent properly or file name is incorrect) return; 
	if (java program named is generated from your lex file) 
	if ( generated program is compiled correctly && A2.class is generated) {
            if (your java program reads A2.input && generates result file A2.output){
                     for (each of the 5 counts in A2.output) {
                           if (count is correct) {  
	yourMark += (lengthOfYourProgram>140)?0:96/lengthOfYourProgram;  // length is counted by PHP in terms of words; Note that different programs count the words in different ways. 
	for (each day of your late submission)  yourMark=yourMark*0.7;

What to submit

You only need to submit A2.lex     

How to get started

Before starting this assignment, you must run our examples on the slides, especially the simple.lex example and the one immideatly after this example.