Flex-and-Bison-Parser

This parser performs lexicological and syntactic analysis in JSON files that contain tweets.

Flex is used as a lexicological analyser. Bison is used for syntactic analysis.

There can be 3 types of input separated with comma: 1)Tweet 2)Retweet (contains less fields than tweet) 3) Extended Tweet (contains more than 140 characters in text element).

In folder tests there are tests for testing the parser.

The restrictions for a normal tweet were these:

text element must be up to 140 characters.
user element must contain ONLY 1 id as unique positive integer and name, screen_name and location as strings.
created_at must be "Day_name MMM DD XX:XX:XX +NNNN YYYY" for example: 'Thu Apr 06 15:24:15 +0000 2017' Of course every field must be checked if is in a valid value or not.
id_str must be string and unique and the value must be a number
hashtag field can be optional but if exists then entities field must exist too.

Also I added some restrictrions:

id_str, user, text and created_at must exist for every tweet in the JSON file. All other fields are optional. -Everything was created according to the JSON described in this link: https://developer.twitter.com/en/docs/twitter-api/v1/data-dictionary/overview/geo-objects
Retweet cannot contain created_at, text or id_str fields because it is a retweet and not a posted tweet. -Extended tweets must have truncated field as true and normal tweets must have truncated field as false.

Every file that is given as input can contain one or multiple tweets, retweets and extended_tweets in any order.

The challenging part with this project was to find a way to pass information from flex file to bison file. That's why flex file has all these yylval.... and in Bison I used union command (no joke).

HOW TO RUN

You must install flex and bison in your ubuntu if (unlikely) you don't have them installed. Then compile the files b.y(bison) and b.l (flex) with exactly the same order: bison -d b.y flex b.l gcc lex.yy.c b.tab.c -o myparser ./myparser myfile.txt

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Models		Models
tests		tests
LICENSE		LICENSE
README.md		README.md
b.l		b.l
b.tab.c		b.tab.c
b.tab.h		b.tab.h
b.y		b.y
flex-bison-exer.rar		flex-bison-exer.rar
lex.yy.c		lex.yy.c
myparser		myparser

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flex-and-Bison-Parser

HOW TO RUN

About

Releases

Packages

Languages

License

Aristidis13/Flex-and-Bison-Parser

Folders and files

Latest commit

History

Repository files navigation

Flex-and-Bison-Parser

HOW TO RUN

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages