The project consists in the textual analysis, carried out using Python
language and NLTK
library, of the speeches made by the two candidates in the US presidential elections on November 2, 2020.
-
This project was carried out in relation to the course of Computational Linguistics of the degree course in Digital humanities (Informatica umanistica), at the University of Pisa.
-
More information about the course of Computational linguistics are avaiable here.
- Type: university project
- Supervisor: dr. Felice Dell'Orletta
- Year: academic year 2020/2021
- Language:
- Corpora: english
- Output: italian
- Project inctructions: italian
Python 3
NLTK
Regex
Inside the repository you will find:
- The two Corpora of speeches:
file1-joeBiden.txt
file2-donaldTrump.txt
- The two Python programs (two different linguistic analysis):
programma1.py
programma2.py
- The two output file:
outputProgramma1.txt
outputProgramma2.txt
- The project instructions:
Progetto Finale.pdf
To start the two programs, you have to write in your Python console the following script:
#launch of the program 1
>>> python3 programma1.py file1-joeBiden.txt file2-donaldTrump.txt
#launch of the program 2
>>> python3 programma2.py file1-joeBiden.txt file2-donaldTrump.txt