Skip to content

Text analysis project materials for the computational linguistics course at the BA Digital Humanities, University of Pisa (a.y. 2020/2021)

Notifications You must be signed in to change notification settings

danieleborghe/computational-linguistics-project-UniPi

Repository files navigation

2020USAElectionsSpeechesAnalysis_UniPi

The project consists in the textual analysis, carried out using Python language and NLTK library, of the speeches made by the two candidates in the US presidential elections on November 2, 2020.

  • This project was carried out in relation to the course of Computational Linguistics of the degree course in Digital humanities (Informatica umanistica), at the University of Pisa.

  • More information about the course of Computational linguistics are avaiable here.

General information

  • Type: university project
  • Supervisor: dr. Felice Dell'Orletta
  • Year: academic year 2020/2021
  • Language:
    • Corpora: english
    • Output: italian
    • Project inctructions: italian

Languages and technologies

  • Python 3
    • NLTK
    • Regex

The repository

Inside the repository you will find:

  • The two Corpora of speeches:
    • file1-joeBiden.txt
    • file2-donaldTrump.txt
  • The two Python programs (two different linguistic analysis):
    • programma1.py
    • programma2.py
  • The two output file:
    • outputProgramma1.txt
    • outputProgramma2.txt
  • The project instructions:
    • Progetto Finale.pdf

Launch

To start the two programs, you have to write in your Python console the following script:

#launch of the program 1
>>> python3 programma1.py file1-joeBiden.txt file2-donaldTrump.txt
#launch of the program 2
>>> python3 programma2.py file1-joeBiden.txt file2-donaldTrump.txt

About

Text analysis project materials for the computational linguistics course at the BA Digital Humanities, University of Pisa (a.y. 2020/2021)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages