Skip to content
This repository has been archived by the owner on Aug 24, 2021. It is now read-only.

DavideViolante/DataExtractionProject

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

##Lackbase (AGIW project) This repository contains a project made for the Web Informations Management course at Roma Tre University.

###Brief Freebase.com is a huge knowledge base, but is far from complete. We decided to extend a particular part of it:

  • Nationality of the relation: Soccer Player, people.person.nationality, Nationality
  • Stadium of the relation: Soccer Team, sports.sports_team.arena_stadium, Stadium

To make this happen we crawled a famous and big website about this topic: Transfermarkt.com. Using XPath queries we iteratively and automatically selected the data we wanted and we used it to extend the Freebase relations!

###Results

  • 11% of new Players -> Nationality relations
  • 13% of new Team -> Stadium relations

###Authors

  • Davide Violante
  • Michele D'Antimi
  • Edoardo Carra

About

Information extraction university project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published