Code for web scraping http://gencat.cat (Catalan Government's website), uploading the obtained data to a personally-owned database and using it for a social network style website.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
SCREENSHOTS
SEGURETAT DE LA BD
SINTESI
.gitattributes
LLEGEIX-ME.txt
Memòria_escrita.docx
Memòria_escrita.pdf
README.md

README.md

Mercat Catala

Set of scripts for parsing, saving (web scraping) and uploading to a personally-owned database all the cultural events listed in the Catalan Government's website http://agenda.cultura.gencat.cat/ Also, once we have all that data, use it in a social network style website (intended to be named mercatcatala.cat) for editing, commenting, liking/disliking, etc. the events among the website users. The project was submitted as an ending to a set of technical courses that occur in Spain between High School and College.

  • Languages and techniques used: PHP, Javascript, Ajax, SQL, HTML, CSS
  • Libraries used: script.aculo.us, Snoopy

A fairly detailed description of the code structure can be found at Memòria_escrita.pdf, although it's in Catalan.

Also, a document (in Catalan) showing the necessary steps to follow in order to set up the project into your own web server, can be found at LLEGEIX-ME.txt

Screenshot of the source website from which I'm web scraping the content (http://agenda.cultura.gencat.cat/):

List of events Gencat

Some screenshots of my website:

  • Home page Home page

  • Showing list of events Showing list of events

  • Sign up page Sign up page

  • Modifying an event Modifying an event