Skip to content

Releases: L-Dot/Letterboxd-list-scraper

v2.2.0 - Role scraping and Descriptions

03 Jun 14:18
f94c29f
Compare
Choose a tag to compare

Hi all!

This release brings the addition of two new features:

Role scraping

The first feature is a great contribution from @jonathanhouge in #13: the ability to scrape all films from any {role} in the Cast or Crew section of film pages. In practical terms:

A list of all scrapable roles can be found in the changelog. We'd like to have an exhaustive list, so please let me know if you find roles that are not included in there!

Film description

The second feature is the addition of scraping the description/synopsis of films to the new Description column. This addition was initiated from a request by @meanjoep92 in #2. Limited testing has been done here, so let me know if you find any bugs.

An example of both of the new features can be seen in /example_outputs/actor-anne-hathaway-films.csv. I hope you can make good use of them! 😃

Again thanks to everyone that uses the project and special thanks to the people that are contributing and asking for new features! ❤️

Cheers,
Arno

Full Changelog: https://github.com/L-Dot/Letterboxd-list-scraper/blob/master/CHANGELOG.md.

v2.1.0 - JSON export + bugfix

23 Apr 21:43
91df35d
Compare
Choose a tag to compare

A small update containing added functionality for JSON export and a fix for a writeout bug that occurred when the first film in the list has no rating histogram.

What's Changed

New Contributors

Full Changelog: https://github.com/L-Dot/Letterboxd-list-scraper/blob/master/CHANGELOG.md.

v2.0.0 - Big update

07 Feb 23:32
6595635
Compare
Choose a tag to compare

This update comes with a full overhaul of the Letterboxd-list-scraper into a more interactive and versatile Command Line Interface (CLI). This was done by implementing argparse together with a load of optional flags that can be provided along with the URL.

This overhaul has resulted in the code becoming a lot longer and more complicated in some parts. I have tried to keep a good readability by providing logical function and variable names and a lot of comments and docstrings. While I have done a lot of manual testing, I still expect a lot of new bugs to occur. Please do inform me on these if you find them. A near-future goal of mine is to add test functions and structure the code in a way more in line to the general rules of software applications.

In the meantime I hope you can utilize the new 2.0.0 version of this program! I think it is a lot more convenient to use, especially when you want to scrape large amount of list URLs, so enjoy.

The ideas and motivation for this update mainly came from #2. Thank you for the comments and suggestions! :)

Full Changelog: https://github.com/L-Dot/Letterboxd-list-scraper/blob/master/CHANGELOG.md.

New Contributors

v1.1.0 - First update

02 Dec 14:20
02cd0a5
Compare
Choose a tag to compare

The first update of this app after its 'official' release back in 2021. Thanks to everyone that still uses or has used this application. It is heartwarming and encouraging that people actually use and benefit from this piece of code I created :). The main work of this update comes from @DenJackson42, which #3 was merged to the main branch. I added some additional features and tested the code's performance against several lists. Please do let me know if you encounter any problems!

Again, I'm very grateful to the people that have come with suggestions, provided code, or even just commented on my project. Your investment is appreciated.

What's Changed

Added

  • Adding support for scraping Genres, Runtime, Watches, and Likes by @DenJackson42 in #3
  • Added new data columns for each film's:
    • Number of watches.
    • Number of appearances on a list.
    • Number of likes.
    • Genres.
    • Studios.
    • Countries.
    • Original language.
    • Spoken languages.
    • Owner rating (i.e. the rating that the owner of the list gave the film).

These additions were mostly inspired by users in issue #2.

Changed

  • Updated the requirements.txt file.

Fixed

  • Some changes suggested in issue #1:
    • UnicodeEncodeError was resolved by using utf-8 encoding when writing the CSV.

Full Changelog: https://github.com/L-Dot/Letterboxd-list-scraper/blob/master/CHANGELOG.md

New Contributors