Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ideas for the API #2

Open
prism0x opened this issue Sep 18, 2022 · 0 comments
Open

Ideas for the API #2

prism0x opened this issue Sep 18, 2022 · 0 comments

Comments

@prism0x
Copy link
Contributor

prism0x commented Sep 18, 2022

We are about to deploy the initial version of the API which currently provides data for the awesome-manim feed on the website (corresponding PR for the website ManimCommunity/manim-website#73)

The scraper currently:

  • fetches the list of all YouTube channel links from the README file
  • scrapes the publicly available RSS feeds that are available from https://www.youtube.com/feeds/videos.xml?channel_id=xxxxxxx.
  • searches for the substrings Manim (case insensitive), #some (case insensitive), SoME (case sensitive) in the video title or description. When found, the videos are marked as "being manim videos".
  • The scraper then puts them on a MySQL database and serves videos chronologically on a paginated endpoint /videos/n, 30 videos at a time.

This issue records some ideas we could implement in the future based on feedback.

  • A deeper scrape of all the channels (RSS feeds just return the latest 15 videos)
  • An algorithmic feed that prioritizes videos with higher engagement, but still retains the chronological ordering to some degree
  • ...

Feel free to discuss these and propose any other ideas.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant