Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compiling YouTube video playlists #10

Closed
annakrystalli opened this issue May 31, 2017 · 4 comments
Closed

Compiling YouTube video playlists #10

annakrystalli opened this issue May 31, 2017 · 4 comments

Comments

@annakrystalli
Copy link
Contributor

We're looking to extend data collection from the captions of YouTube videos.

As a start, it would be useful to get playlists of the different topics gathered together. Currently, the most effective approach would be to curate playlists that are consistent on both topic and position ie a separate playlist for climate change vs climate change denying videos.

We are mainly interested in videos in which the caption are NOT autogenerated. However, because further down the line we might look into extracting useful data from autogenerated captions, it would also be useful to compile videos with autogenerated captions separately. So if you do come across them just add them to a separate list (no need to thematically separate that at this point)

We're open to suggestions of what the most effective approach to centralise resulting playlists. Let us know what you think. Otherwise just drop a link to any playlists you create here for the time being.

@TyJK
Copy link
Owner

TyJK commented May 31, 2017

I've tested this now and this app is super easy to use:

YoutubeExtractor

Then what you can do is make a playlist and title it appropriately, such as Drug Policy - Decriminalize, then add suitable videos to it. Once you're done adding to the playlist, you browse to the URL that holds your playlist and open of Youtube Extractor. Paste the url into the field and export it as a text file. Then upload the text file to the Youtube Captions folder. The file you want will take on the name of the playlist.

Also, you can tell if they're autogenerate because if they are, when you turn them on, an overlay will display that says "English (auto generated). As Anna said, try to avoid these if possible, but if you think you have a really good video, put these in a separate playlist (no need to separate by sentiment, as someone will have to go through them manually anyways).

@annakrystalli
Copy link
Contributor Author

Adding the links to the two super useful blogposts for extracting YouTube data through ❤️ R ❤️.

@annakrystalli
Copy link
Contributor Author

I'm creating collaborative playlists that people can directly add videos to. You can add to the playlists by following the appropriate links. Details in the /Youtube Caption Dump/instructions.md

@TyJK
Copy link
Owner

TyJK commented Jun 2, 2017

I was thinking, this whole YouTube thing has REALLY opened up some doors. Basically, anything we can get a transcript of, we can use. Down the line this could include podcasts as we discussed, or even documentaries or episodes of tv shows. These might already be on youtube, or we might be able to get them directly online if they're more well known.

@TyJK TyJK closed this as completed Mar 16, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants