New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
paginate through latest talks? #25
Comments
I agree. This really needs to be considered again. I stopped work on it a while back. I think the best way to do this would be to add an arbitrary id field to each video. This ID field would be unique for each video (some videos already have this ID). This ID would increment as new videos are added and would create a natural sorting field for "newest" videos. This field has been requested by others for other uses so we wouldn't be adding data to the pyvideo/data repo for use solely in pyvideo/pyvideo. @zerok @Daniel-at-github @codersquid @lgh2 Thoughts? |
|
Thank you for your suggestion. Did you look at the "Latest Events" heading On Sun, Oct 23, 2016 at 11:36 AM, Al Johri notifications@github.com wrote:
|
If it's not too hard to add an ID number field to each video then it sounds On Sun, Oct 23, 2016 at 3:36 PM, Paul notifications@github.com wrote:
|
Sorry for the late response :) Personally, I'd prefer if we could somehow get all the dates and times of each talk and event. Having a global number in each category.json and video file sounds slightly off-topic for me inside the data repository and ideally unnecessary if we can get the necessary datetime :) |
Can we just grab it from YouTube (when it's a ur video) for the purposes of the database or does it need to be in the exact time on the events schedule? |
YT* |
The Youtube dates are probably not sufficient as some conferences upload their content sometimes weeks after the actual event. Youtube is also not the only source that is supported (albeit a major one) so we probably won't get around some manual work :) |
@lgh2 events page is good. But a RSS of the events makes pyvideo more useful to the end user, it will know when a new event is added. @zerok only date could be confusing if two events overlap in time. I think event + date is better. What you think? |
With the "old pyvideo", we had two fields. One was the date the talk was recorded. This was sometimes wrong when we added the videos to the corpus initially, but would get fixed later on. The other was the date the talk was added to the corpus. The "latest videos" list and feed on "old pyvideo" was sorted by the date the talk was added to the corpus figuring that these were "new to pyvideo". "old pyvideo" was a Django app, so it wasn't hard to maintain a "date the talk was added" field. It's harder with a git-based corpus, though we could organize things by the date of the git commit the file was added. Pretty sure that's not hard to compute generally speaking. I don't know if that's hard to do with the static site generator being used. Would that sort of thing help here? |
We could compute the event dates from git and save it outside the files, in this way pyvideo-only data don’t mix with conference data. Outside the files can be:
Tinydb is more versatile (and shiny ✨) and its less than half an hour to learn it (I think). |
The focus would be hear an "added" timestamp field, right? Personally I don't mind how it is cached either way 🙂
|
I vote for whatever is simplest, and adding something to the video json doesn't seem bad to me despite mixing up pyvideo data and conference data. then whatever tool someone uses to scrape data could merely have a timestamp to get a 'good enough' estimate for recent things added to pyvideo even if it isn't technically accurate with respect to merge-time. |
Thanks everybody! Lots of great feedback here. Keeping in-line with our goal of keeping things simple, here's what I think our course should be: I think sorting these videos by their "added to the data repo" date based on commit SHA is probably the easiest way to go. It sounds a bit more complicated but I think it will cut out tons of manual work in the long run. Plus, computers are good at doing this mindless sorting :) Next, how to store that sorting. This sort is only needed in the pyvideo/pyvideo repo. Thus, I think it would be best to place it in a new JSON/CSV/etc file that lived in the pyvideo repo. I think a new If anyone wants to work on this, feel free to assign yourself. Otherwise, I will start work on it after I finish writing my PyCon Canada talks :) |
@logston +1 on adding that kind of information into a metadata folder within the pyvideo website project 😄 @Daniel-at-github Would you mind if I still also extend the recorded field within the video.json to support a time component and the category.json with something like start and end fields? |
@zerok I'm a bit lost with the time component in the video.json. Can you give an example or elaborate? (maybe I'm sleepy or lost in translation). Start and end fields in category.json seems great to me. It's more work, but not too much. Adding related_urls (webpage, schedule are usual; slides, repository are less usual, ...) or tags in category.json it's something to think too. start_date it's allready created, I have seen it somewhere: tools/video_statistics.py . Category statistics
Video statistics
|
@Daniel-at-github I mean moving "recorded" from being date-only to a datetime field so that we can do a more natural ordering on the event page itself, for instance.
vs.
using ISO8601 but trying to keep everything in UTC. |
@zerok in youtube scraped videos we often have to settle for a good enough date and take the upload date, and time there will be T00:00:00+00:00. |
Metadata file generated with: for EVENT_FILE in */category.json
do
echo -n "$EVENT_FILE "
git log --diff-filter=A --follow --format=%ai -1 -- "$EVENT_FILE"
done \
| sort -k2,2r -k1,1r \
| sed -e 's?/category.json??; s/ .*$//; s/^/- /' > metadata.yml
vim metadata.yml # Add a parent node and comments. Looks like: # Pyvideo metadata file
# List of pyvideo data events, ordered from recent to older
# Useful for:
# Events RSS
# Event pagination
event:
- pycon-se-2016
- pydata-dc-2016
- pydata-chicago-2016
- pycon-de-2016
- pygotham-2016
- data-school
- ndc-oslo-2016
- pybay-2016
- pycon-jp-2016
- pycon-italia-2016
- pycon-apac-2016
- pycon-uk-2016
- pycon-israel-2016
- kiwi-pycon-2016
- pydata-san-francisco-2016
- pydata-berlin-2016
- pydata-amsterdam-2016
- writethedocs-na-2016
- swiss-python-summit-2016
- pydx-2015
- pyday-galicia-2016
... |
Generated using the instructions in: pyvideo#25 (comment)
i've been using pyvideo quite a bit over the last few days and sometimes I'm just looking to watch something new that catches my eye as opposed to going to a particular conference or tag or speaker. would love to be be able to just paginate through some of the most recent videos
The text was updated successfully, but these errors were encountered: