Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

API gallery page for Paper List Widgets #72

Closed
4 tasks
wammar opened this issue May 3, 2023 · 14 comments
Closed
4 tasks

API gallery page for Paper List Widgets #72

wammar opened this issue May 3, 2023 · 14 comments
Assignees
Labels
documentation Improvements or additions to documentation Triage Copied to S2 internal for triage

Comments

@wammar
Copy link
Contributor

wammar commented May 3, 2023

Next steps:

  • WIP: Iterate on design.
  • Decide if we're going to include photos for this quarter based on the private APIs.
  • Design a thumbnail for the top-level gallery page.
  • Launch.
@wammar wammar added documentation Improvements or additions to documentation status:open/WIP labels May 3, 2023
@nikett
Copy link

nikett commented May 3, 2023

Hi Waleed, paperlist is pretty much done (thanks to Matt for helping with the looks, and me and intern already added caching as well to limit number of api requests). I have just been struggling with the availability of the image server that Luca set up. The server does the following: given a list of paper ids, it returns the image url. Is there a way to make that server a little more reliable (currently it fails on about 99% of the requests). I asked Luca who told me that it can be fixed once Luca is back.

@nikett
Copy link

nikett commented May 29, 2023

I added a branch with paperlist. This is work that I started and then together with my intern Yash Kumar Lal for javascript and with Matt L for a nice UI. @MLatzke you can look at the branch https://github.com/allenai/s2-folks/tree/paperlist

@MLatzke
Copy link
Member

MLatzke commented May 30, 2023

Thanks, @nikett, I'll take a look!

@nikett
Copy link

nikett commented Jul 12, 2023

Do you know if the thumbnails links from a paper will be made available via s2 api?

@cfiorelli
Copy link
Collaborator

@MLatzke / @nikett ,
I understand this ticket represents work to publish "getting started" code for different use cases?
I only briefly poked around the paper list repo - looks cool. Any news on status?

@nikett
Copy link

nikett commented Sep 11, 2023

I am waiting on this TODO:
"Decide if we're going to include photos for this quarter based on the private APIs."

Everything else is good (and this is in use on my own homepage)

@cfiorelli cfiorelli added the Triage Copied to S2 internal for triage label Oct 10, 2023
@cfiorelli cfiorelli self-assigned this Oct 22, 2023
@cfiorelli
Copy link
Collaborator

@nikett The link to paperlist seems to just go to our s2folks repo. I was hoping if work on this is done we can add it to upcoming December API newsletter and gallery page, if that seems appropriate to everyone here?

@nikett
Copy link

nikett commented Dec 9, 2023 via email

@cfiorelli
Copy link
Collaborator

Image thumbnails in the API -as in will we add a field to the api to allow for url to an image thumbnail, or? Sorry I do suspect I am not understanding. If you like we can hop on a call

@cfiorelli
Copy link
Collaborator

@rodneykinney - Can you give a sense of how much effort it would be to provide a field in the public API which returns links to paper thumbnails? This feature would Niket with the final piece he needs to build a code example which users would place on their website to show a feed of papers they're on.

cc @Jgordo72 - I found the below comment from you regarding licensing. Can you confirm that if a Springer abstract were somehow visible in the thumbnail via public API we would be OK?

If the image is available via the S2 page, then I have no issue with the redistribution via the API. We cannot pull images from some PDFs due to licensing restrictions from our publishing partners.

@rodneykinney
Copy link
Member

By "thumbnail" do mean URLs for images of the paper's figures? That's relatively easy to implement. I'd call it a Medium.

@cfiorelli
Copy link
Collaborator

@rodneykinney - I understand we need to get a sense of priorities as we move into start of 24' but for the moment here's some clarifying notes from Niket. (same notes from our call today.)

https://www.semanticscholar.org/paper/RL4F%3A-Generating-Natural-Language-Feedback-with-for-Akyurek-Aky%C3%BCrek/ebf35cef5c249d90b40043fffa41f8802c27f132
For instance in this paper, figures and tables that are displayed. And I pick the first thumbnail

I just want to show the first for now because typically the first image is a representative
I just need urls and not the images as such
And I don’t need the table
Essentially first image is all I need for now. Maybe in the long term it will be a good machine learning problem to decide a representative image given a list of images and tables from a paper.
If it is easier to simply return all the tables and figures urls from the paper then that’s also fine
as for the resolution - it could be a thumbnail or a figure. Thumbnail urls are actually even better for my use case

@Jgordo72
Copy link

@cfiorelli - There should not be an abstract visible in any image within the PDF. If it is visible in a thumbnail, then we would need to remove it to adhere to the Springer contract terms.

@cfiorelli
Copy link
Collaborator

Moving this to 2024 prioritization review !

thanks everyone

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation Triage Copied to S2 internal for triage
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

6 participants