Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

argazki.irekia euskadi.eus #1430

Open
1 task
javiercasares opened this issue Sep 10, 2022 · 2 comments
Open
1 task

argazki.irekia euskadi.eus #1430

javiercasares opened this issue Sep 10, 2022 · 2 comments
Labels
🌟 goal: addition Addition of new feature 🟩 priority: low Low priority and doesn't need to be rushed 🧱 stack: catalog Related to the catalog and Airflow DAGs ⛔ status: blocked Blocked & therefore, not ready for work 💬 talk: discussion Open for discussions and feedback

Comments

@javiercasares
Copy link

Source Site

https://argazki.irekia.euskadi.eus/es/photos?page=1

Value Provided

Images from Euskadi (Spain)

Licenses Provided

https://creativecommons.org/licenses/by/4.0/

Implementation

  • 🙋 I would be interested in implementing this feature.
@javiercasares javiercasares added 🚦 status: awaiting triage Has not been triaged & therefore, not ready for work 🧹 status: ticket work required Needs more details before it can be worked on labels Sep 10, 2022
@dhruvkb dhruvkb changed the title <Source name here> argazki.irekia euskadi.eus Sep 10, 2022
@dhruvkb dhruvkb added 🟩 priority: low Low priority and doesn't need to be rushed 🌟 goal: addition Addition of new feature and removed 🚦 status: awaiting triage Has not been triaged & therefore, not ready for work 🧹 status: ticket work required Needs more details before it can be worked on labels Sep 12, 2022
@dhruvkb
Copy link
Member

dhruvkb commented Sep 12, 2022

Could not find an API for the site. However, the linked source site is paginated and the HTML is scrapable (is structured well, contains lots of metadata and includes pagination boundaries).

<div class='photos with_pagination' id='photos_jglance'>                            
    <div class='box' style='display: none'>            
      <span class='url'>//argazki.irekia.euskadi.eus/photos/p200/20190309_09_1239.jpg</span>
      <span class='title'>20190309_09_1239</span>
      <span class='link'>/es/photos/24060</span>
      <span class='width'>2832</span>
      <span class='height'>4256</span>                                   
      <span class='contributed'>false</span>                                         
    </div>
    <!-- ... more like the above -->
</div>

The language barrier prevents me from being sure but there are some Euskadi OpenData datasets that seem related:

@dhruvkb dhruvkb added 💬 talk: discussion Open for discussions and feedback ⛔ status: blocked Blocked & therefore, not ready for work labels Sep 12, 2022
@dhruvkb
Copy link
Member

dhruvkb commented Sep 12, 2022

Marking this as blocked for now because there needs to be a discussion about how to proceed with the source, considering that the information we have, both about the source as well as the images in it, is inadequate.

@obulat obulat added the 🧱 stack: catalog Related to the catalog and Airflow DAGs label Feb 24, 2023
@obulat obulat transferred this issue from WordPress/openverse-catalog Apr 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🌟 goal: addition Addition of new feature 🟩 priority: low Low priority and doesn't need to be rushed 🧱 stack: catalog Related to the catalog and Airflow DAGs ⛔ status: blocked Blocked & therefore, not ready for work 💬 talk: discussion Open for discussions and feedback
Projects
Status: 📋 Backlog
Development

No branches or pull requests

3 participants