python clip_search.py --query "kids"
Work very much in progress
Aggregating Open Access datasets from the world's top art museums into a unified repository.
I encourage you to build something cool with this data! And if you do, please let me know, I'd love to see it!
- Time range filter is not working properly
- See how the backend naturally handles much larger datasets.
- Speak with supervisor about the best way to go forward with the project. I'd like to look into really good unstructured data search to make best use of the metadata for the artworks... I think this is the best way to aggregate the data.
- Also mention that I would like to finish this thesis as soon as I can as I have a good bit more work now (mention extra workload)
- Cleveland Museum of Art
- CMOA
- However, we need to manually adjust/ update the URLs!
- The MOMA
- National Gallery of Art (DC)
- Note: this doesn't have artist birth and death dates; also I automatically generate and add the artwork URL to the dataset
- [] Penn Museum
- [] QAGOMA
- [] Rijksmuseum
- [] Tate
[x] mark if they're processed
- The Metropolitan Museum of Art
- The MOMA
- National Gallery of Ireland
- Cleveland Museum of Art
- The National Gallery of Art (DC)
- Tate Gallery
- Art Institute of Chicago
- Harvard Art Museum
- Minneapolis Institute of Art
- Cooper Hewitt, Smithsonian Design Museum
The code in this repository (e.g., scripts, workflows) is licensed under the MIT License. See the LICENSE file for details.
The datasets in this repository are aggregated from other open CC0 datasets provided by top museums around the world. They are dedicated to the public domain under the CC0 1.0 Universal license. See the DATA_LICENSE file for more details.