100 000 Bildminnen

A collection of scripts and data connected to the 2022-2023 project 100 000 Bildminnen. A project done together with, and with funding from, Nordiska museet.

Not included in this repo is commons-diff, the tool developed for extracting changes to file pages on Wikimedia Commons in support of roundtripping. This tool is instead provided in its own repo where it has been developed with an aim to be useful beyond this project.

For the uploaded images see Commons:Nordiska museet/100 000 Bildminnen.

The final report for the project can be found at 100 000 Bildminnen - Slutrapport.pdf.

Structure

The repo is structured so that each script lives in a different directory. The directory contains the code of the script as well as a requirements.txt file to be installed via pip.

The directory may also contain a output_data subdirectory containing the final outputs from this scripts when run right after the end of the project. If this subdirectory exists it will also contain an _inputs.md file documenting the inputs used to produce each output.

Scripts

Below is a very brief description of each script.

mediaviews: Get all media-views/mediarequests for files in a Commons category during a provided time span.
wp_captions: Get all images in a Commons category and for each get all global usages and associated captions.
deriv_detector: Attempt to identify derivative files by analysing which other files link to files in a Commons category.
file_count: Show the growth of a Commons category of files by analysing the upload dates of its category.
diff_stats: Extract some statistics from the commons-diff output files. The output_data directory also contains the final commons-diff output.

Disclaimer

While many of the scripts have been generalised to be of use outside the context of this project they are primarilly provided here for convenience and documentation purposes. They are unlikely to be maintained and further development may take place in other repos.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
deriv_detector		deriv_detector
diff_stats		diff_stats
file_count		file_count
mediaviews		mediaviews
wp_captions		wp_captions
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

100 000 Bildminnen

Structure

Scripts

Disclaimer

About

Releases

Packages

Languages

License

Wikimedia-Sverige/nordiska-2022

Folders and files

Latest commit

History

Repository files navigation

100 000 Bildminnen

Structure

Scripts

Disclaimer

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages