This repository - https://github.com/greyscalepress/catalogue-lumiere - contains a list of the 1428 identified films produced by the Lumière company between 1895-1905.
The list comes as a simple CSV file, containing a set of metadata for each title.
The data is also available through the website http://catalogue-lumiere.com/ which offers an interface for browsing / searching through specific parameters.
Understanding the data
Here is the key to the table:
- "Numero-original": the original catalog number of the film, as it appeared in the Lumiere catalog.
- "Numero-livre": The item number that was used by the researchers (Michelle Aubert and Jean-Claude Seguin) in their book "La production cinématographique des Frères Lumière" in 1996.
- "titre": The title of the film.
- "id-operateur": numerical ID of the camera operator.
- "operateur": name of the operator.
- "remarque": comments and notes.
- "description": the original item description, from the Lumière catalogue.
- "projection": information about screenings (when, where...)
- "lieu": human-readable location names (where it was filmed).
- "id-pays": country-ID.
- "id-ville": city-ID.
- "id-lieux": location-ID (mountain, seaside, train station, zoo...). 33 elements.
- "date": human readable date of production.
- "timestamp": machine-readable date stamp (approximate date of production).
- "info-1": mysterious metadata.
- "info-2": mysterious metadata.
- "info-3": mysterious metadata.
- "info-4": mysterious metadata.
- "info-5": mysterious metadata.
- "info-6": mysterious metadata.
- "info-7": mysterious metadata.
- "id-events": mysterious metadata.
- "id-genres": mysterious metadata. 14 elements.
- "id-sujet": types of people (acrobats, peasants, children...).
- "id-identity": numerical ID of persons appearing in the film.
- "sequence-1": mysterious metadata.
- "sequence-2": mysterious metadata.
- "personnes": notes about persons appearing in the film.
- "technique": technical details about the filming process (camera motions, etc).
- "id-objet": motives (train, boat, machines...). 19 elements.
- "id-mouv": type of action (arrival, departure...). 9 elements.
- "support": technical details about the film negative.
Source of the data
The data has been extracted from the CD-Rom that accompanies the book "La production cinématographique des Frères Lumière", published 1996 by the Centre national de la Cinématographie and Université Lumière-Lyon 2.
This catalogue is out-of-print, and will be hard to get, unless you have access to a good academic library.
The CD-Rom contains a bibliographic application, made in Macromedia Director. It was targeted for MacOS System 7 and Windows 95, and is unreadable on current operating systems.
However, the raw binary files of the CD-Rom contain some readable text data, which I was able to retrieve.
With some GREP magic, I produced this CSV table, that holds the essential metadata for the 1428 identified movies from the Lumiere catalogue.
What's the copyright of that data? Is it public domain?
The research team that produced this list gathered all the information from the original trade catalogues of the Lumière company (published 1897-1907), from newspapers published in that time period, and from an inventory established by the Cinémathèque française in 1948.
It's pretty much impossible to establish a clear copyright situation for this type of metadata.
Personally, I place all my contributions to this set of metadata under the Creative Commons Zero Public Domain Dedication (CC0), in order to facilitate research.