### Data from TUD Library

__From Manuel:__

- The metadata I got from their OAI site
http://oai.tudelft.nl/ir/

- Sickle Documentation - http://sickle.readthedocs.io/en/latest/

- Another OAI link - https://books.bk.tudelft.nl/index.php/press/oai
- All OAI repositories: http://www.openarchives.org/Register/BrowseSites
- MERLOT: https://www.merlot.org/merlot/index.htm



 ### Theory OAI‐PMH (Open Archives Initiative Protocol for Metadata Harvesting)

Currently, the basis of most interoperability efforts in the learning object repositories field, OAI‐PMH is a protocol for transferring over the web metadata about any material stored in electronic form. Compared with similar initiatives such as Z39.50 (which addresses issues such as session management, management of result sets and specification of predicates to filter out results), OAI was intentionally designed simple to reduce implementation complexity and therefore costs, facilitating its adoption. Nonetheless, this new protocol was designed for transferring large amounts of metadata, and thus provides a reasonable solution for clients that need to aggregate or index metadata

Paper **Interlinking educational resources and the web of data : A survey of challenges and approaches**

Alternatives
- https://gist.github.com/lawlesst/1323535
- https://pypi.org/project/oaiharvest/
- https://github.com/infrae/pyoai

Set the IOPub data rate limit: https://github.com/jupyter/notebook/issues/2287

run `jupyter notebook --NotebookApp.iopub_data_rate_limit=10000000000`

To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:

NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)

NotebookApp.rate_limit_window=3.0 (secs)

In [1]:
from sickle import Sickle

In [3]:
sickle = Sickle('https://books.bk.tudelft.nl/index.php/press/oai')
records = sickle.ListRecords( **{'metadataPrefix': 'oai_dc', 'ignore_deleted': 'True'})

oFilePath = r"C:\Users\ani\Desktop\Course data Thesis\TUDelftBooks.xml"

with open(oFilePath, 'w', encoding = "utf-8") as oFile:
    for i,rec in enumerate(records):
        #print(rec.header)
        # extract only 1/3rd of the data as the file is too big otherwise (150 mb)
        #if i % 60 == 0:
            #oFile.write(str(rec))
        oFile.write(str(rec))

In [None]:
items = {}

sickle = Sickle('http://oai.tudelft.nl/ir')
records = sickle.ListRecords( **{'metadataPrefix': 'oai_dc', 'ignore_deleted': 'True'})

print('Processing metadata')

for r in records:
    uuid = ''
    uuid = r.metadata['identifier'][0][32:]
    items[uuid] = r.metadata

print(len(items))
print(items)

Processing metadata


In [None]:
from oaipmh.client import Client
from oaipmh.metadata import MetadataRegistry, oai_dc_reader

URL = 'http://oai.tudelft.nl/ir'

registry = MetadataRegistry()
registry.registerReader('oai_dc', oai_dc_reader)
client = Client(URL, registry)

for record in client.listRecords(metadataPrefix='oai_dc'):
   print(record)

## TU Delft Library Metadata 1 Record

` <record xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
	<header>
	<identifier>oai:tudelft.nl:uuid:00036dbe-54d3-432a-a78f-83cd66211b5e</identifier>
	<datestamp>2017-10-18T22:57:59Z</datestamp>
	<setSpec>A-set</setSpec>
	</header>

	<metadata>
		<oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
		<dc:title>Een exploratief onderzoek naar arbeidsmigranten en veiligheid in drie Europese landen</dc:title><dc:creator>Guldenmund, F.W. (author)
		</dc:creator><dc:creator>Cleal, B. (author)</dc:creator>
		<dc:creator>Mearns, K. (author)</dc:creator>
		<dc:description>Het aantal ongevallen met verzuim van arbeidsmigranten of ‘nieuwkomers’ stijgt de laatste jaren. Onder andere naar aanleiding hiervan is er in opdracht van het Ministerie van Sociale Zaken en Werkgelegenheid een studie uitgevoerd naar de arbeidsveiligheid van deze groep. Drie Europese landen (Denemarken, Verenigd Koninkrijk, Nederland) namen deel aan de studie. Speciale aandacht ging uit naar de relatie tussen veiligheid en de nationaliteit van de migranten. Met behulp van vierenveertig interviews, een vragenlijstonderzoek in elf landen en Deense ongevalsdata wordt een beeld geschetst van de situatie van arbeidsmigranten in de drie landen. Hoewel nog veel materiaal anekdotisch is, kan geconcludeerd worden dat het om een kwetsbare groep gaat. Zo lang veel migranten ongeregistreerd aan het werk gaan in het buitenland, en de nationale ongevallen databases niet uitgebreider en systematischer gecodeerd worden, blijft echter de omvang van het probleem vooralsnog onduidelijk.</dc:description>
		<dc:description>Values and Technology</dc:description>
		<dc:description>Technology, Policy and Management</dc:description>
		<dc:publisher>Nederlandse Vereniging voor Arbeidshygiene</dc:publisher>
		<dc:date>2012-06-01</dc:date>
		<dc:type>journal article</dc:type>
		<dc:type>Text</dc:type>
		<dc:identifier>http://resolver.tudelft.nl/uuid:00036dbe-54d3-432a-a78f-83cd66211b5e</dc:identifier>
		<dc:language>nl</dc:language>
		<dc:relation>Tijdschrift voor Toegepaste Arbowetenschappen, 26 (3/4), 2012--0923-618X</dc:relation>
		<dc:rights>(c) 2012 Auteurs</dc:rights></oai_dc:dc>
	</metadata>
</record> `

## DC Elements

- title
- location
- creator
- description
- publisher
- date
- type
- identifier
- language
- relation
- rights

### TUDelft Books 1 record

`<record xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
    <header>
        <identifier>oai:books.bk.tudelft.nl:publicationFormat/4</identifier>
        <datestamp>2013-03-02T12:44:41Z</datestamp>
        <setSpec>press:DASH</setSpec>
    </header>
    <metadata>
        <oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/  http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
            <dc:title xml:lang="en-US">DASH 07 - The Eco House: Typologies of Space, Production and Lifestyles</dc:title>
            <dc:creator>van Gameren, Dick; TU Delft, Architecture and the Built Environment</dc:creator>
            <dc:creator>van den Heuvel, Dirk; TU Delft, Architecture and the Built Environment</dc:creator>
            <dc:description xml:lang="en-US">Construction is responsible for about 20 per cent of total carbon dioxide emissions and 30 per cent of energy demand; this ranks it alongside the chemical and transport industries among the biggest polluters. Sustainability is therefore one of the most significant issues for designers and architects, as well as a challenging field for innovation and research.DASHgoes in search of the ideal eco house: from solar houses to Superuse, from Cradle-to- Cradle to support-infill systems and precise prefab-technology.
DASH The Eco House examines the history and the future of the sustainable home. This includes attention to basic principles (passive solar energy, semi-climate and ventilation) and material-use concepts. Essays by Daniel A. Barber, Machiel van Dorst, Jacques Vink, and Piet Vollaard provide not only a critical analysis of the current state of affairs, but also an inspirational exploration of possibilities for the future. Drop City pioneer Steve Baer looks back on the Construction of hisZome House and Jean-Philippe Vassal talks about his vision on architecture and sustainability. With extensive documentation of trendsetting homes by such architects as Frank Lloyd Wright, Ralph Erskine, Frei Otto, Lacaton &amp;amp; Vassal, Carlos Weeber, DAAD Architecten and 2012Architecten.
&amp;nbsp;</dc:description>
            <dc:publisher xml:lang="en-US">BK BOOKS</dc:publisher>
            <dc:date>2017-08-10</dc:date>
            <dc:type xml:lang="en-US">Book</dc:type>
            <dc:type xml:lang="en-US">text</dc:type>
            <dc:format>Paperback / softback (BC)
			</dc:format>
            <dc:identifier>https://books.bk.tudelft.nl/index.php/press/catalog/book/4</dc:identifier>
            <dc:identifier>9789056628536</dc:identifier>
            <dc:source xml:lang="en-US">BK BOOKS; </dc:source>
            <dc:rights xml:lang="en-US">For sale with exclusive rights in the specified countries or territories (sales restriction applies) (07)
			</dc:rights>
        </oai_dc:dc>
    </metadata>
</record>`

### MERLOT

Material Types
Learning materials in MERLOT can be categorized into 22 different material Types.

- Animation: Visually and dynamically presents concepts, models, processes, and/or phenomena in space or time. Users can control their pace and movement through the material typically, but they cannot determine and/or influence the initial conditions or their outcomes/results.
- Assessment Tool: Forms, templates, and technologies for measuring teaching and/or learning performance
- Assignment: Activities or lesson plans designed to enable students to learn skills and knowledge.
- Case Study: A narrative resource describing a complex interaction of real life factors to help illustrate the impact and/or interactions of concepts and factors in depth
- Collection: A meaningful organization of learning resources such as web sites, documents, apps, etc. that provides users an easier way to discover the materials.
- Development Tool: Software development applications platforms for authoring technology-based resources (e.g. web sites, learning objects, apps.)
- Drill and Practice: Requires users to respond repeatedly to questions or stimuli presented in a variety of sequences. Users practice on their own, at their own pace, to develop their ability to reliably perform and demonstrate the target knowledge and skills.
- ePortfolio: A collection of electronic materials assembled and managed by a user. These may include text, electronic files, images, multimedia, blog entries, and links. E-portfolios are both demonstrations of the user’s abilities and platforms for self-expression, and, if they are online, they can be maintained dynamically over time. An e-portfolio can be seen as a type of learning record that provides actual evidence of achievement.
- Hybrid/Blended Course: The organization and presentation of course curriculum required to deliver a complete course that blends online and face-to-face teaching and learning activities.
- Learning Object Repository: A searchable database of at least 100 online resources that is available on the Internet and whose search result displays an ordered list of items with a descriptive metadata. A webpage with a list of links is not a learning object repository
- Online Course: The organization and presentation of course curriculum required to deliver a complete course fully online
- Online Course Module: A component or section of a course curriculum that can be presented fully online and independent from the complete course.
- Open (Access) Journal – Article: A journal or article in a journal that is free of cost from the end user and has a Creative Commons, public domain, or other public use license agreement.
- Open (Access) Textbook: An online textbook offered by its author(s) with Creative Commons, public domain, or other public use license agreement allowing use of the ebook at no additional cost.
- Presentation: Teaching materials (text and multimedia) used present curriculum and concepts.
- Quiz/Test: Any assessment device intended to evaluate the knowledge and/or skills of learners
- Reference Material: Material with no specific instructional objectives, similar to that found in the reference area of a library. Subject specific directories to other sites, texts, or general information are examples.
- Simulation: Approximates a real or virtual experience in which users’ actions affect the outcomes of tasks they have to complete. Learners determine and input initial conditions that generate output that is different from and changed by the initial conditions.
- Social Networking Tool: Websites and apps that allows users to communicate with others connected in an electronic network of self-identified user groups for the purpose of sharing information, calls for actions, reactions, etc.
- Syllabus: A document or website that outlines the requirements and expectations for completing a course of study.
- Tutorial: Users navigate through a set of scaffolded learning activities designed to meet stated learning objectives, structured to impart specific concepts or skills, and organized sequentially to integrate conceptual presentation, demonstration, practice and testing. Feedback on learner performance is an essential component of a tutorial.
- Workshop and Training Material: Materials best used in a workshop setting for the purpose of professional development.

### MERLOT metadata (partial)

- Material Type
- Author
- Date Created
- Date Modified
- Date added to MERLOT
- Title
- ID
- Material type
- Submitter
- Primary Audience
    - Role (educational, student etc)
- Technical format
    - Applet / Video / Wiki / Image / HTML / Text / PDF etc
- Language
- Cost
- Disciplines
- Keywords
- Material_description
- Rate
- Quality
    - Peer review rating
        - ratingCount
        - ratingValue
        - bestRating
        - worstRating
    - User rating
        - ratingCount
        - ratingValue
        - bestRating
        - worstRating
- URL (ID)
- Media (URL)
- Accessibility
- Mobile Compatibility
- Creative Commons
    - useRightsUrl
    - license
- Similar material discriplines