Find file
Fetching contributors…
Cannot retrieve contributors at this time
113 lines (70 sloc) 7.3 KB
%META:TOPICINFO{author="RogerHyam" date="1225725093" format="1.1" reprev="1.7" version="1.7"}%
%RED% These ideas are still under discussion. %BLACK%
---+ <nop>%TOPIC%
This page describes how you can improve the metatags in a species page so that it works well with species page indexers (see SpeciesPages). It will probably improve performance of any indexer.
---++ Introduction
HTML pages have a &lt;head&gt; section that contains information about the page that isn't normally displayed. Two of the elements that can occur in the head section are the [[][&lt;meta&gt;]] element and the [[][&lt;link&gt;]] element.
Dublin Core (DC) is the most common metadata vocabulary for describing all sorts of objects. The Dublin Core Metadata Initiative [[][describes how to express DC in HTML]] using these two elements. Using DC and one metadata term from the TDWG ontology it is possible to provide rich information in the page header that enable indexers to bind species pages to nomenclatural and other kinds of data thus making your species page part of a global system.
This page provides simple information on what to include in the &lt;head&gt; section of your species pages. You should not need to read other documents to get your pages marked up correctly but you can if you want to of course.
---++ Complete Example
The content to include is simple so it is best to start with a complete example and then explain the different parts.
<link rel="schema.DCTERMS" href="" >
<meta name="DCTERMS.title" content="Lythrum salicaria L." >
<meta name="DCTERMS.subject" content="SpeciesPage Plantae Lythrum salicaria Purple-loosestrife" >
<meta name="DCTERMS.coverage" content="GB IE 155 British Isles and Western Europe" >
<link rel="schema.TDWG" href="" >
<link rel="TDWG.hasName" href="" >
---++ schema.DCTERMS
The first line is not specific to SpeciesPages. It just defines the namespace for the Dublin Core metadata vocabulary that is used in the next three lines. Further explanation is given [[][here]].
---++ DCTERMS.title
The second line gives the 'title' of the resource. The content attribute of this line should contain the full name of the species including any authorship string and dates. Here it is used in an equivalent sense to the Darwin Core [[][ScientificName]] field.
---++ DCTERMS.subject
The third line gives the 'subject' of the resource. The Dublin Core Metadata Initiative makes the following comment "Typically, the subject will be represented using keywords, key phrases, or classification codes. Recommended best practice is to use a controlled vocabulary."
The content attribute of this line should start with the key word 'SpeciesPage' followed by the Kingdom of the species, followed by the genus of the species followed by the specific epithet of the species with a space between each word. Further key words can be added after the specific epithet.
The Kingdom should be one of: Eukaryota, Animalia, Plantae, Fungi, Chromista or Protozoa.
---++ DCTERMS.coverage
The fourth line gives the 'coverage' of the resource. The Dublin Core Metadata Initiative suggests the use of a controlled vocabulary for this field. Because species accounts are typically applicable at quite a high geographic level two controlled vocabularies are suggested.
The content attribute of this element should contain a combination of two letter alphabetic codes for countries from the [[][ISO-3166-1]] standard and/or three digit numeric United Nations [[][geographical region codes]].
A SpeciesPage indexer can digest the contents of this attribute and interpret two letter and three digit codes appropriately but ignore any other words that may be useful for other indexers.
---++ schema.TDWG
The fifth line is like the first line in that it defines a namespace. The namespace defined is for the TDWG TaxonConcept vocabulary that is used in the next line. Both this and the next line give a degree of precision that is desirable but optional.
---++ TDWG.hasName
The sixth and final line enables indexers to unambiguously link the species page with a name as defined by one of the major nomenclators; IPNI, IndexFungorum or ZooBank.
The href attribute of this line should contain the LSID of a name from one of these recognized nomenclators.
---++ Expressing Taxonomic Relationships
You can express valuable taxonomic relationships by adding other lines. If the page represents a taxon that you do not think should be used (e.g. you consider it a synonymous taxon) then you should state specifically in the text of the page that another taxon should be used and you should add this line in the header (below the schema.DCTERMS line).
<link rel="DCTERMS.isReplacedBy" href="http://link_to_other_species_page" >
In the accepted taxon page you could include this line in the header
<link rel="DCTERMS.replaces" href="http://link_to_other_species_page" >
You can indicate that this page represents a taxon that is a child of another taxon (e.g. a species of a genus) by including a line like this:
<link rel="DCTERMS.isPartOf" href="http://link_to_other_species_page" >
In the enclosing taxon page (e.g. the genus of a species) you can include a link like this:
<link rel="DCTERMS.hasPart" href="http://link_to_other_species_page" >
---++ Other Dublin Core Terms
The above recommendation is only concerned with the Dublin Core metadata terms to include for the specific benefit of Species Page indexes. You are encourage to follow best practice in including as many other Dublin Core terms as are relevant to your situation. Terms of particular importance are: created, modified, publisher, creator and license. The terms are documented [[][here]] and examples of expressing them are documented [[][here]].
---++ HTML or XHTML?
There are broadly two types of HTML used today (though several subtypes). If HTML documents are valid XML documents they are referred to as XHTML and begin with an declaration that looks like this:
&lt;?xml version="1.0" encoding="utf-8" ?&gt;
they also have XHTML mentioned in their DOCTYPE declaration. Regular HTML documents are not valid XML they don't start with a XML declaration or mention XHTML in their DOCTYPE declaration. Regular HTML pages are far more common that XHTML pages.
The examples given above are for regular HTML pages. They can be adapted for XHTML pages simply by adding a / (forward slash) before the closing &gt; of the link or meta elements for example this:
&lt;link rel="schema.DCTERMS" href="" &gt;
would become this:
&lt;link rel="schema.DCTERMS" href="" /&gt;
%SEARCH{"%TOPIC%" excludetopic="%TOPIC%" header="*Linking Topics*" format=" * $topic" nosearch="on" nototal="on" }%