Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'master' into ClassPropertyUsageAnalyzer
- Loading branch information
Showing
73 changed files
with
15,242 additions
and
12,505 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,20 +1,27 @@ | ||
Wikidata Toolkit | ||
================ | ||
|
||
[![Build Status](https://travis-ci.org/Wikidata/Wikidata-Toolkit.png?branch=master)](https://travis-ci.org/Wikidata/Wikidata-Toolkit) | ||
[![Coverage Status](https://coveralls.io/repos/Wikidata/Wikidata-Toolkit/badge.png?branch=master)](https://coveralls.io/r/Wikidata/Wikidata-Toolkit?branch=master) | ||
[![Maven Central](https://maven-badges.herokuapp.com/maven-central/org.wikidata.wdtk/wdtk-parent/badge.svg)](http://search.maven.org/#search|ga|1|g%3A%22org.wikidata.wdtk%22) | ||
[![Project Stats](https://www.openhub.net/p/Wikidata-Toolkit/widgets/project_thin_badge.gif)](https://www.openhub.net/p/Wikidata-Toolkit) | ||
|
||
This is the Java implementation of the Wikidata Toolkit, | ||
following the original [Wikibase Toolkit IEG proposal](https://meta.wikimedia.org/wiki/Grants:IEG/Wikidata_Toolkit). | ||
|
||
Documentation: [Wikidata Toolkit homepage](https://www.mediawiki.org/wiki/Wikidata_Toolkit) | ||
|
||
API documentation: [Wikidata Toolkit Javadocs](http://wikidata.github.io/Wikidata-Toolkit/) | ||
|
||
Authors: [Markus Kroetzsch](http://korrekt.org), [Julian Mendez](http://lat.inf.tu-dresden.de/~mendez/), [Fredo Erxleben](https://github.com/fer-rum), [Michael Guenther](https://github.com/guenthermi) | ||
|
||
License: [Apache 2.0](LICENSE.txt) | ||
|
||
|
||
Wikidata Toolkit | ||
================ | ||
|
||
[![Build Status](https://travis-ci.org/Wikidata/Wikidata-Toolkit.png?branch=master)](https://travis-ci.org/Wikidata/Wikidata-Toolkit) | ||
[![Coverage Status](https://coveralls.io/repos/Wikidata/Wikidata-Toolkit/badge.png?branch=master)](https://coveralls.io/r/Wikidata/Wikidata-Toolkit?branch=master) | ||
[![Maven Central](https://maven-badges.herokuapp.com/maven-central/org.wikidata.wdtk/wdtk-parent/badge.svg)](http://search.maven.org/#search|ga|1|g%3A%22org.wikidata.wdtk%22) | ||
[![Project Stats](https://www.openhub.net/p/Wikidata-Toolkit/widgets/project_thin_badge.gif)](https://www.openhub.net/p/Wikidata-Toolkit) | ||
|
||
Wikidata Toolkit is a Java library for accessing Wikidata and other Wikibase installations. It can be used to create bots, to perform data extraction tasks (e.g., convert all data in Wikidata to a new format), and to do large-scale analyses that are too complex for using a simple SPARQL query service. | ||
|
||
Documentation | ||
------------- | ||
|
||
* [Wikidata Toolkit homepage](https://www.mediawiki.org/wiki/Wikidata_Toolkit): project homepage with basic user documentation, including guidelines on how to setup your Java IDE for using Maven and git. | ||
* [Wikidata Toolkit examples](https://github.com/Wikidata/Wikidata-Toolkit-Examples): stand-alone Java project that shows how to use Wikidata Toolkit as a library for your own code. | ||
* [Wikidata Toolkit Javadocs](http://wikidata.github.io/Wikidata-Toolkit/): API documentation | ||
|
||
License and Credits | ||
------------------- | ||
|
||
Authors: [Markus Kroetzsch](http://korrekt.org), [Julian Mendez](http://lat.inf.tu-dresden.de/~mendez/), [Fredo Erxleben](https://github.com/fer-rum), [Michael Guenther](https://github.com/guenthermi), [Markus Damm](https://github.com/mardam), and [other contributors](https://github.com/Wikidata/Wikidata-Toolkit/graphs/contributors) | ||
|
||
License: [Apache 2.0](LICENSE.txt) | ||
|
||
The development of Wikidata Toolkit has been partially funded by the Wikimedia Foundation under the [Wikibase Toolkit Individual Engagement Grant](https://meta.wikimedia.org/wiki/Grants:IEG/Wikidata_Toolkit), and by the German Research Foundation (DFG) under [Emmy Noether grant KR 4381/1-1 "DIAMOND"](https://ddll.inf.tu-dresden.de/web/DIAMOND/en). | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,126 +1,146 @@ | ||
Wikidata Toolkit Release Notes | ||
============================== | ||
|
||
Version 0.5.0 | ||
------------- | ||
|
||
New features: | ||
* Support for reading and writing live entity data from wikidata.org or any other Wikibase site (issue #162) | ||
* New examples for illustrating read/write API support | ||
* Support for quantities with units of measurement (new feature in Wikibase; still beta) | ||
* New builder classes to simplify construction of EntityDocuments, Statements, and References | ||
* Support processing of local dump files by file name in code and command-line client (issue #136) | ||
* New example WorldMapProcessor that shows the generation of maps from geographic data | ||
* Improved output file naming for examples, taking dump date into account | ||
* RDF export uses property register for fewer Web requests during export | ||
* RDF export supports P1921 URI patterns to create links to external RDF datasets | ||
|
||
Bug fixes: | ||
* JSON conversion action of the command-line client was forgetting start of entity list. | ||
* Update URLs to use https instead of http | ||
* Support URLs in sites table that are not protocol-relative (issue #163) | ||
|
||
Incompatible changes: | ||
* EntityDocumentProcessorFilter has a modified constructor that requires a filter object | ||
to be given. The direct set methods to define the filter are no longer available. | ||
|
||
|
||
Version 0.4.0 | ||
------------- | ||
|
||
New features: | ||
* Support statements on property documents | ||
* More robust JSON parsing: recover after errors to process remaining file | ||
* Improved JSON serialization + an example program showing how to do it | ||
* Standard (POJO) datamodel implementation now is Serializable | ||
* Deep copy functionality for changing between datamodel implementations (DatamodelConverter) | ||
* Support for filtering data during copying (e.g., to keep only some languages/properties/sites). | ||
* Support arbitrary precision values in globe coordinates | ||
* Dependency on JSON.org has been removed to use the faster Jackson library everywhere | ||
|
||
Bug fixes: | ||
* Support RDF export of Monolingual Text Value data in statements. | ||
* Significant performance improvements in RDF export of taxonomy data. | ||
* Support new Wikimedia Foundation dump file index HTML format (Issue #114) | ||
|
||
Incompatible changes: | ||
* The datatype of all values in GlobeCoordinateValue (latitude, longitude, precision) has | ||
changed from long (fixed precision number) to double (floating point number) to match the JSON. | ||
* The JSON serializer class org.wikidata.wdtk.datamodel.json.JsonSerializer has vanished. It is | ||
replaced by the org.wikidata.wdtk.datamodel.json.jackson.JsonSerializer (almost same interface). | ||
|
||
|
||
Version 0.3.0 | ||
------------- | ||
|
||
New features: | ||
* Added full support for reading data from the API JSON format (now used in all dumps); | ||
reading JSON dumps also became much faster with this change | ||
* Improved examples (more, faster, easier-to-read programs); documentation on each | ||
example is now found in the Readme.md file in the example package | ||
* Added iterator access to all statements of an item document, all statements in a statement | ||
group, all qualifiers in a claim, all snaks in a snak group, and all snaks in a reference | ||
* Dump files are downloaded to temporary files first to prevent incomplete downloads | ||
from causing errors | ||
* Datamodel objects can now be constructed using the static methods of Datamodel. This makes | ||
object creation more convenient. | ||
|
||
Minor changes: | ||
* ItemIdValue and PropertyIdValue objects now have a "site IRI" that can be retrieved. | ||
This was called "base IRI" in earlier releases and was only used to construct the full | ||
IRI. The new concept is that this IRI is actually the identifier for the site that the | ||
entity comes from. It is important to make it retrievable since it is needed (like in | ||
previous versions) to construct the object using the factory. | ||
* A new helper package in the datamodel module contains common hashCode(), equals(), and | ||
toString() methods that can be used by any datamodel implementation. | ||
|
||
Bug fixes: | ||
* Fix grouping of Statements when reading data from dumps (Issue #78) | ||
|
||
|
||
Version 0.2.0 | ||
------------- | ||
|
||
New features: | ||
* Support for serializing Wikibase data in RDF (as illustrated in new example); | ||
see http://korrekt.org/page/Introducing_Wikidata_to_the_Linked_Data_Web for details | ||
* Simplified code for dump file processing: new helper class DumpProcessingController | ||
* Support for resolving site links, based on information from the sites table dump | ||
(as demonstrated in a new example program) | ||
* Support for SnakGroups (data model updated to group Snaks by property in all lists) | ||
* Support for serializing Wikibase data in JSON (as illustrated in new example) | ||
|
||
Bug fixes: | ||
* Support changed Wikimedia dump HTML page format, which caused download to fail (Issue #70) | ||
* Support processing of property documents when parsing dumps (Issue #67) | ||
* Support SomeValueSnak and NoValueSnak in references (Issue #44) | ||
* Use correct site links when importing data from dumps (Issue #37) | ||
* Do not attempt to download unfinished dump files (Issue #63) | ||
|
||
Incompatible changes: | ||
* The processing of dumpfiles was simplified, using a new class DumpProcessingController. | ||
The former method WmfDumpFileManager#processRecentRevisionDumps() was replaced by | ||
DumpProcessingController#processAllRecentRevisionDumps(). See the examples for example | ||
code. | ||
* Dump files no longer support the retrieval of the maximal revision id, since this | ||
information is no longer published for the main dumps on the Wikimedia site. | ||
|
||
|
||
Version 0.1.0 | ||
------------- | ||
|
||
New features: | ||
* Initial Java implementation of Wikibase datamodel | ||
* Support for downloading Wikimedia dumpfiles | ||
* Support for parsing MediaWiki XML dumps | ||
* Support for parsing Wikibase dump contents to get entity data | ||
* Example Java program shows how to process Wikidata dump files | ||
|
||
Bug fixes: | ||
* not applicable; this is the very first release | ||
|
||
Know issues: | ||
* Entities loaded from dump get wrong base IRI (issue #43) | ||
* URLs for sitelinks are missing (issue #37) | ||
|
||
|
||
Wikidata Toolkit Release Notes | ||
============================== | ||
|
||
Version 0.6.0 | ||
------------- | ||
|
||
A new stand-alone example project is now showing how to use WDTK as a library: | ||
https://github.com/Wikidata/Wikidata-Toolkit-Examples | ||
|
||
New features: | ||
* Support for new Wikidata property type "external identifier" | ||
* Support for new Wikidata property type "math" | ||
* Bots: support maxlag parameter and edit-rate throttling | ||
* Bots: better Wikidata API error handling | ||
* Bots: several real-world bot examples | ||
* New convenience methods for accessing Wikidata Java objects, for simpler code | ||
* full compatibility with Java 8 | ||
|
||
Bug fixes: | ||
* Fix NullPointerException when trying to establish API connection (issue #217) | ||
* Avoid test failures on some platforms (based on too strict assumptions) | ||
|
||
|
||
Version 0.5.0 | ||
------------- | ||
|
||
New features: | ||
* Support for reading and writing live entity data from wikidata.org or any other Wikibase site (issue #162) | ||
* New examples for illustrating read/write API support | ||
* Support for quantities with units of measurement (new feature in Wikibase; still beta) | ||
* New builder classes to simplify construction of EntityDocuments, Statements, and References | ||
* Support processing of local dump files by file name in code and command-line client (issue #136) | ||
* New example WorldMapProcessor that shows the generation of maps from geographic data | ||
* Improved output file naming for examples, taking dump date into account | ||
* RDF export uses property register for fewer Web requests during export | ||
* RDF export supports P1921 URI patterns to create links to external RDF datasets | ||
|
||
Bug fixes: | ||
* JSON conversion action of the command-line client was forgetting start of entity list. | ||
* Update URLs to use https instead of http | ||
* Support URLs in sites table that are not protocol-relative (issue #163) | ||
|
||
Incompatible changes: | ||
* EntityDocumentProcessorFilter has a modified constructor that requires a filter object | ||
to be given. The direct set methods to define the filter are no longer available. | ||
|
||
|
||
Version 0.4.0 | ||
------------- | ||
|
||
New features: | ||
* Support statements on property documents | ||
* More robust JSON parsing: recover after errors to process remaining file | ||
* Improved JSON serialization + an example program showing how to do it | ||
* Standard (POJO) datamodel implementation now is Serializable | ||
* Deep copy functionality for changing between datamodel implementations (DatamodelConverter) | ||
* Support for filtering data during copying (e.g., to keep only some languages/properties/sites). | ||
* Support arbitrary precision values in globe coordinates | ||
* Dependency on JSON.org has been removed to use the faster Jackson library everywhere | ||
|
||
Bug fixes: | ||
* Support RDF export of Monolingual Text Value data in statements. | ||
* Significant performance improvements in RDF export of taxonomy data. | ||
* Support new Wikimedia Foundation dump file index HTML format (Issue #114) | ||
|
||
Incompatible changes: | ||
* The datatype of all values in GlobeCoordinateValue (latitude, longitude, precision) has | ||
changed from long (fixed precision number) to double (floating point number) to match the JSON. | ||
* The JSON serializer class org.wikidata.wdtk.datamodel.json.JsonSerializer has vanished. It is | ||
replaced by the org.wikidata.wdtk.datamodel.json.jackson.JsonSerializer (almost same interface). | ||
|
||
|
||
Version 0.3.0 | ||
------------- | ||
|
||
New features: | ||
* Added full support for reading data from the API JSON format (now used in all dumps); | ||
reading JSON dumps also became much faster with this change | ||
* Improved examples (more, faster, easier-to-read programs); documentation on each | ||
example is now found in the Readme.md file in the example package | ||
* Added iterator access to all statements of an item document, all statements in a statement | ||
group, all qualifiers in a claim, all snaks in a snak group, and all snaks in a reference | ||
* Dump files are downloaded to temporary files first to prevent incomplete downloads | ||
from causing errors | ||
* Datamodel objects can now be constructed using the static methods of Datamodel. This makes | ||
object creation more convenient. | ||
|
||
Minor changes: | ||
* ItemIdValue and PropertyIdValue objects now have a "site IRI" that can be retrieved. | ||
This was called "base IRI" in earlier releases and was only used to construct the full | ||
IRI. The new concept is that this IRI is actually the identifier for the site that the | ||
entity comes from. It is important to make it retrievable since it is needed (like in | ||
previous versions) to construct the object using the factory. | ||
* A new helper package in the datamodel module contains common hashCode(), equals(), and | ||
toString() methods that can be used by any datamodel implementation. | ||
|
||
Bug fixes: | ||
* Fix grouping of Statements when reading data from dumps (Issue #78) | ||
|
||
|
||
Version 0.2.0 | ||
------------- | ||
|
||
New features: | ||
* Support for serializing Wikibase data in RDF (as illustrated in new example); | ||
see http://korrekt.org/page/Introducing_Wikidata_to_the_Linked_Data_Web for details | ||
* Simplified code for dump file processing: new helper class DumpProcessingController | ||
* Support for resolving site links, based on information from the sites table dump | ||
(as demonstrated in a new example program) | ||
* Support for SnakGroups (data model updated to group Snaks by property in all lists) | ||
* Support for serializing Wikibase data in JSON (as illustrated in new example) | ||
|
||
Bug fixes: | ||
* Support changed Wikimedia dump HTML page format, which caused download to fail (Issue #70) | ||
* Support processing of property documents when parsing dumps (Issue #67) | ||
* Support SomeValueSnak and NoValueSnak in references (Issue #44) | ||
* Use correct site links when importing data from dumps (Issue #37) | ||
* Do not attempt to download unfinished dump files (Issue #63) | ||
|
||
Incompatible changes: | ||
* The processing of dumpfiles was simplified, using a new class DumpProcessingController. | ||
The former method WmfDumpFileManager#processRecentRevisionDumps() was replaced by | ||
DumpProcessingController#processAllRecentRevisionDumps(). See the examples for example | ||
code. | ||
* Dump files no longer support the retrieval of the maximal revision id, since this | ||
information is no longer published for the main dumps on the Wikimedia site. | ||
|
||
|
||
Version 0.1.0 | ||
------------- | ||
|
||
New features: | ||
* Initial Java implementation of Wikibase datamodel | ||
* Support for downloading Wikimedia dumpfiles | ||
* Support for parsing MediaWiki XML dumps | ||
* Support for parsing Wikibase dump contents to get entity data | ||
* Example Java program shows how to process Wikidata dump files | ||
|
||
Bug fixes: | ||
* not applicable; this is the very first release | ||
|
||
Know issues: | ||
* Entities loaded from dump get wrong base IRI (issue #43) | ||
* URLs for sitelinks are missing (issue #37) | ||
|
||
|
Oops, something went wrong.