-
Notifications
You must be signed in to change notification settings - Fork 46
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
22 changed files
with
233 additions
and
180 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,13 +1,21 @@ | ||
Partial indexing documents (AtomicUpdates) | ||
****************************************** | ||
|
||
This means whenever possible, only the necessary / specified attributes get updated in Solr, and more importantly, re-indexed by Plone's indexers. | ||
This means whenever possible, | ||
only the necessary/specified attributes get updated in Solr, | ||
and more importantly, | ||
re-indexed by Plone's indexers. | ||
|
||
With collective.recipe.solr a new configuration is introduced, called `updateLog`. ``updateLog`` is enabled by default and allows atomic updates. In detail it adds a new field ``_version_`` to the schema and also adds "<updateLog />" to your solr config. | ||
With collective.recipe.solr a new configuration is introduced, | ||
called ``updateLog``. | ||
``updateLog`` is enabled by default and allows atomic updates. | ||
In detail it adds a new field ``_version_`` to the schema and also adds "<updateLog />" to your solr config. | ||
|
||
Further all your indexes configured in solr.cfg needs the stored:true attribute (Except the ``default`` field). | ||
Further all your indexes configured in solr.cfg needs the ``stored:true`` attribute (except the ``default`` field). | ||
|
||
See http://wiki.apache.org/solr/Atomic_Updates for details. | ||
|
||
|
||
Also note, that the AtomicUpdate feature is no compatible with the "Index time boost" feature. You have to decide, whether using atomic updates, or boosting on index time. You can enable/disable atomic updates thru the collective.solr control panel. Atomic updates are enabled by default. | ||
Also note, that the AtomicUpdate feature is no compatible with the "Index time boost" feature. | ||
You have to decide, whether using atomic updates, or boosting on index time. | ||
You can enable/disable atomic updates through the collective.solr control panel. | ||
Atomic updates are enabled by default. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,14 +1,18 @@ | ||
Indexing binary documents | ||
************************* | ||
|
||
At this point collective.solr uses Plone's default capabilities to index binary documents via `portal_transforms` and installing command line tools like `wv2` or `pdftotext`. | ||
At this point collective.solr uses Plone's default capabilities to index binary documents. | ||
It does so via `portal_transforms` and installing command line tools like `wv2` or `pdftotext`. | ||
Work is under way to expose and use the `Apache Tika`_ Solr integration available via the `update/extract` handler. | ||
|
||
Once finished this will speed up indexing of binary documents considerably, as the extraction will happen out-of-process on the Solr server side. | ||
Once finished this will speed up indexing of binary documents considerably, | ||
as the extraction will happen out-of-process on the Solr server side. | ||
`Apache Tika`_ also supports a much larger list of formats than can be supported by adding external command line tools. | ||
|
||
There is room for more improvements in this area, as c.solr will still send the binary data to Solr as part of the end-user request/transaction. | ||
To further optimize this, Solr index operations can be stored in a task queue as provided by `plone.app.async` or solutions build on top of `Celery`. | ||
There is room for more improvements in this area, | ||
as collective.solr will still send the binary data to Solr as part of the end-user request/transaction. | ||
To further optimize this, | ||
Solr index operations can be stored in a task queue as provided by `plone.app.async` or solutions build on top of `Celery`. | ||
This is currently outside the scope of `collective.solr`. | ||
|
||
.. _`Apache Tika`: http://tika.apache.org/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.