-
Notifications
You must be signed in to change notification settings - Fork 10
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
10 changed files
with
56 additions
and
94 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,56 +1,29 @@ | ||
# Content in the search index and where it comes from | ||
|
||
This list documents the kinds of things included in Rummager's search indexes, | ||
and the apps currently responsible for publishing them as of June 2017. | ||
|
||
![Government content and HMRC manuals make up most of the content we index](rough_content_breakdown.png) | ||
|
||
For a broader view of the content that is available, see [Document types on GOV.UK](https://docs.publishing.service.gov.uk/document-types.html). | ||
For an overview view of the sorts of content that are available, see [Document types on GOV.UK](https://docs.publishing.service.gov.uk/document-types.html). | ||
|
||
## Whitehall | ||
|
||
This is what most publishers use to publish. Content appears on the ["inside government" part of GOV.UK](https://www.gov.uk/government/publications). There are 200,000 documents. | ||
|
||
Implemented in [searchable.rb](https://github.com/alphagov/whitehall/blob/master/app/models/searchable.rb). | ||
|
||
- 96460 publications | ||
- 53678 news articles | ||
- 11052 world location news articles | ||
- 8112 speeches | ||
- 4012 detailed guidance | ||
- 3771 document collections | ||
- 3766 consultations | ||
- 3684 statistics announcements | ||
- 2729 people | ||
- 1579 case study | ||
- 1109 corporate information pages | ||
- 1017 organisations | ||
- 677 policy groups | ||
- 567 statistical data sets | ||
- 501 fatality notices | ||
- 455 worldwide organisations | ||
- 318 ministers | ||
- 234 world locations | ||
- 63 topical events | ||
- 47 “topics” | ||
- 19 inside-government-links (DEPRECATED) | ||
- 18 take parts | ||
- 7 finders | ||
- 5 operational fields | ||
|
||
## Other publishing apps | ||
|
||
Most publishing apps, such as publisher and specialist-publisher, do not send | ||
content to Rummager directly. Instead, they publish content to the | ||
content to Search API directly. Instead, they publish content to the | ||
[publishing-api][publishing_api] which adds the content to a notifications queue | ||
to be ingested by rummager. | ||
to be ingested by search-api. | ||
|
||
See [ADR 001][adr_001] for more details on this approach. | ||
|
||
[publishing_api]: https://github.com/alphagov/publishing-api | ||
[adr_001]: https://github.com/alphagov/rummager/blob/master/doc/arch/adr-001-use-of-both-rabbitmq-and-sidekiq-queues.md | ||
[adr_001]: https://github.com/alphagov/search-api/blob/master/doc/arch/adr-001-use-of-both-rabbitmq-and-sidekiq-queues.md | ||
|
||
## Search admin | ||
Admin for GOV.UK search. Sends 506 "recommended links" to Rummager, so we can | ||
show external links in search results. | ||
Admin for GOV.UK search. Publishes "recommended links" to Search API, | ||
so we can show external links in search results; and "best bets", so | ||
selected search results can be artificially boosted to the top of the | ||
list. | ||
|
||
Implemented in [elastic_search_recommended_link.rb](https://github.com/alphagov/search-admin/blob/master/app/models/elastic_search_recommended_link.rb). | ||
Implemented in [elastic_search_recommended_link.rb](https://github.com/alphagov/search-admin/blob/master/app/models/elastic_search_recommended_link.rb) and [rummager_saver.rb](https://github.com/alphagov/search-admin/blob/master/app/services/rummager_saver.rb). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,4 @@ | ||
# Rummager Documents API | ||
# Documents API | ||
|
||
### `POST /:index/documents` | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters