Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SOLR-14726: Initial draft of a new quickstart guide #2594

Draft
wants to merge 2 commits into
base: branch_8x
Choose a base branch
from

Conversation

chatman
Copy link
Contributor

@chatman chatman commented Oct 26, 2021

A new quickstart guide that can potentially replace (or live side by side with) the Solr tutorial.
This is WIP at the moment, but would appreciate early feedback and thoughts.

@chatman chatman marked this pull request as draft October 26, 2021 18:04
A single document can be indexed as:
[source,subs="verbatim,attributes+"]
----
$ curl --request POST \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A nit pick is that the collection is tech productions, and we have books..... Maybe we should think (separately) renaming techproducts to just products.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, good idea. I just took those docs off the Solr tutorial (which indexes books into techproducts). But, clearly, it is time for a better example.

----

== Commit
After documents are indexed into a collection, they are not immediately available for searching. In order to have them searchable, a commit operation (also called `refresh` in other search engines like OpenSearch etc.) is needed. Commits can be scheduled at periodic intervals using auto-commits as follows.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i don't know if introducing terms used by other search engines is useful... though maybe we want to build up a gloassary that would list "equivalent" terms from other engines?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A glossary sounds like a very good idea, for people coming from different systems.

Copy link
Contributor Author

@chatman chatman Oct 26, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i don't know if introducing terms used by other search engines is useful

I feel that those coming from ES / OpenSearch backgrounds might be able to relate better. My main motivation with this document is to cut down on paragraphs of text and have more copy-paste-able snippets, esp. using JSON/V2 apis, to make Solr more appealing to those who find ES easy to use (mainly due to their superior beginner documentation).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That makes sense...

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"solr for ES/OS refugees"

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense to me as a point of reference. It might be more economical to say "(also called refresh in ElasticSearch/OpenSearch)" ... unless there are other search engines that refer to this concept as "refresh"?

"create": {
"name": "techproducts",
"numShards": 1,
"replicationFactor": 1
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why no config attribute ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought the same. If the consensus is that we're going away from field guessing, then we should not promote the current _default config, but rather be explicit and reference the bundled techproducts configset. Or better, show them how to use Schema Designer to setup a configset for a certain dataset?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For quickstart examples, we don't need the user to use their own configsets. They can start with the default configset, add fields (schema API) and their indexing/searching.

If the consensus is that we're going away from field guessing, then we should not promote the current _default config, but rather be explicit and reference the bundled techproducts configset.

I'm more inclined to remove the techproducts configset. They can be downloaded from some web resource for those who need it.

Or better, show them how to use Schema Designer to setup a configset for a certain dataset?

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
5 participants