Skip to content

Commit

Permalink
Merge remote-tracking branch 'upstream/0.11.x'
Browse files Browse the repository at this point in the history
  • Loading branch information
newgene committed Aug 31, 2022
2 parents fe98613 + a1f4823 commit 234d4fe
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions docs/tutorial/studio_tutorial.rst
Original file line number Diff line number Diff line change
Expand Up @@ -224,9 +224,10 @@ in a dictionary indexed by gene ID. The final documents are assembled in the las
yield doc
.. note:: The `_id` key is mandatory and represents a unique identifier for this document. The type must be a string. The _id key is
.. note:: The `_id` key is mandatory and represents a unique identifier for this document. The type must be a string. The `_id` key is
used when data from multiple datasources are merged together, that process is done according to its value
(all documents sharing the same _id from different datasources will be merged together).
(all documents sharing the same `_id` from different datasources will be merged together). Due to the `indexing limitation <https://www.elastic.co/guide/en/elasticsearch/reference/8.4/mapping-id-field.html>`_, the length of
the `_id` key should be kept no more than 512.

.. note:: In this specific example, we read the whole content of this input file in memory, then store annotations per gene. The data itself
is small enough to do this, but memory usage always needs to be cautiously considered when we write a parser.
Expand Down

0 comments on commit 234d4fe

Please sign in to comment.