Import milli 🎉 #3346

Kerollmops · 2023-01-16T15:44:58Z

Main work

integrate the milli repository as an internal crate into this repo
Update the Cargo.toml accordingly
Ensure meilisearch-type now uses the internal milli crate and not the remote repository
Update the milli's version to follow the meilisearch one

Also

Removed the beta tests in test CI (will be re-integrated later if needed)
Move and modify milli's README into the milli folder
remove the script folder from milli
Removed useless CI (release-drafter and enforce-label)

⚠️ Also, import all the release-v1.0.0 until a5c4fb included (merged of the PR #3334)

671: Update version for the next release (v0.35.0) in Cargo.toml files r=Kerollmops a=meili-bot ⚠️ This PR is automatically generated. Check the new version is the expected one before merging. Co-authored-by: curquiza <curquiza@users.noreply.github.com>

... that were reintroduced after a rebase

668: Fix many Clippy errors part 2 r=ManyTheFish a=ehiggs This brings us a step closer to enforcing clippy on each build. # Pull Request ## Related issue This does not fix any issue outright, but it is a second round of fixes for clippy after meilisearch/milli#665. This should contribute to fixing meilisearch/milli#659. ## What does this PR do? Satisfies many issues for clippy. The complaints are mostly: * Passing reference where a variable is already a reference. * Using clone where a struct already implements `Copy` * Using `ok_or_else` when it is a closure that returns a value instead of using the closure to call function (hence we use `ok_or`) * Unambiguous lifetimes don't need names, so we can just use `'_` * Using `return` when it is not needed as we are on the last expression of a function. ## PR checklist Please check if your PR fulfills the following requirements: - [x] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [x] Have you read the contributing guidelines? - [x] Have you made sure that the title is accurate and descriptive of the changes? Thank you so much for contributing to Meilisearch! Co-authored-by: Ewan Higgs <ewan.higgs@gmail.com>

Fixes #661 and #2905

Originally written by ManyTheFish here: https://gist.github.com/ManyTheFish/f840e37cb2d2e029ce05396b4d540762 Co-authored-by: ManyTheFish <many@meilisearch.com>

Moved the actual test into a separate function used by both the existing test and the new test.

619: Refactor the Facets databases to enable incremental indexing r=curquiza a=loiclec # Pull Request ## What does this PR do? Party fixes meilisearch/milli#605 by making the indexing of the facet databases (i.e. `facet_id_f64_docids` and `facet_id_string_docids`) incremental. It also closes #327 and #2820 . Two more untracked bugs were also fixed: 1. The facet distribution algorithm did not respect the `maxFacetValues` parameter when there were only a few candidate document ids. 2. The structure of the levels > 0 of the facet databases were not updated following the deletion of documents ## How to review this PR First, read this comment to get an overview of the changes. Then, based on this comment, raise any concerns you might have about: 1. the new structure of the databases 2. the algorithms for sort, facet distribution, and range search 3. the new/removed heed codecs Then, weigh in on the following concerns: 1. adding `fuzzcheck` as a fuzz-only dependency may add too much complexity for the benefits it provides 2. the `ByteSliceRef` and `StrRefCodec` are misnamed or should not exist 3. the new behaviour of facet distributions can be considered incorrect 4. incremental deletion is useless given that documents are always deleted in bulk ## What's left for me to do 1. Re-read everything once to make sure I haven't forgotten anything 2. Wait for the results of the benchmarks and see if (1) they provide enough information (2) there was any change in performance, especially for search queries. Then, maybe, spend some time optimising the code. 3. Test whether the `info`/`http-ui` crates survived the refactor ## Old structure of the `facet_id_f64_docids` and `facet_id_string_docids` databases Previously, these two databases had different but conceptually similar structures. For each field id, the facet number database had the following format: ``` ┌───────────────────────────────┬───────────────────────────────┬───────────────┐ ┌───────┐ │ 1.2 – 2 │ 3.4 – 100 │ 102 – 104 │ │Level 2│ │ │ │ │ └───────┘ │ a, b, d, f, z │ c, d, e, f, g │ u, y │ ├───────────────┬───────────────┼───────────────┬───────────────┼───────────────┤ ┌───────┐ │ 1.2 – 1.3 │ 1.6 – 2 │ 3.4 – 12 │ 12.3 – 100 │ 102 – 104 │ │Level 1│ │ │ │ │ │ │ └───────┘ │ a, b, d, z │ a, b, f │ c, d, g │ e, f │ u, y │ ├───────┬───────┼───────┬───────┼───────┬───────┼───────┬───────┼───────┬───────┤ ┌───────┐ │ 1.2 │ 1.3 │ 1.6 │ 2 │ 3.4 │ 12 │ 12.3 │ 100 │ 102 │ 104 │ │Level 0│ │ │ │ │ │ │ │ │ │ │ │ └───────┘ │ a, b │ d, z │ b, f │ a, f │ c, d │ g │ e │ e, f │ y │ u │ └───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┘ ``` where the first line is the key of the database, consisting of : - the field id - the level height - the left and right bound of the group and the second line is the value of the database, consisting of: - a bitmap of all the docids that have a facet value within the bounds The `facet_id_string_docids` had a similar structure: ``` ┌───────────────────────────────┬───────────────────────────────┬───────────────┐ ┌───────┐ │ 0 – 3 │ 4 – 7 │ 8 – 9 │ │Level 2│ │ │ │ │ └───────┘ │ a, b, d, f, z │ c, d, e, f, g │ u, y │ ├───────────────┬───────────────┼───────────────┬───────────────┼───────────────┤ ┌───────┐ │ 0 – 1 │ 2 – 3 │ 4 – 5 │ 6 – 7 │ 8 – 9 │ │Level 1│ │ "ab" – "ac" │ "ba" – "bac" │ "gaf" – "gal" │"form" – "wow" │ "woz" – "zz" │ └───────┘ │ a, b, d, z │ a, b, f │ c, d, g │ e, f │ u, y │ ├───────┬───────┼───────┬───────┼───────┬───────┼───────┬───────┼───────┬───────┤ ┌───────┐ │ "ab" │ "ac" │ "ba" │ "bac" │ "gaf" │ "gal" │ "form"│ "wow" │ "woz" │ "zz" │ │Level 0│ │ "AB" │ " Ac" │ "ba " │ "Bac" │ " GAF"│ "gal" │ "Form"│ " wow"│ "woz" │ "ZZ" │ └───────┘ │ a, b │ d, z │ b, f │ a, f │ c, d │ g │ e │ e, f │ y │ u │ └───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┘ ``` where, **at level 0**, the key is: * the normalised facet value (string) and the value is: * the original facet value (string) * a bitmap of all the docids that have this normalised string facet value **At level 1**, the key is: * the left bound of the range as an index in level 0 * the right bound of the range as an index in level 0 and the value is: * the left bound of the range as a normalised string * the right bound of the range as a normalised string * a bitmap of all the docids that have a string facet value within the bounds **At level > 1**, the key is: * the left bound of the range as an index in level 0 * the right bound of the range as an index in level 0 and the value is: * a bitmap of all the docids that have a string facet value within the bounds ## New structure of the `facet_id_f64_docids` and `facet_id_string_docids` databases Now both the `facet_id_f64_docids` and `facet_id_string_docids` databases have the exact same structure: ``` ┌───────────────────────────────┬───────────────────────────────┬───────────────┐ ┌───────┐ │ "ab" (2) │ "gaf" (2) │ "woz" (1) │ │Level 2│ │ │ │ │ └───────┘ │ [a, b, d, f, z] │ [c, d, e, f, g] │ [u, y] │ ├───────────────┬───────────────┼───────────────┬───────────────┼───────────────┤ ┌───────┐ │ "ab" (2) │ "ba" (2) │ "gaf" (2) │ "form" (2) │ "woz" (2) │ │Level 1│ │ │ │ │ │ │ └───────┘ │ [a, b, d, z] │ [a, b, f] │ [c, d, g] │ [e, f] │ [u, y] │ ├───────┬───────┼───────┬───────┼───────┬───────┼───────┬───────┼───────┬───────┤ ┌───────┐ │ "ab" │ "ac" │ "ba" │ "bac" │ "gaf" │ "gal" │ "form"│ "wow" │ "woz" │ "zz" │ │Level 0│ │ │ │ │ │ │ │ │ │ │ │ └───────┘ │ [a, b]│ [d, z]│ [b, f]│ [a, f]│ [c, d]│ [g] │ [e] │ [e, f]│ [y] │ [u] │ └───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┴───────┘ ``` where for all levels, the key is a `FacetGroupKey<T>` containing: * the field id (`u16`) * the level height (`u8`) * the left bound of the range (`T`) and the value is a `FacetGroupValue` containing: * the number of elements from the level below that are part of the range (`u8`, =0 for level 0) * a bitmap of all the docids that have a facet value within the bounds (`RoaringBitmap`) The right bound of the range is now implicit, it is equal to `Excluded(next_left_bound)`. In the code, the key is always encoded using `FacetGroupKeyCodec<C>` where `C` is the codec used to encode the facet value (either `OrderedF64Codec` or `StrRefCodec`) and the value is encoded with `FacetGroupValueCodec`. Since both databases share the same structure, we can implement almost all operations only once by treating the facet value as a byte slice (i.e. `FacetGroupKey<&[u8]>` encoded as `FacetGroupKeyCodec<ByteSliceRef>`). This is, in my opinion, a big simplification. The reason for changing the structure of the databases is to make it possible to incrementally add a facet value to an existing database. Since the `facet_id_string_docids` used to store indices to `level 0` in all levels > 0, adding an element to level 0 would potentially invalidate all the indices. Note that the original string value of a facet is no longer stored in this database. ## Incrementally adding a facet value Here I describe how we can add a facet value to the new database incrementally. If we want to add the document with id `z` and facet value `gap`., then we want to add/modify the elements highlighted below in pink: <img width="946" alt="Screenshot 2022-09-12 at 10 14 54" src="https://user-images.githubusercontent.com/6040237/189605532-fe4b0f52-e13d-4b3c-92d9-10c705953e3d.png"> which results in: <img width="662" alt="Screenshot 2022-09-12 at 10 23 29" src="https://user-images.githubusercontent.com/6040237/189607015-c3a37588-b825-43c2-878a-f8f85c000b94.png"> * one element was added in level 0 * one key/value was modified in level 1 * one value was modified in level 2 Adding this element was easy since we could simply add it to level 0 and then increase the `group_size` part of the value for the level above. However, in order to keep the structure balanced, we can't always do this. If the group size reaches a threshold (`max_group_size`), then we split the node into two. For example, let's imagine that `max_group_size` is `4` and we add the docid `y` with facet value `gas`. First, we add it in level 0: <img width="904" alt="Screenshot 2022-09-12 at 10 30 40" src="https://user-images.githubusercontent.com/6040237/189608391-531f9df1-3424-4f1f-8344-73eb194570e5.png"> Then, we realise that the group size of its parent is going to reach the maximum group size (=4) and thus we split the parent into two nodes: <img width="919" alt="Screenshot 2022-09-12 at 10 33 16" src="https://user-images.githubusercontent.com/6040237/189608884-66f87635-1fc6-41d2-a459-87c995491ac4.png"> and since we inserted an element in level 1, we also update level 2 accordingly, by increasing the group size of the parent: <img width="915" alt="Screenshot 2022-09-12 at 10 34 42" src="https://user-images.githubusercontent.com/6040237/189609233-d4a893ff-254a-48a7-a5ad-c0dc337f23ca.png"> We also have two other parameters: * `group_size` is the default group size when building the database from scratch * `min_level_size` is the minimum number of elements that a level should contain When the highest level size is greater than `group_size * min_level_size`, then we create an additional level above it. There is one more edge case for the insertion algorithm. While we normally don't modify the existing left bounds of a key, we have to do it if the facet value being inserted is smaller than the first left bound. For example, inserting `"aa"` with the docid `w` would change the database to: <img width="756" alt="Screenshot 2022-09-12 at 10 41 56" src="https://user-images.githubusercontent.com/6040237/189610637-a043ef71-7159-4bf1-b4fd-9903134fc095.png"> The root of the code for incremental indexing is the `FacetUpdateIncremental` builder. ## Incrementally removing a facet value TODO: the algorithm was implemented and works, but its current API is: `fn delete(self, facet_value, single_docid)`. It removes the given document id from all keys containing the given facet value. I don't think it is the right way to implement it anymore. Perhaps a bitmap of docids should be given instead. This is fairly easy to do. But since we batch document deletions together (because of soft deletion), it's not clear to me anymore that incremental deletion should be implemented at all. ## Bulk insertion While it's faster to incrementally add a single facet value to the database, it is sometimes **slower** to repeatedly add facet values one-by-one instead of doing it in bulk. For example, during initial indexing, we'd like to build the database from a list of facet values and associated document ids in one go. The `FacetUpdateBulk` builder provides a way to do so. It works by: 1. clearing all levels > 0 from the DB 2. adding all new elements in level 0 3. rebuilding the higher levels from scratch The algorithm for bulk insertion is the same as the previous one. ## Choosing between incremental and bulk insertion On my computer, I measured that is about 50x slower to add N facet values incrementally than it is to re-build a database with N facet values in level 0. Therefore, we dynamically choose to use either incremental insertion or bulk insertion based on (1) the number of existing elements in level 0 of the database and (2) the number of facet values from the new documents. This is imprecise but is mainly aimed at avoiding the worst-case scenario where the incremental insertion method is used repeatedly millions of times. ## Fuzz-testing **Potentially controversial:** I fuzz-tested incremental addition and deletion using fuzzcheck, which found many bugs. The fuzz-test consists of inserting/deleting facet values and docids in succession, each operation is processed with different parameters for `group_size`, `max_group_size`, and `min_level_size`. After all the operations are processed, the content of level 0 is compared to the content of an equivalent structure with a simple and easily-checked implementation. Furthermore, we check that the database has a correct structure (all groups from levels > 0 correctly combine the content of their children). I also visualised the code coverage found by the fuzz-test. It covered 100% of the relevant code except for `unreachable/panic` statements and errors returned by `heed`. The fuzz-test and the fuzzcheck dependency are only compiled when `cargo fuzzcheck` is used. For now, the dependency is from a local path on my computer, but it can be changed to a crate version if we decide to keep it. ## Algorithms operating on the facet databases There are four important algorithms making use of the facet databases: 1. Sort, ascending 2. Sort, descending 3. Facet distribution 4. Range search Previously, the implementation of all four algorithms was based on a number of iterators specific to each database kind (number or string): `FacetNumberRange`, `FacetNumberRevRange`, `FacetNumberIter` (with a reversed and reducing/non-reducing option), `FacetStringGroupRange`, `FacetStringGroupRevRange`, `FacetStringLevel0Range`, `FacetStringLevel0RevRange`, and `FacetStringIter` (reversed + reducing/non-reducing). Now, all four algorithms have a unique implementation shared by both the string and number databases. There are four functions: 1. `ascending_facet_sort` in `search/facet/facet_sort_ascending.rs` 2. `descending_facet_sort` in `search/facet/facet_sort_descending.rs` 3. `iterate_over_facet_distribution` in `search/facet/facet_distribution_iter.rs` 4. `find_docids_of_facet_within_bounds` in `search/facet/facet_range_search.rs` I have tried to test them with some snapshot tests but more testing could still be done. I don't *think* that the performance of these algorithms regressed, but that will need to be confirmed by benchmarks. ## Change of behaviour for facet distributions Previously, the original string value of a facet was stored in the level 0 of `facet_id_string_docids `. This is no longer the case. The original string value was used in the implementation of the facet distribution algorithm. Now, to recover it, we pick a random document id which contains the normalised string value and look up the original one in `field_id_docid_facet_strings`. As a consequence, it may be that the string value returned in the field distribution does not appear in any of the candidates. For example, ```json { "id": 0, "colour": "RED" } { "id": 1, "colour": "red" } ``` Facet distribution for the `colour` field among the candidates `[1]`: ``` { "RED": 1 } ``` Here, "RED" was given as the original facet value even though it does not appear in the document id `1`. ## Heed codecs A number of heed codecs related to the facet databases were removed: * `FacetLevelValueF64Codec` * `FacetLevelValueU32Codec` * `FacetStringLevelZeroCodec` * `StringValueCodec` * `FacetStringZeroBoundsValueCodec` * `FacetValueStringCodec` * `FieldDocIdFacetStringCodec` * `FieldDocIdFacetF64Codec` They were replaced by: * `FacetGroupKeyCodec<C>` (replaces all key codecs for the facet databases) * `FacetGroupValueCodec` (replaces all value codecs for the facet databases) * `FieldDocIdFacetCodec<C>` (replaces `FieldDocIdFacetStringCodec` and `FieldDocIdFacetF64Codec`) Since the associated encoded item of `FacetGroupKeyCodec<C>` is `FacetKey<T>` and we often work with `FacetKey<&[u8]>` and `FacetKey<&str>`, then we need to have codecs that encode values of type `&str` and `&[u8]`. The existing `ByteSlice` and `Str` codecs do not work for that purpose (their `EItem` are `[u8]` and `str`), I have also created two new codecs: * `ByteSliceRef` is a codec with a `EItem = DItem = &[u8]` * `StrRefCodec` is a codec with a `EItem = DItem = &str` I have also factored out the code used to encode an ordered f64 into its own `OrderedF64Codec`. Co-authored-by: Loïc Lecrenier <loic@meilisearch.com>

Add Run Clippy to bors.toml

Dont apply clippy for tests for now Fix clippy warnings of filter-parser package parent 8352febd646ec4bcf56a44161e5c4dce0e55111f author unvalley <38400669+unvalley@users.noreply.github.com> 1666325847 +0900 committer unvalley <kirohi.code@gmail.com> 1666791316 +0900 Update .github/workflows/rust.yml Co-authored-by: Clémentine Urquizar - curqui <clementine@meilisearch.com> Allow clippy lint too_many_argments Allow clippy lint needless_collect Allow clippy lint too_many_arguments and type_complexity Fix for clippy warnings comparison_chains Fix for clippy warnings vec_init_then_push Allow clippy lint should_implement_trait Allow clippy lint drop_non_drop Fix lifetime clipy warnings in filter-paprser Execute cargo fmt Fix clippy remaining warnings Fix clippy remaining warnings again and allow lint on each place

Co-authored-by: ManyTheFish <many@meilisearch.com>

Remove clippy job Fix clippy error type_complexity Restore ambiguous change

bors · 2023-01-16T17:47:36Z

try

Build failed:

Run Rustfmt

dureuill · 2023-01-16T18:40:12Z

I'm personally really afraid of having the internal crate out of sync. Will you have the rigor to update them each time you make a change? It makes no point to having un-synchronized versions in the crates if they make no sense because you forget to update them.

@curquiza I feel like this could be automated in CI, maybe with https://github.com/obi1kenobi/cargo-semver-checks, or with some custom logic based on cargo metadata.

Meanwhile, I see no real drawback to just using 0.0.0 in milli

curquiza · 2023-01-17T08:34:10Z

Good tool @dureuill 👍 But once we choose to publish milli right? I don't see any point in having this additional tool until we decide to publish milli

Meanwhile, I see no real drawback to just using 0.0.0 in milli

I just have to change this automation, because, so far, it updates every */src/Cargo.toml file.
Should be ok, I hope. I often struggle to anticipate what is easy or not to do in a CI.

dureuill · 2023-01-17T08:38:40Z

💪 good luck curquiza. In my experience, everything tends to be more complicated than it should be in CI.

But once we choose to publish milli right? I don't see any point in having this additional tool until we decide to publish milli

Agreed!

loiclec · 2023-01-17T08:50:52Z

I am afraid something in my original message was misunderstood. I was not talking about milli specifically, but about EVERY crate, that is:

meilisearch
meilisearch-types
meilisearch-auth
meili-snap
index-scheduler
dump
file-store
permissive-json-pointer
milli
filter-parser
flatten-serde-json
json-depth-checker
benchmarks

None of them (including meilisearch!) have a stable API, and therefore none of them should have the version 1.0.0. I like the idea of using 0.0.0 for all of them (again, including meilisearch!).

curquiza · 2023-01-17T08:53:16Z

Which cargo.toml version is used in the /version route?

loiclec · 2023-01-17T08:57:33Z

It is meilisearch's Cargo.toml if I am not mistaken, thus the idea of either:

splitting meilisearch into meilisearch-lib (0.0.0) and meilisearch-cli (1.0.0) (I am not 100% sure it would work correctly, but it's easy to verify); or:
defining the version manually in the code

dureuill · 2023-01-17T08:58:10Z

Which cargo.toml version is used in the /version route?

Excellent question! The version in the /version route uses the CARGO_PKG_VERSION of the meilisearch package, however there's a versioning.rs file in meilisearch-types that also uses this environment variable (in the context of meilisearch-types), notably to check the DB version

Looking for CARGO_PKG_VERSION reveals it is also being used in the dump crate.

Looks like the assumption that the version is the same for all the crates was made up to now.

(to clarify, each package uses its own version for the value of the compile-time CARGO_PKG_VERSION environment variable)

Kerollmops · 2023-01-17T15:18:04Z

One solution to the fact that the meilisearch crate version could be to simply delete the lib.rs file.
Wouldn't it fix the issue? This way nobody but the main.rs file can access the internal API.

About using the 0.0.0 version everywhere, it is a good idea for crates that are not published on crates.io. Maintaining a valid version of the milli crate is not our main concern, we can continue to ship it by using GitHub tags until it is published officially.

So, what should I do in this PR? At least put 0.0.0 for the milli crate.

dureuill · 2023-01-18T08:34:25Z

Discussed internally, consensus seems to be:

All unpublished crates, which currently is all crates, switch to v1.0.0
The README is updated to specify: the crates in this repository are not currently available on crates.io and do not follow semver conventions

curquiza · 2023-01-18T09:27:07Z

README.md

@@ -101,3 +101,7 @@ Meilisearch is a search engine created by [Meili](https://www.welcometothejungle
 - For everything else, please check [this page listing some of the other places where you can find us](https://docs.meilisearch.com/learn/what_is_meilisearch/contact.html)

 Thank you for your support!
+
+## 📦 Internal crates and their versioning


Hum, I don't really like this title, it's too technical compared to what we have on it. We are a famous project and now the README is almost like a landing page. However I don't want to block the PR, I will make a PR suggestion later
(I notice we don't talk about contribution in the README, it will be also an opportunity to add this 😄)

curquiza · 2023-01-18T09:27:54Z

bors try

bors · 2023-01-18T10:07:02Z

try

Build succeeded:

Kerollmops · 2023-01-18T10:19:24Z

And here we are! The day has come to import milli once and for all into the Meilisearch repository 🎉
bors merge

bors · 2023-01-18T11:01:41Z

Build succeeded:

3399: Rework technical information in the README r=Kerollmops a=curquiza Following this #3346 (comment) Co-authored-by: curquiza <clementine@meilisearch.com> Co-authored-by: Clémentine Urquizar - curqui <clementine@meilisearch.com>

loiclec and others added 30 commits October 26, 2022 13:49

Fix formatting and warning after rebasing from main

b7f2428

Merge #671

d3f95e6

671: Update version for the next release (v0.35.0) in Cargo.toml files r=Kerollmops a=meili-bot ⚠️ This PR is automatically generated. Check the new version is the expected one before merging. Co-authored-by: curquiza <curquiza@users.noreply.github.com>

Merge remote-tracking branch 'origin/main' into facet-levels-refactor

2741756

Depend on released version of fuzzcheck from crates.io

631e991

Remove outdated files from http-ui/ and infos/

2fa85a2

... that were reintroduced after a rebase

Merge remote-tracking branch 'origin/main' into facet-levels-refactor

54c0cf9

[WIP] Fix phrase search containing stop words

62816dd

Fixes #661 and #2905

Add test for phrase search with stop words

6a10b67

Originally written by ManyTheFish here: https://gist.github.com/ManyTheFish/f840e37cb2d2e029ce05396b4d540762 Co-authored-by: ManyTheFish <many@meilisearch.com>

Perform filter after enumerate to keep origin indices

ef13c6a

Increment position even when it's a stop word in exactness criteria

709ab3c

Search for closest non-stop words in proximity criteria

3e19050

Use resolve_phrase in exactness and typo criteria

c8c666c

Fix snapshots to use new phrase type

d187b32

Run cargo fmt

bb9ce3c

Fix panic when phrase contains only one stop word and nothing else

2aa11af

Simplify stop word checking in create_primitive_query

77f1ff0

Add test for phrase search with stop words and all criteria at once

f1da623

Moved the actual test into a separate function used by both the existing test and the new test.

Consecutive is false when at least 1 stop word is surrounded by words

af33d22

Run cargo fmt

488d31e

Add clippy job

d8fed1f

Add Run Clippy to bors.toml

Execute cargo clippy --fix

811f156

Update phrase search to use new execute method

752d031

Change consecutive phrase search grouping logic

d35afa0

Co-authored-by: ManyTheFish <many@meilisearch.com>

Fix all clippy error after conflicts

f4ec1ab

Fix rust fmt

f3c0b05

fix clippy error and remove clippy job from ci

a1d7ed1

Remove clippy job Fix clippy error type_complexity Restore ambiguous change

Only call plane_sweep on subgroups when 2 or more are present

03eb5d8

Kerollmops added 3 commits January 17, 2023 16:26

Fix the CI to ignore a missing file

2b1f6a7

Make clippy happy

1b78231

Fix the formatting

1d507c8

Kerollmops force-pushed the import-milli branch from 2104339 to 1d507c8 Compare January 17, 2023 17:26

Add a note in the README about the crates versionning

0769090

Kerollmops force-pushed the import-milli branch from 10ef564 to 0769090 Compare January 18, 2023 09:08

curquiza approved these changes Jan 18, 2023

View reviewed changes

bors bot added a commit that referenced this pull request Jan 18, 2023

Try #3346:

ea942af

Kerollmops marked this pull request as ready for review January 18, 2023 10:17

bors bot merged commit 6b3da8a into main Jan 18, 2023

bors bot deleted the import-milli branch January 18, 2023 11:01

This was referenced Jan 18, 2023

Benchmarks running on main are failing #3386

Closed

Rework technical information in the README #3399

Merged

curquiza mentioned this pull request Feb 8, 2023

Publish milli in crates.io #3367

Open

meili-bot added the v1.1.0 PRs/issues solved in v1.1.0 released on 2023-04-03 label Apr 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Import milli 🎉 #3346

Import milli 🎉 #3346

Kerollmops commented Jan 16, 2023 •

edited by curquiza

Loading

bors bot commented Jan 16, 2023

dureuill commented Jan 16, 2023

curquiza commented Jan 17, 2023 •

edited

Loading

dureuill commented Jan 17, 2023

loiclec commented Jan 17, 2023

curquiza commented Jan 17, 2023

loiclec commented Jan 17, 2023 •

edited

Loading

dureuill commented Jan 17, 2023 •

edited

Loading

Kerollmops commented Jan 17, 2023 •

edited

Loading

dureuill commented Jan 18, 2023

curquiza Jan 18, 2023

curquiza commented Jan 18, 2023

bors bot commented Jan 18, 2023

Kerollmops commented Jan 18, 2023

bors bot commented Jan 18, 2023

Import milli 🎉 #3346

Import milli 🎉 #3346

Conversation

Kerollmops commented Jan 16, 2023 • edited by curquiza Loading

bors bot commented Jan 16, 2023

try

dureuill commented Jan 16, 2023

curquiza commented Jan 17, 2023 • edited Loading

dureuill commented Jan 17, 2023

loiclec commented Jan 17, 2023

curquiza commented Jan 17, 2023

loiclec commented Jan 17, 2023 • edited Loading

dureuill commented Jan 17, 2023 • edited Loading

Kerollmops commented Jan 17, 2023 • edited Loading

dureuill commented Jan 18, 2023

curquiza Jan 18, 2023

Choose a reason for hiding this comment

curquiza commented Jan 18, 2023

bors bot commented Jan 18, 2023

try

Kerollmops commented Jan 18, 2023

bors bot commented Jan 18, 2023

Kerollmops commented Jan 16, 2023 •

edited by curquiza

Loading

curquiza commented Jan 17, 2023 •

edited

Loading

loiclec commented Jan 17, 2023 •

edited

Loading

dureuill commented Jan 17, 2023 •

edited

Loading

Kerollmops commented Jan 17, 2023 •

edited

Loading