Integrate Uffizzi #2

waveywaves · 2022-11-08T11:41:08Z

No description provided.

github-actions · 2022-12-06T18:13:34Z

Uffizzi Preview deployment-13042 was deleted.

3499: Use the workspace inheritance r=Kerollmops a=irevoire Use the workspace inheritance [introduced in rust 1.64](https://blog.rust-lang.org/2022/09/22/Rust-1.64.0.html#cargo-improvements-workspace-inheritance-and-multi-target-builds). It allows us to define the version of meilisearch once in the main `Cargo.toml` and let all the other `Cargo.toml` uses this version. `@curquiza` I added you as a reviewer because I had to patch some CI scripts And `@Kerollmops,` I had to bump the `cargo_toml` crates because our version was getting old and didn't support the feature yet. Also, in another PR, I would like to unify some of our dependencies to ensure we always stay in sync between all our crates. Co-authored-by: Tamo <tamo@meilisearch.com>

Metrics feature was relying on old references. Refactored with inspiration from the `get_stats` method in `meilisearch/src/routes/lib.rs`. `enable_metrics_routes` added to options in `segment_analytics`. Resolves: meilisearch#3469 See also: meilisearch#2763

3467: Identify builds git tagged with `prototype-...` in CLI and analytics r=curquiza a=dureuill # Pull Request ## What does this PR do? - Parses the last git tag to extract a prototype name if: - Current build uses the prototype tag (not after the tag) precisely - The prototype tag name respects the following conditions: 1. starts with `prototype-` 2. ends with a number 3. the hyphen-separated segment right before the number is not a number (required to reject commits after the tag). - Display the prototype name in the launch summary in the CLI - Send the prototype name to analytics if any - Update prototypes instructions in CONTRIBUTING.md |`VERGEN_GIT_SEMVER_LIGHTWEIGHT` value | Prototype | |---|---| | `Some("prototype-geo-bounding-box-0-139-gcde89018")` | `None` (does not end with a number) | | `Some("prototype-geo-bounding-box-0-139-89018")` | `None` (before the last segment is a number) | | `Some("prototype-geo-bounding-box-0")` | `Some("prototype-geo-bounding-box-0")` | | `Some("prototype-geo-bounding-box")` | `None` (does not end with a number") | | `Some("geo-bounding-box-0")` | `None` (does not start with "prototype") | | `None` | `None` | Co-authored-by: Louis Dureuil <louis@meilisearch.com>

3514: Bump version of mini-dashboard to v0.2.6 r=irevoire a=bidoubiwa Update the version of the mini-dashboard to v0.2.6. See [release notes](https://github.com/meilisearch/mini-dashboard/releases/tag/v0.2.6). Co-authored-by: Charlotte Vermandel <charlottevermandel@gmail.com>

3515: Consider null as a valid geo field r=irevoire a=irevoire Fix meilisearch#3497 Associated spec; meilisearch/specifications#222 Co-authored-by: Tamo <tamo@meilisearch.com>

3319: Transparently resize indexes on MaxDatabaseSizeReached errors r=Kerollmops a=dureuill # Pull Request ## Related issue Related to meilisearch#3280, depends on meilisearch/milli#760 ## What does this PR do? ### User standpoint - Meilisearch no longer fails tasks that encounter the `milli::UserError(MaxDatabaseSizeReached)` error. - Instead, these tasks are retried after increasing the maximum size allocated to the index where the failure occurred. ### Implementation standpoint - Add `Batch::index_uid` to get the `index_uid` of a batch of task if there is one - `IndexMapper::create_or_open_index` now takes an additional `size` argument that allows to (re)open indexes with a size different from the base `IndexScheduler::index_size` field - `IndexScheduler::tick` now returns a `Result<TickOutcome>` instead of a `Result<usize>`. This offers more explicit control over what the behavior should be wrt the next tick. - Add `IndexStatus::BeingResized` that contains a handle that a thread can use to await for the resize operation to complete and the index to be available again. - Add `IndexMapper::resize_index` to increase the size of an index. - In `IndexScheduler::tick`, intercept task batches that failed due to `MaxDatabaseSizeReached` and resize the index that caused the error, then request a new tick that will eventually handle the still enqueued task. ## Testing the PR The following diff can be applied to this branch to make testing the PR easier: <details> ```diff diff --git a/index-scheduler/src/index_mapper.rs b/index-scheduler/src/index_mapper.rs index 553ab45a..022b2f00 100644 --- a/index-scheduler/src/index_mapper.rs +++ b/index-scheduler/src/index_mapper.rs `@@` -228,13 +228,15 `@@` impl IndexMapper { drop(lock); + std::thread::sleep_ms(2000); + let current_size = index.map_size()?; let closing_event = index.prepare_for_closing(); - log::info!("Resizing index {} from {} to {} bytes", name, current_size, current_size * 2); + log::error!("Resizing index {} from {} to {} bytes", name, current_size, current_size * 2); closing_event.wait(); - log::info!("Resized index {} from {} to {} bytes", name, current_size, current_size * 2); + log::error!("Resized index {} from {} to {} bytes", name, current_size, current_size * 2); let index_path = self.base_path.join(uuid.to_string()); let index = self.create_or_open_index(&index_path, None, 2 * current_size)?; `@@` -268,8 +270,10 `@@` impl IndexMapper { match index { Some(Available(index)) => break index, Some(BeingResized(ref resize_operation)) => { + log::error!("waiting for resize end"); // Deadlock: no lock taken while doing this operation. resize_operation.wait(); + log::error!("trying our luck again!"); continue; } Some(BeingDeleted) => return Err(Error::IndexNotFound(name.to_string())), diff --git a/index-scheduler/src/lib.rs b/index-scheduler/src/lib.rs index 11b17d05..242dc095 100644 --- a/index-scheduler/src/lib.rs +++ b/index-scheduler/src/lib.rs `@@` -908,6 +908,7 `@@` impl IndexScheduler { /// /// Returns the number of processed tasks. fn tick(&self) -> Result<TickOutcome> { + log::error!("ticking!"); #[cfg(test)] { *self.run_loop_iteration.write().unwrap() += 1; diff --git a/meilisearch/src/main.rs b/meilisearch/src/main.rs index 050c825a..63f312f6 100644 --- a/meilisearch/src/main.rs +++ b/meilisearch/src/main.rs `@@` -25,7 +25,7 `@@` fn setup(opt: &Opt) -> anyhow::Result<()> { #[actix_web::main] async fn main() -> anyhow::Result<()> { - let (opt, config_read_from) = Opt::try_build()?; + let (mut opt, config_read_from) = Opt::try_build()?; setup(&opt)?; `@@` -56,6 +56,8 `@@` We generated a secure master key for you (you can safely copy this token): _ => (), } + opt.max_index_size = byte_unit::Byte::from_str("1MB").unwrap(); + let (index_scheduler, auth_controller) = setup_meilisearch(&opt)?; #[cfg(all(not(debug_assertions), feature = "analytics"))] ``` </details> Mainly, these debug changes do the following: - Set the default index size to 1MiB so that index resizes are initially frequent - Turn some logs from info to error so that they can be displayed with `--log-level ERROR` (hiding the other infos) - Add a long sleep between the beginning and the end of the resize so that we can observe the `BeingResized` index status (otherwise it would never come up in my tests) ## Open questions - Is the growth factor of x2 the correct solution? For a `Vec` in memory it makes sense, but here we're manipulating quantities that are potentially in the order of 500GiBs. For bigger indexes it may make more sense to add at most e.g. 100GiB on each resize operation, avoiding big steps like 500GiB -> 1TiB. ## PR checklist Please check if your PR fulfills the following requirements: - [ ] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [ ] Have you read the contributing guidelines? - [ ] Have you made sure that the title is accurate and descriptive of the changes? Thank you so much for contributing to Meilisearch! 3470: Autobatch addition and deletion r=irevoire a=irevoire This PR adds the capability to meilisearch to batch document addition and deletion together. Fix meilisearch#3440 -------------- Things to check before merging; - [x] What happens if we delete multiple time the same documents -> add a test - [x] If a documentDeletion gets batched with a documentAddition but the index doesn't exist yet? It should not work Co-authored-by: Louis Dureuil <louis@meilisearch.com> Co-authored-by: Tamo <tamo@meilisearch.com>

3505: Csv delimiter r=irevoire a=irevoire Fixes meilisearch#3442 Closes meilisearch#2803 Specified in meilisearch/specifications#221 This PR is a reimplementation of meilisearch#2803, on the new engine. Thanks for your idea and initial PR `@MixusMinimax;` sorry I couldn’t update/merge your PR. Way too many changes happened on the engine in the meantime. **Attention to reviewer**; I had to update deserr to implement the support of deserializing `char`s ------- It introduces four new error messages; - Invalid value in parameter csvDelimiter: expected a string of one character, but found an empty string - Invalid value in parameter csvDelimiter: expected a string of one character, but found the following string of 5 characters: doggo - csv delimiter must be an ascii character. Found: 🍰 - The Content-Type application/json does not support the use of a csv delimiter. The csv delimiter can only be used with the Content-Type text/csv. And one error code; - `invalid_index_csv_delimiter` The `invalid_content_type` error code is now also used when we encounter the `csvDelimiter` query parameter with a non-csv content type. Co-authored-by: Tamo <tamo@meilisearch.com>

Resolves: meilisearch#3469 See also: meilisearch#2763

tab in enable_metrics_route to fix cargo fmt issues Resolves: meilisearch#3469 See also: meilisearch#2763

3496: Fix metrics feature r=irevoire a=james-2001 # Pull Request ## Related issue Resolves: meilisearch#3469 See also: meilisearch#2763 ## What does this PR do? As reported the metrics feature was broken by still using and old reference to `meilisearch_auth::actions`. This commit switches to the new location, `meilisearch_types::keys::actions`. The original issue was not *that* clear as to exactly what was broken, and the build logs have disappeared, but it seemed to just be this one line fix. If this is not the case and I've missed the mark let me know, and i'll head back to the drawing board. ## PR checklist Please check if your PR fulfills the following requirements: - [x] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [x] Have you read the contributing guidelines? - [x] Have you made sure that the title is accurate and descriptive of the changes? Co-authored-by: James <james.a.may.2001@gmail.com>

even though docker cache was being used earlier for uffizzi builds, seems like the cache layers weren't persisting. This commit adds changes to move meilisearch building outside the dockerfile so that we can use the rust cache action. We are also building to the musl target so that the binary for meilisearch which is created can be used for the uffizzi ttyd image which uses alpine.

waveywaves force-pushed the uffizzi branch 14 times, most recently from 17c6e89 to be6e4c8 Compare November 11, 2022 12:37

waveywaves mentioned this pull request Nov 11, 2022

feat: create a preview environment for every PR using Uffizzi meilisearch/meilisearch#3040

Merged

3 tasks

waveywaves force-pushed the uffizzi branch from be6e4c8 to 648be00 Compare November 11, 2022 12:51

waveywaves force-pushed the uffizzi branch from 648be00 to 654a5aa Compare December 6, 2022 17:28

Repository owner deleted a comment from github-actions bot Dec 6, 2022

waveywaves force-pushed the uffizzi branch from 654a5aa to 8e2a1ba Compare December 6, 2022 17:52

waveywaves force-pushed the uffizzi branch 2 times, most recently from beef775 to 37f07fb Compare December 23, 2022 12:28

waveywaves force-pushed the uffizzi branch 2 times, most recently from 6028d46 to dba5a4b Compare January 12, 2023 20:35

Kerollmops added 5 commits January 25, 2023 14:22

Add tests for the index patterns

474d4ec

Introduce the IndexUidPattern type

0b08413

Make it work with the dumps

29961b8

Make it work in the CreateApiKey struct

184b8af

Make it work for any all routes including stats and index swaps

ec7de4b

irevoire and others added 24 commits February 16, 2023 17:37

add two tests on the indexing of csvs

5367d8f

make cargo fmt&clippy happy

e79f6f8

cargo run execute meilisearch by default

f11c7d4

apply review suggestions

895ab29

Fix metrics feature

91048d2

Metrics feature was relying on old references. Refactored with inspiration from the `get_stats` method in `meilisearch/src/routes/lib.rs`. `enable_metrics_routes` added to options in `segment_analytics`. Resolves: meilisearch#3469 See also: meilisearch#2763

Consider null as a valid geo object

18796d6

Bump version of mini-dashboard to v0.2.6

dd120e0

Add Batch::index_uid

4c519c2

create_or_open_index takes a map_size argument

faf1e17

IndexScheduler::tick returns a TickOutcome

6cc3797

Add IndexStatus::BeingResized

1c670d7

Add IndexMapper::resize_index fn

951a5b5

Resize indexes when they're full

1116788

Make sure we don't leave the in memory hashmap in an inconsistent state

35f6c62

apply review suggestions

1479050

Merge meilisearch#3515

a8f6f10

3515: Consider null as a valid geo field r=irevoire a=irevoire Fix meilisearch#3497 Associated spec; meilisearch/specifications#222 Co-authored-by: Tamo <tamo@meilisearch.com>

Add feature flags to Opt structure

5cff435

Resolves: meilisearch#3469 See also: meilisearch#2763

Fix formatting issue on Opt struct

ce807d7

tab in enable_metrics_route to fix cargo fmt issues Resolves: meilisearch#3469 See also: meilisearch#2763

waveywaves force-pushed the uffizzi branch 3 times, most recently from e3471a8 to 44d5692 Compare February 21, 2023 11:48

waveywaves force-pushed the uffizzi branch from 44d5692 to 09a94e0 Compare February 21, 2023 11:55

waveywaves closed this Feb 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate Uffizzi #2

Integrate Uffizzi #2

waveywaves commented Nov 8, 2022

github-actions bot commented Dec 6, 2022 •

edited

Loading

Integrate Uffizzi #2

Integrate Uffizzi #2

Conversation

waveywaves commented Nov 8, 2022

github-actions bot commented Dec 6, 2022 • edited Loading

github-actions bot commented Dec 6, 2022 •

edited

Loading