[API]: Endpoint for retrieving all policies #1698

Altair-Bueno · 2023-07-20T09:20:01Z

Signed-off-by: Altair-Bueno <altair.bueno@uma.es>

Altair-Bueno · 2023-07-20T09:25:56Z

.../src/main/java/org/eclipse/ditto/thingsearch/model/signals/commands/query/QueryPolicies.java

+ */
+@Immutable
+@JsonParsableCommand(typePrefix = ThingSearchCommand.TYPE_PREFIX, name = QueryPolicies.NAME)
+public final class QueryPolicies extends AbstractCommand<QueryPolicies> implements ThingSearchQueryCommand<QueryPolicies> {


It might be wise to use an abstract class to avoid code duplication, but:

I'm not sure what level of abstraction you are aiming for

Not even sure if it makes sense, just doing some rough changes until I learn the codebase

I would say to wait for abstracting it before it really provides a benefit, e.g. in reacting to the command in a similar way when applying the things/policies query.

Signed-off-by: Altair-Bueno <altair.bueno@uma.es>

Altair-Bueno · 2023-07-25T09:45:50Z

Hey @thjaeckle . I've been struggling to comprehend the control flow of the GET /search/things endpoint, and I'll like to summarize the steps to make sure I'm not leaving anything important on the table:

An incoming request to said endpoint is transformed into a Ditto command (QueryThings).
The SearchActor handles the command by validating it and retrieving all matching IDs from the persistence layer.
The retrieved IDs are then passed to QueryThingsPerRequestActor.
QueryThingsPerRequestActor instructs ThingsAggregatorProxyActor to retrieve all the corresponding things.
Finally, the ThingsAggregatorProxyActor gathers the requested things and responds back.

However, I'm curious as I don't understand the reasoning behind this approach. It appears quite complex and involves multiple messages between distributed services. Have you considered the possibility of using a more straightforward approach? On paper, a database query seems to be more performant and resource-efficient, as it would only require one microservice and the optimised database (in the end, Mongo is designed for high performance reads).

PS: This comment really help me out to finally gasp where things came from

ditto/thingsearch/service/src/main/java/org/eclipse/ditto/thingsearch/service/starter/actors/SearchActor.java

Line 105 in 58ce86b

    
            * The ThingsSearchPersistence returns only Thing IDs. Thus, to provide complete Thing information to the requester,

thjaeckle · 2023-07-25T17:17:28Z

@Altair-Bueno this approach is basically done in order to not transfer 200 search results in a single message across the cluster (with a max. thing size of 100 KiB this would be a 20 MiB message).
We configured a max. message size in the cluster of 256 KiB in order to e.g. have an optimized memory consumption and not need to "reserve" a lot of non-needed memory for transfering huge messages.

The solution to still be able to retrieve (max page size) 200 results in a single page at the search API is to

use the search index to find the "thingIds" which match the query
respond with the list of "thingIds"
then 1-by-1 ask for the data of the things from the "things" service
- this also will "filter" the responses based on the policy to not have data included which the user is not allowed to see
aggregate them back at the gateway into a single "QueryThingsResponse"

So yes, it is complex - but it is required to not exceed messaging limits.
Policies can also be 100 KiB big (default configuration) - so the same approach has to be applied here as well I fear.

Altair-Bueno · 2023-07-26T10:56:02Z

Thanks for your response Thomas. I do understand now the motivation behind performing multiple asks to the things service instead a single big one. However, that doesn't explain why asking each actor was chosen over a database query from the gateway component itself.

The only reason I can think of is to avoid stale data to be served (e.g. a thing with changes that hasn't been persisted yet), but IMHO that seems acceptable given the drawbacks of the current approach: increased pressure on Akka, latency and software complexity. If one would like a more precise representation of the actual twin, a GET /things/{thingid} query could performed to retrieve the latest changes.

Please keep in mind that I have little experience with Akka/actors and this might be common practice in actor systems. It just doesn't click with me to rely so heavily on actors when other approaches exists, specially when we are already relying on the database for the ID query.

thjaeckle · 2023-07-26T11:18:03Z

However, that doesn't explain why asking each actor was chosen over a database query from the gateway component itself.

As I wrote:

this also will "filter" the responses based on the policy to not have data included which the user is not allowed to see

The search index does not necessarily have all of the information which the thing contains - its purpose is "only" to return with a list of matching IDs to a query.
With this approach, it is also possible to e.g. limit the fields which the search index should index as discussed in #1521

As things and policies apply the event sourcing pattern (e.g. in order to provide history and audit log capabilities), we cannot "simply" query their database from the gateway (as the actual state would have to be constructed based on persisted events) - which you should not do anyways in service oriented architectures - to use the database of another service without the other service being aware.

This is not going to change or be simplified as I don't see how this can work otherwise.

thjaeckle · 2024-01-17T08:28:52Z

@Altair-Bueno this is no longer in work, or is it?

Altair-Bueno · 2024-01-18T10:49:35Z

Hi Thomas. Indeed, we are no longer working on this. I've recently been reassigned, and unfortunately, this feature development slipped through the cracks. I apologize for any inconvenience this may have caused.

thjaeckle · 2024-01-18T11:13:18Z

@Altair-Bueno ok, no problem - thanks for letting us know 👍

Altair-Bueno and others added 3 commits July 20, 2023 09:12

new(thingssearch): QueryPolicies and QueryPoliciesResponse

7a710b4

Signed-off-by: Altair-Bueno <altair.bueno@uma.es>

Merge branch 'eclipse-ditto:master' into master

ce69efc

fix(thingssearch): Update license header

b27975b

Signed-off-by: Altair-Bueno <altair.bueno@uma.es>

Altair-Bueno commented Jul 20, 2023

View reviewed changes

Altair-Bueno added 2 commits July 21, 2023 11:17

new(thingssearch): Add /search/policies route

53c6fc4

Signed-off-by: Altair-Bueno <altair.bueno@uma.es>

new(thingssearch): PoliciesSearchCursor

4530b37

Signed-off-by: Altair-Bueno <altair.bueno@uma.es>

Merge branch 'master' into master

0b01452

Merge branch 'eclipse-ditto:master' into master

7d6685c

thjaeckle mentioned this pull request Aug 8, 2023

Show policy imports in Ditto explorer UI #1700

Closed

Altair-Bueno added 2 commits August 22, 2023 12:47

Merge branch 'eclipse-ditto:master' into master

bac179c

Merge branch 'eclipse-ditto:master' into master

f2c40a6

thjaeckle closed this Jan 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[API]: Endpoint for retrieving all policies #1698

[API]: Endpoint for retrieving all policies #1698

Altair-Bueno commented Jul 20, 2023

Altair-Bueno Jul 20, 2023

thjaeckle Jul 20, 2023

Altair-Bueno commented Jul 25, 2023

thjaeckle commented Jul 25, 2023 •

edited

Altair-Bueno commented Jul 26, 2023 •

edited

thjaeckle commented Jul 26, 2023

thjaeckle commented Jan 17, 2024

Altair-Bueno commented Jan 18, 2024

thjaeckle commented Jan 18, 2024

[API]: Endpoint for retrieving all policies #1698

[API]: Endpoint for retrieving all policies #1698

Conversation

Altair-Bueno commented Jul 20, 2023

Altair-Bueno Jul 20, 2023

Choose a reason for hiding this comment

thjaeckle Jul 20, 2023

Choose a reason for hiding this comment

Altair-Bueno commented Jul 25, 2023

thjaeckle commented Jul 25, 2023 • edited

Altair-Bueno commented Jul 26, 2023 • edited

thjaeckle commented Jul 26, 2023

thjaeckle commented Jan 17, 2024

Altair-Bueno commented Jan 18, 2024

thjaeckle commented Jan 18, 2024

thjaeckle commented Jul 25, 2023 •

edited

Altair-Bueno commented Jul 26, 2023 •

edited