feat: add request.param guc #1710

steve-chavez · 2020-12-28T23:35:36Z

As discussed on #915 (comment). This adds the request.param GUC.

This is the draft implementation. Not efficient because it leads to a lot of new prepared statements(no reuse).
Each query param ?select=id,name&id=eq.1&name=eq.project&order=id will get its own parameter:

select 
  set_config('request.param.id', $1, true),
  set_config('request.param.name', $2, true),
  set_config('request.param.select', $3, true),
  set_config('request.param.order', $4, true);

Ideas

Convert all the query parameters to a single json, like mentioned in Change SET LOCAL gucs to set_config #1600 (comment). Would lead to reusing prepared statements.
How about having a single GUC for the filters only?
- It can be like request.param.filters. This would be a JSON object: {"id": "eq.1", "name": "eq.project"}. There wouldn't be a request.param.id or request.param.name.
- With this we could disable update/delete with no filters globally by using pre-request. It'd be easier to validate that filters are present by having them on a single json.
```
create or replace function update_delete_restrictions() returns void as $$
declare
  req_filters text = nullif(current_setting('request.param.filters', true), '{}');
  req_method text = current_setting('request.method', true);
begin
if req_method similar to 'PATCH|DELETE' and req_filters is null then
  RAISE EXCEPTION 'UPDATE or DELETE is not allowed without filters'
  USING HINT = 'Add filters to the request';
end if;
end $$ language plpgsql;
```
(this would be more flexible than doing it as config option as proposed on Cancel query if user cancels request #699 (comment))

wolfgangwalther · 2021-01-01T11:46:58Z

Hm. I'm not entirely convinced that I would want to have all the request params set as GUCs. Every GUC has a performance penalty when setting it: I remember from the SQL tests with did with the SET LOCAL stuff - the number of gucs had a linear impact on the query time. There is no way to opt out of this.

It'd be easier to validate that filters are present by having them on a single json.

I agree with that.

Taking a step back for a moment I wonder whether we can design something that would allow me to opt-in for all kind of request. GUCs, but have nothing as the default - for the best-performing case.

What if we did:

remove all set_config except role and search_path.
pass those settings in to hook functions we have (currently only db-pre-request, but might become more, see Expose raw body in request #1661 (comment)).
make it optional by using named arguments on those hooks, similar to how fixtures work with pytest.

Example:

CREATE FUNCTION my_pre_request(method TEXT, filters JSON) RETURNS void
LANGUAGE plpgsql AS $$
  ... similar to the function in the opening post ...
$$;

Here, we would parse the function definition together with the schema cache and would then know which arguments we need to pass in.
We could easily add an example pre-request function that uses set_config to restore current behaviour to postgrest-contrib.

This would have the advantage that we could choose which variables to pass / set and could leave a big chunk out in most cases - improving performance.

We would still only need one round-trip to the database, it should not perform worse.

Thinking more about the naming of arguments, we could support something like a json path in those, to be able to select only a sub-key:

CREATE FUNCTION my_pre_request("headers.my_custom_header" TEXT, filters JSON)

This would provide all filters, but only one of the headers.

steve-chavez · 2021-03-01T16:50:40Z

pass those settings in to hook functions we have (currently only db-pre-request)
make it optional by using named arguments on those hooks, similar to how fixtures work with pytest.

Defining the gucs in pre-request seems interesting.

A drawback is that if you only want to use restricted views or tables(#915 (comment)), you'd have to define the gucs on pre-request as well, which makes the feature more complicated to use. Ideally one would only touch pre-request for the global cases.

An advantage is that it could allow more GUCs per endpoint and it would not be a global config like we could do on the config file.

Not opposed to the idea, need to have more thought about it.

Other stuff that could be interesting for this feature:

request.param.filters.indexed(json): with this we could reject all requests that don't use at least one indexed filter. We'd have to get INDEXes into our schema cache for this.
request.param.distinct and request.param.group_by: if we expose distinct and group by as discussed on Ability to generate queries with distinct/group by #915, these two would allow to globally disallow them(would solve Ability to generate queries with distinct/group by #915 (comment)) or enable them per endpoint.

steve-chavez · 2021-03-27T19:14:57Z

New idea for filter restrictions

How about this.

If we (ab)use our in-db config for defining restrictions, we could make them static checks that don't have to hit the database.

Example:

CREATE TABLE clients(
  id int
, name text
);

-- assuming our postgrest-contrib in place
SELECT pgrst.restrict_filter('clients', 'id');

Then a request that doesn't include a filter on id will get rejected as:

GET /clients

HTTP/1.1 403 Forbidden
{"hint":null,"details":null, "code": null, "message":"id filter must be present"}

To make that work, the pgrst.restrict_filter(..) would be translated to:

ALTER ROLE pgrst_authenticator SET parameters_restrictions = $$
{
  "clients": "id"
}
$$;

-- or to a config file as
parameters-restrictions = '{"clients": "id"}'

Advantages

We don't have to add GUCs to every request.
As a I mentioned above, we can reject the request at the postgrest level, it doesn't have to hit the db. It would be similar to how we reject non-existent embeds.
We could use this mechanism not only for tables/views but also for set returning functions.

SELECT pgrst.restrict_filter('rpc/get_clients', 'id');

Possibilities

Restricting PATCH/DELETE without filters could also be done in a similar manner.

SELECT pgrst.restrict_method('clients', '{PATCH,DELETE}', '{id,name}');
-- only allow patch/delete if id or name filters are present.

Though the functions interface is not yet defined, this could also aid in allowing them in this way:

SELECT pgrst.allow_function('clients', 'avg(salary)');

Allow/deny operators (someone asked for this before on gitter, only choice was proxy)

SELECT pgrst.allow_operators('clients', '{eq,gt,lt}');
SELECT pgrst.deny_operators('clients', '{fts,wfts,phtfts,like}');

Enable group by/distinct globally or per route

SELECT pgrst.allow_group_by(global:=true);
SELECT pgrst.allow_group_by('clients');
SELECT pgrst.allow_distinct('clients');

Restrict the number of possible embeds(related: Customize query timeouts #249 (comment))

SELECT pgrst.embedding_depth(global:=true, 5);

(Just an idea, the functions form could use more work)

@wolfgangwalther What do you think? Do you see any drawbacks with this approach?

wolfgangwalther · 2021-04-01T10:50:08Z

If we (ab)use our in-db config for defining restrictions, we could make them static checks that don't have to hit the database.

Advantages

We don't have to add GUCs to every request.

As a I mentioned above, we can reject the request at the postgrest level, it doesn't have to hit the db. It would be similar to how we reject non-existent embeds.

We could use this mechanism not only for tables/views but also for set returning functions.

Restricting PATCH/DELETE without filters could also be done in a similar manner.

Allow/deny operators (someone asked for this before on gitter, only choice was proxy)

Enable group by/distinct globally or per route

@wolfgangwalther What do you think? Do you see any drawbacks with this approach?

Hm. My first reaction was: Why not do that at the proxy-level? This seems out of scope for PostgREST.

But giving it further thought... I actually like it. When discussing all the content negotiation stuff with Accept headers etc., I briefly had the idea of improving our current "router" situation. Right now we basically have a very basic router hardcoded: It just takes the route, maps it to a table or RPC name and then does a little bit of magic with the Accept header.

We could implement this in a much more generic and flexible way. And then we could load additional config per route the kind of checks you mention. I'm not sure about using all kinds of GUCs / config options for those kind of settings, though. Maybe we should just have a db-router-config setting that takes a table name. This table would have to follow a predefined schema and would hold all kind of those settings you mentioned above.

steve-chavez · 2021-04-01T19:35:47Z

Why not do that at the proxy-level? This seems out of scope for PostgREST.

Yeah, the line that separated proxy and PostgREST has blurred out with time. Even the previous GUC request.param proposal or the whole discussion at #915 (comment) would be meaningless if we decide to do in the proxy(which we can). But that would keep us from evolving and adding features that are valuable.

Also, we're already a proxy for pg as well, after all we reject some requests(invalid http method, invalid jwt, etc) without touching pg.

Additionally, doing REST restrictions with our functions seems superior to doing it inside RLS or a security-barrier-like condition on views. They're a PostgREST concern after all, they should be stored in our config.

I briefly had the idea of improving our current "router" situation
Maybe we should just have a db-router-config setting that takes a table name.

Sounds interesting! But I think we should keep the design that authenticator doesn't need a privilege over a table or usage over a schema. So perhaps we should keep doing it in our config(as a JSON field), or maybe later on a table inside our extension.

wolfgangwalther · 2021-04-05T10:42:28Z

I briefly had the idea of improving our current "router" situation
Maybe we should just have a db-router-config setting that takes a table name.

Sounds interesting! But I think we should keep the design that authenticator doesn't need a privilege over a table or usage over a schema. So perhaps we should keep doing it in our config(as a JSON field), or maybe later on a table inside our extension.

Hm. The table approach is kind of a hack to achieve the same thing as schema variables would do. For schema variables, we'd need to have privileges, too. Yes, I was thinking about putting it in our extension. This idea was mainly targeted at cloud-based users, who can't use GUC-based configuration. Maybe we can even extend this idea:

How about making db-config take either a boolean or a string? The meaning of boolean would still be the same. If it's set to a string, this is the schema qualified name of a table, that the authenticator user needs access to. This table can be a simple key/value store of all those config options that we are reading from GUCs right now. This would basically replace the GUC loading with table-loading. This would allow all cloud-based users to still maintain their config in the database.

In any case, I agree with keeping the filter settings you mentioned above in a single JSON map. This would allow the following generalization of our "routing".

We could create some types roughly along the lines of:

-- maps a list of path segments to a route configuration
type Router = M.Map [Text] Route

data Route = Route {
  rtTarget :: Target,
  -- ... other settings
  rtAllowedMethods :: [Method],
  rtAllowedOperators :: [Operator],
  rtAllowAggregation :: Boolean,
  rtAllowedFunctions :: [Text],
  rtRequiredFilters :: [Text]
}

(Those types need to be adjusted to properly allow function overloading for RPCs, but the general idea would be the same.)

When parsing the request, we would match this to one of the routes. This would imply, that we can't respond to any unknown routes anymore. I remember we discussed this elsewhere already, though.

The default values for each route are generated from the schema cache. And then we add the json config option mentioned above (e.g. router-overlay) on top of the whole Router structure. This would allow us to override some of those values selectively. With some helper functions as mentioned above, this could be easily managed. In the future this could also allow much more customization, e.g. moving target tables/views/rpcs to completely different paths, etc.

wolfgangwalther · 2021-04-05T14:30:04Z

Maybe wai-routing can be of help here, too. I didn't fully read it, yet, but it sounds like those preconditions could be checked with it: https://hackage.haskell.org/package/wai-routing-0.3/docs/Network-Wai-Routing-Tutorial.html.

steve-chavez · 2022-07-16T18:28:15Z

Every GUC has a performance penalty when setting it: I remember from the SQL tests with did with the SET LOCAL stuff - the number of gucs had a linear impact on the query time

The above is still true. However now that we use a single json for our GUCs, request.params would have a lesser impact.

steve-chavez · 2022-08-26T20:54:02Z

The above ideas could also be done with a SECURITY LABEL extension, like discussed on #2442 (comment)

rdlrt · 2022-10-27T01:50:14Z

The option to be able to access limit/offset - even if not other columns (controlled via config) - also helps us build RPCs that can artificially use custom pagination techniques (for my use case, currently modifying RPC to view works , combed with haproxy URL redirection for consistency - but would be nice to have a more consistent option as we predominantly rely on /rpc => /api map - which works for 80% of endpoints - except cases when RPC are not ideal - for being able to leverage keyset pagination)

feat: add request.param guc

50bb322

wolfgangwalther changed the base branch from master to main December 31, 2020 14:10

steve-chavez mentioned this pull request Feb 9, 2021

Ability to generate queries with distinct/group by #915

Closed

This was referenced Apr 8, 2021

Add request.spec to db-root-spec #1794

Merged

Add pgrst.accept setting to RPC #1582

Closed

This was referenced Apr 16, 2021

refactor: make Junction non-recursive #1818

Merged

fix: Allow options to generate certain HTTP methods for a DB view #1824

Merged

wolfgangwalther mentioned this pull request May 28, 2021

PG14: Custom parameter names must be of the form "identifier.identifier" and no dash(-) can be inside them #1857

Closed

steve-chavez mentioned this pull request Aug 18, 2021

Differentiate between PostgREST and PostgreSQL errors in the error body #1917

Closed

wolfgangwalther mentioned this pull request Sep 13, 2021

[Feature Request] HTTP headers filtering - whitelist transaction-scoped settings #1941

Open

steve-chavez mentioned this pull request Nov 20, 2021

Support for PGroonga operators #2028

Open

steve-chavez mentioned this pull request Apr 4, 2023

how to restrict which operators can be used and which props can be used for ordering #2442

Open

steve-chavez mentioned this pull request Apr 18, 2023

Customize URL structure - Confusion about /rpc/ prefix, move stored procedures to root level #1086

Open

steve-chavez mentioned this pull request Jun 26, 2023

Feature request: transformations to format and parse values in the API #2310

Closed

steve-chavez mentioned this pull request Nov 22, 2023

Add aggregate functions #2925

Merged

wolfgangwalther mentioned this pull request Jan 16, 2024

Verify JWTs without changing role #3002

Open

steve-chavez mentioned this pull request Aug 20, 2024

Configurable query grammar #2805

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add request.param guc #1710

feat: add request.param guc #1710

steve-chavez commented Dec 28, 2020

wolfgangwalther commented Jan 1, 2021

steve-chavez commented Mar 1, 2021

steve-chavez commented Mar 27, 2021 •

edited

Loading

wolfgangwalther commented Apr 1, 2021

Advantages

steve-chavez commented Apr 1, 2021

wolfgangwalther commented Apr 5, 2021

wolfgangwalther commented Apr 5, 2021

steve-chavez commented Jul 16, 2022

steve-chavez commented Aug 26, 2022

rdlrt commented Oct 27, 2022

feat: add request.param guc #1710

Are you sure you want to change the base?

feat: add request.param guc #1710

Conversation

steve-chavez commented Dec 28, 2020

Ideas

wolfgangwalther commented Jan 1, 2021

steve-chavez commented Mar 1, 2021

steve-chavez commented Mar 27, 2021 • edited Loading

New idea for filter restrictions

Advantages

Possibilities

wolfgangwalther commented Apr 1, 2021

Advantages

steve-chavez commented Apr 1, 2021

wolfgangwalther commented Apr 5, 2021

wolfgangwalther commented Apr 5, 2021

steve-chavez commented Jul 16, 2022

steve-chavez commented Aug 26, 2022

rdlrt commented Oct 27, 2022

steve-chavez commented Mar 27, 2021 •

edited

Loading