Restart required for detecting table relations on schema change #475

sscarduzio · 2016-01-28T12:06:34Z

I have an event_group table with fields group_id and event_id referencing events.id and groups.idrespectively.

The weird thing is that even if hydrating the event IDs works perfectly:

GET /event_group?select=events{*}
[
  {
    "events": {
      "doc": {
     ...
       }
  }
]

When I try to do the same with groups , I receive this error:

GET  /event_group?select=groups{*}
 {
  "hint": null,
  "details": null,
  "code": "42703",
  "message": "column groups.group_id does not exist"
}

why does it expect groups.group_id and not groups.id? I didn't need a event.event_id for events!

Here's the SQL:

CREATE TABLE "1".event_group
(
  group_id character varying(32) NOT NULL,
  event_id character varying(36),
  CONSTRAINT fk_eid FOREIGN KEY (event_id) REFERENCES "1".events (id) MATCH SIMPLE ON UPDATE CASCADE ON DELETE NO ACTION,
  CONSTRAINT fk_groups FOREIGN KEY (group_id) REFERENCES "1".groups (id) MATCH SIMPLE ON UPDATE NO ACTION ON DELETE NO ACTION,
  CONSTRAINT uq_gid_eid UNIQUE (event_id, group_id)
)


CREATE TABLE "1".events
(
  doc jsonb,
  id character varying(36) NOT NULL DEFAULT uuid_generate_v4(),
  CONSTRAINT pkey_eid PRIMARY KEY (id)
)

CREATE TABLE "1".groups
(
  id character varying(32) NOT NULL DEFAULT uuid_generate_v4(),
  name character varying(128) NOT NULL,
  description character varying(800),
  CONSTRAINT group_pkey PRIMARY KEY (id)
)

The text was updated successfully, but these errors were encountered:

begriffs · 2016-01-29T00:11:59Z

This was fixed by restarting your postgrest server, correct? If so I'll rename this issue to discuss ways for PostgREST to detect schema changes.

sscarduzio · 2016-01-29T11:17:23Z

Restarting PostgREST has fixed my issue indeed. But honestly, before I consider this resolved, I'd love to see some notes on the documentation that makes this (probably temporary) side effect evident to the users.
Do you want to take care of this or shall I?

begriffs · 2016-01-29T17:08:21Z

I'd love your help with the docs if you could.

#475

begriffs · 2016-01-29T21:15:22Z

Here are the options I can see for handling schema change.

Don't cache knowledge of the schema
- When a request comes in that needs to know the db structure, query for the relevant parts of the schema then and there
- 👍 keeps postgrest stateless
- 👍 fast propagation of schema changes to the API
- 👎 may need to break up the consolidated schema detection code into little pieces for each type of check
- 👎 slow and wasteful when the schema doesn't change much (which is probably the common case)
Refresh schema structure cache on a sysadmin signal
- Typically in unix it is SIGHUP, but we could also listen for a database NOTIFY as well so that a stored proc could cause the schema reload
- 👍 Only makes the sql calls when it has to
- 👍 Probably not a whole lot to change about our code
- 👎 You have to remember to SIGHUP just like you currently have to remember to restart, so still potentially confusing
- 👎 Painstaking to do on a cluster of load-balanced postgrests via a unix signal, but maybe OK if they all listen for a db notify
Poll for changes
- 👍 nobody has to remember to restart or send a signal
- 👍 can be implemented in addition to the refresh signals
- 👎 most APIs probably run for a long time without schema change so polling is wasteful
Refresh on ddl_command_end event trigger
- 👍 happens for all and only the kind of schema-altering events we care about
- 👍 available in postgres 9.3+
- 👍 no manual signaling required, and schema changes propagate quickly to the API
- 👎 requires the db admin to create the trigger which translates ddl_command_end into a pg_notify for which postgrest can listen. Maybe there is a way that postgrest can create such a trigger for the duration of its run?
- 👍 supports manual refresh too by issuing a sql notify statement that mimics the one generated by our ddl event trigger

This last option is really nice. Ideally we can have postgrest set it up without any admin intervention. Any options or pros/cons I'm missing?

eric-brechemier · 2016-01-29T21:50:00Z

If you number the options:

No change, keep current behavior: schema knowledge is loaded when postgrest starts
Don't cache knowledge of the schema
Refresh schema structure cache on a sysadmin signal
Poll for changes
Refresh on ddl_command_end event trigger

The options 2 and 4 seem worse than current behavior, 3 and 5 look OK and could probably both be implemented.

If we go for option 5, just like for authentication, it would make sense to keep this feature out of postgrest and implement it as a separate postgrest-supervisor or something.

begriffs · 2016-01-30T06:12:50Z

How about this, on startup have postgrest run this sql:

CREATE OR REPLACE FUNCTION public.notify_ddl_postgrest()
  RETURNS event_trigger
 LANGUAGE plpgsql
  AS $$
BEGIN
  NOTIFY ddl_command_end;
END;
$$;

DROP EVENT TRIGGER IF EXISTS ddl_postgrest;
CREATE EVENT TRIGGER ddl_postgrest ON ddl_command_end
   EXECUTE PROCEDURE public.notify_ddl_postgrest();

Then any schema change will notify on channel ddl_command_end. Also an admin can call public.notify_ddl_postgrest to manually tell postgrest to reload the schema. If the function is available with the appropriate permissions then you could access it over http at /rpc/notify_ddl_postgrest.

The missing piece is that postgrest needs to be able to listen for changes. @nikita-volkov would it be feasible to add support for listen in hasql? Something similar to this?

nikita-volkov · 2016-01-30T09:04:50Z

@begriffs There actually is a PR from Leon Smith with that: https://github.com/nikita-volkov/hasql/pull/43/files . I just need to adapt it a little better.

sscarduzio · 2016-01-31T19:23:34Z

I think PostgREST produces enough value to be a tad more "invasive" in terms of integration.

I mean, when one wants to use PostgREST, he/she may be fine with the fact this component will require an "installation phase" in which it (automatically) creates triggers, tables and whatever else it needs to provide all these non trivial features.

I imagine the UX would be something along the lines of:

$ postgrest postgres://postgres@127.0.0.1/somedb -a postgres -s "1"
WARNING: PostgREST is not installed in "somedb". Some features might not work as expected.
SOLUTION: restart using the --install option

This way, we can let people opt out the custom triggers and tables if they don't use the advanced features (user management, schema auto-refresh, etc.). But we'd offer a turn-key setup process for the rest of the users (as opposed to "if you want user management, follow this howto..").

begriffs · 2016-01-31T21:33:02Z

@sscarduzio I agree, in fact for this feature we just need to create a proc and a trigger so maybe postgrest should just do it without asking. The trigger shouldn't add much of a burden on anyone's db because it triggers only on DDL changes.

ruslantalpa · 2016-02-01T05:55:07Z

I dont agree :) in order for this to work, the authenticator needs to be granted high privileges, and for what? Compromise security for a small convinience?

begriffs · 2016-02-01T06:51:09Z

Does it take high privileges to create a function and a trigger?

ruslantalpa · 2016-02-01T07:17:20Z

I would consider the ability to create procedures and triggers "high privileges".
Why not simply implement (3) and let some other script do the refresh in whatever way it chooses to.
In production there is no "random" schema change, usually it's done using CI so why not allow the ability to issue a HUP at the end of the deployment script?

eric-brechemier · 2016-02-01T08:43:41Z

In production there is no "random" schema change, usually it's done using CI so why not allow the ability to issue a HUP at the end of the deployment script?

Exactly my thought after I posted my previous comment. For example, I am using git to deploy new versions on a server, and the repository includes also the nginx configuration of the virtual host and the sqitch folder with SQL scripts.

I currently run nginx -s reload after the git push to reload nginx configuration, I would simply add service reload postgrest or something similar to reload postgrest configuration after deploying database changes with sqitch deploy.

sscarduzio · 2016-02-01T14:14:19Z

Well, plot twist: there's not only "production" that counts.
I'm creating tables as I progress with the user stories: postgres - the stateful layer of my architecture -keeps the pace with me (no need to restart), why should my development cycle include restarting a supposedly "stateless" layer? It's really awkward, looks like we're back in Tomcat times.

eric-brechemier · 2016-02-01T14:43:43Z

Well, plot twist: there's not only "production" that counts.

Also agreed, but development uses extra tools, not installed on the server.

I am still suggesting to have a separate, officially sanctioned executable to refresh postgrest config at regular intervals or using elevated privileges to detect schema changes during development.

sscarduzio · 2016-02-01T14:55:18Z

so you suggest I have a while true SIGHUP sleep 1 in my dev env? Is it what you'd propose for tomcat as well?

ruslantalpa · 2016-02-01T15:13:24Z

In your dev env you create those triggers then listen to notify events with a 10 line script and restart postgrest

eric-brechemier · 2016-02-01T15:13:50Z

My memories of Tomcat days are slowly fading away, but afaik using a separate process to monitor (e.g. file) changes to reload the server is still pretty much current state of the art:

node-supervisor for NodeJS
https://github.com/petruisfan/node-supervisor

or more generally

Rerun
https://github.com/alexch/rerun

sscarduzio · 2016-02-01T16:02:24Z

I still don't understand why it's ok for developers if the developer experience is crap. We have the solution in hand to make this a pleasant thing to use, why pulling back?

If you don't want postgrest to retain CREATE permissions, just don't give them. Postgrest will print a warning (as I showed) that schema won't be refreshed.

davidkuep · 2016-02-01T20:21:51Z

New to this - so not sure if this is a silly option- but why not make the notify/advanced features a separate extension for postgres itself?

It could be installed separately in its own schema-

the authenticator user wouldn't need any special permissions as there could still just be listen/notify sql code that resides in the postgrest-extension schema
Said code could then notify with the name of schemas that have been refreshed and the postgrest instance would simply have to know what schema it was running and only update if its name was called
If the postgrest extension isn't installed everything would still work, you just won't get the advanced features and dropping the postgrest extension would drop the advanced features etc. ]

Anyway, there are probably cons here that I'm not considering, but just wondering why it's not a part of the discussion.

begriffs · 2016-04-03T18:38:42Z

OK how about this: postgrest listens for both SIGHUP and a specially named SQL event and reloads the schema in either situation. In production you would send the server a signal, and in development you create an event trigger on ddl_command_end and have it notify postgrest through the sql event.

In development there would be a snippet of sql that the developer would have to run once in their database which would enable postgrest auto schema change detection.

begriffs · 2016-08-31T02:23:25Z

Look what's now available! https://github.com/cocreature/hasql-notifications

matthewmueller · 2016-11-12T07:32:51Z

Has this been resolved? Seems like if your DB changes, you'll end up with some failed API requests until postgrest has been restarted. Are there any plans to implement the auto-refresh via notifications?

begriffs · 2016-11-12T07:34:33Z

It hasn't been resolved in full generality, but postgrest does listen for SIGHUP nowadays which will cause it to reload the schema. It's less intrusive than having to restart the server.

matthewmueller · 2016-11-12T07:37:25Z

Ah gotcha, so the idea would be that you SSH into your container / dyno and trigger a SIGHUP on that process?

begriffs · 2016-11-12T07:38:52Z

For now yeah. Having postgrest LISTEN for a specially named event seems feasible for v0.4.1. Once that is implemented we can add a section in the docs about hooking ddl_command_end up to NOTIFY that event for immediate postgrest schema reloads on DDL changes.

matthewmueller · 2016-11-12T07:41:06Z

Got it, thanks for your help!

begriffs · 2017-07-13T05:07:53Z

I followed up on the hasql-notifications library which seems to have run into a race condition bug last year. We'll see if that gets resolved, and then it'll be pretty easy to finally get this issue closed.

jmealo · 2018-05-31T14:11:47Z

@begriffs: JarvusInnovations/lapidus#19

It seems as though it'd be trivial to create a tiny sidecar application that sends SIGHUP to PostgREST if the Haskell library isn't ready yet and just suggest its use in the documentation.

Go:
http://coussej.github.io/2015/09/15/Listening-to-generic-JSON-notifications-from-PostgreSQL-in-Go/

Rust:
sfackler/rust-postgres@bb837bd

Elixr/Erlang:
https://github.com/elixir-ecto/postgrex/blob/v0.13.3/lib/postgrex/notifications.ex#L78

I think it may be possible to do this with a shell script assuming the user is providing credentials using something libpq uses. I'm not seeing any examples though.

swuecho · 2019-06-15T18:30:16Z

what is about the part " a specially named SQL event and reloads the schema"? I saw #570 implement the SIGNUP part.

for my use case, postgrest run in docker. I run test in another docker. send a signal from one container to another is possible I guess, but not interesting.

sscarduzio added a commit to sscarduzio/postgrest that referenced this issue Jan 29, 2016

add some notes about PostgREST#475

acfa467

sscarduzio added a commit to sscarduzio/postgrest that referenced this issue Jan 29, 2016

PostgREST#475

b769633

sscarduzio added a commit to sscarduzio/postgrest that referenced this issue Jan 29, 2016

PostgREST#475

4e3a04e

begriffs added a commit that referenced this issue Jan 29, 2016

Merge pull request #477 from sscarduzio/patch-3

03468c8

#475

begriffs changed the title ~~Cannot hydrate external id column by foreign key only on one column~~ Restart required for detecting table relations on schema change Jan 29, 2016

begriffs mentioned this issue Jan 31, 2016

Implement Notification handling nikita-volkov/hasql#43

Open

begriffs mentioned this issue Jan 31, 2016

Detect db schema changes #479

Closed

begriffs added the enhancement a feature, ready for implementation label Apr 3, 2016

begriffs mentioned this issue Apr 23, 2016

Reload schema definition on SIGHUP #570

Merged

begriffs closed this as completed in #570 Apr 26, 2016

jmealo mentioned this issue May 31, 2018

Support schema change notifications and introspection reloading from PostgreSQL JarvusInnovations/lapidus#19

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restart required for detecting table relations on schema change #475

Restart required for detecting table relations on schema change #475

sscarduzio commented Jan 28, 2016

begriffs commented Jan 29, 2016

sscarduzio commented Jan 29, 2016

begriffs commented Jan 29, 2016

begriffs commented Jan 29, 2016

eric-brechemier commented Jan 29, 2016

begriffs commented Jan 30, 2016

nikita-volkov commented Jan 30, 2016

sscarduzio commented Jan 31, 2016

begriffs commented Jan 31, 2016

ruslantalpa commented Feb 1, 2016

begriffs commented Feb 1, 2016

ruslantalpa commented Feb 1, 2016

eric-brechemier commented Feb 1, 2016

sscarduzio commented Feb 1, 2016

eric-brechemier commented Feb 1, 2016

sscarduzio commented Feb 1, 2016

ruslantalpa commented Feb 1, 2016

eric-brechemier commented Feb 1, 2016

sscarduzio commented Feb 1, 2016

davidkuep commented Feb 1, 2016

begriffs commented Apr 3, 2016

begriffs commented Aug 31, 2016

matthewmueller commented Nov 12, 2016

begriffs commented Nov 12, 2016 •

edited

matthewmueller commented Nov 12, 2016

begriffs commented Nov 12, 2016

matthewmueller commented Nov 12, 2016

begriffs commented Jul 13, 2017

jmealo commented May 31, 2018 •

edited

swuecho commented Jun 15, 2019 •

edited

Restart required for detecting table relations on schema change #475

Restart required for detecting table relations on schema change #475

Comments

sscarduzio commented Jan 28, 2016

begriffs commented Jan 29, 2016

sscarduzio commented Jan 29, 2016

begriffs commented Jan 29, 2016

begriffs commented Jan 29, 2016

eric-brechemier commented Jan 29, 2016

begriffs commented Jan 30, 2016

nikita-volkov commented Jan 30, 2016

sscarduzio commented Jan 31, 2016

begriffs commented Jan 31, 2016

ruslantalpa commented Feb 1, 2016

begriffs commented Feb 1, 2016

ruslantalpa commented Feb 1, 2016

eric-brechemier commented Feb 1, 2016

sscarduzio commented Feb 1, 2016

eric-brechemier commented Feb 1, 2016

sscarduzio commented Feb 1, 2016

ruslantalpa commented Feb 1, 2016

eric-brechemier commented Feb 1, 2016

sscarduzio commented Feb 1, 2016

davidkuep commented Feb 1, 2016

begriffs commented Apr 3, 2016

begriffs commented Aug 31, 2016

matthewmueller commented Nov 12, 2016

begriffs commented Nov 12, 2016 • edited

matthewmueller commented Nov 12, 2016

begriffs commented Nov 12, 2016

matthewmueller commented Nov 12, 2016

begriffs commented Jul 13, 2017

jmealo commented May 31, 2018 • edited

swuecho commented Jun 15, 2019 • edited

begriffs commented Nov 12, 2016 •

edited

jmealo commented May 31, 2018 •

edited

swuecho commented Jun 15, 2019 •

edited