Upgraded DB schema doesn't match clean new schema (affecting full-text search) #3028

Jakdaw · 2018-10-30T11:00:07Z

This is a diff of the PostgreSQL schema between an instance that was upgraded from v3 to v5.0.2 verses a clean empty schema created with v5.0.2 using manage.py database create_tables:

--- schema1     2018-10-29 22:38:34.752535928 +0000
+++ schema2     2018-10-29 22:38:28.479611730 +0000
@@ -37,7 +37,7 @@
     LANGUAGE plpgsql
     AS $$
             BEGIN
-                NEW.search_vector = ((to_tsvector('pg_catalog.simple', regexp_replace(coalesce(CAST(NEW.id AS TEXT), ''), '[-@.]', ' ', 'g')) || to_tsvector('pg_catalog.simple', regexp_replace(coalesce(NEW.name, ''), '[-@.]', ' ', 'g'))) || to_tsvector('pg_catalog.simple', regexp_replace(coalesce(NEW.description, ''), '[-@.]', ' ', 'g'))) || to_tsvector('pg_catalog.simple', regexp_replace(coalesce(NEW.query, ''), '[-@.]', ' ', 'g'));
+                NEW.search_vector = ((setweight(to_tsvector('pg_catalog.simple', regexp_replace(coalesce(CAST(NEW.id AS TEXT), ''), '[-@.]', ' ', 'g')), 'B') || setweight(to_tsvector('pg_catalog.simple', regexp_replace(coalesce(NEW.name, ''), '[-@.]', ' ', 'g')), 'A')) || setweight(to_tsvector('pg_catalog.simple', regexp_replace(coalesce(NEW.description, ''), '[-@.]', ' ', 'g')), 'C')) || setweight(to_tsvector('pg_catalog.simple', regexp_replace(coalesce(NEW.query, ''), '[-@.]', ' ', 'g')), 'D');
                 RETURN NEW;
             END
             $$;
@@ -418,10 +418,10 @@
     updated_at timestamp with time zone NOT NULL,
     created_at timestamp with time zone NOT NULL,
     id integer NOT NULL,
+    org_id integer NOT NULL,
     object_type character varying(255) NOT NULL,
     object_id integer NOT NULL,
-    user_id integer NOT NULL,
-    org_id integer NOT NULL
+    user_id integer NOT NULL
 );
 
 
@@ -700,11 +700,11 @@
     id integer NOT NULL,
     org_id integer NOT NULL,
     name character varying(320) NOT NULL,
-    email character varying(320) NOT NULL,
+    email character varying(255) NOT NULL,
+    profile_image_url character varying(320),
     password_hash character varying(128),
     groups integer[],
     api_key character varying(40) NOT NULL,
-    profile_image_url character varying,
     disabled_at timestamp with time zone
 );

So, importantly, upgraded schemas don't have the weighting of the search terms in the Postgres full text index; and there are a couple of other minor fields whose maximum lengths don't match.

The text was updated successfully, but these errors were encountered:

arikfr · 2018-10-31T16:18:12Z

@Jakdaw did you try changing the weights and see if it returns better search results?

Jakdaw · 2018-10-31T18:07:19Z

I've not yet deployed a redash that includes the changes you mentioned in #2798 - but I can confirm that manual queries against postgres using the full text index work much better with the weights in place!

jezdez · 2018-11-01T17:11:47Z

Oh, that is odd, seems like a missing step in the data migration for sure.

arikfr added this to the Next milestone Mar 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgraded DB schema doesn't match clean new schema (affecting full-text search) #3028

Upgraded DB schema doesn't match clean new schema (affecting full-text search) #3028

Jakdaw commented Oct 30, 2018

arikfr commented Oct 31, 2018

Jakdaw commented Oct 31, 2018

jezdez commented Nov 1, 2018

Upgraded DB schema doesn't match clean new schema (affecting full-text search) #3028

Upgraded DB schema doesn't match clean new schema (affecting full-text search) #3028

Comments

Jakdaw commented Oct 30, 2018

arikfr commented Oct 31, 2018

Jakdaw commented Oct 31, 2018

jezdez commented Nov 1, 2018