Implementing custom column types #55

Fingel · 2021-02-02T18:56:22Z

Hello,

I just recently stumbled upon Piccolo and I'm very impressed so far with how much thought and effort has been put into it. Modern Python desperately needs an ORM that can keep up with the next generation of web development libraries. Piccolo looks to be well on it's way to fill that need.

I have a project in mind that I could use Piccolo with, but it does have a use case that requires the use of a PostGIS geography type column. I've been looking through https://github.com/piccolo-orm/piccolo/blob/master/piccolo/columns/base.py and it doesn't look too daunting to subclass Column and go from there. If I can get something useable, even with using raw for selects, it would be pretty great.

Since there doesn't seem to be any documentation on implementing custom columns, I'd figure I'd post an issue, even if there isn't really a problem yet just in case anyone had any advice or thoughts.

The text was updated successfully, but these errors were encountered:

dantownsend · 2021-02-02T23:39:35Z

@Fingel Thanks for the kind words.

You're right - it should be possible to create custom column types without too much hassle by subclassing Column. I should mention this in the docs.

I don't know much about the Geography column type, but you'll probably need something like this:

class Geography(Column):

    value_type = str

    def __init__(
        self,
        shape: str = 'POINT',
        number: t.Optional[int] = None,
        default: t.Union[str, t.Callable[[], str], None] = "",
        **kwargs,
    ) -> None:
        self._validate_default(default, (str, None))

        self.shape = shape
        self.number = number
        kwargs.update({"shape": shape, "number": number})
        super().__init__(**kwargs)

    @property
    def column_type(self):
        if self.number:
            return f"GEOGRAPHY({self.shape}, {self.number})"
        else:
            return f"GEOGRAPHY({self.shape})"

I'm interested to know how you get along. If you're able to implement PostGIS functionality, I'd like to merge it into the main piccolo repo, or have a separate piccolo_gis repo.

Fingel · 2021-02-03T00:04:36Z

Hi @dantownsend ,

I did start our writing something that looks like your example. I wanted to make it easy on myself to I defined a Point(Column) which looked a lot like your example, with plans to generalize into full Geography type later.

The first issue I ran into when trying to create a migration was this:

module 'piccolo.columns.column_types' has no attribute 'Point'

piccolo/piccolo/apps/migrations/auto/migration_manager.py

Line 192 in fcf58ac

column_class = getattr(column_types, column_class_name)

It looks like the migration manager assumes all classes are defined within piccolo.columns.column_types. I could fork the project and work in there, with the assumption that maybe support for external columns could be added later?

The next hurdle will be querying. PostGIS uses special functions like so:

SELECT * FROM source WHERE ST_Dwithin(source.location, 'SRID=4035;POINT(1 1)', 100)

for example.

Here is where it looks a little more difficult. I'm still going through the code, but I'm not quite sure yet how a custom method like this could be implemented. So that you could so something like this:

MyTable.objects().where(MyTable.location.ST_Dwithin('SRID=4035;POINT(1 1)', 100)).run_sync()

Any ideas on how to approach that?

Alternatively (which could go a long way to help other niche use cases) would be to support raw sql inside of a Where. So you could do something like this:

MyTable.objects().where(
    (MyTable.name=='foo') &
    (raw("ST_Dwithin(location, 'SRID=4035;POINT(1 1)', 100)"))
).run_sync()

But I have no idea of the feasibility of that.

dantownsend · 2021-02-03T00:38:12Z

@Fingel I think those issues are all solvable, but will require some changes to Piccolo.

Custom column types could be registered in piccolo_conf.py, which solves the migration issue.
In terms of adding custom methods like ST_Dwithin, the best analogy so far is the JSONB column type, which has an arrow method. Something similar to this could be implemented.
With raw SQL in where clauses - this also shouldn't be too tricky, and I can add this.

I should be able to get 1 and 3 done in the next day or so. Otherwise you might want to fork it for now.

Fingel · 2021-02-03T23:49:47Z

Hi @dantownsend
I was able to implement geometry/geography columns in my fork here:
https://github.com/piccolo-orm/piccolo/compare/master...Fingel:feature/postgis_funcs?expand=1

Some questions:

geography fields in particular really need to use gist indexes instead of btree. I noticed that gist is referenced here:

piccolo/piccolo/query/methods/create_index.py

Line 16 in c0d34ea

gist = "gist"

but I do not believe anything other than btree is implemented. What do you think about adding the ability for columns to specify which kind of index they should use?
Database extensions. In order for these columns to work, the PostGIS extension needs to be installed. If you want to test it out, I'd recommend spinning up an instance of the Postgis Docker image here. It's kept up to date with PostgreSQL, but comes with the extension installed. People not using this image will have to install postgis manually. One thing that is nice (which Geodjango does) is automatically run "CREATE EXTENSION 'postgis' IF NOT EXISTS;" so that any new database created will have the extension installed automatically. This is really helpful for test databases, for example. How about a way to specify custom sql to be run at certain hooks, or something like that?

Looking forward to continuing on this project!

dantownsend · 2021-02-04T00:26:35Z

@Fingel You've made some great progress.

With the index types, it would be easy enough to expose this in the Column constructor as an index_type arg. What complicates matters is making it work with migrations, if someone was to change the index_type. It's not impossible to do, but anything which touches migrations usually takes a bit longer to implement.

The PostgresEngine has a prep_database method, which currently just sets up the uuid extension.

piccolo/piccolo/engine/postgres.py

Line 270 in c0d34ea

def prep_database(self):

Extending this to also setup other extensions would be pretty straightforward. PostgresEngine would just need to accept a list of extensions names.

I've added the ability to run raw SQL in where clauses: #57

If this is what you were expecting, I'll release it tomorrow. Unfortunately Travis CI is painfully slow now, and I really need to switch to Github Actions.

Fingel · 2021-02-09T19:15:38Z

Hi,

I think the raw where PR is great, I'm glad it was merged.

I still think a PostGIS extension for Piccolo would be great. For example, It would allow us to make working with GIS fields a little more user friendly by being able to leverage tools like Shapely to covert the textual representation of geometries into useful python objects. This would require installing additional dependencies you probably wouldn't want in Piccolo core.

I'm not really sure how to proceed at this point. I could keep working in my branch, but it would be good to know if creating an extension would be possible, or if keeping it super simple so that the fields can be included in core. Thoughts?

dantownsend · 2021-02-09T20:55:40Z

@Fingel I've made a couple of updates, which should make using custom column types easier.

You can now specify an extensions argument in PostgresEngine, and it will try and create that extension when the engine starts. This is currently in master.
I've just created a pull request, which should fix the issue of custom column types in migrations allow custom column types in migrations #69

There are still some limitations though. PostGIS may support custom DDL statements, which the Piccolo migrations don't support. This could mean some more refactoring of Piccolo.

Merging the PostGIS stuff into core is still an option - shapely could be an optional dependency.

Can you think of any other blockers for integrating GIS with Piccolo, either as part of the main library, or a separate package?

Fingel · 2021-02-10T00:25:33Z

@dantownsend This is good stuff! I will try factoring the GIS column types out of my fork and test it with #69. If it goes well and I can develop the columns as a separate package, that would be great. Why don't we see how that goes and if the package looks good enough we can consider merging it back into core?

dantownsend · 2021-02-10T08:17:39Z

@Fingel Cool - makes sense.

dantownsend · 2021-09-16T14:51:46Z

Going to close this for now - custom column types should now be possible.

dantownsend mentioned this issue Feb 4, 2021

Added WhereRaw #57

Merged

dantownsend mentioned this issue Feb 9, 2021

allow custom column types in migrations #69

Merged

dantownsend closed this as completed Sep 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing custom column types #55

Implementing custom column types #55

Fingel commented Feb 2, 2021

dantownsend commented Feb 2, 2021

Fingel commented Feb 3, 2021 •

edited

Loading

dantownsend commented Feb 3, 2021

Fingel commented Feb 3, 2021

dantownsend commented Feb 4, 2021

Fingel commented Feb 9, 2021

dantownsend commented Feb 9, 2021

Fingel commented Feb 10, 2021

dantownsend commented Feb 10, 2021

dantownsend commented Sep 16, 2021

Implementing custom column types #55

Implementing custom column types #55

Comments

Fingel commented Feb 2, 2021

dantownsend commented Feb 2, 2021

Fingel commented Feb 3, 2021 • edited Loading

dantownsend commented Feb 3, 2021

Fingel commented Feb 3, 2021

dantownsend commented Feb 4, 2021

Fingel commented Feb 9, 2021

dantownsend commented Feb 9, 2021

Fingel commented Feb 10, 2021

dantownsend commented Feb 10, 2021

dantownsend commented Sep 16, 2021

Fingel commented Feb 3, 2021 •

edited

Loading