Problem: lack of distinct options to process data #215

yrashk · 2023-06-30T13:03:38Z

Your choices are either SQL with all its expressive power, potential parallelization, optimizations, etc. or a few imperative or functional languages, starting from PL/pgSQL.

Other approaches are not quite available.

Solution: provide a proof-of-concept integration of Prolog

It uses SWI-Prolog as the implementation of choice. It's not the fastest one, or ISO-compliant. But it does integrate relatively well and packs a lot of features. It's actively developed and has a good community. Being a relative newcomer to Prolog, I can't tell if this is a perfect decision long-term, but it does make sense now.

Why Prolog?

As I was working on Postgres package management, I ended up choosing Prolog as the language for describing constraints and rules for building packages in varieties of configurations and settings. And since I was integrating it with C codebase, I thought: "hey, maybe this is a good candidate for a language in Postgres!". It certainly allows to process data from the database in a different, unique, and valuable way.

This current implementation is incomplete but shows the general idea and even implements a sandboxed (trusted) version of the language using library(sandbox).

ghost · 2023-06-30T13:04:52Z

👇 Click on the image for a new way to code review

Legend

Your choices are either SQL with all its expressive power, potential parallelization, optimizations, etc. or a few imperative or functional languages, starting from PL/pgSQL. Other approaches are not quite available. Solution: provide a proof-of-concept integration of Prolog It uses SWI-Prolog as the implementation of choice. It's not the fastest one, or ISO-compliant. But it does integrate relatively well and packs a lot of features. It's actively developed and has a good community. Being a relative newcomer to Prolog, I can't tell if this is a perfect decision long-term, but it does make sense now. Why Prolog? As I was working on Postgres package management, I ended up choosing Prolog as the language for describing constraints and rules for building packages in varieties of configurations and settings. And since I was integrating it with C codebase, I thought: "hey, maybe this is a good candidate for a language in Postgres!". It certainly allows to process data from the database in a different, unique, and valuable way. This current implementation is incomplete but shows the general idea and even implements a sandboxed (trusted) version of the language using `library(sandbox)`.

Solution: ensure we use older API on Postgres 13

This is limiting their usefulness. Solution: implement basic support for these Note that it currently only supports "materialize" method mostly for simplicity's sake. It's possible to do the function call convention by saving the engine in the context. Can be done at a later point.

It's not describing and everything and describes something it doesn't do (printing). Solution: adjust the commentary to reflect the reality

Mostly because we don't check the results of some functions. Solution: make them all required to pass successfully I wrote a `PL_require(cond)` macro for this.

Not sure if they are reported correctly if set is not expected. Solution: write a test

It does something "funny" when it encounters such types: it creates an `unsupported` atom. This is not a good behaviour. Solution: raise an exception instead

Solution: ensure PL_require is called by its correct name It was originally called PL_iff but was then renamed using IDE and it didn't rename the #ifndef-outed name.

This is because initialization is delayed until first invocation. Solution: initialize with the extension in _PG_init

This is less than ideal as the output of it is less predictable. Solution: just throw an error

This will make it harder to discover it. Solution: list it

Solution: fix the target of the comment

Solution: switch to the development branch They are generally quite stable and pack a lot of new features that can be useful.

Solution: update to a newer one

In particular, it uses `success` field that has been removed. Solution: use `error` alone

Solution: switch it to migrations

When building in isolation, it fails to load omni_prolog_stub: ``` open_shared_object/3: dlopen(omni_prolog_stub.so, 0x0001): tried: 'omni_prolog_stub.so' (no such file) ``` Solution: ensure the target that requires it depends on it

Solution: upgrade to 9.1.21

The error reporting suggests `$omni_load_code/2` could not be found. Solution: ensure we're not deallocating the path to the library when handing it off to SWI Prolog's initializer.

Solution: ensure we allocate the results in the correct context Results were previously allocated in (primarily) "SPI Proc" context which was deleted before the return from the language handler.

Solution: start writing tests Currently, dynamic predicates are shared. Static aren't.

yrashk marked this pull request as draft June 30, 2023 13:03

yrashk marked this pull request as ready for review July 2, 2023 18:40

yrashk force-pushed the prolog branch 3 times, most recently from 64eb362 to 795c1a2 Compare July 5, 2023 23:49

yrashk force-pushed the prolog branch from 795c1a2 to a8c6d8f Compare July 16, 2023 17:30

yrashk force-pushed the prolog branch from eeb9db1 to 859354d Compare August 7, 2023 20:11

yrashk force-pushed the prolog branch from 859354d to 962151e Compare September 5, 2023 20:18

yrashk force-pushed the prolog branch from 962151e to 018d76f Compare September 19, 2023 18:15

yrashk force-pushed the prolog branch from 018d76f to 01306cc Compare October 21, 2023 17:16

yrashk added 19 commits December 17, 2023 07:01

Problem: omni_prolog's query/3 does not compile for Postgres 13

a6e0046

Solution: ensure we use older API on Postgres 13

Problem: commentary on prolog function introspection

0b16080

It's not describing and everything and describes something it doesn't do (printing). Solution: adjust the commentary to reflect the reality

Problem: a bunch of warnings in omni_prolog.c

03ede29

Mostly because we don't check the results of some functions. Solution: make them all required to pass successfully I wrote a `PL_require(cond)` macro for this.

Problem: handling multiple results in Prolog functions

3311945

Not sure if they are reported correctly if set is not expected. Solution: write a test

Problem: plprolog doesn't support all common Postgres types yet

8be1ae4

It does something "funny" when it encounters such types: it creates an `unsupported` atom. This is not a good behaviour. Solution: raise an exception instead

Problem: omni_prolog doesn't build on Postgres 13 again

6d84bd7

Solution: ensure PL_require is called by its correct name It was originally called PL_iff but was then renamed using IDE and it didn't rename the #ifndef-outed name.

Problem: first Prolog function invocation takes longer time

5a4dbac

This is because initialization is delayed until first invocation. Solution: initialize with the extension in _PG_init

Problem: using syntax error for testing Prolog error handling

f996d03

This is less than ideal as the output of it is less predictable. Solution: just throw an error

Problem: omni_prolog is not listed on the roadmap

12ee9f3

This will make it harder to discover it. Solution: list it

Problem: plprologu is documented as trusted in the comment

b1fa27e

Solution: fix the target of the comment

Problem: older version of SWI Prolog

86f1f14

Solution: switch to the development branch They are generally quite stable and pack a lot of new features that can be useful.

Problem: outdated SWI Prolog version

ecde970

Solution: update to a newer one

Problem: omni_prolog uses outdated pg_yregress syntax

592fa1f

In particular, it uses `success` field that has been removed. Solution: use `error` alone

Problem: omni_prolog uses old script layout

a9fd5d4

Solution: switch it to migrations

Problem: building omni_prolog

9074464

When building in isolation, it fails to load omni_prolog_stub: ``` open_shared_object/3: dlopen(omni_prolog_stub.so, 0x0001): tried: 'omni_prolog_stub.so' (no such file) ``` Solution: ensure the target that requires it depends on it

Problem: using older version of SWI Prolog

bdd2a6f

Solution: upgrade to 9.1.21

Problem: Prolog function calls failing

e54f58c

The error reporting suggests `$omni_load_code/2` could not be found. Solution: ensure we're not deallocating the path to the library when handing it off to SWI Prolog's initializer.

Problem: Prolog's results of query returning garbage

69006be

Solution: ensure we allocate the results in the correct context Results were previously allocated in (primarily) "SPI Proc" context which was deleted before the return from the language handler.

yrashk force-pushed the prolog branch from 01306cc to 69006be Compare December 17, 2023 18:04

Problem: unclear what isolation exists between Prolog functions

a8d0a93

Solution: start writing tests Currently, dynamic predicates are shared. Static aren't.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem: lack of distinct options to process data #215

Problem: lack of distinct options to process data #215

yrashk commented Jun 30, 2023

ghost commented Jun 30, 2023 •

edited by ghost

Legend

Problem: lack of distinct options to process data #215

Are you sure you want to change the base?

Problem: lack of distinct options to process data #215

Conversation

yrashk commented Jun 30, 2023

ghost commented Jun 30, 2023 • edited by ghost

Legend

ghost commented Jun 30, 2023 •

edited by ghost