pg_llm - PostgreSQL LLM Integration Extension

This PostgreSQL extension enables direct integration with various Large Language Models (LLMs). It supports multiple LLM providers and features like session management, parallel inference, and more.

Features

Prerequisites

PostgreSQL 14.0 or later
C++20 compatible compiler
libcurl
OpenSSL
jsoncpp
CMake 3.15 or later (for CMake build)

Dependencies

The pg_llm extension depends on the following libraries:

OpenSSL: For secure connections to LLM APIs
cURL: For making HTTP requests to LLM APIs
JsonCpp: For parsing JSON responses from LLM APIs
PostgreSQL logging subsystem: For extension runtime logging

The build system will automatically check for these dependencies and provide instructions if they are missing.

Installation

Configure PostgreSQL environment:

# Add PostgreSQL binaries to your PATH
# First, locate your PostgreSQL installation's bin directory:
# You can use the command:
which psql
# or
pg_config --bindir

# Then add the path to your shell configuration file
# For macOS (add to ~/.zshrc or ~/.bash_profile):
export PATH=/path/to/postgresql/bin:$PATH

# For Linux (add to ~/.bashrc):
export PATH=/path/to/postgresql/bin:$PATH

# Apply changes
source ~/.zshrc  # or ~/.bash_profile or ~/.bashrc

Install dependencies:

# Ubuntu/Debian
sudo apt-get install postgresql-server-dev-all libcurl4-openssl-dev libjsoncpp-dev libssl-dev cmake pkg-config

# MacOS
brew install postgresql curl jsoncpp openssl cmake pkg-config

Build and install pg_llm:

cd pg_llm
mkdir build && cd build

# Configure (choose one of the following build types)
# Debug build
cmake -DCMAKE_BUILD_TYPE=Debug ..

# Release build
cmake -DCMAKE_BUILD_TYPE=Release ..

# Address Sanitizer build
cmake -DCMAKE_BUILD_TYPE=ASan ..

# Build
make

# Install
sudo make install

Configure PostgreSQL to load the extension:

Since pg_llm implements the _PG_init function for initialization, it must be loaded via shared_preload_libraries. Add the following to your postgresql.conf file:

# Add pg_llm to shared_preload_libraries
shared_preload_libraries = 'pg_llm'

Restart PostgreSQL to load the extension:

# For systemd-based systems
sudo systemctl restart postgresql

# For macOS
brew services restart postgresql

# For other systems
pg_ctl restart -D /path/to/data/directory

Create the extension in your database: After building, you need to enable the extension in PostgreSQL:

-- Enable the extension
CREATE EXTENSION vector;
CREATE EXTENSION pg_llm;

-- Verify installation
SELECT * FROM pg_available_extensions WHERE name = 'pg_llm';

Usage

Adding Models

Alibaba Tongyi Qianwen:

SELECT pg_llm_add_model(
    'qianwen',
    'qianwen-chat',
    'your-api-key',
    '{
        "model_name": "qwen-turbo",
        "api_endpoint": "https://dashscope.aliyuncs.com/api/v1/services/aigc/text-generation/generation",
        "access_key_id": "your-access-key-id",
        "access_key_secret": "your-access-key-secret"
    }'
);

Single-turn Chat

SELECT pg_llm_chat('gpt4-chat', 'What is PostgreSQL?');

Multi-turn Chat

SELECT pg_llm_create_session();
SELECT pg_llm_multi_turn_chat('qianwen-chat', '5PN2qmWqBlQ9wQj99nsQzldVI5ZuGXbE', 'WHO ARE YOU?');
SELECT pg_llm_multi_turn_chat('qianwen-chat', '5PN2qmWqBlQ9wQj99nsQzldVI5ZuGXbE', 'What was the previous question?');

Parallel Multi-model Chat

SELECT pg_llm_parallel_chat(
    'What are the advantages of PostgreSQL?',
    ARRAY['gpt4', 'deepseek-chat', 'hunyuan-chat', 'qianwen-chat']
);

Streaming Chat

SELECT *
FROM pg_llm_chat_stream(
  'qianwen-chat',
  'Explain MVCC in PostgreSQL',
  '{}'::jsonb
);

Structured JSON APIs

SELECT pg_llm_chat_json('qianwen-chat', 'Summarize PostgreSQL in one paragraph');

SELECT pg_llm_parallel_chat_json(
  'Which PostgreSQL features matter most for analytics workloads?',
  ARRAY['deepseek-r1-local', 'qianwen-chat'],
  '{"confidence_threshold": 0.65}'::jsonb
);

SELECT pg_llm_text2sql_json(
  'qianwen-chat',
  'Show the latest 10 orders',
  NULL,
  true,
  '{"enable_rag": true}'::jsonb
);

Session State

SELECT pg_llm_create_session(8);
SELECT pg_llm_update_session_state('session-id', '{"topic":"finance"}'::jsonb);
SELECT pg_llm_get_session('session-id');
SELECT * FROM pg_llm_get_session_messages('session-id');

SQL Analysis And Reporting

SELECT pg_llm_execute_sql_with_analysis(
  'qianwen-chat',
  'SELECT region, SUM(revenue) AS total_revenue FROM sales GROUP BY region',
  '{}'::jsonb
);

SELECT pg_llm_generate_report(
  'qianwen-chat',
  'SELECT region, SUM(revenue) AS total_revenue FROM sales GROUP BY region',
  '{}'::jsonb
);

Knowledge Base And Feedback

SELECT pg_llm_add_knowledge(
  'ops-runbook',
  'PostgreSQL VACUUM reclaims dead tuples and updates visibility information.',
  '{"domain":"operations"}'::jsonb,
  '{"chunk_size": 128}'::jsonb
);

SELECT * FROM pg_llm_search_knowledge('How does VACUUM help?', '{"limit": 3}'::jsonb);

SELECT pg_llm_record_feedback(
  '00000000-0000-0000-0000-000000000000'::uuid,
  5,
  'Helpful answer',
  '{"tag":"positive"}'::jsonb
);

Audit And Trace

SELECT * FROM pg_llm_get_audit_log('{"limit": 20}'::jsonb);
SELECT pg_llm_get_trace('00000000-0000-0000-0000-000000000000'::uuid);

Removing Models

SELECT pg_llm_remove_model('gpt4-chat');

Development Guide

Please refer to CONTRIBUTING.md for detailed development guidelines, including:

Code organization
Building for development
Development workflow
Adding new models
Coding standards
Commit conventions

Security Considerations

API keys are encrypted before storage
All sensitive information is handled securely
Access control is managed through PostgreSQL's permission system
Comprehensive audit logging

Contributing

Fork the repository
Create your feature branch
Commit your changes
Push to the branch
Create a Pull Request

Coding Standards

Use C++20 features appropriately
Follow PostgreSQL coding conventions
Add comprehensive comments
Include unit tests for new features
Update documentation as needed

License

This project is licensed under the PostgreSQL License - see the LICENSE file for details.

Support

For issues and feature requests, please create an issue in the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.githooks		.githooks
.github/workflows		.github/workflows
docs		docs
include		include
sql		sql
src		src
test		test
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTING.zh.md		CONTRIBUTING.zh.md
README.md		README.md
README.zh.md		README.zh.md
build.sh		build.sh
format.sh		format.sh
pg_llm.control		pg_llm.control
setup-clang-format.sh		setup-clang-format.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pg_llm - PostgreSQL LLM Integration Extension

Features

Prerequisites

Dependencies

Installation

Usage

Adding Models

Single-turn Chat

Multi-turn Chat

Parallel Multi-model Chat

Streaming Chat

Structured JSON APIs

Session State

SQL Analysis And Reporting

Knowledge Base And Feedback

Audit And Trace

Removing Models

Development Guide

Security Considerations

Contributing

Coding Standards

License

Support

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pg_llm - PostgreSQL LLM Integration Extension

Features

Prerequisites

Dependencies

Installation

Usage

Adding Models

Single-turn Chat

Multi-turn Chat

Parallel Multi-model Chat

Streaming Chat

Structured JSON APIs

Session State

SQL Analysis And Reporting

Knowledge Base And Feedback

Audit And Trace

Removing Models

Development Guide

Security Considerations

Contributing

Coding Standards

License

Support

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages