PMM - High Availability

Percona Monitoring and Management High Availability - PMM HA

This method provides means to:

Use a running PMM instance
Prepare it to act as a Primary
Install a second PMM on a different machine
Prepare it to act as a Secondary
Establish replication

Prerequisites

Docker 23.0.3 and higher
Docker installation will fail if on Amazon Linux or RHEL9/EL9 (unless you are on a s390x architecture machine), since the Percona Easy-Install script relies on the Get Docker script for the docker install. You will need to install docker on your own on those cases
SSH Access to the host servers
sudo capabilities
Ports 443 and 9000 accessible from outside the Primary host machine

Install & Run

Clone the repo and run the pmm.sh server

git clone https://github.com/nethalo/pmmha.git
cd pmmha
bash pmm.sh

You will be presented with the available options. First one is straightforward: Install PMM from scratch.

Set Primary & Replica

Both Primary and Replica requires some preparation. Follow the steps below:

Setup a Replica	Setup a Primary
Choose the option Set PMM Replica:	Choose the option Set PMM Primary:
Confirm	<--- Confirm
Enter the info for Host, User and Password of the Primary PMM	No additional info needed :)
Steps will be performed Wait for the steps to finish and you are all set!	Confirm it and steps will be performed You are all set!

What is under the hood?

Simply put, there are 3 main things replicated:

VictoriaMetrics time series data
Inventory+conf info from PostgreSQL
ClickHouse metrics table
SQLite info: Grafana Dashboards/Users/Roles/etc, Alerts, PMM Managed Backups (for MongoDB)

VictoriaMetrics

Federation is what is being used. A new scrape is configured tothe gather metrics via federate from the primary and stores it locally on the secondary:

scrape_configs:
  - job_name: pmmha
    honor_timestamps: true
    scrape_interval: 2s
    scrape_timeout: 1s
    metrics_path: /prometheus/federate?match[]={__name__=~".*"}
    scheme: $scheme
    tls_config:
      insecure_skip_verify: true
    basic_auth:
      username: $user
      password: $pass
    static_configs:
      - targets:
          - "$host:$port"

PostgreSQL

A pg_dump of the pmm-managed schema is made, stored into a FILE table inside the primary ClickHouse and the Secondary will read the contents of that table via the REMOTE function of ClickHouse and will restore the dump.

The FILE table is defined as

CREATE TABLE IF NOT EXISTS pmm.pgpmm (dump String) ENGINE = File(RawBLOB);

And the dump is

pg_dump -Upmm-managed --dbname=pmm-managed --inserts --data-only --disable-triggers > /srv/clickhouse/data/pmm/pgpmm/data.RawBLOB

ClickHouse

For QAN data, the same REMOTE functionality is used. However, to achieve data Deduplication, an intermediate table is created with the engine ReplacingMergeTree so that way when forcing a Merge, data is consolidated.

The remote functionality is a simple as this query

select * from remote('$pmmserver', pmm.metrics)

SQLite

Again, For SQLite data, the same REMOTE functionality is used.

Dump is made per table to divide the dashboard ones from the rest, due to size

sqlite3 /srv/grafana/grafana.db ".dump --nosys --data-only --newlines --preserve-rowids dashboard dashboard_version" > /srv/clickhouse/data/pmm/sqlitedash/data.RawBLOB

The remote functionality is a simple as this query

clickhouse-client --format=PrettySpaceNoEscapes --multiquery --database pmm --query="SET output_format_pretty_max_rows=10000000000000; SET output_format_pretty_max_column_pad_width=10; SET output_format_pretty_max_value_width=100000000; select * from remote('$pmmserver', pmm.sqlitedash)"

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
media		media
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
grafanareplica.ini		grafanareplica.ini
grafanareplica.sh		grafanareplica.sh
inventoryreplica.ini		inventoryreplica.ini
inventoryreplica.sh		inventoryreplica.sh
metrics.sql		metrics.sql
pggrafana.sql		pggrafana.sql
pgpmm.sql		pgpmm.sql
pmm.sh		pmm.sh
pmmgrafanadump.ini		pmmgrafanadump.ini
pmmgrafanadump.sh		pmmgrafanadump.sh
pmmpgdump.ini		pmmpgdump.ini
pmmpgdump.sh		pmmpgdump.sh
qanreplica.ini		qanreplica.ini
qanreplica.sh		qanreplica.sh
replica.gif		replica.gif
replicahd.gif		replicahd.gif
restoresqlitedash.ini		restoresqlitedash.ini
restoresqlitedash.sh		restoresqlitedash.sh
restoresqlitepmm.ini		restoresqlitepmm.ini
restoresqlitepmm.sh		restoresqlitepmm.sh
scrape.yaml		scrape.yaml
sqlitedash.ini		sqlitedash.ini
sqlitedash.sh		sqlitedash.sh
sqlitedash.sql		sqlitedash.sql
sqlitepmm.ini		sqlitepmm.ini
sqlitepmm.sh		sqlitepmm.sh
sqlitepmm.sql		sqlitepmm.sql

License

nethalo/pmmha

Folders and files

Latest commit

History

Repository files navigation

PMM - High Availability

Prerequisites

Install & Run

Set Primary & Replica

What is under the hood?

VictoriaMetrics

PostgreSQL

ClickHouse

SQLite

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages