Comments (3)
This is missing. Consider that at the moment our "replica cluster" story is not yet complete. In this specific case, we should monitor if the WAL receiver is up. Our recommendation is to always use the WAL archive as a fallback mechanism.
"Cluster is healthy" is referred to the local replica cluster. It means that all replicas are up and so the designated primary.
from cloudnative-pg.
I checked and we already monitor this: https://www.postgresql.org/docs/current/monitoring-stats.html#MONITORING-PG-STAT-WAL-RECEIVER-VIEW
Look at the pg_replication
metric in the Prometheus exporter.
from cloudnative-pg.
Hi @gbartolini
Thanks for your feedback
I looked at the content of the "pg_stat_wal_receiver" view and I have the impression that the data in pg_stat_wal_receiver only concerns the primary pod and the replica pods within the same cluster.
I've checked the primary and replica clusters, both of which only contain data for their replica pods. The hosts correspond only to their replica pods.
So I'm afraid the pg_replication
metric is only valid within the same cluster.
cnpg_pg_replication_streaming_replicas
value (2) matchs only the clusters replica pods
And only replica pods name and addr are present in metrics labels .... no information to Replica cluster and its pods
I'll also have a look at wal archiving
from cloudnative-pg.
Related Issues (20)
- Can cloudnative-pg support longhorn to make data sync instead of pg sync data itself in pg cluster, to support infinity disk capacity
- setup creadentials for preflight check in ci workflow
- [Bug]: Logger PGAudit not working, postgres logger is used instead HOT 2
- [Bug]: Kubectl Plugin displays default Operator Image version, when using ImageCatalog in Cluster CR
- [Bug]: CNPG 1.23 - cannot create Cluster auxiliary objects: expected pointer, but got invalid HOT 2
- [Bug]: Replica starts recovery from backup that is outside of retention period.
- [Bug]: Node restarts may render cluster broken HOT 1
- [Docs]: development env setup instructions don't support Apple Silicon
- [Docs]: Release notes for 1.23.1
- [Feature]: Add Container Lifecycle Hooks (PostStart,PreStop) for containers in the Cluster CRD
- [Bug]: Recovery from WAL fails when tablespace was added HOT 1
- [Bug]: New replica searches for WAL archive that doesn't exist
- [Feature]: ScheduledBackup resource should be able to backup using plugin method
- [Bug]: Operator restarting due to DetectAvailableArchitectures()
- [Bug]: Operator test should rely on Deployments not counting pods
- [Docs]: guide on production readiness for CNPG HOT 2
- [Bug]: Do not start the psql or minio container for tests if not needed
- [Bug]: The instance manager is not able to unfence PG after a previous unfence operation failed
- [Feature]: Specify addtional labels for PodMonitor
- [Bug]: Missing log line if only the private key is changed in a TLS secret
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cloudnative-pg.