Commit graph

27 commits

Author SHA1 Message Date
benoit df744bf7dc Use isort to automatically sort imports 2023-03-20 14:56:11 +01:00
benoit dff95eae2f Fixing logging issues in the previous modifications 2023-03-20 12:25:32 +01:00
benoit 7eea4c94be Reorganise for the urllib3 > requests change 2023-03-20 12:25:32 +01:00
benoit 2443505ad6 Change logging
We now use our own logger.
When debug is set (-vvv), we also display urllib3's debug info.
2023-03-20 12:25:32 +01:00
benoit 7815f3379c Reverse the test for node_is_pending
Since the desired state is for there to be no restart pending state, it
makes more sense to modify the service logic so that the return code
reflects this. As a result, the test for the service `node_is_pending`
has been reversed.
2023-03-20 12:25:32 +01:00
benoit 9cd80f5af8 Move from urllib3 to requests 2023-03-20 12:25:32 +01:00
benoit 0800fc72e9 Add more info on states and roles for cluster_node_count 2023-03-10 10:16:01 +01:00
benoit 6f45618e08 Update node_is_alive to metion that it's an liveness check. 2023-03-10 10:16:01 +01:00
benoit 7286638121 -e/endpoints spec update
* it is now possible to specify a comma separated list of endpoints
* the documentation as been updated to explain that:
  + for node services if several addresses are specified they should
    point to different interfaces on the same server.
  + for cluster services several addresses should be used because we
    want the cluster status so the more API we try the better our chance
    of having a reply.
2023-03-10 10:16:01 +01:00
benoit 48d2656ec7 Update check_patroni's description to mention Patroni's API 2023-03-10 10:16:01 +01:00
benoit 275901006b Add spellcheck + tox in requirements-dev.txt 2023-03-02 17:32:18 +01:00
benoit 908669f073 Add a --save option when state files are used
The checks `cluster_config_has_changed` and `node_tl_has_changed` use a
state file to store the previous value of the config hash and the
timeline.

Previously the check would fail if something changed, but the new value
would be saved directly. This behavious has changed. The new value
is saved only if `--save` is passed to the check.

The mimics the way [check_pgactivity] manages this kind of checks.

[check_pgactivity]: https://github.com/OPMDG/check_pgactivity
2023-03-02 17:32:18 +01:00
benoit aa1de928d3 An attempt at correctly ordering imports 2022-02-07 15:11:05 +01:00
benoit 9ed9b6466d Read state file in cli to pass the info to the checks and summaries 2022-02-07 15:01:50 +01:00
benoit 4de20fefdc Node and Cluster services reviews 2022-02-07 14:18:14 +01:00
benoit 7898011c40 Update the README and help 2022-02-07 11:03:12 +01:00
benoit 561c3ed9da Fix doc layout and threshold doc 2021-12-31 11:30:17 +01:00
benoit 86f8bdb395 Readme: perfdata for not_tl_has_changed 2021-12-08 17:23:56 +01:00
benoit df58901fd9 Typo Readme 2021-12-08 17:14:38 +01:00
benoit 6c696a03ee Mypy fix
Stop using ctx.parent.params to get the verbose and timeout parameters
parsed in main and use ctx.obj instead.

ctx.parent.params is typed as Optional[Context] which forces us to test
if it's NULL before using it. It's useless in our case because we know
it's not empty and the resulting code is ugly.

The mypy ierror.

Item "None" of "Optional[Context]" has an attribute "params"
[union-attr]
2021-09-11 00:36:57 +02:00
benoit ca95250f2e disaply default values in help 2021-09-10 09:14:19 +02:00
benoit e18ce97d66 Add default value for verbose and fix the type in main parameters 2021-09-09 17:39:12 +02:00
benoit d4e974da51 --max-lag additions and fixups
* add --max-lag to cluster_has_replica
* change --lag to --max-lag in node_is_replica
* update tests
* update README.md
2021-08-13 11:00:43 +02:00
benoit dd8130a459 New service cluster_is_in_maintenance 2021-08-12 13:15:37 +02:00
benoit db3f008b81 More Typing 2021-08-12 13:07:28 +02:00
benoit 1e6adc6a1a First working version 2021-08-12 13:07:10 +02:00
benoit aa17162871 Initial commit 2021-07-02 12:37:03 +02:00