Qumomf

Qumomf is a Tarantool vshard high availability tool which supports discovery and recovery.

vshard configuration consistency (prefer replica which has the same configuration as master),
which upstream status did replica have before the crash,
how replica is far from the master comparing LSN to the master LSN,
last time when replica received data or heartbeat signal from the master,
user promotion rules based on the instance priorities.

You can define your own promotion rules which will influence on master election during a failover. Each instance has a priority set via config. Negative priority excludes follower from the election process.

Recovery hooks

Hooks invoked through the recovery process via shell, in particular bash.

These hooks are available:

PreFailover: executed immediately before qumomf takes recovery action. Failure (non-zero exit code) of any of these processes aborts the recovery. Hint: this gives you the opportunity to abort recovery based on some internal state of your system.
PostSuccessfulFailover: executed at the end of successful recovery.
PostUnsuccessfulFailover: executed at the end of unsuccessful recovery.

Any process command that starts with "&" will be executed asynchronously, and a failure for such process is ignored.

Qumomf executes lists of commands sequentially, in order of definition.

A naive implementation might look like:

hooks:
  shell: bash
  pre_failover:
    - "echo 'Will recover from {failureType} on {failureCluster}' >> /tmp/qumomf_recovery.log"
  post_successful_failover:
    - "echo 'Recovered from {failureType} on {failureCluster}. Set: {failureReplicaSetUUID}; Failed: {failedURI}; Successor: {successorURI}' >> /tmp/qumomf_recovery.log"
  post_unsuccessful_failover:
    - "echo 'Failed to recover from {failureType} on {failureCluster}. Set: {failureReplicaSetUUID}; Failed: {failedURI}' >> /tmp/qumomf_recovery.log"

Hooks arguments and environment

Qumomf provides all hooks with failure/recovery related information, such as the UUID/URI of the failed instance, UUID/URI of promoted instance, type of failure, name of cluster, etc.

This information is passed independently in two ways, and you may choose to use one or both:

Environment variables:

QUM_FAILURE_TYPE
QUM_FAILED_UUID
QUM_FAILED_URI
QUM_FAILURE_CLUSTER
QUM_FAILURE_REPLICA_SET_UUID
QUM_COUNT_FOLLOWERS
QUM_COUNT_WORKING_FOLLOWERS
QUM_COUNT_REPLICATING_FOLLOWERS
QUM_COUNT_INCONSISTENT_VSHARD_CONF
QUM_IS_SUCCESSFUL

And, if a recovery was successful:

QUM_SUCCESSOR_UUID
QUM_SUCCESSOR_URI

Command line text replacement.

Qumomf replaces the following tokens in your hook commands:

{failureType}
{failedUUID}
{failedURI}
{failureCluster}
{failureReplicaSetUUID}
{countFollowers}
{countWorkingFollowers}
{countReplicatingFollowers}
{countInconsistentVShardConf}
{isSuccessful}

And, if a recovery was a successful:

{successorUUID}
{successorURI}

API

Qumomf exposes several debug endpoints:

/debug/metrics - runtime and app metrics in Prometheus format,
/debug/health - health check,
/debug/about - the app version and build date.

API documentation for getting information about cluster states, recoveries and problems.

Hacking

Feel free to open issues and pull requests with your ideas how to improve qumomf.

To run unit and integration tests:

make env_up
make run_tests
make env_down

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.github/workflows		.github/workflows
api		api
cmd/qumomf		cmd/qumomf
config		config
example		example
internal		internal
scripts		scripts
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yml		.goreleaser.yml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qumomf

Table of Contents

Discovery

Configuration

How to add a new cluster

Topology recovery

Idle

Smart

Recovery hooks

Hooks arguments and environment

API

Hacking

About

Releases 9

Packages

Contributors 3

Languages

License

shmel1k/qumomf

Folders and files

Latest commit

History

Repository files navigation

Qumomf

Table of Contents

Discovery

Configuration

How to add a new cluster

Topology recovery

Idle

Smart

Recovery hooks

Hooks arguments and environment

API

Hacking

About

Resources

License

Stars

Watchers

Forks

Releases 9

Packages 0

Contributors 3

Languages

Packages