EOS BPs: Auto Failover for Producing Nodes by blockmatrix

eos · @blockmatrix · Jun 18 '18 (edited)

$0.79

EOS BPs: Auto Failover for Producing Nodes

It is imperative for active BPs to ensure their producing nodes are reliable, and that in the event of failure they can continue to sign blocks from standby nodes without any human intervention. 

Currently, there aren't too many "ideal" solutions for this - various [issues](https://github.com/EOSIO/eos/issues/4025) have been raised on the EOS github to help make this process easier for us Block Producers, but until they have been shipped we must do the best we can with the tools currently at our disposal. 

At Block Matrix we have been battle testing an automated failover solution using `keepalived` in the event of the `nodeos` processing being killed. We now have a lightweight solution in place, which auto promotes a backup node via the producer API. You can watch this in action here: 

https://www.youtube.com/watch?v=OuB40yd0z4M

We have put together the code for this over on our [Github](https://github.com/BlockMatrixNetwork/eos-bp-failover), with some explanation around the process and a special addendum for AWS users to combat the multicast/unicast issue which will prevent a vanilla `keepalived` solution from working within their environment.

We have several improvements to this, catering for issues where `nodeos` continues to run but stalls or stops signing blocks - once we have the relevant updates from the EOS dev team we will extend our examples to include them. 

Happy HA'ing to all BPs!

---

[Block Matrix](https://blockmatrix.network) are an EOS block producer candidate, producer name: `blockmatrix1`

👍 bitspace, tsto, eos-costarica, aclarkuk82

`author`	blockmatrix
`permlink`	eos-bps-auto-failover-for-producing-nodes
`category`	eos
`json_metadata`	{"tags":["eos","blockproducers","ha","keepalived","failover"],"image":["https://img.youtube.com/vi/OuB40yd0z4M/0.jpg"],"links":["https://github.com/EOSIO/eos/issues/4025","https://www.youtube.com/watch?v=OuB40yd0z4M","https://github.com/BlockMatrixNetwork/eos-bp-failover","https://blockmatrix.network"],"app":"steemit/0.1","format":"markdown"}
`created`	2018-06-18 14:44:42
`last_update`	2018-06-18 14:46:15
`depth`	0
`children`	0
`last_payout`	2018-06-25 14:44:42
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.749 HBD
`curator_payout_value`	0.040 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	1,539
`author_reputation`	213,606,231,566
`root_title`	"EOS BPs: Auto Failover for Producing Nodes"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	61,234,482
`net_rshares`	370,735,505,311
`author_curate_reward`	""

properties (23)vote details (4)

voter	rshares	pct
bitspace	369,241,997,856	100%
aclarkuk82	192,845,807	100%
tsto	979,851,128	100%
eos-costarica	320,810,520	100%