create account

Current bugs/problems with HF16 by someguy123

View this thread on: hive.blogpeakd.comecency.com
· @someguy123 · (edited)
$106.24
Current bugs/problems with HF16
So far, HF16 has been a pretty bad release for myself. While I did successfully upgrade to HF16, I didn't do so without casualties, I missed about 4 blocks, ruining the fact that I had only missed 3 blocks since I first begun in August. This was not due to the witness going offline, no, this was due to the first problem: 

#### Disclaimer: I'm not trying to bad mouth the developers at Steemit Inc., STEEM is a very complicated system, and I envy them for having the skills to work on it. But I'd like to get these issues out to the public so that everyone is aware.

# It's very disk intensive, and any other steem-based server (or other high disk usage) running on the same physical server may cause you to miss blocks

Because of the Shared DB system, STEEM creates a large virtual memory file on disk, and uses it as a sort-of [Page File](https://en.wikipedia.org/wiki/Paging). This is supposed to reduce RAM, however in some cases it doesn't actually help (more in the next section).

![](https://i.imgur.com/WYhUeWq.png)

Thanks to this Shared DB system, STEEM is capable of thrashing your disk 24/7, to the point where you may be unable to run any other steem-based server on the same physical server, even if they're isolated within VMs. The optimisations given out in the release page for Linux systems don't seem to address this problem completely, and may need tuning.

I learned this lesson, as my GOLOS witness was running on one of my backup witness nodes for STEEM - this had never been an issue before, until now, thanks to the intense disk usage of HF16, my GOLOS node missed almost 200 blocks before I figured out HF16 was to blame.

Thanks to the pressure on the disk, it can cause problems for people trying to run their node on a VPS provider such as DigitalOcean or Linode, due to the disk throttling normally in place. Even for those that don't throttle, the heavy disk usage may result in you getting suspended.

# Big variation in RAM usage

![](https://i.imgur.com/fK6MLb3.png)
Above screenshot - 2 witness servers running on similar hardware, same settings.

I have servers that use 100mb of RAM, I have servers using 1.5gb of RAM, I have another server chewing up 6gb of RAM. @pharesim has a server chewing up 9gb of RAM. Despite them all being witnesses and running on similar hardware, the RAM usage cannot be predicted, it's seemingly arbitrary.

This could be the sign of a memory leak in the STEEM code, and should be addressed, as it may affect the reliability of the STEEM server over time, e.g. witnesses who need 24/7 uptime.

# General random block misses

In the #witness-blocks channel, there have been more misses than ever, some people like @pharesim and @riverhead have missed seemingly hundreds of blocks on STEEM thanks to strange issues with HF16 (completely out of their control thanks to HF16, this is not shaming). I've personally missed about 5 blocks so far, and still rising, because HF16 isn't very reliable.

# Server randomly stops syncing

This is a problem that should definitely be investigated. One of my STEEM nodes randomly stopped syncing, yet still believed it was supposed to be producing blocks. I'm not alone on this, some others such as @pharesim have also experienced this bug.

This problem results in missed blocks if it affects a witness node.

---

Do you like what I'm doing for STEEM/Steemit?
----
[Vote for me to be a witness](https://steemit.com/~witnesses) - every vote counts. 
----
Don't forget to follow me for more like this.
----

----
👍  , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and 217 others
👎  , , , , , , , , ,
properties (23)
authorsomeguy123
permlinkcurrent-bugs-problems-with-hf16
categorysteem
json_metadata{"tags":["steem","steemit","witness-category","dev","witness-blocks"],"users":["pharesim","riverhead"],"image":["https://i.imgur.com/WYhUeWq.png","https://i.imgur.com/fK6MLb3.png"],"links":["https://en.wikipedia.org/wiki/Paging","https://steemit.com/~witnesses"],"app":"steemit/0.1","format":"markdown"}
created2016-12-05 10:01:12
last_update2016-12-05 10:09:54
depth0
children10
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value95.785 HBD
curator_payout_value10.454 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length3,519
author_reputation103,945,664,283,580
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,921,436
net_rshares111,295,328,483,345
author_curate_reward""
vote details (291)
@abit ·
And worse the page file is not fully pre-allocated, but increasing on the fly..

Workaround: get a machine with large RAM, put it in /dev/shm, then pray.

By the way, Steemit, Inc. has officially suggested in the blog post that node operators should tweak kernel vm parameters, which will increase the possibility of data loss for other services running in the same machine. So better use a dedicated box for Steemd if you care your data.
👍  ,
properties (23)
authorabit
permlinkre-someguy123-current-bugs-problems-with-hf16-20161205t114805754z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 11:48:15
last_update2016-12-05 11:48:15
depth1
children2
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length438
author_reputation141,171,499,037,785
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,921,880
net_rshares278,333,623,674
author_curate_reward""
vote details (2)
@someguy123 ·
I have used the parameters suggested, and added them to steem-in-a-box via `./run.sh optimize`, but it's far from perfect, still causes a lot of issues from the disk usage.

Also the workaround is a little ridiculous, since this was supposed to reduce RAM usage...
properties (22)
authorsomeguy123
permlinkre-abit-re-someguy123-current-bugs-problems-with-hf16-20161205t115214932z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 11:52:15
last_update2016-12-05 11:52:15
depth2
children1
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length264
author_reputation103,945,664,283,580
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,921,897
net_rshares0
@abit ·
It did reduce RAM usage if you need to run 4 steemd instances in one server, which would have consumed you 100G+ RAM with 0.15 or earlier, but with 0.16 now it needs less than 30G.
properties (22)
authorabit
permlinkre-someguy123-re-abit-re-someguy123-current-bugs-problems-with-hf16-20161205t134530757z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 13:45:39
last_update2016-12-05 13:45:39
depth3
children0
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length180
author_reputation141,171,499,037,785
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,922,591
net_rshares0
@dantheman ·
$3.18
Thanks for this report.  Various ram usage is based on OS caching and depends on how long it has been running.  

It shouldn't be thrashed that much.  

OS X does much better with same code.  We will continue looking for ways to address Linux.  In mean time using a RAM disk should fix it.
👍  , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
properties (23)
authordantheman
permlinkre-someguy123-current-bugs-problems-with-hf16-20161205t121645021z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 12:16:45
last_update2016-12-05 12:16:45
depth1
children0
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value2.416 HBD
curator_payout_value0.761 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length289
author_reputation240,292,002,602,347
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,922,007
net_rshares17,695,257,432,217
author_curate_reward""
vote details (38)
@good-karma ·
It is working alright after following suggested changes, though haven't been able to test it extensively, yet to see if node keeps running as smoothly.
properties (22)
authorgood-karma
permlinkre-someguy123-current-bugs-problems-with-hf16-20161205t124302465z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 12:43:06
last_update2016-12-05 12:43:06
depth1
children0
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length151
author_reputation656,210,817,936,836
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,922,114
net_rshares0
@jacobcards ·
I wish I could speak this language...I feel like such a muggle :(
👍  , , , , , ,
properties (23)
authorjacobcards
permlinkre-someguy123-current-bugs-problems-with-hf16-20161205t144102369z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 14:41:03
last_update2016-12-05 14:41:03
depth1
children2
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length65
author_reputation48,954,175,480,649
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,923,050
net_rshares283,054,082,299
author_curate_reward""
vote details (7)
@anotherjoe ·
haha, you just posted for 99% of us!
properties (22)
authoranotherjoe
permlinkre-jacobcards-re-someguy123-current-bugs-problems-with-hf16-20161205t170606205z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 17:06:06
last_update2016-12-05 17:06:06
depth2
children0
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length36
author_reputation40,326,779,382,210
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,924,169
net_rshares0
@chris4210 ·
Totally agree,  sometimes we all feel like muggels.
properties (22)
authorchris4210
permlinkre-jacobcards-re-someguy123-current-bugs-problems-with-hf16-20161205t144102369z-2016125t203846934z
categorysteem
json_metadata{"tags":"steem","app":"esteem/1.3.2","format":"markdown+html"}
created2016-12-05 19:38:57
last_update2016-12-05 19:38:57
depth2
children0
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length51
author_reputation40,042,535,242,490
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,925,506
net_rshares0
@kingscrown ·
nicely done!
properties (22)
authorkingscrown
permlinkre-someguy123-current-bugs-problems-with-hf16-20161205t150447422z
categorysteem
json_metadata{"tags":["steem"]}
created2016-12-05 15:04:48
last_update2016-12-05 15:04:48
depth1
children0
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length12
author_reputation2,114,606,667,003,741
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,923,226
net_rshares0
@pfunk ·
I have a seed-node using the [recommended kernel settings](https://steemit.com/steem/@steemitblog/steem-0-16-0-official-release) using about 15GB of memory out of 32. Before I restarted it last night it was at about 16GB.

I have two witness nodes at 8.6 and 8.4GB use.
properties (22)
authorpfunk
permlinkre-someguy123-current-bugs-problems-with-hf16-20161205t185310504z
categorysteem
json_metadata{"tags":["steem"],"links":["https://steemit.com/steem/@steemitblog/steem-0-16-0-official-release"]}
created2016-12-05 18:53:15
last_update2016-12-05 18:53:15
depth1
children0
last_payout2017-01-05 13:15:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length269
author_reputation221,632,045,904,452
root_title"Current bugs/problems with HF16"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id1,925,060
net_rshares0