-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
no metrics during sync progress #15
Comments
Based on the socket timeout error message, I suspect the RPC server was totally down during that time. Were you running version 0.20.0? Do you have the logs from the bitcoin node? |
No, it was responding to
I upgraded to v0.20.1 while this was happening. Didn't help.
Yes, then just had the regular logs. No errors. e.g.
After sync finished, the monitor started working again. |
Got it. Then the
The timeout sent into the rpc client should be the same I've honestly never run into a similar issue, even during syncs. Are you running the node and the exporter together on an under-powered machine? Right now there's a global |
I'm running on bitcoind and exporter both on t3a.small with magnetic storage - it's meets my needs and the sync was reasonably fast.
Yes, looks like this is the root cause is there. It's currently all or nothing. Allowing some metrics would be a much better behavior. Also, is there a way to prioritize blockchaininfo even if other RPC's throw exceptions? |
Just curious because my node starts around 1GB memory usage and climbs pretty quickly from there. My instance is pretty unconstrained, memory-wise, and I only see Hopefully I can reproduce by artificially limiting the available memory.
I'll try to just make each metric independent, instead of prioritizing them. |
@alevchuk - I just pushed a new branch with a rewrite of the metric refresh to run the RPC calls in parallel and also be more lenient with failures. https://github.com/jvstein/bitcoin-prometheus-exporter/tree/issue_15/async_refresh Are you able to give it a test against your node? |
i repro'd again on latest master before applying the patch (stopped bitcoind for 1 hour to test the high-io sync). then switched to the branch and run bitcoind again with still some lag and high-io sync. got this crash yet not sure it this was before or after the "Verifying last 6 blocks at level 3" which takes a few minutes before oppening network ports |
getting that's when starting the monitoring after |
My node was down for a few days, after resuming it, there were no metrics while the sync was happening.
Misc:
Monitoring is most needed during unordinary conditions like this one. I don't think it makes sense to export any other metrics during sync, yet sync progress metric is important during sync. For example, it enables sync performance debugging.
The text was updated successfully, but these errors were encountered: