Michael Vines
5f5824d78d
Rework cluster metrics dashboard to support the modern clusters
2020-03-11 14:14:56 -07:00
carllin
d3712dd26d
Factor repair from gossip ( #8044 )
2020-02-11 13:11:48 -07:00
carllin
fe590da3b6
Revert "Factor repair from gossip ( #8044 )" ( #8143 )
...
This reverts commit e61257695f
.
2020-02-06 11:44:20 -08:00
carllin
e61257695f
Factor repair from gossip ( #8044 )
2020-01-31 14:23:50 -08:00
Justin Starry
87598c7612
Consolidate tx error counters and update metrics dashboard ( #7724 )
...
automerge
2020-01-16 23:26:50 -08:00
Greg Fitzgerald
b5dba77056
Rename blocktree to blockstore ( #7757 )
...
automerge
2020-01-13 13:13:52 -08:00
Ryo Onodera
865c42465a
Cap file size for snapshot data files ( #7182 )
...
* save limit deserialize
* save
* Save
* Clean up
* rustfmt
* rustfmt
* Just comment out to please CI
* Fix ci...
* Move code
* Rustfmt
* Crean up control flow
* Add another comment
* Introduce predetermined constant limit on snapshot data files (deserialize side)
* Introduce predetermined constant limit on snapshot data files (serialize side)
* rustfmt
* Tweak message
* Revert dynamic memory limit
* Limit size of snapshot data file (de)serialization
* Fix test breakage
* Clean up
* Fix uses formatting
* Rename: deserialize_{for,from}_snapshot
* Simplify comment
* Use Slot
* Provide slot for status cache
* Align variable name with snapshot_status_cache_file_path
* Define serialize_snapshot_data_file_with_metrics
* Fix build.......
* De-marco serialize_snapshot_data_file_with_metrics
* Revert u64 => Slot
2020-01-10 09:49:36 +09:00
Sagar Dhawan
0ae7e86fcb
Dashboard: Move y-axis items with different units either side of the graph ( #7573 )
...
automerge
2019-12-19 15:22:43 -08:00
Tyera Eulberg
3513f4ee84
Rename drone to faucet ( #7508 )
2019-12-16 14:05:17 -07:00
carllin
146bc95c16
Fix sigverify metrics ( #7393 )
...
* Add filter
* Collapse logs
* Fix dashboard
2019-12-10 11:28:07 -08:00
Michael Vines
a33bcac52f
Fix stable metrics graph: "Bank Height / Slot Distance ($hostid)"
2019-12-09 22:56:54 -07:00
Sagar Dhawan
79199711b8
Add gpu resource usage tracking ( #7075 )
2019-11-21 08:33:02 -08:00
Sagar Dhawan
100b9dd12a
Fix num nodes metrics ( #7068 )
...
* Fix num nodes metric
* Fix node count metrics
2019-11-20 17:00:31 -08:00
carllin
70322d1ff8
Add error logging to dead slots ( #7000 )
2019-11-16 02:54:51 -08:00
Sagar Dhawan
cab2232aba
Fix System Stats script ( #6985 )
...
automerge
2019-11-15 13:25:40 -08:00
Sagar Dhawan
3ce6248f8c
Add CPU and RAM usage to Metrics ( #6968 )
...
* Add CPU usage to Metrics
* Add RAM usage and rename to system-stats
* Shellcheck
* Remove SC exception
* Address review comments
2019-11-14 20:36:34 -08:00
Michael Vines
56fc15f44d
Fix units on dead slots graph
2019-11-07 08:26:13 -07:00
carllin
0559212df7
log bench ( #6643 )
2019-10-30 19:51:44 -07:00
carllin
59f3dc3b6b
Fix PohRecorder Metrics ( #6644 )
...
* Update Poh Recorder Dashboard
* Update PohRecorder logging
2019-10-30 18:55:29 -07:00
Michael Vines
e738019c48
Add Ramp TPS table
2019-10-29 16:18:58 -07:00
Pankaj Garg
2747c9db23
Fix metrics dashboard layout ( #6419 )
2019-10-17 13:39:50 -07:00
carllin
ccb4e32ee0
ReplayStage metrics ( #6358 )
...
* ReplayStage metrics
* Add more metrics
* Refactor get_slot_entries_with_shred_count() to detect wasted work
* Update dashboard
* Update broadcast slots to micros
* Add broadcast dashboard
2019-10-16 14:32:18 -07:00
Greg Fitzgerald
322fcea6e5
More fullnode to validator renaming ( #6337 )
2019-10-11 13:30:52 -06:00
Justin Starry
8d0236e3f1
Rename bank height to block_height and expose method ( #6199 )
...
* Rename bank bank_height to block_height
* Expose block_height method
2019-10-01 14:55:39 -04:00
carllin
701d90a41d
Remove some AccountStorage Serialization ( #6047 )
...
* Remove serialization of AccountStorageEntry fields
* Add metric for evaluating BankRc serialization time
* Serialize AppendVec current len
* Add dashboard metrics
* Move flush of AppendVecs to packaging thread
2019-09-25 18:07:41 -07:00
Pankaj Garg
e2a5ec9cd2
Change formula used in erasure statistics graph ( #6102 )
...
automerge
2019-09-25 14:57:16 -07:00
Pankaj Garg
de3765ab70
Add erasure recovery stats to dashboard ( #6079 )
...
automerge
2019-09-24 19:59:42 -07:00
Michael Vines
9d805dfc59
Tweak Bank Slot Distance graph
2019-09-24 14:52:29 -07:00
Michael Vines
1aec9e38fa
Restore default time range to now-5m
2019-09-16 08:45:27 -07:00
Michael Vines
c490a50c91
Restore blocktree_error graph
2019-09-14 21:41:48 -07:00
Michael Vines
140d4ccf77
Add dead slot table to stablity section
2019-09-14 20:54:23 -07:00
Michael Vines
1609765740
Adjust snapshot metrics layout
2019-08-27 20:56:15 -07:00
Michael Vines
0c62cf8980
Add metrics for snapshot generation ( #5677 )
2019-08-27 13:04:20 -07:00
Michael Vines
81bb208a62
Add open file descriptor monitoring ( #5655 )
2019-08-26 15:17:19 -07:00
anatoly yakovenko
4fa795b026
bank slot distance ( #5545 )
2019-08-16 15:00:12 -07:00
Michael Vines
cd89f280b7
Remove decimal point from node count
2019-08-11 09:28:59 -07:00
Justin Starry
12bb05c320
Fix dashboard mean tx/s stat ( #5455 )
2019-08-07 16:50:58 -04:00
Tyera Eulberg
efd09ecd37
Revert fork metrics ( #5427 )
...
* Revert "Remove duplicate row (#5419 )"
This reverts commit a81dd80d60
.
* Revert "Log fork stake-percentage in metrics, and display (#5406 )"
This reverts commit 92e419f1c7
.
2019-08-05 15:53:36 -07:00
Tyera Eulberg
a81dd80d60
Remove duplicate row ( #5419 )
2019-08-05 11:45:52 -06:00
Tyera Eulberg
92e419f1c7
Log fork stake-percentage in metrics, and display ( #5406 )
...
* Log fork stake percentage data
* Add fork stake percentage to dashboard
* Call out parent slot
2019-08-02 19:16:23 -06:00
Justin Starry
43cc9fcb1d
Update mean tx/s to use the correct counter ( #5390 )
2019-08-01 15:30:36 -04:00
Justin Starry
47b5ba44e9
Add tag suffix to remaining metrics host_id queries ( #5388 )
2019-08-01 14:43:13 -04:00
Justin Starry
e95397e0a8
Clarify that host_id is a tag in metrics influx queries ( #5387 )
2019-08-01 14:34:07 -04:00
Justin Starry
a41c7451f1
Add testnet prefix to the metrics queries without it ( #5376 )
2019-07-31 21:07:25 -04:00
Justin Starry
84368697af
Fix metrics when leader does not report metrics ( #5291 )
2019-07-30 16:18:33 -04:00
Greg Fitzgerald
6c49b10784
Purge remaining uses of Locktower ( #5076 )
...
automerge
2019-07-13 00:24:15 -07:00
Dan Albert
b4aebbd991
Increment InfluxDB to 1.7 ( #4931 )
2019-07-03 17:44:49 -07:00
anatoly yakovenko
29611fb61d
tower consensus naming ( #4598 )
...
s/locktower/tower/g
2019-06-24 13:41:23 -07:00
Michael Vines
8628f33d0b
Fix HostId field in the testnet dashboard
2019-06-13 16:09:09 -07:00
Michael Vines
e1444a9b00
Add curl retries
2019-06-13 15:05:07 -07:00