Commit Graph

91 Commits

Author SHA1 Message Date
HaoranYi a26fe6d1be
fix node count query (#28259)
* fix node count query

* fix column name
2022-10-06 11:39:39 -05:00
sakridge 14bc55eddb
Use net-stats-validator metric on the dashboard (#25171) 2022-05-12 20:20:20 +00:00
Haoran Yi 7de339cb5c remove vote instruction counter query from grafana dashboard configuration 2022-04-26 14:38:44 -05:00
joeaba 3b9654771d
change metrics to internal-metrics 2022-01-15 00:08:45 +05:30
Justin Starry 85eb37fab0
Merge pull request from GHSA-8v47-8c53-wwrc
* Track transaction check time separately from account loads

* banking packet process metrics

* Remove signature clone in status cache lookup

* Reduce allocations when converting packets to transactions

* Add blake3 hash of transaction messages in status cache

* Bug fixes

* fix tests and run fmt

* Address feedback

* fix simd tx entry verification

* Fix rebase

* Feedback

* clean up

* Add tests

* Remove feature switch and fall back to signature check

* Bump programs/bpf Cargo.lock

* clippy

* nudge benches

* Bump `BankSlotDelta` frozen ABI hash`

* Add blake3 to sdk/programs/Cargo.lock

* nudge bpf tests

* short circuit status cache checks

Co-authored-by: Trent Nelson <trent@solana.com>
2021-04-13 00:28:08 -06:00
sakridge ff2242d0e3
Update dashboard with new poh metrics (#15991) 2021-03-22 11:38:57 -07:00
sakridge 391a1ea760
Add panel to show slot production (#14615) 2021-01-17 08:35:01 -08:00
sakridge f0e9341450
Add dropped hash to dashboard (#13972) 2020-12-05 18:02:01 -08:00
Tyera Eulberg be7c30df35
Update Current Leader query (#12465) 2020-09-24 20:42:30 -06:00
sakridge 4bd0ced87d
Back out dashboard (#11791) 2020-08-21 21:48:37 -07:00
sakridge b10f874f49
Reorganize dashboard (#11639)
Split out sections for gossip, repair, leader.
2020-08-14 14:42:33 -07:00
sakridge bb8ccbb022
Add retransmit stats (#11588) 2020-08-12 12:26:14 -07:00
carllin a7ea340f22
Track votes from gossip for optimistic confirmation (#11209)
* Add check in cluster_info_vote_listenere to see if optimstic conf was achieved
Add OptimisticConfirmationVerifier

* More fixes

* Fix merge conflicts

* Remove gossip notificatin

* Add dashboards

* Fix rebase

* Count switch votes as well toward optimistic conf

* rename

Co-authored-by: Carl <carl@solana.com>
2020-07-28 09:33:27 +00:00
carllin 7a14e359d7
Switch to using weighted repair in RepairService (#10735)
* Plumb votes into repair service

* Remove refactoring

* Fix tests

* Switch to using RepairWeight for generating repairs

* Revert "Weight repair slots based on vote stake (#10741)"

This reverts commit cabd0a09c3.

* Update logging

Co-authored-by: Carl <carl@solana.com>
2020-07-10 05:52:54 +00:00
sakridge ea708b0d84
Add blockstore insert metrics to dashboard (#10787) 2020-06-24 18:21:26 -07:00
sakridge 4e04282564
Add correct erasure metrics to dashboard (#10781) 2020-06-24 13:16:56 -07:00
sakridge b172b3369e
Add new split poh/transaction verify times to dashboard (#10706) 2020-06-18 22:21:11 -07:00
Michael Vines 769f456a22 Add RPC Send Transaction Service graphs 2020-06-10 18:14:06 -07:00
sakridge d59fd7d512
Add gossip/poh to dashboard (#10449) 2020-06-09 10:25:10 -07:00
Michael Vines 156387aba4
LedgerCleanupService no longer causes an OOM and actually purges (#10199)
* cleanup_ledger() now services new_root_receiver while purging
* purge_slots() now fully deletes before compacting
* Add ledger pruning grafana graph
2020-05-24 21:41:54 -07:00
sakridge 9222bc2b35
Add 30s option to metrics. (#10081)
10s too short, 1m too long, 30s just right.
2020-05-16 10:46:17 -07:00
carllin 01ab1d1369
Add metrics for logging time taken in replaystage steps (#9933)
automerge
2020-05-08 03:46:29 -07:00
Michael Vines 75b25e33f6
Adjust dashboard time range 2020-04-19 09:24:42 -07:00
Michael Vines dbaebe101c
Clean up metrics dashboard 2020-04-17 18:08:59 -07:00
carllin 7aa4d401f7
Fix broadcast metrics (#9461)
* Rework broadcast metrics to support multiple threads

* Update dashboards

Co-authored-by: Carl <carl@solana.com>
2020-04-15 15:22:16 -07:00
carllin 4522e85ac4
Add Metrics/Dashboards tracking block production (#9342)
* Add metric tracking blocks/dropped blocks

Co-authored-by: Carl <carl@solana.com>
2020-04-08 14:35:24 -07:00
sakridge c67703e7a3
Make repair metrics less chatty (#9094) 2020-03-29 14:43:58 -07:00
Michael Vines 35dd52e9ba
Remove SLP from grafana 2020-03-24 12:23:30 -07:00
Michael Vines 5f5824d78d Rework cluster metrics dashboard to support the modern clusters 2020-03-11 14:14:56 -07:00
carllin d3712dd26d Factor repair from gossip (#8044) 2020-02-11 13:11:48 -07:00
carllin fe590da3b6
Revert "Factor repair from gossip (#8044)" (#8143)
This reverts commit e61257695f.
2020-02-06 11:44:20 -08:00
carllin e61257695f
Factor repair from gossip (#8044) 2020-01-31 14:23:50 -08:00
Justin Starry 87598c7612 Consolidate tx error counters and update metrics dashboard (#7724)
automerge
2020-01-16 23:26:50 -08:00
Greg Fitzgerald b5dba77056 Rename blocktree to blockstore (#7757)
automerge
2020-01-13 13:13:52 -08:00
Ryo Onodera 865c42465a
Cap file size for snapshot data files (#7182)
* save limit deserialize

* save

* Save

* Clean up

* rustfmt

* rustfmt

* Just comment out to please CI

* Fix ci...

* Move code

* Rustfmt

* Crean up control flow

* Add another comment

* Introduce predetermined constant limit on snapshot data files (deserialize side)

* Introduce predetermined constant limit on snapshot data files (serialize side)

* rustfmt

* Tweak message

* Revert dynamic memory limit

* Limit size of snapshot data file (de)serialization

* Fix test breakage

* Clean up

* Fix uses formatting

* Rename: deserialize_{for,from}_snapshot

* Simplify comment

* Use Slot

* Provide slot for status cache

* Align variable name with snapshot_status_cache_file_path

* Define serialize_snapshot_data_file_with_metrics

* Fix build.......

* De-marco serialize_snapshot_data_file_with_metrics

* Revert u64 => Slot
2020-01-10 09:49:36 +09:00
Sagar Dhawan 0ae7e86fcb Dashboard: Move y-axis items with different units either side of the graph (#7573)
automerge
2019-12-19 15:22:43 -08:00
Tyera Eulberg 3513f4ee84
Rename drone to faucet (#7508) 2019-12-16 14:05:17 -07:00
carllin 146bc95c16
Fix sigverify metrics (#7393)
* Add filter

* Collapse logs

* Fix dashboard
2019-12-10 11:28:07 -08:00
Michael Vines a33bcac52f
Fix stable metrics graph: "Bank Height / Slot Distance ($hostid)" 2019-12-09 22:56:54 -07:00
Sagar Dhawan 79199711b8
Add gpu resource usage tracking (#7075) 2019-11-21 08:33:02 -08:00
Sagar Dhawan 100b9dd12a
Fix num nodes metrics (#7068)
* Fix num nodes metric

* Fix node count metrics
2019-11-20 17:00:31 -08:00
carllin 70322d1ff8
Add error logging to dead slots (#7000) 2019-11-16 02:54:51 -08:00
Sagar Dhawan cab2232aba Fix System Stats script (#6985)
automerge
2019-11-15 13:25:40 -08:00
Sagar Dhawan 3ce6248f8c
Add CPU and RAM usage to Metrics (#6968)
* Add CPU usage to Metrics

* Add RAM usage and rename to system-stats

* Shellcheck

* Remove SC exception

* Address review comments
2019-11-14 20:36:34 -08:00
Michael Vines 56fc15f44d
Fix units on dead slots graph 2019-11-07 08:26:13 -07:00
carllin 0559212df7
log bench (#6643) 2019-10-30 19:51:44 -07:00
carllin 59f3dc3b6b
Fix PohRecorder Metrics (#6644)
* Update Poh Recorder Dashboard

* Update PohRecorder logging
2019-10-30 18:55:29 -07:00
Michael Vines e738019c48
Add Ramp TPS table 2019-10-29 16:18:58 -07:00
Pankaj Garg 2747c9db23
Fix metrics dashboard layout (#6419) 2019-10-17 13:39:50 -07:00
carllin ccb4e32ee0
ReplayStage metrics (#6358)
* ReplayStage metrics

* Add more metrics

* Refactor get_slot_entries_with_shred_count() to detect wasted work

* Update dashboard

* Update broadcast slots to micros

* Add broadcast dashboard
2019-10-16 14:32:18 -07:00