Commit Graph

354 Commits

Author SHA1 Message Date
Illia Bobyr 4f7e45bb24
metrics: Submit metrics when exiting. Refactor `MetricsAgent::run()`. (#718)
There are a few minor issues this change addresses:

1. When we send points to the `MetricsWriter` we are calling
   `Instant::now()` twice, using the first result in the metrics stats,
   and using the seconds value for `last_write_time`.  Yet, on the next
   upload, we would use `last_write_time` as a reference point.

   We upload metrics using a network call, so it is far from
   instantaneous.  This creates a minor discrepancy in our time
   reporting.

   Good news is that we do not really need to call `Instant::now()`
   twice at all, as we can use the same value for both stats and
   `last_write_time`.

2. We did not report metrics stats if we did not have any points
   accumulated.  It seems better to always report metric stats,
   including when no points have been accumulated.  In practice, this
   does not happen for the validator, as validators always report
   something during a 10-second accumulation interval.

3. We did not upload any points when the metrics thread was existing.
   This may cause a short number of metrics not to be reported.

4. `collect_points()` was always converting both `points` and `counters`
   into a vector of `DataPoint`, even if the final length was over the
   specified `max_points`.  In the `mainnet-beta` we have values of up
   to 5m points lost, so it could be a small optimization if we drop
   them sooner.
2024-04-11 22:02:44 -07:00
Andrew Fitzgerald 2a9ed3ee92
Remove ability to submit metrics with no fields (#35133) 2024-02-16 11:10:15 -08:00
Bork Bork f8d01df1db
chore(docs): proofreading (#35172)
* proofread

* proofread
2024-02-10 17:46:07 -07:00
joao 05dae592f4
Enhancements to Solana's Documentations for Better Clarity (#34386)
* fix grammatical error in Incident Response Process section

* Fix typo in setup instructions

* Fixed grammatical errors in documentation: corrected article usage, pluralization, and singular form in database reference

* Fix typos in documentation: Correct 'In oder' to 'In order', fix 'enviroment' to 'environment', correct 'reults' to 'results' in Ping Results section, and change 'engress' to 'egress' in CUDA section.

* Fix grammatical errors in documentation

* Fix grammatical issues in documentation

Correct subject-verb agreement in transaction balance description and clarify execution of instructions in a transaction.
2023-12-12 15:26:14 -05:00
Jeff Biseda 0f82662a7f
allow empty string for SOLANA_METRICS_CONFIG sanity checking (#33515) 2023-10-11 09:58:39 -07:00
Alexander Meißner 9e703f85de
Upgrades Rust to 1.72.0 & nightly-2023-08-25 (#32961)
* allow pedantic invalid cast lint

* allow lint with false-positive triggered by `test-case` crate

* nightly `fmt` correction

* adapt to rust layout changes

* remove dubious test

* Use transmute instead of pointer cast and de/ref when check_aligned is false.

* Renames clippy::integer_arithmetic to clippy::arithmetic_side_effects.

* bump rust nightly to 2023-08-25

* Upgrades Rust to 1.72.0

---------

Co-authored-by: Trent Nelson <trent@solana.com>
2023-09-01 07:26:13 +00:00
Alexander Meißner a8be70fa7a
General cleanup (#32980)
* Fixes `cargo::` in build.rs

* Fixes overshadowing reexports.

* Removes unnecessary `as *const u8`.

* Removes unnecessary `.into_iter()`.

* cargo clippy
2023-08-24 21:44:19 +02:00
Jeff Biseda 58cca78067
sanity check metrics configuration (#32799) 2023-08-11 14:38:33 -07:00
Yihau Chen ce929bead2
chore: remove unused code (#32195) 2023-06-20 10:48:44 +08:00
Yihau Chen 021d2cdb71
fix: metrics deploy script (#32074)
fix: cert path
2023-06-12 22:15:09 +08:00
Yihau Chen a52ca5d38e
fix: update metrics deploy script (#32073)
fix: cert path
2023-06-12 21:57:34 +08:00
Yihau Chen 40d1b2409b
ci: update metrics related deploying code (#32072) 2023-06-12 21:44:30 +08:00
Yihau Chen da5c64d9bf
ci: update kapacitor config (#32069)
* add two new kapacitor post endpoints

* use kapacitor.sh to start

* add pager duty channels

* fix shell check
2023-06-12 20:23:44 +08:00
Yihau Chen 228e92a024
ci: update kapacitor.conf (#32007) 2023-06-09 10:51:31 +08:00
axleiro 989e61318b
fix prometheus path reference (#32003)
* fix prometheus path reference

* add additional Discord handlers

* fix trailing whitespaces

---------

Co-authored-by: joeaba <joe@solana.com>
2023-06-07 02:56:55 +00:00
Yihau Chen 557155d34b
ci: fix env name (#31985) 2023-06-06 14:03:11 +00:00
Jeff Biseda 6371240746
Add support for optional metric values via Option<_> (#31975) 2023-06-05 15:01:26 -07:00
axleiro 8628845836
add Discord handler to Kapacitor (#31972) 2023-06-05 11:35:02 -05:00
joeaba 038ca31cfa
Add stackpath development servers (#31949)
Add stackpath servers
2023-06-02 11:21:10 -05:00
Andrew Fitzgerald 5953768cd8
remove lock from metrics agent singleton (#31785) 2023-05-24 12:31:08 -07:00
Will Hickey 3293ca4a52
Add canaries to metrics config (#31517) 2023-05-10 12:12:50 -05:00
joeaba f42955c93e
remove inactive hosts (#31521) 2023-05-05 23:02:09 -05:00
joeaba ed4cc52250
add ssh connection check (#31472) 2023-05-03 20:22:30 -05:00
joeaba 74315d2d62
fix influx status check (#31456) 2023-05-02 23:24:52 -05:00
joeaba f6259fa4b4
add influx servers (#31218) 2023-04-16 19:42:25 -05:00
axleiro 577cc185fd
increase docker mem allocation (#31197) 2023-04-14 03:06:23 -05:00
joeaba 68d57b1b9f
update influx enterprise scripts (#31117)
* update influx enterprise scripts
2023-04-10 09:10:54 -05:00
joeaba 9df1ac9d60
fix session timeout (#31057)
* fix session timeout
2023-04-05 11:12:29 -05:00
joeaba 80b25726e1
update metrics status scripts (#31037)
* update metrics status scripts

* add exit condition
2023-04-04 09:03:57 -05:00
joeaba eedb92a6c0
refactor container status check (#30998)
* refactor container status check

* remove blank line at EOF

* add pagerduty integration

Co-authored-by: axleiro <83293196+axleiro@users.noreply.github.com>

* fix discord webhook reference

* remove webhook references

---------

Co-authored-by: axleiro <83293196+axleiro@users.noreply.github.com>
2023-03-30 22:35:21 -05:00
joeaba ecd3279af9
fix grafana configuration (#30961)
* fix grafana configuration
2023-03-29 13:21:50 -05:00
joeaba 77aac98672
adding metrics deployment scripts (#30926)
* adding metrics deployment scripts

* fix trailing whitespaces

* fix more trailing whitespaces

* fix typos

* fix trailing whitespace

* fix loops

* update env vars

* fix shellcheck source

* add source references
2023-03-28 12:26:26 -05:00
Illia Bobyr c449a15c30
metrics: datapoint macro: Allow trailing comma (#30317)
Rust grammar allows trailing commas in most places where a list of
elements are accepted.  It simplifies cases when the list is generated
by a macro, allowing the macro to avoid special cases for a one element
list vs longer lists.

As such, it is a common practice to allow trailing commas in macros as
well.
2023-03-17 12:57:46 -07:00
Yihau Chen df3ef111f7
chore: workspace inheritance (#29893)
* introduce workspace.package

* introduce workspace.dependencies

* read version from root cargo.toml

* pass check when version = { workspace = true }

* don't bump version when version = { workspace = true }

* including workspace Cargo.toml when bump version

* programs/sbf use workspace inheritance

* fix increasing cargo version ignore program/sbf/Cargo.toml
2023-02-23 22:01:54 +08:00
Michael Vines 5136ed3448
Update homepage value for all crates (#30444) 2023-02-23 02:20:18 +00:00
Will Hickey 04a6a631bc
Bump version to v1.16 (#30028) 2023-01-31 17:48:33 -06:00
joeaba a12bf8c003
Update maintainers references (#29997)
* update maintainers references

* chore: update maintainers reference
2023-01-31 08:07:13 -05:00
Yihau Chen 9193b4221d
Revert "chore: workspace inheritance (#29509)" (#29892)
This reverts commit a67d239dde.
2023-01-25 15:50:41 +08:00
Yihau Chen a67d239dde
chore: workspace inheritance (#29509)
* introduce workspace.package

* introduce workspace.dependencies

* read version from root cargo.toml

* pass check when version = { workspace = true }

* don't bump version when version = { workspace = true }

* including workspace Cargo.toml when bump version

* programs/sbf use workspace inheritance

* fix increasing cargo version ignore program/sbf/Cargo.toml
2023-01-25 13:59:59 +08:00
behzad nouri 5c9beef498
fixes errors from clippy::useless_conversion (#29534)
https://rust-lang.github.io/rust-clippy/master/index.html#useless_conversion
2023-01-05 18:05:32 +00:00
Jon Cinque aa593a31f8
logger: Update to env_logger 0.9.3 (#29510) 2023-01-04 21:35:37 +01:00
behzad nouri 9524c9dbff patches errors from clippy::uninlined_format_args
https://rust-lang.github.io/rust-clippy/master/index.html#uninlined_format_args
2022-12-06 19:32:15 +00:00
HaoranYi a26fe6d1be
fix node count query (#28259)
* fix node count query

* fix column name
2022-10-06 11:39:39 -05:00
dependabot[bot] 2f24a357b6
chore: bump reqwest from 0.11.11 to 0.11.12 (#28003)
* chore: bump reqwest from 0.11.11 to 0.11.12

Bumps [reqwest](https://github.com/seanmonstar/reqwest) from 0.11.11 to 0.11.12.
- [Release notes](https://github.com/seanmonstar/reqwest/releases)
- [Changelog](https://github.com/seanmonstar/reqwest/blob/master/CHANGELOG.md)
- [Commits](https://github.com/seanmonstar/reqwest/compare/v0.11.11...v0.11.12)

---
updated-dependencies:
- dependency-name: reqwest
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

* [auto-commit] Update all Cargo lock files

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: dependabot-buildkite <dependabot-buildkite@noreply.solana.com>
2022-09-23 10:09:27 -06:00
Will Hickey c0e4379f43
Whickey/version v1.15 (#27739)
* Bump version to v1.13.0
* Bump version to v1.14.0
* Bump version to v1.15.0
2022-09-13 09:06:15 -05:00
Michael Vines 3f4731b37f Standardize thread names
Tenets:
1. Limit thread names to 15 characters
2. Prefix all Solana-controlled threads with "sol"
3. Use Camel case. It's more character dense than Snake or Kebab case
2022-08-20 07:49:39 -07:00
dependabot[bot] a0d1f4ef88
chore: bump serial_test from 0.8.0 to 0.9.0 (#27097)
Bumps [serial_test](https://github.com/palfrey/serial_test) from 0.8.0 to 0.9.0.
- [Release notes](https://github.com/palfrey/serial_test/releases)
- [Commits](https://github.com/palfrey/serial_test/compare/v0.8.0...v0.9.0)

---
updated-dependencies:
- dependency-name: serial_test
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-15 15:53:12 -06:00
Will Hickey ed8c224374
Bump version to v1.12 (#26967) 2022-08-06 13:20:30 -05:00
github-actions[bot] fbf1bf6d86
Bump Version to 1.11.6 (#26906)
Co-authored-by: willhickey <willhickey@users.noreply.github.com>
2022-08-03 12:48:43 -05:00
github-actions[bot] 5d038b9d2a
Bump Version to 1.11.5 (#26758)
Co-authored-by: willhickey <willhickey@users.noreply.github.com>
2022-07-25 13:05:14 -06:00