solana

Commit Graph

Author	SHA1	Message	Date
Brooks Prumo	7aa5f6b833	Add CLI args for incremental snapshots (#19694 ) Add `--incremental-snapshots` flag to enable incremental snapshots. This will allow setting `--full-snapshot-interval-slots` and `--incremental-snapshot-interval-slots`. Also added `--maximum-incremental-snapshots-to-retain`. Co-authored-by: Michael Vines <mvines@gmail.com>	2021-09-10 15:59:26 -05:00
Michael Vines	4386e09710	Reduce wait for supermajority threshold back to 80%	2021-09-09 21:17:35 -07:00
Jeff Washington (jwash)	456bf15012	AccountsIndexConfig -> AccountsDbConfig (#19687 )	2021-09-08 04:30:38 +00:00
Jeff Washington (jwash)	d3f938f0cf	Remove Copy from AccountsIndexConfig. Not all types will support it (#19686 )	2021-09-07 20:09:40 -05:00
Brooks Prumo	a0552e5b46	Make startup aware of Incremental Snapshots (#19600 )	2021-09-07 20:43:43 +00:00
Brooks Prumo	fe8ba81ce6	Rename to is_valid instead of is_invalid (#19670 )	2021-09-07 09:31:54 -05:00
Brooks Prumo	9d9482b9d8	Plumb `maximum_incremental_snapshot_archives_to_retain` (#19640 )	2021-09-06 18:01:56 -05:00
Brooks Prumo	1828579580	Pass SnapshotConfig to SnapshotPackagerService (#19616 )	2021-09-03 21:42:32 +00:00
Brooks Prumo	7ab0aec61f	Rename maximum_full_snapshot_archives_to_retain (#19610 ) To prepare for adding maximum_incremental_snapshot_archives_to_retain, rename the current field in SnapshotConfig.	2021-09-03 11:28:10 -05:00
Brooks Prumo	e9374d32a3	Revert "Make startup aware of Incremental Snapshots (#19550 )" (#19599 ) This reverts commit `d45ced0a5d`.	2021-09-02 19:14:41 -05:00
Brooks Prumo	d45ced0a5d	Make startup aware of Incremental Snapshots (#19550 )	2021-09-02 19:05:15 -05:00
Lijun Wang	8378e8790f	Accountsdb replication installment 2 (#19325 ) This is the 2nd installment for the AccountsDb replication. Summary of Changes The basic google protocol buffer protocol for replicating updated slots and accounts. tonic/tokio is used for transporting the messages. The basic framework of the client and server for replicating slots and accounts -- the persisting of accounts in the replica-side will be done at the next PR -- right now -- the accounts are streamed to the replica-node and dumped. Replication for information about Bank is also not done in this PR -- to be addressed in the next PR to limit the change size. Functionality used by both the client and server side are encapsulated in the replica-lib crate. There is no impact to the existing validator by default. Tests: Observe the confirmed slots replicated to the replica-node. Observe the accounts for the confirmed slot are received at the replica-node side.	2021-09-01 14:10:16 -07:00
Brooks Prumo	fe9ee9134a	Make background services aware of incremental snapshots (#19401 ) AccountsBackgroundService now knows about incremental snapshots. It is now also in charge of deciding if an AccountsPackage is destined to be a SnapshotPackage or not (or just used by AccountsHashVerifier). !!! New behavior changes !!! Taking snapshots (both bank and archive) MUST succeed. This is required because of how the last full snapshot slot is calculated, which is used by AccountsBackgroundService when calling `clean_accounts()`. File system calls are now unwrapped and will result in a crash. As Trent told me: >Well I think if a snapshot fails due to some IO error, it's very likely that the operator is going to have to intervene before it works. We should exit error in this case, otherwise the validator might happily spin for several more hours, never successfully writing a complete snapshot, before something else brings it down. This would leave the validator's last local snapshot many more slots behind than it would be had we exited outright and potentially force the operator to abandon ledger continuity in favor of a quick catchup Other errors will set the `exit` flag to `true`, and the node will gracefully shutdown. Fixes #19167 Fixes #19168	2021-08-31 18:33:27 -05:00
behzad nouri	8ad52fa095	implements copy-on-write for vote-accounts (#19362 ) Bank::vote_accounts redundantly clones vote-accounts HashMap even though an immutable reference will suffice: https://github.com/solana-labs/solana/blob/95c998a19/runtime/src/bank.rs#L5174-L5186 This commit implements copy-on-write semantics for vote-accounts by wrapping the underlying HashMap in Arc<...>.	2021-08-30 15:54:01 +00:00
Brooks Prumo	6d939811e9	Name snapshots consistently (#19346 ) #### Problem Snapshot names are overloaded, and there are multiple terms that mean the same thing. This is confusing. Here's a list of ones in the codebase that I've found: ``` - snapshot_dir - snapshots_dir - snapshot_path - snapshot_output_dir - snapshot_package_output_path - snapshot_archives_dir ``` #### Summary of Changes For all the ones that are about the directory where snapshot archives are stored, ensure they are `snapshot_archives_dir`. For the ones about the (bank) snapshots directory, set to `bank_snapshots_dir`. Co-authored-by: Michael Vines <mvines@gmail.com>	2021-08-21 15:41:03 -05:00
Jeff Washington (jwash)	7c70f2158b	accounts_index_bins to AccountsIndexConfig (#19257 ) * accounts_index_bins to AccountsIndexConfig * rename param bins -> config * rename BINS_FOR* to ACCOUNTS_INDEX_CONFIG_FOR*	2021-08-17 14:50:01 -05:00
Brooks Prumo	f9986c66b8	Make SnapshotPackagerService aware of Incremental Snapshots (#19254 ) Add a field to SnapshotPackage that is an enum for SnapshotType, so archive_snapshot_package() will do the right thing. Fixes #19166	2021-08-17 13:01:59 -05:00
behzad nouri	7a789e0763	filters for recent contact-infos when checking for live stake (#19204 ) Contact-infos are saved to disk: https://github.com/solana-labs/solana/blob/9dfeee299/gossip/src/cluster_info.rs#L1678-L1683 and restored on validator start-up: https://github.com/solana-labs/solana/blob/9dfeee299/core/src/validator.rs#L450 Staked nodes entries will not expire until an epoch after. So when the validator checks for online stake it is erroneously picking up contact-infos restored from disk, which breaks the entire wait-for-supermajority logic: https://github.com/solana-labs/solana/blob/9dfeee299/core/src/validator.rs#L1515-L1561 This commit adds an extra check for the age of contact-info entries and filters out old ones.	2021-08-13 12:12:40 +00:00
Jeff Washington (jwash)	e91988c977	cli for num account index bins (#19085 )	2021-08-11 11:45:25 -05:00
Michael Vines	7ddda30126	`solana-test-validator` now uses FileTowerStorage	2021-08-11 00:20:46 -07:00
Michael Vines	e9722474eb	Move tower storage into its own module	2021-08-11 00:20:46 -07:00
Brooks Prumo	fd937548a0	Move SnapshotArchiveInfo and friends into its own module (#19114 )	2021-08-08 07:57:06 -05:00
Brooks Prumo	00890957ee	Add snapshot_utils::bank_from_latest_snapshot_archives() (#18983 ) While reviewing PR #18565, as issue was brought up to refactor some code around verifying the bank after rebuilding from snapshots. A new top-level function has been added to get the latest snapshot archives and load the bank then verify. Additionally, new tests have been written and existing tests have been updated to use this new function. Fixes #18973 While resolving the issue, it became clear there was some additional low-hanging fruit this change enabled. Specifically, the functions `bank_to_xxx_snapshot_archive()` now return their respective `SnapshotArchiveInfo`. And on the flip side, `bank_from_snapshot_archives()` now takes `SnapshotArchiveInfo`s instead of separate paths and archive formats. This bundling simplifies bank rebuilding.	2021-08-06 20:16:06 -05:00
Michael Vines	397801a2d8	Extract tower storage details from Tower struct	2021-08-06 10:04:37 -07:00
Jeff Washington (jwash)	14361906ca	for all tests, bank::new -> bank::new_for_tests (#19064 )	2021-08-05 08:42:38 -05:00
Jeff Washington (jwash)	3280ae3e9f	add validator option --accounts-db-skip-shrink (#19028 ) * add validator option --accounts-db-skip-shrink * typo	2021-08-04 17:28:33 -05:00
Brooks Prumo	ca14475085	Add incremental_snapshot_archive_interval_slots to SnapshotConfig (#19026 ) This commit also renames `snapshot_interval_slots` to `full_snapshot_archive_interval_slots`, updates the comments on the fields, and make appropriate updates where SnapshotConfig is used.	2021-08-04 14:40:20 -05:00
Trent Nelson	71f6d839f9	validator: remove disused cuda config argument	2021-07-29 03:08:52 +00:00
Trent Nelson	8ed0cd0fff	validator: check target CPU features earlier	2021-07-29 03:08:52 +00:00
Trent Nelson	c435f7b3e3	validator: add avx2 runtime check	2021-07-29 03:08:52 +00:00
Trent Nelson	e641f257ef	test-validator: move feature check earlier in startup	2021-07-29 03:08:52 +00:00
Trent Nelson	59641623d1	Improve check for Apple M1 silicon under Rosetta	2021-07-29 03:08:52 +00:00
Jack May	f1b9f97aef	remove avx error on macos (#18923 )	2021-07-27 16:34:04 -07:00
behzad nouri	d2d5f36a3c	adds validator flag to allow private ip addresses (#18850 )	2021-07-23 15:25:03 +00:00
Brooks Prumo	d1debcd971	Add incremental snapshot utils (#18504 ) This commit adds high-level functions for creating and loading-from incremental snapshots, plus all low-level functions required to perform those tasks. This commit does not add taking incremental snapshots as part of a running validator, nor starting up a node with an incremental snapshot; just laying ground work. Additionally, `snapshot_utils` and `serde_snapshot` have been refactored to use a common code paths for the different snapshots. Also of note, some renaming has happened: 1. Snapshots are now either `full_` or `incremental_` throughout the codebase. If not specified, the code applies to both. 2. Bank snapshots now are called "bank snapshots" (before they were called "slot snapshots", "bank snapshots", or just "snapshots"). The one exception is within `Bank`, where they are still just "snapshots", because they are already "bank snapshots". 3. Snapshot archives now have `_archive` in the code. This should clear up an ambiguity between bank snapshots and snapshot archives.	2021-07-22 14:40:37 -05:00
sakridge	7f2254225e	Move entry/poh to own crate to speed up poh bench build (#18225 )	2021-07-14 14:16:29 +02:00
Tao Zhu	b6dff12923	update ledger tool to restore cost table from blockstore (#18489 ) * update ledger tool to restore cost model from blockstore when compute-slot-cost * Move initialize_cost_table into cost_model, so the function can be tested and shared between validator and ledger-tool * refactor and simplify a test	2021-07-07 23:44:51 -05:00
Michael Vines	b6792a3328	Add ability to change the validator identity at runtime	2021-07-01 17:50:04 -07:00
Tao Zhu	5e424826ba	Persist cost table to blockstore (#18123 ) * Add `ProgramCosts` Column Family to blockstore, implement LedgerColumn; add `delete_cf` to Rocks * Add ProgramCosts to compaction excluding list alone side with TransactionStatusIndex in one place: `excludes_from_compaction()` * Write cost table to blockstore after `replay_stage` replayed active banks; add stats to measure persist time * Deletes program from `ProgramCosts` in blockstore when they are removed from cost_table in memory * Only try to persist to blockstore when cost_table is changed. * Restore cost table during validator startup * Offload `cost_model` related operations from replay main thread to dedicated service thread, add channel to send execute_timings between these threads; * Move `cost_update_service` to its own module; replay_stage is now decoupled from cost_model.	2021-07-01 11:32:41 -05:00
Brooks Prumo	89a3e4f91e	Move SnapshotConfig into its own module (#18331 ) Also move ArchiveFormat to snapshot_utils, and do not reexport SnapshotVersion.	2021-07-01 08:55:26 -05:00
Trent Nelson	d269975784	Revert "Clean up build warning" This reverts commit `17a173ebb5`.	2021-06-24 19:57:52 -06:00
behzad nouri	598093b5db	adds shred-version to ip-echo-server response When starting a validator, the node initially joins gossip with shred_verison = 0, until it adopts the entrypoint's shred-version: https://github.com/solana-labs/solana/blob/9b182f408/validator/src/main.rs#L417 Depending on the load on the entrypoint, this adopting entrypoint shred-version through gossip sometimes becomes very slow, and causes several problems in gossip because we have to partially support shred_version == 0 which is a source of leaking crds values from one cluster to another. e.g. see https://github.com/solana-labs/solana/pull/17899 and the other linked issues there. In order to remove shred_version == 0 from gossip, this commit adds shred-version to ip-echo-server response. Once the entrypoints are updated, on validator start-up, if --expected_shred_version is not specified we will obtain shred-version from the entrypoint using ip-echo-server.	2021-06-21 19:37:16 +00:00
Alexander Meißner	6514096a67	chore: cargo +nightly clippy --fix -Z unstable-options	2021-06-18 10:42:46 -07:00
Michael Vines	fa04531c7a	Extricate RpcCompletedSlotsService from RetransmitStage	2021-06-16 16:20:35 -07:00
Trent Nelson	5bc6c89adc	validator: run poh speed test earlier in start up	2021-06-16 21:27:08 +00:00
Lijun Wang	269d995832	Make account shrink configurable #17544 (#17778 ) 1. Added both options for measuring space usage using total accounts usage and for individual store shrink ratio using an enum. Validator CLI options: --accounts-shrink-optimize-total-space and --accounts-shrink-ratio 2. Added code for selecting candidates based on total usage in a separate function select_candidates_by_total_usage 3. Added unit tests for the new functions added 4. The default implementations is kept at 0.8 shrink ratio with --accounts-shrink-optimize-total-space set to true Fixes #17544	2021-06-09 21:21:32 -07:00
Tao Zhu	ae27fcbcda	replay stage feed back program cost (#17731 ) * replay stage feeds back realtime per-program execution cost to cost model; * program cost execution table is initialized into empty table, no longer populated with hardcoded numbers; * changed cost unit to microsecond, using value collected from mainnet; * add ExecuteCostTable with fixed capacity for security concern, when its limit is reached, programs with old age AND less occurrence will be pushed out to make room for new programs.	2021-06-09 17:10:59 -05:00
Tyera Eulberg	544b3c0d17	Create solana-poh and move remaining rpc modules to solana-rpc (#17698 ) * Create solana-poh crate * Move BigTableUploadService to solana-ledger * Add solana-rpc to workspace * Move dependencies to solana-rpc * Move remaining rpc modules to solana-rpc * Single use statement solana-poh * Single use statement solana-rpc	2021-06-04 09:23:06 -06:00
Tyera Eulberg	3a647c4bea	Rename ValidatorExit and move to sdk (#17728 )	2021-06-04 03:06:13 +00:00
carllin	96ba2edfeb	Switch EpochSlots to be frozen slots, not completed slots (#17168 )	2021-06-03 00:20:00 +00:00

1 2 3 4 5 ...

333 Commits