solana

Commit Graph

Author	SHA1	Message	Date
Jeff Washington (jwash)	47716a5e01	async hash verify on load (#26208 ) * verify accounts hash in bg on startup * fix some tests and loading from genesis * add extra state for when background thread has completed	2022-07-15 14:29:56 -05:00
Jeff Washington (jwash)	602da5e51f	add accounts db config to bank tests (#26517 )	2022-07-10 19:42:06 -05:00
Brooks Prumo	9ec38a3191	Cleanup snapshot integration tests (#26390 )	2022-07-05 09:23:23 -05:00
Brooks Prumo	877fedadac	Remove StatusCacheRc type and use StatusCache directly (#26184 )	2022-06-24 08:38:56 -05:00
Brooks Prumo	23c50a2389	Add StatusCache::root_slot_deltas() and use it (#26170 )	2022-06-23 15:19:06 -05:00
Yueh-Hsuan Chiang	5b67960c76	(Refactor) Move blocktore options related stuff to blockstore_options.rs (#25509 ) #### Problem blockstore_db.rs has a mutual dependency between blockstore_metrics.rs. #### Summary of Changes This PR removes the mutual dependency by moving the option-related stuff out from blockstore_db.rs to its new home --- blockstore_options.rs. By doing this, we address the mutual dependency and also make the code cleaner.	2022-05-26 16:59:26 -07:00
DimAn	2fa9bc3e70	Add options to store full and/or incremental snapshots in separate locations (#24247 )	2022-05-10 16:37:41 -04:00
Justin Starry	4e58b3870c	Update all BankForks methods to return owned values (#24801 )	2022-04-28 18:51:00 +00:00
behzad nouri	454ef38e43	caches StakeAccount instead of Delegation in Stakes The commit makes values in stake_delegations map in Stakes struct generic. Stakes<Delegation> is equivalent to the old code and is used for backward compatibility in BankFieldsTo{Serialize,Deserialize}. But banks cache Stakes<StakeAccount> which includes the entire stake account and StakeState deserialized from account. Doing so, will remove the need to load stake account from accounts-db when working with stake-delegations.	2022-04-21 15:28:41 +00:00
Michael Vines	0e2e0c8b7d	Extract most storage-related services from the Tvu abstraction	2022-04-19 19:23:58 -07:00
Jeff Washington (jwash)	210f6a6fab	move hash calculation out of acct bg svc (#23689 ) * move hash calculation out of acct bg svc * pr feedback	2022-04-08 10:42:03 -05:00
Jeff Washington (jwash)	c27150b1a3	reserialize_bank_fields_with_hash (#23916 ) * reserialize_bank_with_new_accounts_hash * Update runtime/src/serde_snapshot.rs Co-authored-by: Brooks Prumo <brooks@prumo.org> * Update runtime/src/serde_snapshot/tests.rs Co-authored-by: Brooks Prumo <brooks@prumo.org> * Update runtime/src/serde_snapshot/tests.rs Co-authored-by: Brooks Prumo <brooks@prumo.org> * pr feedback Co-authored-by: Brooks Prumo <brooks@prumo.org>	2022-04-07 14:05:57 -05:00
Jeff Washington (jwash)	550ca7bf92	compare contents of serialized banks instead of exact file format (#24141 ) * compare contents of serialized banks instead of exact file format * Update runtime/src/snapshot_utils.rs Co-authored-by: Brooks Prumo <brooks@prumo.org> * Update runtime/src/snapshot_utils.rs Co-authored-by: Brooks Prumo <brooks@prumo.org> * pr feedback * get rid of clone * pr feedback Co-authored-by: Brooks Prumo <brooks@prumo.org>	2022-04-06 21:55:44 -05:00
Jeff Washington (jwash)	fddd162645	reserialize bank in ahv by first writing to temp file in abs (#23947 )	2022-04-06 21:39:26 -05:00
Brooks Prumo	c322842257	Replace channel with Mutex<Option> for AccountsPackage (#24013 )	2022-04-06 05:47:19 -05:00
Jeff Washington (jwash)	c24de17278	remove index hash calculation as an option (#23928 )	2022-03-25 15:32:53 -05:00
Jeff Washington (jwash)	db5d68f01f	HashCalc uses self.accounts_hash_cache_path (#23882 )	2022-03-24 09:31:55 -05:00
Yueh-Hsuan Chiang	ae75b1a25f	(LedgerStore) Add compression type (#23578 ) This PR adds `--rocksdb-ledger-compression` as a hidden argument to the validator for specifying the compression algorithm for TransactionStatus. Available compression algorithms include `lz4`, `snappy`, `zlib`. The default value is `none`. Experimental results show that with lz4 compression, we can achieve ~37% size-reduction on the TransactionStatus column family, or ~8% size-reduction of the ledger store size.	2022-03-22 02:27:09 -07:00
Yueh-Hsuan Chiang	f999eef452	(LedgerStore) Rename BlockstoreAdvancedOptions to LedgerColumnOptions (#23764 ) This PR renames BlockstoreAdvancedOptions to LedgerColumnOptions, as we will pass-down this struct to LedgerColumn to allow it to perform metric reporting.	2022-03-18 11:13:35 -07:00
Yueh-Hsuan Chiang	1e20bd8f9a	(LedgerStore) Include storage type as a tag in RocksDB metric reporting (#23523 ) #### Summary of Changes This PR further enables group by operation on storage type in blockstore_rocksdb_cfs metrics. Such group-by allows us to further compare the performance metrics between rocks-level and rocks-fifo. To make things extensible, this PR introduces BlockstoreAdvancedOptions and move shred_storage_type. All fields in BlockstoreAdvancedOptions will support group-by operation in blockstore_rocksdb_cfs. Dependency: #23580	2022-03-11 15:17:34 -08:00
Brooks Prumo	9bbccbe27c	Use `AsRef<Path>` instead of `PathBuf` for parameters (#23560 )	2022-03-09 16:08:33 -06:00
HaoranYi	a1c45d5acb	typo (#23564 )	2022-03-09 15:11:21 -06:00
HaoranYi	181fffb916	rename status filename to be consistent (#23501 )	2022-03-07 17:34:35 +00:00
Yueh-Hsuan Chiang	b8b7163b66	(Ledger Store) Report RocksDB Column Family Metrics (#22503 ) This PR enables blockstore to periodically report RocksDB column family properties. The reported properties are under blockstore_rocksdb_cfs, and the properties also support group by operation on cf_name.	2022-03-05 16:13:03 -08:00
HaoranYi	4f0070a5c6	unittest for bind two consecutive ports (#23008 ) * minor fix of comments in fork-selection tests * fix doc link * add unittest for bind_two_consecutive_in_range	2022-03-02 09:10:29 -06:00
Yueh-Hsuan Chiang	8c872e9ce0	(LedgerStore/FIFO) Refactor FIFO options and sanity check. (#23131 )	2022-02-19 00:58:38 -08:00
Yueh-Hsuan Chiang	1add82aa9e	(Ledger Store Benchmark) Add flags for checking shred storage size. (#22451 )	2022-02-18 19:35:28 -08:00
Yueh-Hsuan Chiang	824446710b	(Ledger Store Benchmark) Enable FIFO compmaction in the benchmark. (#22162 )	2022-02-12 00:43:25 -08:00
Yueh-Hsuan Chiang	f8db314134	(Ledger Store Benchmark) Display storage size of all data shreds (#22445 ) * (Ledger Store) APIs for obtaining physical size of all data and coding shreds * (Ledger Store Benchmark) Display total data shred storage size.	2022-01-19 19:33:08 -08:00
Jeff Biseda	8b66625c95	convert std::sync::mpsc to crossbeam_channel (#22264 )	2022-01-11 02:44:46 -08:00
Yueh-Hsuan Chiang	5771c36d3f	Rename open_with_access_type() to open_with_options() (#22123 )	2022-01-07 12:11:43 -08:00
Yueh-Hsuan Chiang	47b74e28ec	Add CLEANUP_SERVICE flag to ledger cleanup benchmark (#22108 )	2022-01-05 23:46:02 -08:00
Yueh-Hsuan Chiang	44d61465f1	(Ledger store benchmark - 3/N) Add comments about the benchmark and its arguments (#22160 ) * Avoid shred generation in ledger_cleanup.rs * Update comment for test_ledger_cleanup_compaction to include benchmark information.	2022-01-04 23:35:55 -10:00
Yueh-Hsuan Chiang	e201b41341	Avoid shred generation in ledger_cleanup.rs (#22091 )	2022-01-04 15:29:43 -10:00
Yueh-Hsuan Chiang	557d35ec79	Remove duplicate code in ledger_cleanup_compaction_test (#22204 )	2021-12-31 11:19:33 -05:00
Yueh-Hsuan Chiang	f479ab7af2	ledger_cleanup test improvement (1/N) -- make the test lockless and simplify the logic (#22090 )	2021-12-30 20:18:47 -10:00
steviez	e83ca4bb28	Clean up test_ledger_cleanup_compaction prints (#21875 ) - Use info!()/warn!() over println!()/eprintln!() - Make status prints consistent - Add default RUST_LOG filter to see test printouts - Adjust reported data to show shreds and rates we care about	2021-12-16 11:24:29 -06:00
steviez	4a9d7318d1	Rework test parameters to be shreds instead of entries (#21780 ) The number of shreds that result from a given number of entries is variable and in our test case, somewhat unintuitive to think about when trying to determine how much data we're pushing into the blockstore. So, this change converts the unit of test parameters from entries to shreds. This change also cleans up some variable naming for clarity and prints.	2021-12-13 23:34:43 -06:00
Yueh-Hsuan Chiang	65194c7ae8	Add NUM_WRITERS to ledger_cleanup to enable multiple writers. (#21729 ) Summary: * Add NUM_WRITERS to ledger_cleanup to enable multiple writers. (Note that our insert_shreds() is still single threaded because it has a lock that limits only one writer at a time.) * Make pre-generated slots more performent by directly inserting into the shared queue. Otherwise, the main-thread which prepares the slots will be slower than the writers. * Correct the shred insertion time -- before this diff it did not wait for joining all writer threads.	2021-12-10 09:42:51 -08:00
Michael Vines	b8837c04ec	Reformat imports to a consistent style for imports rustfmt.toml configuration: imports_granularity = "One" group_imports = "One"	2021-12-03 09:19:13 -08:00
Michael Vines	ba9dfa0d22	Remove frozen account support	2021-11-29 08:38:11 -08:00
sakridge	a8d78e89d3	Move test-validator to own module to reduce core dependencies (#20658 ) * Move test-validator to own module to reduce core dependencies * Fix a few TestValidator paths * Use solana_test_validator crate for solana_test_validator bin * Move client int tests to separate crate Co-authored-by: Tyera Eulberg <tyera@solana.com>	2021-10-29 01:27:07 +00:00
Jeff Washington (jwash)	95e91a4863	disable gossip publish of snapshots when using filler accts (#20824 )	2021-10-20 18:07:29 +00:00
Michael Vines	7027d56064	Resolve nightly-2021-10-05 clippy complaints	2021-10-06 10:37:58 -07:00
Lijun Wang	fe97cb2ddf	AccountsDb plugin framework (#20047 ) Summary of Changes Create a plugin mechanism in the accounts update path so that accounts data can be streamed out to external data stores (be it Kafka or Postgres). The plugin mechanism allows Data stores of connection strings/credentials to be configured, Accounts with patterns to be streamed PostgreSQL implementation of the streaming for different destination stores to be plugged in. The code comprises 4 major parts: accountsdb-plugin-intf: defines the plugin interface which concrete plugin should implement. accountsdb-plugin-manager: manages the load/unload of plugins and provide interfaces which the validator can notify of accounts update to plugins. accountsdb-plugin-postgres: the concrete plugin implementation for PostgreSQL The validator integrations: updated streamed right after snapshot restore and after account update from transaction processing or other real updates. The plugin is optionally loaded on demand by new validator CLI argument -- there is no impact if the plugin is not loaded.	2021-09-30 14:26:17 -07:00
Pavel Strakhov	65227f44dc	Optimize RPC pubsub for multiple clients with the same subscription (#18943 ) * reimplement rpc pubsub with a broadcast queue * update tests for new pubsub implementation * fix: fix review suggestions * chore(rpc): add additional pubsub metrics * integrate max subscriptions check into SubscriptionTracker to reduce locking * separate subscription control from tracker * limit memory usage of items in pubsub broadcast queue, improve error handling * add more pubsub metrics * add final count metrics to pubsub * add metric for total number of subscriptions * fix small review suggestions * remove by_params from SubscriptionTracker and add node_progress_watchers map instead * add subscription tracker tests * add metrics for number of pubsub notifications as a counter * ignore clippy lint in TokenCounter * fix underflow in token counter * reduce queue capacity in pubsub tests * fix(rpc): fix test timeouts * fix race in account subscription test * Add RpcSubscriptions::new_for_tests Co-authored-by: Pavel Strakhov <p.strakhov@iconic.vc> Co-authored-by: Nikita Podoliako <n.podoliako@zubr.io> Co-authored-by: Tyera Eulberg <tyera@solana.com>	2021-09-17 13:40:14 -06:00
Jeff Washington (jwash)	b57e86abf2	cache account hash info (#19426 ) * cache account hash info * ledger_path -> accounts_hash_cache_path	2021-09-13 20:39:26 -05:00
Brooks Prumo	62c8bcf565	Add default() to SnapshotConfig (#19776 )	2021-09-12 13:44:27 -05:00
Jeff Washington (jwash)	456bf15012	AccountsIndexConfig -> AccountsDbConfig (#19687 )	2021-09-08 04:30:38 +00:00
Brooks Prumo	4a5f83d3a7	Add fn to check when to take snapshots (#19682 )	2021-09-07 23:26:35 +00:00
Brooks Prumo	a0552e5b46	Make startup aware of Incremental Snapshots (#19600 )	2021-09-07 20:43:43 +00:00
Brooks Prumo	9d9482b9d8	Plumb `maximum_incremental_snapshot_archives_to_retain` (#19640 )	2021-09-06 18:01:56 -05:00
Brooks Prumo	1828579580	Pass SnapshotConfig to SnapshotPackagerService (#19616 )	2021-09-03 21:42:32 +00:00
Brooks Prumo	5e25ee5ebe	Add maximum_incremental_snapshot_archives_to_retain to SnapshotConfig (#19612 )	2021-09-03 20:21:32 +00:00
Brooks Prumo	7ab0aec61f	Rename maximum_full_snapshot_archives_to_retain (#19610 ) To prepare for adding maximum_incremental_snapshot_archives_to_retain, rename the current field in SnapshotConfig.	2021-09-03 11:28:10 -05:00
Brooks Prumo	e9374d32a3	Revert "Make startup aware of Incremental Snapshots (#19550 )" (#19599 ) This reverts commit `d45ced0a5d`.	2021-09-02 19:14:41 -05:00
Brooks Prumo	d45ced0a5d	Make startup aware of Incremental Snapshots (#19550 )	2021-09-02 19:05:15 -05:00
Brooks Prumo	1d5a8ebc6a	Revert "Add LastFullSnapshotSlot to SnapshotConfig (#19341 )" (#19529 ) This reverts commit `4d361af976`.	2021-08-31 22:03:19 -05:00
Brooks Prumo	fe9ee9134a	Make background services aware of incremental snapshots (#19401 ) AccountsBackgroundService now knows about incremental snapshots. It is now also in charge of deciding if an AccountsPackage is destined to be a SnapshotPackage or not (or just used by AccountsHashVerifier). !!! New behavior changes !!! Taking snapshots (both bank and archive) MUST succeed. This is required because of how the last full snapshot slot is calculated, which is used by AccountsBackgroundService when calling `clean_accounts()`. File system calls are now unwrapped and will result in a crash. As Trent told me: >Well I think if a snapshot fails due to some IO error, it's very likely that the operator is going to have to intervene before it works. We should exit error in this case, otherwise the validator might happily spin for several more hours, never successfully writing a complete snapshot, before something else brings it down. This would leave the validator's last local snapshot many more slots behind than it would be had we exited outright and potentially force the operator to abandon ledger continuity in favor of a quick catchup Other errors will set the `exit` flag to `true`, and the node will gracefully shutdown. Fixes #19167 Fixes #19168	2021-08-31 18:33:27 -05:00
Brooks Prumo	1a4bede918	Remove filter_snapshot_storages_for_incremental_snapshot() (#19349 ) Now that we can get just the subset of snapshot storages we want, the extra filtering afterwards is redundant.	2021-08-23 09:45:29 -05:00
Brooks Prumo	6d939811e9	Name snapshots consistently (#19346 ) #### Problem Snapshot names are overloaded, and there are multiple terms that mean the same thing. This is confusing. Here's a list of ones in the codebase that I've found: ``` - snapshot_dir - snapshots_dir - snapshot_path - snapshot_output_dir - snapshot_package_output_path - snapshot_archives_dir ``` #### Summary of Changes For all the ones that are about the directory where snapshot archives are stored, ensure they are `snapshot_archives_dir`. For the ones about the (bank) snapshots directory, set to `bank_snapshots_dir`. Co-authored-by: Michael Vines <mvines@gmail.com>	2021-08-21 15:41:03 -05:00
Brooks Prumo	234461f779	Add base_slot to get_snapshot_storages() (#19348 )	2021-08-20 16:23:43 -05:00
Brooks Prumo	4d361af976	Add LastFullSnapshotSlot to SnapshotConfig (#19341 )	2021-08-20 17:06:53 +00:00
Jeff Washington (jwash)	7c70f2158b	accounts_index_bins to AccountsIndexConfig (#19257 ) * accounts_index_bins to AccountsIndexConfig * rename param bins -> config * rename BINS_FOR* to ACCOUNTS_INDEX_CONFIG_FOR*	2021-08-17 14:50:01 -05:00
Brooks Prumo	f9986c66b8	Make SnapshotPackagerService aware of Incremental Snapshots (#19254 ) Add a field to SnapshotPackage that is an enum for SnapshotType, so archive_snapshot_package() will do the right thing. Fixes #19166	2021-08-17 13:01:59 -05:00
Brooks Prumo	176036aa58	Rename AccountsPacakge to SnapshotPackage and AccountsPackagePre to AccountsPackage (#19231 ) Renaming these types to better communicate their usages, which will further diverge as incremental snapshot support is added. With the new names, AccountsPacakge now refers to the type between AccountsBackgroundProcess and AccountsHashVerifier, and SnapshotPackage refers to the type between AccountsHashVerifier and SnapshotPackagerService.	2021-08-13 16:08:09 -05:00
Jack May	0b50bb2b20	Deprecate FeeCalculator returning APIs (#19120 )	2021-08-13 09:08:20 -07:00
Brooks Prumo	9d8594a046	Add test for snapshots with background services (#19158 ) Add a test for snapshots that spins up AccountsBackgroundService, AccountsHashVerifier, and SnapshotPackagerService. Currently there is not a test for snapshots that spins up the background services fully. This means that there's not a current test that I can use when adding incremental snapshot support to these three services. Fixes #19014	2021-08-12 10:18:20 -05:00
Brooks Prumo	faf99f4760	Move Bank::get_incremental_snapshot_storages() into snapshot_utils (#19155 ) Filtering out storages for incremental snapshots will be needed by the background services for incremental snapshot support, but there is not a Bank at that point. Since the filtering doesn't apply only to Bank, and more to snapshots, move the functionality into snapshot_utils.	2021-08-10 14:04:11 -05:00
Jeff Washington (jwash)	651343688d	Use option for account index bucket parameter (#19150 )	2021-08-10 11:32:25 -05:00
Jeff Washington (jwash)	47e0d9aa95	plumb more accounts_index bins (#19123 )	2021-08-10 05:45:46 -05:00
Brian Anderson	e4b66a5913	Fix deserialization of RPC errors in HttpSender (#19110 ) Fixes #15576	2021-08-09 10:45:00 -07:00
Brooks Prumo	fd937548a0	Move SnapshotArchiveInfo and friends into its own module (#19114 )	2021-08-08 07:57:06 -05:00
Brooks Prumo	00890957ee	Add snapshot_utils::bank_from_latest_snapshot_archives() (#18983 ) While reviewing PR #18565, as issue was brought up to refactor some code around verifying the bank after rebuilding from snapshots. A new top-level function has been added to get the latest snapshot archives and load the bank then verify. Additionally, new tests have been written and existing tests have been updated to use this new function. Fixes #18973 While resolving the issue, it became clear there was some additional low-hanging fruit this change enabled. Specifically, the functions `bank_to_xxx_snapshot_archive()` now return their respective `SnapshotArchiveInfo`. And on the flip side, `bank_from_snapshot_archives()` now takes `SnapshotArchiveInfo`s instead of separate paths and archive formats. This bundling simplifies bank rebuilding.	2021-08-06 20:16:06 -05:00
Jeff Washington (jwash)	ca37873e16	rework bank::new_with_paths (#19087 ) * rework bank::new_with_paths * missing 1 bench	2021-08-06 09:30:40 -05:00
Jeff Washington (jwash)	14361906ca	for all tests, bank::new -> bank::new_for_tests (#19064 )	2021-08-05 08:42:38 -05:00
Jeff Washington (jwash)	3280ae3e9f	add validator option --accounts-db-skip-shrink (#19028 ) * add validator option --accounts-db-skip-shrink * typo	2021-08-04 17:28:33 -05:00
Brooks Prumo	ca14475085	Add incremental_snapshot_archive_interval_slots to SnapshotConfig (#19026 ) This commit also renames `snapshot_interval_slots` to `full_snapshot_archive_interval_slots`, updates the comments on the fields, and make appropriate updates where SnapshotConfig is used.	2021-08-04 14:40:20 -05:00
Oliver	06e08c4840	move package_snapshots to AccountsPackagePre ctors (#18997 ) This PR solves #18815. Note that I had to make the snapshot prefix constants inside `snapshot_utils.rs` public at the crate level in order to make this work. I'm not sure whether or not introducing this dependency is entirely good, either way the `snapshot_utils.rs` file needs a lot of rework so things will move around, I believe this does the work in the meantime. Any feedback will be greatly appreciated.	2021-08-04 09:03:03 -05:00
Brooks Prumo	b05fb87f22	Add test_bank_forks_incremental_snapshot() (#18565 ) This commit builds on PR #18504 by adding a test to core/tests/snapshot.rs for Incremental Snapshots. The test adds banks to bank forks in a loop and takes both full snapshots and incremental snapshots at intervals, and validates they are rebuild-able. For background info about Incremental Snapshots, see #17088. Fixes #18829 and #18972	2021-07-29 16:46:54 -05:00
Tyera Eulberg	8596db8f53	Bump jsonrpc crates and remove old tokio (#18779 ) * Bump jsonrpc crates and replace old tokio * Bump tokio * getBlockTime * getBlocks * getBlocksWithLimit, getInflationReward * getBlock * getFirstAvailableBlock * getTransaction * getSignaturesForAddress * getSignatureStatuses * Remove superfluous runtime	2021-07-26 12:32:17 -06:00
behzad nouri	d2d5f36a3c	adds validator flag to allow private ip addresses (#18850 )	2021-07-23 15:25:03 +00:00
Brooks Prumo	d1debcd971	Add incremental snapshot utils (#18504 ) This commit adds high-level functions for creating and loading-from incremental snapshots, plus all low-level functions required to perform those tasks. This commit does not add taking incremental snapshots as part of a running validator, nor starting up a node with an incremental snapshot; just laying ground work. Additionally, `snapshot_utils` and `serde_snapshot` have been refactored to use a common code paths for the different snapshots. Also of note, some renaming has happened: 1. Snapshots are now either `full_` or `incremental_` throughout the codebase. If not specified, the code applies to both. 2. Bank snapshots now are called "bank snapshots" (before they were called "slot snapshots", "bank snapshots", or just "snapshots"). The one exception is within `Bank`, where they are still just "snapshots", because they are already "bank snapshots". 3. Snapshot archives now have `_archive` in the code. This should clear up an ambiguity between bank snapshots and snapshot archives.	2021-07-22 14:40:37 -05:00
Jeff Washington (jwash)	d092fa1f03	add ledger-tool verify verify-accounts-index option (#18375 ) * add ledger-tool verify verify-accounts-index option * comment, merge, respond to feedback, cleanup	2021-07-13 11:06:18 -05:00
Brooks Prumo	45d54b1fc6	Add SnapshotArchiveInfo and refactor functions in snapshot_utils (#18232 )	2021-07-01 12:20:56 -05:00
Brooks Prumo	89a3e4f91e	Move SnapshotConfig into its own module (#18331 ) Also move ArchiveFormat to snapshot_utils, and do not reexport SnapshotVersion.	2021-07-01 08:55:26 -05:00
Alexander Meißner	6514096a67	chore: cargo +nightly clippy --fix -Z unstable-options	2021-06-18 10:42:46 -07:00
Jeff Washington (jwash)	dbd4dc04b0	ledger tool limit_load_slot_count_from_snapshot avoids assert failures (#17974 )	2021-06-15 15:39:22 -05:00
Jeff Washington (jwash)	f558b9b6bf	verify bank hash on startup with ledger tool option (#17939 )	2021-06-15 11:52:12 -05:00
Jeff Washington (jwash)	471b34132e	add metrics for startup (#17913 ) * add metrics for startup * roll timings up higher * fix test * fix duplicate	2021-06-14 17:46:49 -05:00
Jeff Washington (jwash)	e6bbd4b3f0	add metrics to handle_snapshot_requests (#17937 )	2021-06-14 15:46:19 -05:00
Lijun Wang	269d995832	Make account shrink configurable #17544 (#17778 ) 1. Added both options for measuring space usage using total accounts usage and for individual store shrink ratio using an enum. Validator CLI options: --accounts-shrink-optimize-total-space and --accounts-shrink-ratio 2. Added code for selecting candidates based on total usage in a separate function select_candidates_by_total_usage 3. Added unit tests for the new functions added 4. The default implementations is kept at 0.8 shrink ratio with --accounts-shrink-optimize-total-space set to true Fixes #17544	2021-06-09 21:21:32 -07:00
Ryo Onodera	1f97b2365f	Avoid full-range compactions with periodic filtered b.g. ones (#16697 ) * Update rocksdb to v0.16.0 * Promote the infrequent and important log to info! * Force background compaction by ttl without manual compaction * Fix test * Support no compaction mode in test_ledger_cleanup_compaction * Fix comment * Make compaction_interval customizable * Avoid major compaction with periodic filtering... * Adress lazy_static, special cfs and range check * Clean up a bit and add comment * Add comment * More comments... * Config code cleanup * Add comment * Use .conflicts_with() * Nullify unneeded delete_range ops for special CFs * Some clean ups * Clarify the locking intention * Ensure special CFs' consistency with PurgeType::CompactionFilter * Fix comment * Fix bad copy paste * Fix various types... * Don't use tuples * Add a unit test for compaction_filter * Fix typo... * Remove flag and just use new behavior always * Fix wrong condition negation... * Doc. about no set_last_purged_slot in purge_slots * Write a test and fix off-by-one bug.... * Apply suggestions from code review Co-authored-by: Tyera Eulberg <teulberg@gmail.com> * Follow up to github review suggestions * Fix line-wrapping * Fix conflict Co-authored-by: Tyera Eulberg <teulberg@gmail.com>	2021-05-28 16:42:56 +09:00
Jeff Washington (jwash)	6b9d8d41a3	add --limit_load_slot_count_from_snapshot to ledger-tool (#17417 )	2021-05-26 10:36:12 -05:00
Tyera Eulberg	9a5330b7eb	Move gossip modules into solana-gossip crate (#17352 ) * Move gossip modules to solana-gossip * Update Protocol abi digest due to move * Move gossip benches and hook up CI * Remove unneeded Result entries * Single use statements	2021-05-26 09:15:46 -06:00
behzad nouri	e867d7f3b8	removes Crds::lookup and lookup_versioned (#17438 )	2021-05-24 18:21:54 +00:00
behzad nouri	9d112cf41f	encapsulates purged values bookkeeping into crds module (#17265 ) For all code paths (gossip push, pull, purge, etc) that remove or override a crds value, it is necessary to record hash of values purged from crds table, in order to exclude them from subsequent pull-requests; otherwise the next pull request will likely return outdated values, wasting bandwidth: https://github.com/solana-labs/solana/blob/ed51cde37/core/src/crds_gossip_pull.rs#L486-L491 Currently this is done all over the place in multiple modules, and this has caused bugs in the past where purged values were not recorded. This commit encapsulated this bookkeeping into crds module, so that any code path which removes or overrides a crds value, also records the hash of purged value in-place.	2021-05-24 13:47:21 +00:00
behzad nouri	71de021177	adds metric for turbine retransmit tree mismatch In order to remove port-based forwarding logic in turbine, we need to first track how often the turbine retransmit/broadcast trees mismatch across nodes. One consistency condition is that if the node is on the critical path (i.e. the first node in each neighborhood), then we expect that the packet arrives at tvu socket as opposed to tvu-forwards. This commit adds a metric to track how often above condition is not met.	2021-05-21 17:10:56 +00:00
behzad nouri	2adce67260	extends crds values timeouts if stakes are unknown (#17261 ) If stakes are unknown, then timeouts will be short, resulting in values being purged from the crds table, and consequently higher pull-response load when they are obtained again from gossip. In particular, this slows down validator start where almost all values obtained from entrypoint are immediately discarded.	2021-05-21 15:55:22 +00:00
Tao Zhu	0781fe1b4f	Upgrade Rust to 1.52.0 (#17096 ) * Upgrade Rust to 1.52.0 update nightly_version to newly pushed docker image fix clippy lint errors 1.52 comes with grcov 0.8.0, include this version to script * upgrade to Rust 1.52.1 * disabling Serum from downstream projects until it is upgraded to Rust 1.52.1	2021-05-19 09:31:47 -05:00
Tyera Eulberg	827355a6b1	Create solana-rpc crate and move subscriptions (#17320 ) * Move non_circulating_supply to runtime * Add solana-rpc crate and move max_slots * Move subscriptions to solana-rpc * Single use statements	2021-05-19 00:54:28 -06:00
behzad nouri	0e646d10bb	prunes received-cache only once per unique owner's key (#17039 )	2021-05-13 13:50:16 +00:00
Lijun Wang	9c42a89a43	Issue #17008 -- make snapshot archives to hold on to configurable. (#17158 ) * purge_old_snapshot_archives is changed to take an extra argument 'maximum_snapshots_to_retain' to control the max number of latest snapshot archives to retain. Note the oldest snapshot is always retained as before and is not subjected to this new options. * The validator and ledger-tool executables are modified with a CLI argument --maximum-snapshots-to-retain. And the options are propagated down the call chains. Their corresponding shell scripts were changed accordingly. * SnapshotConfig is modified to have an extra field for the maximum_snapshots_to_retain * Unit tests are developed to cover purge_old_snapshot_archives	2021-05-12 10:32:27 -07:00
Jeff Washington (jwash)	f39dda00e0	type AccountSecondaryIndexes = HashSet (#17108 )	2021-05-10 14:22:48 +00:00
behzad nouri	22c02b917e	reads gossip push messages off crds ordinal index Having an ordinal index on crds values based on insert order allows to efficiently filter values using a cursor. In particular CrdsGossipPush::push_messages hash-map can be replaced with a cursor, saving on the bookkeepings, purging, etc	2021-05-09 22:40:41 +00:00
behzad nouri	fa86a335b0	implements cursor for gossip crds table queries (#16952 ) VersionedCrdsValue.insert_timestamp is used for fetching crds values inserted since last query: https://github.com/solana-labs/solana/blob/ec37a843a/core/src/cluster_info.rs#L1197-L1215 https://github.com/solana-labs/solana/blob/ec37a843a/core/src/cluster_info.rs#L1274-L1298 So it is crucial that insert_timestamp does not go backward in time when new values are inserted into the table. However std::time::SystemTime is not monotonic, or due to workload, lock contention, thread scheduling, etc, ... new values may be inserted with a stalled timestamp way in the past. Additionally, reading system time for the above purpose is inefficient/unnecessary. This commit adds an ordinal index to crds values indicating their insert order. Additionally, it implements a new Cursor type for fetching values inserted since last query.	2021-05-06 14:04:17 +00:00
behzad nouri	7cea2c4466	validates gossip addresses before sending pull-requests IP addresses need to be validated before sending packets to them. This commit, sends a ping packet to nodes before any pull requests. Pull requests are then only sent to the nodes which have responded with the correct hash of their respective ping packet.	2021-05-03 18:21:06 +00:00
behzad nouri	25054bfd35	retains peer's contact-info when making pull requests (#16715 ) ClusterInfo::new_pull_requests has to lookup contact-infos: https://github.com/solana-labs/solana/blob/a1ef2bd74/core/src/cluster_info.rs#L1663-L1673 when it was already available when making pull requests: https://github.com/solana-labs/solana/blob/a1ef2bd74/core/src/crds_gossip_pull.rs#L232	2021-04-28 13:19:12 +00:00
Justin Starry	75b8434b76	Add TPU client for sending txs to the current leader tpu port (#16736 ) * Add TPU client for sending txs to the current leader tpu port * Update tpu_client.rs	2021-04-23 09:35:12 +08:00
Justin Starry	a7e65c0034	RPC: use finalized as default pubsub commitment level (#16659 ) * RPC: use finalized as default pubsub commitment level * update docs * Fix tests	2021-04-20 08:19:54 +00:00
Tyera Eulberg	7dfb51c0b4	Cli: move airdrop to rpc requests (#16557 ) * Add recent_blockhash to requestAirdrop * Move tx confirmation to separate method * Add RpcClient airdrop methods * Request cli airdrop via RpcClient * Pass optional faucet_addr into TestValidator and fix tests * Update client/src/rpc_client.rs Co-authored-by: Michael Vines <mvines@gmail.com> Co-authored-by: Michael Vines <mvines@gmail.com>	2021-04-15 06:25:23 +00:00
Tyera Eulberg	70f3f7e679	Move obsolete rpc endpoints to separate api for removal (#16500 ) * Move obsolete rpc methods to separate api for removal * Remove obsolete method from docs * Fix test using obs method	2021-04-12 20:33:40 -06:00
behzad nouri	570fd3f810	makes turbine peer computation consistent between broadcast and retransmit (#14910 ) get_broadcast_peers is using tvu_peers: https://github.com/solana-labs/solana/blob/84e52b606/core/src/broadcast_stage.rs#L362-L370 which is potentially inconsistent with retransmit_peers: https://github.com/solana-labs/solana/blob/84e52b606/core/src/cluster_info.rs#L1332-L1345 Also, the leader does not include its own contact-info when broadcasting shreds: https://github.com/solana-labs/solana/blob/84e52b606/core/src/cluster_info.rs#L1324 but on the retransmit side, slot leader is removed only _after_ neighbors and children are computed: https://github.com/solana-labs/solana/blob/84e52b606/core/src/retransmit_stage.rs#L383-L384 So the turbine broadcast tree is different between the two stages. This commit: * Removes retransmit_peers. Broadcast and retransmit stages will use tvu_peers consistently. * Retransmit stage removes slot leader _before_ computing children and neighbors.	2021-03-24 13:34:48 +00:00
behzad nouri	f2865dfd63	requires stakes for propagating crds values through gossip (#15561 )	2021-03-12 15:50:14 +00:00
Justin Starry	918d04e3f0	Add more slot update notifications (#15734 ) * Add more slot update notifications * fix merge * Address feedback and add integration test * switch to datapoint * remove unused shred method * fix clippy * new thread for rpc completed slots * remove extra constant * fixes * rely on channel closing * fix check	2021-03-12 21:44:06 +08:00
Michael Vines	5df36aec7d	Pacify clippy	2021-02-19 20:08:41 -08:00
Trent Nelson	7f7370c306	Re-allow clippy::integer_arithmetic at crate-level	2021-02-17 13:55:08 -07:00
sakridge	b24cb9840e	Speedup ledger cleanup test (#15304 ) Just clone to produce shreds and use a separate insert thread.	2021-02-17 08:59:25 -08:00
Jeff Washington (jwash)	ba02452d75	add validator flag no-accounts-db-index-hashing (#15350 ) * add validator flag no_accounts_db_index_hashing * add validator flag no_accounts_db_index_hashing	2021-02-16 21:13:48 +00:00
sakridge	5b8f046c67	More configurable rocksdb compaction (#15213 ) rocksdb compaction can cause long stalls, so make it more configurable to try and reduce those stalls and also to coordinate between multiple nodes to not induce stall at the same time.	2021-02-14 10:16:30 -08:00
carllin	990bb426a9	Fix flaky test test_concurrent_snapshot_packaging (#15252 )	2021-02-11 16:03:51 -08:00
Jeff Washington (jwash)	fabecdc86c	use thread pool for non-index hash calculations (#15149 )	2021-02-05 19:48:55 +00:00
behzad nouri	6fd5ec0e4c	caches descendants in bank forks (#15107 )	2021-02-05 18:00:45 +00:00
Tyera Eulberg	d1563f0ccd	Bump tonic, prost, tarpc, tokio (#15013 ) * Update tonic & prost, and regenerate proto * Reignore doc code * Revert pull #14367, but pin tokio to v0.2 for jsonrpc * Bump backoff and goauth -> and therefore tokio * Bump tokio in faucet, net-utils * Bump remaining tokio, plus tarpc	2021-02-05 00:21:53 -07:00
Jeff Washington (jwash)	600ff0d915	calculate hash from store instead of index (#15034 ) * calculate hash from store instead of index * restore update hash in abs	2021-02-04 09:00:33 -06:00
behzad nouri	0ad063f4e9	adds flag to disable duplicate instance check (#15006 )	2021-02-03 16:26:17 +00:00
Tyera Eulberg	98aa1fa4ea	Upgrade jsonrpc crates to v17.0.0 (#15018 ) * Upgrade to jsonrpc 17.0.0 * Fix test * tree Co-authored-by: Michael Vines <mvines@gmail.com>	2021-02-02 19:53:08 -07:00
Tom Parker-Shemilt	01230a0105	Remove serial_test_derive dependency (#14891 )	2021-01-28 22:35:31 -07:00
Tyera Eulberg	ffa5c7dcc8	Deprecate commitment variants (#14797 ) * Deprecate commitment variants * Add new CommitmentConfig builders * Add helpers to avoid allowing deprecated variants * Remove deprecated transaction-status code * Include new commitment variants in runtime commitment; allow deprecated as long as old variants persist * Remove deprecated banks code * Remove deprecated variants in core; allow deprecated in rpc/rpc-subscriptions for now * Heavier hand with rpc/rpc-subscription commitment * Remove deprecated variants from local-cluster * Remove deprecated variants from various tools * Remove deprecated variants from validator * Update docs * Remove deprecated client code * Add new variants to cli; remove deprecated variants as possible * Don't send new commitment variants to old clusters * Retain deprecated method in test_validator_saves_tower * Fix clippy matches! suggestion for BPF solana-sdk legacy compile test * Refactor node version check to handle commitment variants and transaction encoding * Hide deprecated variants from cli help * Add cli App comments	2021-01-26 19:23:07 +00:00
behzad nouri	e1021d9f83	removes redundant epoch stakes cache in retransmit (#14781 ) Following `d6d76219b`, staked nodes computed from vote accounts are already cached in runtime::Stakes, so the caching in retransmit_stage is redundant.	2021-01-24 21:15:09 +00:00
Michael Vines	bf1943e489	Add solana-test-validator --warp-slot argument	2021-01-22 21:17:02 -08:00
behzad nouri	8e581601d6	patches crds vote-index assignment bug (#14438 ) If tower is full, old votes are evicted from the front of the deque: https://github.com/solana-labs/solana/blob/2074e407c/programs/vote/src/vote_state/mod.rs#L367-L373 whereas recent votes if expire are evicted from the back: https://github.com/solana-labs/solana/blob/2074e407c/programs/vote/src/vote_state/mod.rs#L529-L537 As a result, from a single tower_index scalar, we cannot infer which crds-vote should be overwritten: https://github.com/solana-labs/solana/blob/2074e407c/core/src/crds_value.rs#L576 In addition there is an off by one bug in the existing code. tower_index is bounded by MAX_LOCKOUT_HISTORY - 1: https://github.com/solana-labs/solana/blob/2074e407c/core/src/consensus.rs#L382 So, it is at most 30, whereas MAX_VOTES is 32: https://github.com/solana-labs/solana/blob/2074e407c/core/src/crds_value.rs#L29 Which means that this branch is never taken: https://github.com/solana-labs/solana/blob/2074e407c/core/src/crds_value.rs#L590-L593 so crds table alwasys keeps 29 oldest votes by wallclock, and then only overrides the 30st one each time. (i.e a tally of only two most recent votes).	2021-01-21 13:08:07 +00:00
behzad nouri	b5fd0ed859	rewrites turbine retransmit peers computation (#14584 )	2021-01-19 04:18:47 +00:00
carllin	6dfad0652f	Cache account stores, flush from AccountsBackgroundService (#13140 )	2021-01-11 17:00:23 -08:00
Michael Vines	a95675a7ce	Avoid tmp snapshot backlog in SnapshotPackagerService under high load (#14516 )	2021-01-11 10:21:15 -08:00
Michael Vines	7be6770808	Rename CompressionType to ArchiveFormat	2021-01-09 09:07:49 -08:00
carllin	5affd8aa72	Add secondary indexes (#14212 )	2020-12-31 18:06:03 -08:00
behzad nouri	691031fefd	limits number of crds values returned when responding to pull requests (#13739 ) Crds values buffered when responding to pull-requests can be very large taking a lot of memory. Added a limit for number of buffered crds values based on outbound data budget.	2020-12-18 18:45:12 +00:00
Michael Vines	7143aaa89b	Clippy	2020-12-14 08:03:29 -08:00
carllin	55fc963595	Move slot cleanup to AccountsBackgroundService (#13911 ) * Move bank drop to AccountsBackgroundService * Send to ABS on drop instead, protects against other places banks are dropped * Fix Abi * test Co-authored-by: Carl Lin <carl@solana.com>	2020-12-13 01:22:34 +00:00
Michael Vines	bbad3fe501	TestValidator now implements Drop, no need to close() it	2020-12-11 04:17:38 +00:00
Michael Vines	0a9ff1dc9d	Initial solana-test-validator command-line program	2020-12-11 04:17:38 +00:00
Michael Vines	73111b005f	Reduce the number of snapshots	2020-12-01 11:13:37 -08:00
Michael Vines	43b82b31e5	More TestValidator cleanup	2020-11-26 08:56:25 +00:00
Michael Vines	b5f7e39be8	TestValidator public interface cleanup	2020-11-25 17:04:37 -08:00
behzad nouri	b58f69297f	makes crds fields private (#13703 ) Crds fields should maintain several invariants between themselves, so exposing them as public fields can be bug prone. In addition these invariants are asserted on every write: https://github.com/solana-labs/solana/blob/9668dd85d/core/src/crds.rs#L138-L154 https://github.com/solana-labs/solana/blob/9668dd85d/core/src/crds.rs#L239-L262 which adds extra instructions and is not optimal. Should these fields be private the asserts will be redundant.	2020-11-19 20:57:40 +00:00
Tyera Eulberg	ef99689592	Improve TestValidator instantiation (#13627 ) * Add TestValidator::new_with_fees constructor, and warning for low bootstrap_validator_lamports * Add logging to solana-tokens integration test to help catch low bootstrap_validator_lamports in the future * Reasonable TestValidator mint_lamports	2020-11-16 23:27:36 -07:00
behzad nouri	8f0796436a	shares the lock on gossip when processing prune messages (#13339 ) Processing prune messages acquires an exclusive lock on gossip: https://github.com/solana-labs/solana/blob/55b0428ff/core/src/cluster_info.rs#L1824-L1825 This can be reduced to a shared lock if active-sets are changed to use atomic bloom filters: https://github.com/solana-labs/solana/blob/55b0428ff/core/src/crds_gossip_push.rs#L50	2020-11-05 15:42:00 +00:00
behzad nouri	ae91270961	implements ping-pong packets between nodes (#12794 ) https://hackerone.com/reports/991106 > It’s possible to use UDP gossip protocol to amplify DDoS attacks. An attacker > can spoof IP address in UDP packet when sending PullRequest to the node. > There's no any validation if provided source IP address is not spoofed and > the node can send much larger PullResponse to victim's IP. As I checked, > PullRequest is about 290 bytes, while PullResponse is about 10 kB. It means > that amplification is about 34x. This way an attacker can easily perform DDoS > attack both on Solana node and third-party server. > > To prevent it, need for example to implement ping-pong mechanism similar as > in Ethereum: Before accepting requests from remote client needs to validate > his IP. Local node sends Ping packet to the remote node and it needs to reply > with Pong packet that contains hash of matching Ping packet. Content of Ping > packet is unpredictable. If hash from Pong packet matches, local node can > remember IP where Ping packet was sent as correct and allow further > communication. > > More info: > https://github.com/ethereum/devp2p/blob/master/discv4.md#endpoint-proof > https://github.com/ethereum/devp2p/blob/master/discv4.md#wire-protocol The commit adds a PingCache, which maintains records of remote nodes which have returned a valid response to a ping message, and on-the-fly ping messages pending a pong response from the remote node. When handling pull-requests, those from addresses which have not passed the ping-pong check are filtered out, and additionally ping packets are added for addresses which need to be (re)verified.	2020-10-28 17:03:02 +00:00
behzad nouri	37c8842bcb	scans crds table in parallel for finding old labels (#13073 ) From runtime profiles, the majority time of ClusterInfo::handle_purge https://github.com/solana-labs/solana/blob/0776fa05c/core/src/cluster_info.rs#L1605-L1626 is spent scanning crds table finding old labels: https://github.com/solana-labs/solana/blob/0776fa05c/core/src/crds.rs#L175-L197 This can be done in parallel given that gossip thread-pool: https://github.com/solana-labs/solana/blob/0776fa05c/core/src/cluster_info.rs#L1637-L1641 is idle when handle_purge is invoked: https://github.com/solana-labs/solana/blob/0776fa05c/core/src/cluster_info.rs#L1681	2020-10-23 14:17:37 +00:00

1 2 3 4 5 ...

499 Commits