solana

Commit Graph

Author	SHA1	Message	Date
sakridge	4a7fb2a808	Revert "core: disable quic servers on mainnet-beta" (#26216 ) Enable QUIC server	2022-07-20 20:37:24 +02:00
Pankaj Garg	ea7448c568	Use client certs in QUIC to get peer's stake (#26477 ) * Use client certs in QUIC to get peer's stake * fixes to cert processing * integrate the code * clippy * more cleanup * sort cargo deps * test fixes * info -> debug	2022-07-11 18:06:40 +00:00
Nicholas Clarke	ee0a40937e	Add validator argument log_messages_bytes_limit to change log truncation limit. Add new cli argument log_messages_bytes_limit to solana-validator to control how long program logs can be before truncation	2022-07-11 10:53:18 -05:00
Nick Rempel	7e4a5de99c	Refactor ConnectionCache::use_quic (#26235 ) * Remove UseQuic type Move to storing the UdpSocket on ConnectionCache and accepting a bool * Remove use_quic from ConnectionCache constructor Replace with separate with_udp constructor to force callers to choose	2022-07-05 10:49:42 -07:00
behzad nouri	61f0a7d9c3	replaces Mutex<PohRecorder> with RwLock<PohRecorder> (#26370 ) Mutex causes superfluous lock contention when a read-only reference suffices.	2022-07-05 14:29:44 +00:00
Jeff Washington (jwash)	557bf6e656	allow initial hash calc to occur in bg (#26271 ) * allow initial hash calc to occur in bg * validator_initialized -> startup_verification_complete * add infos for leader and vote * rework snapshot for startup verification * change to assert	2022-06-29 16:48:33 -05:00
behzad nouri	348fe9ebe2	verifies shred slot and parent in fetch stage (#26225 ) Shred slot and parent are not verified until window-service where resources are already wasted to sig-verify and deserialize shreds. This commit moves above verification to earlier in the pipeline in fetch stage.	2022-06-28 12:45:50 +00:00
Brooks Prumo	662818ef0d	Use `VoteAccount::node_pubkey()` (#26207 )	2022-06-27 09:09:06 -05:00
Ryo Onodera	cd2878acf9	Avoid to miss to root for local slots before the hard fork (#19912 ) * Make sure to root local slots even with hard fork * Address review comments * Cleanup a bit * Further clean up * Further clean up a bit * Add comment * Tweak hard fork reconciliation code placement	2022-06-26 15:14:17 +09:00
Jeff Biseda	bafdb7dd62	Revert handle start_http failure in rpc_service (#25400 ) (#26130 ) * revert `e263be2000`	2022-06-22 10:52:27 -07:00
HaoranYi	b5d0c7b468	Revert "tvu and tpu timeout on joining its microservices (#24111 )" (#26132 ) This reverts commit `e105547c14`.	2022-06-22 10:57:46 -05:00
Pankaj Garg	43ff65ece9	Use single send socket in UdpTpuConnection (#26105 )	2022-06-21 14:56:21 -07:00
Boqin Qin(秦伯钦)	611d2ec73c	core: fix double-readlock in validator (#26053 )	2022-06-20 15:07:00 +00:00
Trent Nelson	a5f290a66f	core: disable quic servers on mainnet-beta	2022-06-17 20:04:05 -06:00
Lijun Wang	29b597cea5	Connection pool support in connection cache and QUIC connection reliability improvement (#25793 ) * Connection pool in connection cache and handle connection errors 1. The connection not has a pool of connections per address, configurable, default 4 2. The connections per address share a lazy initialized endpoint 3. Handle connection issues better, avoid race conditions 4. Various log improvement for help debug connection issues	2022-06-10 09:25:24 -07:00
Yueh-Hsuan Chiang	ee4469c882	Skip compaction in backup_and_clear_blockstore (#25810 ) #### Problem blockstore clean and compact is quite slow with wait-for-supermajority purge and can take 20-30 minutes as described in #25710. #### Summary of Changes This PR removes the compaction logic in backup_and_clear_blockstore as the actual the restoration from a bad fork is handled by `blockstore.purge_slots` (which is done by issuing rocksdb range-delete that makes the bad fork unavailable.) Compaction is irreverent to the shred version, as its main job in this context is to reclaim disk storage from the deleted slots, which we can let the rocksdb automatic background compaction to handle it. Fixes #25710	2022-06-09 17:11:50 +08:00
Jon Cinque	79a8ecd0ac	client: Remove static connection cache, plumb it instead (#25667 ) * client: Remove static connection cache, plumb it instead * Add TpuClient::new_with_connection_cache to not break downstream * Refactor get_connection and RwLock into ConnectionCache * Fix merge conflicts from new async TpuClient * Remove `ConnectionCache::set_use_quic` * Move DEFAULT_TPU_USE_QUIC to client, use ConnectionCache::default()	2022-06-08 13:57:12 +02:00
Brennan Watt	ba04063956	Add CPUmetrics (#25802 ) Add in some CPU utilization metrics such as: number of vCPUs, clock frequency, average load across different time intervals, and number of total threads	2022-06-07 11:34:25 -07:00
Pankaj Garg	1c2ae470c5	Fix forwarding of transactions over QUIC (#25674 ) * Spawn QUIC server to receive forwarded txs * Update validator port range * forward votes using UDP * no forwarding from unstaked nodes * forwarding stats in banking stage * fix test builds * fix lifetime of forward sender	2022-06-02 11:14:58 -07:00
Ryo Onodera	aedcb05dc8	Record solana-validator ver to metrics at startup (#25635 ) * Record solana-validator ver to metrics at startup * Update Cargo.lock	2022-06-01 13:37:50 +09:00
Yueh-Hsuan Chiang	5b67960c76	(Refactor) Move blocktore options related stuff to blockstore_options.rs (#25509 ) #### Problem blockstore_db.rs has a mutual dependency between blockstore_metrics.rs. #### Summary of Changes This PR removes the mutual dependency by moving the option-related stuff out from blockstore_db.rs to its new home --- blockstore_options.rs. By doing this, we address the mutual dependency and also make the code cleaner.	2022-05-26 16:59:26 -07:00
Michael Vines	b05c7d91ed	Fix derive_partial_eq_without_eq clippy lint	2022-05-22 22:22:21 -07:00
Jeff Biseda	e263be2000	handle start_http failure in rpc_service (#25400 )	2022-05-20 17:59:23 -07:00
Jeff Washington (jwash)	3a4f0d3397	println -> info (#25163 )	2022-05-12 11:07:13 -05:00
HaoranYi	41d34d45e0	pass exit by ref (#25120 )	2022-05-11 09:17:21 -05:00
DimAn	2fa9bc3e70	Add options to store full and/or incremental snapshots in separate locations (#24247 )	2022-05-10 16:37:41 -04:00
Pankaj Garg	88c16c0176	Check if quic is enabled before warming up quic connections (#24821 ) * Check if quic is enabled before warming up quic connections * fix after rebase * don't start warmup service if quic not enabled * fix test	2022-05-01 03:52:38 +00:00
Michael Vines	d0a8a16a57	ReplayStage no longer relies on Validator to reset the poh recorder at start	2022-04-22 21:17:49 -07:00
Michael Vines	84e3342612	Process blockstore after starting the TVU	2022-04-22 21:17:49 -07:00
Michael Vines	83e041299a	Run real snapshot packager while processing blockstore at validator startup	2022-04-22 21:17:49 -07:00
Justin Starry	c544742091	Local cluster test cleanup and refactoring (#24559 ) * remove FixedSchedule.start_epoch * use duration for timing * Rename to partition bool to turbine_disabled * simplify partition config	2022-04-22 12:14:07 +08:00
Michael Vines	05f32f287c	solana-validator monitor now reports slot-level progress while loading blockstore	2022-04-19 22:09:48 -07:00
Michael Vines	9e4999ef6a	Remove halt_at_slot from RuntimeConfig, it's not a runtime concern	2022-04-19 19:23:58 -07:00
Michael Vines	988210908c	Move verify_udp_stats_access out of the way	2022-04-19 19:23:58 -07:00
Michael Vines	c6f3da4879	blockstore_processor now accepts an Arc<Rwlock<BankForks>>	2022-04-19 19:23:58 -07:00
Michael Vines	0e2e0c8b7d	Extract most storage-related services from the Tvu abstraction	2022-04-19 19:23:58 -07:00
Michael Vines	268a2109de	Relocate hard forks info log	2022-04-19 19:23:58 -07:00
Michael Vines	dd766042df	Remove LedgerMetricReportService from TVU	2022-04-19 19:23:58 -07:00
sakridge	7a4a6597c0	Don't enforce ulimit for validator test config (#24272 )	2022-04-12 22:06:37 +02:00
Jon Cinque	9b8850f99e	test-validator: Add `--max-compute-units` flag (#24130 ) * test-validator: Add `--max-compute-units` flag * Add `RuntimeConfig` for tweaking runtime behavior * Actually add the file * Move RuntimeConfig to runtime	2022-04-12 02:28:10 +02:00
Giorgio Gambino	60b2155bd3	Add accounts-filler-size command line option (#23896 )	2022-04-11 13:10:09 -05:00
HaoranYi	e105547c14	tvu and tpu timeout on joining its microservices (#24111 ) * panic when test timeout * nonblocking send when when droping banks * debug log * timeout for tvu * unused varaible * timeout for tpu * Revert "debug log" This reverts commit da780a3301a51d7c496141a85fcd35014fe6dff5. * add timeout const * fix typo * Revert "nonblocking send when when droping banks". I will create another pull request for this. This reverts commit 088c98ec0facf825b5eca058fb860deba6d28888. * Update core/src/tpu.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * Update core/src/tpu.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * Update core/src/tvu.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * Update core/src/tvu.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * Update core/src/validator.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com>	2022-04-07 20:20:13 -05:00
Tyera Eulberg	fb67ff14de	Remove replica-node crates (#24152 )	2022-04-06 16:52:19 -06:00
Brooks Prumo	c322842257	Replace channel with Mutex<Option> for AccountsPackage (#24013 )	2022-04-06 05:47:19 -05:00
carllin	4ea59d8cb4	Set drop callback on first root bank (#23999 )	2022-04-05 13:02:33 -05:00
HaoranYi	6ba4e870c4	Blockstore should drop signals before validator exit (#24025 ) * timeout for validator exits * clippy * print backtrace when panic * add backtrace package * increase time out to 30s * debug logging * make rpc complete service non blocking * reduce log level * remove logging * recv_timeout * remove backtrace * remove sleep * wip * remove unused variable * add comments * Update core/src/validator.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * Update core/src/validator.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * whitespace * more whitespace * fix build * clean up import * add mutex for signal senders in blockstore * remove mut * refactor: extract add signal functions * make blockstore signal private * let compiler infer mutex type Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com>	2022-04-04 11:38:05 -05:00
HaoranYi	ffa4cafe1c	Revert sequential execution of validator_exit and validator_parallel_exit tests (#24048 ) * handle channel disconnect * revert sequential execution of validator_exit and parallel_validator_exit tests	2022-04-02 10:22:47 -05:00
HaoranYi	51b37f0184	Modify rpc_completed_slot_service to be non-blocking (#24007 ) * timeout for validator exits * clippy * print backtrace when panic * add backtrace package * increase time out to 30s * debug logging * make rpc complete service non blocking * reduce log level * remove logging * recv_timeout * remove backtrace * remove sleep * remove unused variable * add comments * Update core/src/validator.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * Update core/src/validator.rs Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com> * whitespace * more whitespace * fix build Co-authored-by: Trent Nelson <trent.a.b.nelson@gmail.com>	2022-03-31 16:44:23 -05:00
HaoranYi	ba770832d0	Poh timing service (#23736 ) * initial work for poh timing report service * add poh_timing_report_service to validator * fix comments * clippy * imrove test coverage * delete record when complete * rename shred full to slot full. * debug logging * fix slot full * remove debug comments * adding fmt trait * derive default * default for poh timing reporter * better comments * remove commented code * fix test * more test fixes * delete timestamps for slot that are older than root_slot * debug log * record poh start end in bank reset * report full to start time instead * fix poh slot offset * report poh start for normal ticks * fix typo * refactor out poh point report fn * rename * optimize delete - delete only when last_root changed * change log level to trace * convert if to match * remove redudant check * fix SlotPohTiming comments * review feedback on poh timing reporter * review feedback on poh_recorder * add test case for out-of-order arrival of timing points and incomplete timing points * refactor poh_timing_points into its own mod * remove option for poh_timing_report service * move poh_timing_point_sender to constructor * clippy * better comments * more clippy * more clippy * add slot poh timing point macro * clippy * assert in test * comments and display fmt * fix check * assert format * revise comments * refactor * extrac send fn * revert reporting_poh_timing_point * align loggin * small refactor * move type declaration to the top of the module * replace macro with constructor * clippy: remove redundant closure * review comments * simplify poh timing point creation Co-authored-by: Haoran Yi <hyi@Haorans-MacBook-Air.local>	2022-03-30 09:04:49 -05:00
Jeff Washington (jwash)	c24de17278	remove index hash calculation as an option (#23928 )	2022-03-25 15:32:53 -05:00
HaoranYi	01af40d6b6	Fix intermittent validator_exit test failure (#23594 ) * run validator_exit_test sequentially * limit validator exit run to its own serial run subset add 10ms delay in the validator exit tests * fix intermittent validator exit failure * no sleep * undo the code move	2022-03-25 14:38:19 -05:00
Steven Luscher	f44c8f296f	fix: thread `enforce_ulimit_nofile` config down when opening blockstore (#23925 )	2022-03-25 03:13:33 -05:00
Jon Cinque	7af48465fa	transaction-status: Add return data to meta (#23688 ) * transaction-status: Add return data to meta * Add return data to simulation results * Use pretty-hex for printing return data * Update arg name, make TransactionRecord struct * Rename TransactionRecord -> ExecutionRecord	2022-03-22 23:17:05 +01:00
Yueh-Hsuan Chiang	f999eef452	(LedgerStore) Rename BlockstoreAdvancedOptions to LedgerColumnOptions (#23764 ) This PR renames BlockstoreAdvancedOptions to LedgerColumnOptions, as we will pass-down this struct to LedgerColumn to allow it to perform metric reporting.	2022-03-18 11:13:35 -07:00
Michael Vines	3773b753d1	Configure shrink paths during blockstore load	2022-03-15 23:08:07 -07:00
Michael Vines	ab373bb1a9	Refactor new_banks_from_ledger() into load and process steps	2022-03-15 23:08:07 -07:00
Michael Vines	2da4e3eb6c	Add --no-os-memory-stats-reporting	2022-03-15 17:07:40 -07:00
Tyera Eulberg	102dd68a03	Rename AccountsDb plugins to Geyser plugins (#23604 )	2022-03-14 19:18:46 -06:00
Michael Vines	17cc095d28	Slot warping doesn't need to be in new_banks_from_ledger	2022-03-14 15:29:58 -07:00
Michael Vines	2e7ee0f177	Tower loading doesn't need to be in new_banks_from_ledger	2022-03-14 15:29:58 -07:00
Michael Vines	390dc24608	Create leader schedule before processing blockstore	2022-03-14 15:29:58 -07:00
Michael Vines	543d5d4a5d	Reduce new_banks_from_ledger arguments	2022-03-14 15:29:58 -07:00
Michael Vines	115f376465	Factor out bank_forks_utils::load_bank_forks()	2022-03-14 15:29:58 -07:00
Yueh-Hsuan Chiang	1e20bd8f9a	(LedgerStore) Include storage type as a tag in RocksDB metric reporting (#23523 ) #### Summary of Changes This PR further enables group by operation on storage type in blockstore_rocksdb_cfs metrics. Such group-by allows us to further compare the performance metrics between rocks-level and rocks-fifo. To make things extensible, this PR introduces BlockstoreAdvancedOptions and move shred_storage_type. All fields in BlockstoreAdvancedOptions will support group-by operation in blockstore_rocksdb_cfs. Dependency: #23580	2022-03-11 15:17:34 -08:00
Tao Zhu	9f71958d7d	Patch validator from loading persisted program costs	2022-03-09 21:05:47 -07:00
Yueh-Hsuan Chiang	62d2a4cd88	Make ShredStorageType::RocksLevel public (#23272 ) #### Summary of Changes This PR adds two hidden arguments to the validator that allow users to use RocksDB's FIFO compaction for storing shreds. --shred-storage <SHRED_STORAGE> EXPERIMENTAL: Controls how RocksDB compacts shreds. WARNING: You will lose your ledger data when you switch between options. Possible values are: 'level': stores shreds using RocksDB's default (level) compaction. 'fifo': stores shreds under RocksDB's FIFO compaction. This option is more efficient on disk-write-bytes of the ledger store. [default: level] [possible values: level, fifo] --shred-storage-size <SHRED_STORAGE_SIZE_BYTES> The shred storage size in bytes. The suggested value is 50% of your ledger storage size in bytes. [default: 268435456000]	2022-03-03 12:43:58 -08:00
Michael Vines	a6d736572c	`solana-validator set-identity` now supports the `--require-tower` flag	2022-02-15 19:45:00 -08:00
Ashwin Sekar	ab92578b02	Fix the flaky test test_restart_tower_rollback (#23129 ) * Add flag to disable voting until a slot to avoid duplicate voting * Fix the tower rollback test and remove it from flaky.	2022-02-15 13:19:34 -07:00
Lijun Wang	c04438be4b	Retaining transaction logs when transaction plugin is loaded. (#22874 ) Transaction logs are not being saved to the database through the plugin interface. Summary of Changes Retain the transaction logs when transaction notification plugin is loaded. Fixes # lijunwangs/solana-accountsdb-plugin-postgres#6	2022-02-11 20:29:07 -08:00
behzad nouri	27aaf9df85	removes VoteTracker::new in favor of VoteTracker::default (#22941 ) VoteTracker::new does not need a bank and is so redundant: https://github.com/solana-labs/solana/blob/5a230f418/core/src/cluster_info_vote_listener.rs#L103-L107	2022-02-04 19:01:59 +00:00
sakridge	5a230f418d	Add quic port for accepting transactions (#22753 ) using quinn library streamer: Sign TLS cert with validator identity key Handle multiple incoming chunks	2022-02-04 15:27:09 +01:00
Trent Nelson	c62f9839a2	test-validator-bin: reinstate full rpc method set	2022-02-03 02:43:03 +00:00
Trent Nelson	eac4a6df68	rpc: use minimal mode by default	2022-02-01 19:00:06 -07:00
Justin Starry	220aa6ada0	Fix poh recorder initialization on startup (#22755 )	2022-01-28 14:21:15 +08:00
Justin Starry	d9c259a231	Set the correct root in block commitment cache initialization (#22750 ) * Set the correct root in block commitment cache initialization * clean up test * bump	2022-01-27 00:48:00 +08:00
Jeff Biseda	8b66625c95	convert std::sync::mpsc to crossbeam_channel (#22264 )	2022-01-11 02:44:46 -08:00
steviez	5f1f4dcbdd	Use struct to pass all Tpu sockets as one argument to Tpu::new() (#21965 ) Tpu::new() now matches Tvu::new() in having struct to reduce argument list. Additionally, Rust supports partial moves, so there is no need to clone the Tvu sockets out of Node object.	2022-01-10 11:29:48 -06:00
Yueh-Hsuan Chiang	e8b7f96a89	Add struct BlockstoreOptions (#22121 )	2022-01-03 18:30:45 -10:00
Jeff Biseda	0e4ede46d1	work around rust 39364 for stats_reporter_sender (#22227 )	2022-01-03 11:46:02 -08:00
Lijun Wang	f14928a970	Stream additional block metadata via plugin (#22023 ) * Stream additional block metadata through plugin blockhash, block_height, block_time, rewards are streamed	2021-12-29 15:12:01 -08:00
Yueh-Hsuan Chiang	b89cd8cd1a	Avoid cloning Vec<Entry> when calling entries_to_test_shreds() (#22093 )	2021-12-24 12:32:43 -08:00
carllin	7f6fb6937a	Ensure AncestorHashesSerice selects an open port (#21919 )	2021-12-18 00:44:01 -05:00
Jeff Biseda	97a1fa10a6	streamer send destination metrics for repair, gossip (#21564 )	2021-12-17 15:21:05 -08:00
segfaultdoctor	76098dd42a	RPC Block Subscription (#21787 ) * add stuff * compiling * add notify block * wip * feat: add blockSubscribe pubsub method * address PR comments Co-authored-by: Lucas B <buffalu@jito.network> Co-authored-by: Zano <segfaultdoctor@protonmail.com>	2021-12-17 16:03:09 -07:00
Justin Starry	1430b58a6d	Remove deprecated slow epoch boundary methods (#21568 )	2021-12-03 17:59:10 +00:00
Michael Vines	b8837c04ec	Reformat imports to a consistent style for imports rustfmt.toml configuration: imports_granularity = "One" group_imports = "One"	2021-12-03 09:19:13 -08:00
Michael Vines	ba9dfa0d22	Remove frozen account support	2021-11-29 08:38:11 -08:00
Lijun Wang	c29838fce1	Accountsdb plugin transaction part 3: Transaction Notifier (#21374 ) The TransactionNotifierInterface interface for notifying transactions. Changes to transaction_status_service to notify the notifier of the transaction data. Interface to query the plugin's interest in transaction data	2021-11-23 09:55:53 -08:00
Jeff Washington (jwash)	87831e7f8d	start system monitor earlier in validator so we get memory stats at startup (#21372 )	2021-11-22 14:37:17 -06:00
Lijun Wang	89c45a57f8	Refactor slot status notification to decouple from accounts notifications (#21308 ) Problem Slot status can be used of in other scenarios in addition to account information such as transactions, blocks. The current implementation is too tightly coupled. Summary of Changes Decouple the slot status notification from accounts notification. Created a new slot status notification module.	2021-11-17 17:11:38 -08:00
Jeff Biseda	d5de0c8e12	add --no-os-network-stats-reporting option (#21296 )	2021-11-16 10:26:03 -08:00
Michael Keleti	b0ca335463	Rename "trusted" to "known" in `validators/` (#21197 ) * Replaced trusted with known validator * Format Convention	2021-11-12 11:57:55 -07:00
sakridge	a8d78e89d3	Move test-validator to own module to reduce core dependencies (#20658 ) * Move test-validator to own module to reduce core dependencies * Fix a few TestValidator paths * Use solana_test_validator crate for solana_test_validator bin * Move client int tests to separate crate Co-authored-by: Tyera Eulberg <tyera@solana.com>	2021-10-29 01:27:07 +00:00
Trent Nelson	fe098b5ddc	rpc-send-tx-svc: add with_config constructor	2021-10-20 13:43:27 -06:00
Jeff Washington (jwash)	95e91a4863	disable gossip publish of snapshots when using filler accts (#20824 )	2021-10-20 18:07:29 +00:00
Jeff Biseda	4cac66244d	report udp stats from validator (#20587 )	2021-10-15 15:11:11 -07:00
Tao Zhu	005d6863fd	- move cost tracker into bank, so each bank has its own cost tracker; (#20527 ) - move related modules to runtime	2021-10-12 08:51:33 -05:00
Michael Vines	c16510152e	Rework AVX/AVX2 detection again	2021-10-10 12:22:10 -07:00
Lijun Wang	d621994fee	Accountsdb stream plugin improvement (#20419 ) Support using connection pooling and use multiple threads to do Postgres db operations. The performance is improved from 1500 RPS to 40,000 RPS measured during validator start. Support multiple plugins at the same time.	2021-10-08 20:06:58 -07:00
Brooks Prumo	5440c1d2e1	SnapshotPackagerService pushes incremental snapshot hashes to CRDS (#20442 ) Now that CRDS supports incremental snapshot hashes, SnapshotPackagerService needs to push 'em! This commit does two main things: 1. SnapshotPackagerService now knows about incremental snapshot hashes, and will push SnapshotPackage::IncrementalSnapshot hashes to CRDS. 2. At startup, when loading from a full + incremental snapshot, the hashes need to be passed all the way to SnapshotPackagerService so it can push these starting hashes to CRDS. Those values have been piped through. Fixes #20441 and #20423	2021-10-08 15:14:56 -05:00
Tao Zhu	177a375479	Tpu vote 1.7 (#20187 ) (#20494 ) * Add separate vote processing tpu port * Add feature to send to tpu vote port * Add vote rejecting sigverify mode * use packet.meta.is_simple_vote_tx in place of deserialization * consolidate code that identifies vote tx atcommon path for cpu and gpu * new key for feature set * banking forward tpu vote * add tpu vote port to dockerfile and other review changes * Simplify thread id compare * fix a test; updated cluster_info ABI change Co-authored-by: Tao Zhu <tao@solana.com> Co-authored-by: sakridge <sakridge@gmail.com>	2021-10-07 09:38:23 +00:00
Justin Starry	129716f3f0	Optimize stakes cache and rewards at epoch boundaries (#20432 ) * Optimize stakes cache and rewards at epoch boundaries * Fetch from accounts db * Add cli flag for disabling epoch boundary optimization	2021-10-06 00:53:26 -04:00
Lijun Wang	fe97cb2ddf	AccountsDb plugin framework (#20047 ) Summary of Changes Create a plugin mechanism in the accounts update path so that accounts data can be streamed out to external data stores (be it Kafka or Postgres). The plugin mechanism allows Data stores of connection strings/credentials to be configured, Accounts with patterns to be streamed PostgreSQL implementation of the streaming for different destination stores to be plugged in. The code comprises 4 major parts: accountsdb-plugin-intf: defines the plugin interface which concrete plugin should implement. accountsdb-plugin-manager: manages the load/unload of plugins and provide interfaces which the validator can notify of accounts update to plugins. accountsdb-plugin-postgres: the concrete plugin implementation for PostgreSQL The validator integrations: updated streamed right after snapshot restore and after account update from transaction processing or other real updates. The plugin is optionally loaded on demand by new validator CLI argument -- there is no impact if the plugin is not loaded.	2021-09-30 14:26:17 -07:00
Pavel Strakhov	65227f44dc	Optimize RPC pubsub for multiple clients with the same subscription (#18943 ) * reimplement rpc pubsub with a broadcast queue * update tests for new pubsub implementation * fix: fix review suggestions * chore(rpc): add additional pubsub metrics * integrate max subscriptions check into SubscriptionTracker to reduce locking * separate subscription control from tracker * limit memory usage of items in pubsub broadcast queue, improve error handling * add more pubsub metrics * add final count metrics to pubsub * add metric for total number of subscriptions * fix small review suggestions * remove by_params from SubscriptionTracker and add node_progress_watchers map instead * add subscription tracker tests * add metrics for number of pubsub notifications as a counter * ignore clippy lint in TokenCounter * fix underflow in token counter * reduce queue capacity in pubsub tests * fix(rpc): fix test timeouts * fix race in account subscription test * Add RpcSubscriptions::new_for_tests Co-authored-by: Pavel Strakhov <p.strakhov@iconic.vc> Co-authored-by: Nikita Podoliako <n.podoliako@zubr.io> Co-authored-by: Tyera Eulberg <tyera@solana.com>	2021-09-17 13:40:14 -06:00
Tyera Eulberg	c91519961c	Use f64 for stake math in get_stake_percent_in_gossip (#19895 )	2021-09-14 23:36:30 -06:00
Michael	4ff50519ff	Add an info log to indicate the node has reached supermajority and print the active stake percentage (#19893 )	2021-09-14 21:48:15 -06:00
carllin	87a7f00926	Track reset bank in PohRecorder (#19810 )	2021-09-13 16:55:35 -07:00
Brooks Prumo	7aa5f6b833	Add CLI args for incremental snapshots (#19694 ) Add `--incremental-snapshots` flag to enable incremental snapshots. This will allow setting `--full-snapshot-interval-slots` and `--incremental-snapshot-interval-slots`. Also added `--maximum-incremental-snapshots-to-retain`. Co-authored-by: Michael Vines <mvines@gmail.com>	2021-09-10 15:59:26 -05:00
Michael Vines	4386e09710	Reduce wait for supermajority threshold back to 80%	2021-09-09 21:17:35 -07:00
Jeff Washington (jwash)	456bf15012	AccountsIndexConfig -> AccountsDbConfig (#19687 )	2021-09-08 04:30:38 +00:00
Jeff Washington (jwash)	d3f938f0cf	Remove Copy from AccountsIndexConfig. Not all types will support it (#19686 )	2021-09-07 20:09:40 -05:00
Brooks Prumo	a0552e5b46	Make startup aware of Incremental Snapshots (#19600 )	2021-09-07 20:43:43 +00:00
Brooks Prumo	fe8ba81ce6	Rename to is_valid instead of is_invalid (#19670 )	2021-09-07 09:31:54 -05:00
Brooks Prumo	9d9482b9d8	Plumb `maximum_incremental_snapshot_archives_to_retain` (#19640 )	2021-09-06 18:01:56 -05:00
Brooks Prumo	1828579580	Pass SnapshotConfig to SnapshotPackagerService (#19616 )	2021-09-03 21:42:32 +00:00
Brooks Prumo	7ab0aec61f	Rename maximum_full_snapshot_archives_to_retain (#19610 ) To prepare for adding maximum_incremental_snapshot_archives_to_retain, rename the current field in SnapshotConfig.	2021-09-03 11:28:10 -05:00
Brooks Prumo	e9374d32a3	Revert "Make startup aware of Incremental Snapshots (#19550 )" (#19599 ) This reverts commit `d45ced0a5d`.	2021-09-02 19:14:41 -05:00
Brooks Prumo	d45ced0a5d	Make startup aware of Incremental Snapshots (#19550 )	2021-09-02 19:05:15 -05:00
Lijun Wang	8378e8790f	Accountsdb replication installment 2 (#19325 ) This is the 2nd installment for the AccountsDb replication. Summary of Changes The basic google protocol buffer protocol for replicating updated slots and accounts. tonic/tokio is used for transporting the messages. The basic framework of the client and server for replicating slots and accounts -- the persisting of accounts in the replica-side will be done at the next PR -- right now -- the accounts are streamed to the replica-node and dumped. Replication for information about Bank is also not done in this PR -- to be addressed in the next PR to limit the change size. Functionality used by both the client and server side are encapsulated in the replica-lib crate. There is no impact to the existing validator by default. Tests: Observe the confirmed slots replicated to the replica-node. Observe the accounts for the confirmed slot are received at the replica-node side.	2021-09-01 14:10:16 -07:00
Brooks Prumo	fe9ee9134a	Make background services aware of incremental snapshots (#19401 ) AccountsBackgroundService now knows about incremental snapshots. It is now also in charge of deciding if an AccountsPackage is destined to be a SnapshotPackage or not (or just used by AccountsHashVerifier). !!! New behavior changes !!! Taking snapshots (both bank and archive) MUST succeed. This is required because of how the last full snapshot slot is calculated, which is used by AccountsBackgroundService when calling `clean_accounts()`. File system calls are now unwrapped and will result in a crash. As Trent told me: >Well I think if a snapshot fails due to some IO error, it's very likely that the operator is going to have to intervene before it works. We should exit error in this case, otherwise the validator might happily spin for several more hours, never successfully writing a complete snapshot, before something else brings it down. This would leave the validator's last local snapshot many more slots behind than it would be had we exited outright and potentially force the operator to abandon ledger continuity in favor of a quick catchup Other errors will set the `exit` flag to `true`, and the node will gracefully shutdown. Fixes #19167 Fixes #19168	2021-08-31 18:33:27 -05:00
behzad nouri	8ad52fa095	implements copy-on-write for vote-accounts (#19362 ) Bank::vote_accounts redundantly clones vote-accounts HashMap even though an immutable reference will suffice: https://github.com/solana-labs/solana/blob/95c998a19/runtime/src/bank.rs#L5174-L5186 This commit implements copy-on-write semantics for vote-accounts by wrapping the underlying HashMap in Arc<...>.	2021-08-30 15:54:01 +00:00
Brooks Prumo	6d939811e9	Name snapshots consistently (#19346 ) #### Problem Snapshot names are overloaded, and there are multiple terms that mean the same thing. This is confusing. Here's a list of ones in the codebase that I've found: ``` - snapshot_dir - snapshots_dir - snapshot_path - snapshot_output_dir - snapshot_package_output_path - snapshot_archives_dir ``` #### Summary of Changes For all the ones that are about the directory where snapshot archives are stored, ensure they are `snapshot_archives_dir`. For the ones about the (bank) snapshots directory, set to `bank_snapshots_dir`. Co-authored-by: Michael Vines <mvines@gmail.com>	2021-08-21 15:41:03 -05:00
Jeff Washington (jwash)	7c70f2158b	accounts_index_bins to AccountsIndexConfig (#19257 ) * accounts_index_bins to AccountsIndexConfig * rename param bins -> config * rename BINS_FOR* to ACCOUNTS_INDEX_CONFIG_FOR*	2021-08-17 14:50:01 -05:00
Brooks Prumo	f9986c66b8	Make SnapshotPackagerService aware of Incremental Snapshots (#19254 ) Add a field to SnapshotPackage that is an enum for SnapshotType, so archive_snapshot_package() will do the right thing. Fixes #19166	2021-08-17 13:01:59 -05:00
behzad nouri	7a789e0763	filters for recent contact-infos when checking for live stake (#19204 ) Contact-infos are saved to disk: https://github.com/solana-labs/solana/blob/9dfeee299/gossip/src/cluster_info.rs#L1678-L1683 and restored on validator start-up: https://github.com/solana-labs/solana/blob/9dfeee299/core/src/validator.rs#L450 Staked nodes entries will not expire until an epoch after. So when the validator checks for online stake it is erroneously picking up contact-infos restored from disk, which breaks the entire wait-for-supermajority logic: https://github.com/solana-labs/solana/blob/9dfeee299/core/src/validator.rs#L1515-L1561 This commit adds an extra check for the age of contact-info entries and filters out old ones.	2021-08-13 12:12:40 +00:00
Jeff Washington (jwash)	e91988c977	cli for num account index bins (#19085 )	2021-08-11 11:45:25 -05:00
Michael Vines	7ddda30126	`solana-test-validator` now uses FileTowerStorage	2021-08-11 00:20:46 -07:00
Michael Vines	e9722474eb	Move tower storage into its own module	2021-08-11 00:20:46 -07:00
Brooks Prumo	fd937548a0	Move SnapshotArchiveInfo and friends into its own module (#19114 )	2021-08-08 07:57:06 -05:00
Brooks Prumo	00890957ee	Add snapshot_utils::bank_from_latest_snapshot_archives() (#18983 ) While reviewing PR #18565, as issue was brought up to refactor some code around verifying the bank after rebuilding from snapshots. A new top-level function has been added to get the latest snapshot archives and load the bank then verify. Additionally, new tests have been written and existing tests have been updated to use this new function. Fixes #18973 While resolving the issue, it became clear there was some additional low-hanging fruit this change enabled. Specifically, the functions `bank_to_xxx_snapshot_archive()` now return their respective `SnapshotArchiveInfo`. And on the flip side, `bank_from_snapshot_archives()` now takes `SnapshotArchiveInfo`s instead of separate paths and archive formats. This bundling simplifies bank rebuilding.	2021-08-06 20:16:06 -05:00
Michael Vines	397801a2d8	Extract tower storage details from Tower struct	2021-08-06 10:04:37 -07:00
Jeff Washington (jwash)	14361906ca	for all tests, bank::new -> bank::new_for_tests (#19064 )	2021-08-05 08:42:38 -05:00
Jeff Washington (jwash)	3280ae3e9f	add validator option --accounts-db-skip-shrink (#19028 ) * add validator option --accounts-db-skip-shrink * typo	2021-08-04 17:28:33 -05:00
Brooks Prumo	ca14475085	Add incremental_snapshot_archive_interval_slots to SnapshotConfig (#19026 ) This commit also renames `snapshot_interval_slots` to `full_snapshot_archive_interval_slots`, updates the comments on the fields, and make appropriate updates where SnapshotConfig is used.	2021-08-04 14:40:20 -05:00
Trent Nelson	71f6d839f9	validator: remove disused cuda config argument	2021-07-29 03:08:52 +00:00
Trent Nelson	8ed0cd0fff	validator: check target CPU features earlier	2021-07-29 03:08:52 +00:00
Trent Nelson	c435f7b3e3	validator: add avx2 runtime check	2021-07-29 03:08:52 +00:00
Trent Nelson	e641f257ef	test-validator: move feature check earlier in startup	2021-07-29 03:08:52 +00:00
Trent Nelson	59641623d1	Improve check for Apple M1 silicon under Rosetta	2021-07-29 03:08:52 +00:00
Jack May	f1b9f97aef	remove avx error on macos (#18923 )	2021-07-27 16:34:04 -07:00
behzad nouri	d2d5f36a3c	adds validator flag to allow private ip addresses (#18850 )	2021-07-23 15:25:03 +00:00
Brooks Prumo	d1debcd971	Add incremental snapshot utils (#18504 ) This commit adds high-level functions for creating and loading-from incremental snapshots, plus all low-level functions required to perform those tasks. This commit does not add taking incremental snapshots as part of a running validator, nor starting up a node with an incremental snapshot; just laying ground work. Additionally, `snapshot_utils` and `serde_snapshot` have been refactored to use a common code paths for the different snapshots. Also of note, some renaming has happened: 1. Snapshots are now either `full_` or `incremental_` throughout the codebase. If not specified, the code applies to both. 2. Bank snapshots now are called "bank snapshots" (before they were called "slot snapshots", "bank snapshots", or just "snapshots"). The one exception is within `Bank`, where they are still just "snapshots", because they are already "bank snapshots". 3. Snapshot archives now have `_archive` in the code. This should clear up an ambiguity between bank snapshots and snapshot archives.	2021-07-22 14:40:37 -05:00
sakridge	7f2254225e	Move entry/poh to own crate to speed up poh bench build (#18225 )	2021-07-14 14:16:29 +02:00
Tao Zhu	b6dff12923	update ledger tool to restore cost table from blockstore (#18489 ) * update ledger tool to restore cost model from blockstore when compute-slot-cost * Move initialize_cost_table into cost_model, so the function can be tested and shared between validator and ledger-tool * refactor and simplify a test	2021-07-07 23:44:51 -05:00
Michael Vines	b6792a3328	Add ability to change the validator identity at runtime	2021-07-01 17:50:04 -07:00
Tao Zhu	5e424826ba	Persist cost table to blockstore (#18123 ) * Add `ProgramCosts` Column Family to blockstore, implement LedgerColumn; add `delete_cf` to Rocks * Add ProgramCosts to compaction excluding list alone side with TransactionStatusIndex in one place: `excludes_from_compaction()` * Write cost table to blockstore after `replay_stage` replayed active banks; add stats to measure persist time * Deletes program from `ProgramCosts` in blockstore when they are removed from cost_table in memory * Only try to persist to blockstore when cost_table is changed. * Restore cost table during validator startup * Offload `cost_model` related operations from replay main thread to dedicated service thread, add channel to send execute_timings between these threads; * Move `cost_update_service` to its own module; replay_stage is now decoupled from cost_model.	2021-07-01 11:32:41 -05:00
Brooks Prumo	89a3e4f91e	Move SnapshotConfig into its own module (#18331 ) Also move ArchiveFormat to snapshot_utils, and do not reexport SnapshotVersion.	2021-07-01 08:55:26 -05:00
Trent Nelson	d269975784	Revert "Clean up build warning" This reverts commit `17a173ebb5`.	2021-06-24 19:57:52 -06:00
behzad nouri	598093b5db	adds shred-version to ip-echo-server response When starting a validator, the node initially joins gossip with shred_verison = 0, until it adopts the entrypoint's shred-version: https://github.com/solana-labs/solana/blob/9b182f408/validator/src/main.rs#L417 Depending on the load on the entrypoint, this adopting entrypoint shred-version through gossip sometimes becomes very slow, and causes several problems in gossip because we have to partially support shred_version == 0 which is a source of leaking crds values from one cluster to another. e.g. see https://github.com/solana-labs/solana/pull/17899 and the other linked issues there. In order to remove shred_version == 0 from gossip, this commit adds shred-version to ip-echo-server response. Once the entrypoints are updated, on validator start-up, if --expected_shred_version is not specified we will obtain shred-version from the entrypoint using ip-echo-server.	2021-06-21 19:37:16 +00:00
Alexander Meißner	6514096a67	chore: cargo +nightly clippy --fix -Z unstable-options	2021-06-18 10:42:46 -07:00

1 2 3 4 5 ...

540 Commits