solana-with-rpc-optimizations

Commit Graph

Author	SHA1	Message	Date
Will Hickey	3096b64f9d	Update error that results when snapshot is missing (#24839 )	2022-06-21 13:06:37 -05:00
Will Hickey	51f26dc96e	Bump version to 1.11.1 (#26104 )	2022-06-21 12:07:46 -05:00
behzad nouri	47e62add5b	removes feature gate code adding shred-type to shred seed (#25963 ) The feature is already activated on all clusters, and does not impact processing of ledger/snapshots.	2022-06-20 14:39:24 +00:00
Tyera Eulberg	2866ca4b1c	Add ledger-tool bigtable upload loop (#26030 ) * Add ledger-tool bigtable upload loop * Limit range on caller side, switch to while loop, and remove now-obsolete option	2022-06-17 19:31:13 +00:00
behzad nouri	31b3e0e15a	adds metric tracking wasted data buffer in shreds (#25972 )	2022-06-16 16:14:00 +00:00
Brooks Prumo	b4b191e446	Enforce accounts data size limit per block in ReplayStage (#25524 )	2022-06-15 20:35:33 -05:00
Brian Anderson	db9004bd0f	Fix doc warnings (#25953 )	2022-06-14 21:55:08 -06:00
Tyera Eulberg	8a3d48b0ee	Reduce 2 iterators to one (#25973 )	2022-06-14 22:49:58 +00:00
Michael Vines	b4237f3f2c	Do not exclude failed simple vote transactions from consensus	2022-06-12 22:11:23 -07:00
Yueh-Hsuan Chiang	591986eb01	Helper function for creating ShredStorageType::RocksFifo (#25569 ) #### Problem Currently, the creation of ShredStorageType::RocksFifo is hard coded in validator/src/main.rs. But this common code will also need to be used in other places like ledger-tool. #### Summary of Changes This PR creates a helper functionShredStorageType::rocks_fifo that takes a total shred_storage_size and equally allocates to data-shred and coding-shred storage.	2022-06-08 07:58:58 +08:00
behzad nouri	5f04512d3a	adds a new shred variant embedding merkle tree hashes of the erasure batch (#25237 ) Coding shreds can only be signed once erasure codings are already generated. Therefore coding shreds recovered from erasure codings lack slot leader's signature and so cannot be retransmitted to the rest of the cluster. shred/merkle.rs implements a new shred variant where we generate merkle tree for each erasure encoded batch and each shred includes: * root of the merkle tree (Hash truncated to 20 bytes). * slot leader's signature of the root of the merkle tree. * merkle tree nodes along the branch the shred belongs to, where hashes are trimmed to 20 bytes during tree construction. This schema results in the same signature for all shreds within an erasure batch. When recovering shreds from erasure codes, we can reconstruct merkle tree for the batch and for each recovered shred also recover respective merkle tree branch; then snap the slot leader's signature from any of the shreds received from turbine and retransmit all recovered code or data shreds. Backward compatibility is achieved by encoding shred variant at byte 65 of payload (previously shred-type at this position): * 0b0101_1010 indicates a legacy coding shred, which is also equal to ShredType::Code for backward compatibility. * 0b1010_0101 indicates a legacy data shred, which is also equal to ShredType::Data for backward compatibility. * 0b0100_???? indicates a merkle coding shred with merkle branch size indicated by the last 4 bits. * 0b1000_???? indicates a merkle data shred with merkle branch size indicated by the last 4 bits. Merkle root and branch are encoded at the end of the shred payload.	2022-06-07 22:41:03 +00:00
behzad nouri	6c9f2eac78	removes fec_set_offset from UnfinishedSlotInfo (#25815 ) If the blockstore has shreds for a slot, it should not recreate the slot: https://github.com/solana-labs/solana/blob/ff68bf6c2/ledger/src/leader_schedule_cache.rs#L142-L146 https://github.com/solana-labs/solana/pull/15849/files#r596657314 Therefore in broadcast stage if UnfinishedSlotInfo is None, then fec_set_offset will be zero: https://github.com/solana-labs/solana/blob/ff68bf6c2/core/src/broadcast_stage/standard_broadcast_run.rs#L111-L120 As a result fec_set_offset will always be zero, and is so redundant and can be removed.	2022-06-07 22:17:37 +00:00
Yueh-Hsuan Chiang	8674c96a66	Make the default values of FIFO compaction consistent with validator args (#25778 ) #### Problem When FIFO compaction is used, the size ratio between data shred and coding shred is set to 1:1 based on the `--rocksdb_fifo_shred_storage_size` arg. However, BlockstoreRocksFifoOptions::default() uses a slightly optimized 5:4 ratio instead, and the default() function is only used in benchmarks. #### Summary of Changes This PR makes both validator argument and BlockstoreRocksFifoOptions::default() to use 1:1 ratio between data and coding shred size.	2022-06-07 15:24:58 +08:00
behzad nouri	5dbf7d8f91	removes raw indexing into packet data (#25554 ) Packets are at the boundary of the system where, vast majority of the time, they are received from an untrusted source. Raw indexing into the data buffer can open attack vectors if the offsets are invalid. Validating offsets beforehand is verbose and error prone. The commit updates Packet::data() api to take a SliceIndex and always to return an Option. The call-sites are so forced to explicitly handle the case where the offsets are invalid.	2022-06-03 01:05:06 +00:00
behzad nouri	81231a89b9	adds support for different variants of ShredCode and ShredData The commit implements two new types: pub enum ShredCode { Legacy(legacy::ShredCode), } pub enum ShredData { Legacy(legacy::ShredData), } Following commits will extend these types by adding merkle variants: pub enum ShredCode { Legacy(legacy::ShredCode), Merkle(merkle::ShredCode), } pub enum ShredData { Legacy(legacy::ShredData), Merkle(merkle::ShredData), }	2022-06-02 18:55:50 +00:00
behzad nouri	a913068512	embeds versioning into shred binary In preparation of https://github.com/solana-labs/solana/pull/25237 which adds a new shred variant with merkle tree branches, the commit embeds versioning into shred binary by encoding a new ShredVariant type at byte 65 of payload replacing previously ShredType at this offset. enum ShredVariant { LegacyCode, // 0b0101_1010 LegacyData, // 0b0101_1010 } * 0b0101_1010 indicates a legacy coding shred, which is also equal to ShredType::Code for backward compatibility. * 0b1010_0101 indicates a legacy data shred, which is also equal to ShredType::Data for backward compatibility. Following commits will add merkle variants to this type: enum ShredVariant { LegacyCode, // 0b0101_1010 LegacyData, // 0b1010_0101 MerkleCode(/proof_size:/ u8), // 0b0100_???? MerkleData(/proof_size:/ u8), // 0b1000_???? }	2022-06-02 18:55:50 +00:00
apfitzge	934da5ef99	Fix pre-check of blockstore slots during load_bank_forks (#25632 ) Fix pre-check of blockstore slts during load_bank_forks. Now iterates from starting_slot to halt_slot via slot_meta.next_slots to confirm they are connected.	2022-06-01 20:19:42 -05:00
behzad nouri	29cfa04c05	records number of residual data shreds which don't make a full batch (#25693 ) Data shreds are batched into MAX_DATA_SHREDS_PER_FEC_BLOCK shreds for each erasure batch. If there are residual shreds not making a full batch, then we cannot generate coding shreds and need to buffer shreds until there is a full batch; This may add latency to coding shreds generation and broadcast. In order to evaluate upcoming changes removing this buffering logic, this commit adds metrics tracking residual number of data shreds which don't make a full batch.	2022-06-02 00:32:32 +00:00
steviez	17995c7e67	Cleanup BlockstoreInsertionMetrics (#25618 ) * Move BlockstoreInsertionMetrics to blockstore_metrics.rs * Specify unit (us) in metric fields	2022-06-01 10:54:11 -05:00
Yueh-Hsuan Chiang	bcff88bf42	Use the new datapoint macro for RocksDB column family metrics (#25505 ) #### Summary of Changes Use the new datapoint macro that supports group-by for RocksDB column family metrics. By using the new macro, we can further remove large chunks of boilerplate code that try to work around the previous datapoint macro that does not support group-by.	2022-05-31 09:26:57 -07:00
Yueh-Hsuan Chiang	24634b6e25	Use the new datapoint macro that supports group-by for RocksDB read/write metrics. (#25392 ) #### Summary of Changes Use the new datapoint macro that supports group-by for RocksDB read/write perf metrics.	2022-05-26 22:17:29 -07:00
Yueh-Hsuan Chiang	5b67960c76	(Refactor) Move blocktore options related stuff to blockstore_options.rs (#25509 ) #### Problem blockstore_db.rs has a mutual dependency between blockstore_metrics.rs. #### Summary of Changes This PR removes the mutual dependency by moving the option-related stuff out from blockstore_db.rs to its new home --- blockstore_options.rs. By doing this, we address the mutual dependency and also make the code cleaner.	2022-05-26 16:59:26 -07:00
dependabot[bot]	7f4128947b	chore: bump lru from 0.7.5 to 0.7.6 (#25572 ) * chore: bump lru from 0.7.5 to 0.7.6 Bumps [lru](https://github.com/jeromefroe/lru-rs) from 0.7.5 to 0.7.6. - [Release notes](https://github.com/jeromefroe/lru-rs/releases) - [Changelog](https://github.com/jeromefroe/lru-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/jeromefroe/lru-rs/compare/0.7.5...0.7.6) --- updated-dependencies: - dependency-name: lru dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * [auto-commit] Update all Cargo lock files Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: dependabot-buildkite <dependabot-buildkite@noreply.solana.com>	2022-05-26 19:05:02 +00:00
dependabot[bot]	86d308ae50	chore: bump prost from 0.10.3 to 0.10.4 (#25574 ) * chore: bump prost from 0.10.3 to 0.10.4 Bumps [prost](https://github.com/tokio-rs/prost) from 0.10.3 to 0.10.4. - [Release notes](https://github.com/tokio-rs/prost/releases) - [Commits](https://github.com/tokio-rs/prost/compare/v0.10.3...v0.10.4) --- updated-dependencies: - dependency-name: prost dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * [auto-commit] Update all Cargo lock files Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: dependabot-buildkite <dependabot-buildkite@noreply.solana.com>	2022-05-26 17:30:15 +00:00
behzad nouri	de612c25b3	removes shred wire layout specs from sigverify (#25520 ) sigverify_shreds relies on wire layout specs of shreds: https://github.com/solana-labs/solana/blob/0376ab41a/ledger/src/sigverify_shreds.rs#L39-L46 https://github.com/solana-labs/solana/blob/0376ab41a/ledger/src/sigverify_shreds.rs#L298-L305 In preparation of https://github.com/solana-labs/solana/pull/25237 which adds a new shred variant with different layout and signed message, this commit removes shred layout specification from sigverify and instead encapsulate that in shred module.	2022-05-26 13:06:27 +00:00
behzad nouri	cafa85bfbb	includes shred-type when computing turbine broadcast seed (#25556 ) Indices for code and data shreds of the same slot overlap; and so they will have the same random number generator seed when shuffling cluster nodes for turbine broadcast. This results in the same propagation path for code and data shreds of the same index and effectively smaller sample size for re-transmitter nodes. For example a 32:32 batch (32 code + 32 data shreds), is retransmitted through _at most_ 32 unique nodes, whereas ideally we want ~64 unique re-transmitters. This commit adds shred-type to seed function so that code and data sherds of the same (slot, index) will (most likely) have different propagation paths.	2022-05-25 20:31:53 +00:00
behzad nouri	880684565c	limits read access into Packet data to Packet.meta.size (#25484 ) Bytes past Packet.meta.size are not valid to read from. The commit makes the buffer field private and instead provides two methods: * Packet::data() which returns an immutable reference to the underlying buffer up to Packet.meta.size. The rest of the buffer is not valid to read from. * Packet::buffer_mut() which returns a mutable reference to the entirety of the underlying buffer to write into. The caller is responsible to update Packet.meta.size after writing to the buffer.	2022-05-25 16:52:54 +00:00
Jeff Biseda	61c5a471e8	preserve optimistic_slot in blockstore (#25311 )	2022-05-24 12:03:28 -07:00
Justin Starry	cad1c41ce2	Add Packet::deserialize_slice convenience method	2022-05-24 17:31:14 +08:00
steviez	ec7ca411dd	Make PacketBatch packets vector non-public (#25413 ) Upcoming changes to PacketBatch to support variable sized packets will modify the internals of PacketBatch. So, this change removes usage of the internal packet struct and instead uses accessors (which are currently just wrappers of Vector functions but will change down the road).	2022-05-23 15:30:15 -05:00
Jeff Washington (jwash)	41f30a2383	stop logging misleading bank hash mismatch (#25427 )	2022-05-23 08:43:25 -05:00
Michael Vines	9d9773bd2a	Use write! instead of format! to pacify clippy	2022-05-22 22:22:21 -07:00
Michael Vines	b05c7d91ed	Fix derive_partial_eq_without_eq clippy lint	2022-05-22 22:22:21 -07:00
Brooks Prumo	f8842032c6	clippy: fix "this let-binding has unit value" warnings (#25429 )	2022-05-22 12:17:59 -04:00
Yueh-Hsuan Chiang	d3dc2db9fb	(LedgerStore) Rate-limit RocksDB perf sample by a minimum time interval (#25100 ) #### Problem The current RocksDB read/write perf metrics do not include the total operation nanos and thus we have to include all fields that might contribute to the total operation nanos. #### Summary of Changes This PR includes the total operation nanos in RocksDB's read/write perf and reduces the number of reported fields in its perf metric.	2022-05-21 16:42:33 -07:00
Jeff Biseda	8caf0aabd1	framework to preserve optimistic_slot in blockstore (#25362 )	2022-05-20 16:46:23 -07:00
Yueh-Hsuan Chiang	de2033f2f2	(LedgerStore) Rate-limit RocksDB perf sample by a minimum time interval (#25093 ) #### Problem When the number of RocksDB read/write operations spikes, its payload size might exceed the limit (413 Payload Too Large). #### Summary of Changes This PR rate-limit the perf-sampling of RocksDB read/write operations by one second in addition to the existing sampling that is configurable via the hidden validator argument --rocksdb-perf-sample-interval.	2022-05-20 10:54:27 -07:00
Michael Vines	c54e06355f	voteSubscribe pubsub notification now includes the vote transaction signature (#25291 )	2022-05-19 18:28:46 -07:00
dependabot[bot]	6e5612dd55	chore: bump libc from 0.2.125 to 0.2.126 (#25332 ) * chore: bump libc from 0.2.125 to 0.2.126 Bumps [libc](https://github.com/rust-lang/libc) from 0.2.125 to 0.2.126. - [Release notes](https://github.com/rust-lang/libc/releases) - [Commits](https://github.com/rust-lang/libc/compare/0.2.125...0.2.126) --- updated-dependencies: - dependency-name: libc dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * [auto-commit] Update all Cargo lock files Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: dependabot-buildkite <dependabot-buildkite@noreply.solana.com>	2022-05-19 13:58:12 -06:00
behzad nouri	be1d606dea	adds sanity checks to Shred::reference_tick_from_data Shred::reference_tick_from_data should check if payload is indeed a data shred and has valid size.	2022-05-18 21:56:22 +00:00
behzad nouri	e2bbc3913d	separates out data vs code shreds at the type level Working towards revising shred struct to embed versioning so that a new variant can contain merkle tree hashes of the erasure batch. To ease out migration the commit adds more type-safety by distinguishing data vs code shreds at the type level. Additionally having both data and coding headers in each shred is redundant as only one is relevant for each shred. The revised shred type in this commit will only have one type-specific header. https://github.com/solana-labs/solana/blob/c785f1ffc/ledger/src/shred.rs#L198-L203	2022-05-18 21:56:22 +00:00
dependabot[bot]	542bd0ec3c	chore: bump rayon from 1.5.2 to 1.5.3 (#25242 ) * chore: bump rayon from 1.5.2 to 1.5.3 Bumps [rayon](https://github.com/rayon-rs/rayon) from 1.5.2 to 1.5.3. - [Release notes](https://github.com/rayon-rs/rayon/releases) - [Changelog](https://github.com/rayon-rs/rayon/blob/master/RELEASES.md) - [Commits](https://github.com/rayon-rs/rayon/compare/v1.5.2...v1.5.3) --- updated-dependencies: - dependency-name: rayon dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * [auto-commit] Update all Cargo lock files Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: dependabot-buildkite <dependabot-buildkite@noreply.solana.com>	2022-05-18 09:39:57 -06:00
behzad nouri	9b13b1b712	adds const_assert_eq for shred constants (#25288 ) Adding const_assert_eq: * Documents explicitly what the constants are equal to. * Prevents introducing bugs by silently changing the constants as the code is updated.	2022-05-17 22:44:35 +00:00
buffalu	6bcadc755e	Speedup bigtable block upload by factor of 8-10x (#24534 ) Added multiple blockstore read threads. Run the bigtable upload in tokio::spawn context. Run bigtable tx and tx-by-addr uploads in tokio::spawn context.	2022-05-17 00:21:05 -06:00
Yueh-Hsuan Chiang	5625959f7e	(LedgerStore) Change perf_samples_counter from Arc<AtomicUsize> to AtomicUsize (#25043 ) #### Problem After #25042, each LedgerColumn has its own BlockstoreRocksDbWritePerfMetrics and BlockstoreRocksDbReadPerfMetrics instances. As it has total ownership, its member field does not need to use Arc. #### Summary of Changes Change perf_samples_counter from Arc<AtomicUsize> to AtomicUsize under BlockstoreRocksDbWritePerfMetrics and BlockstoreRocksDbReadPerfMetrics.	2022-05-16 11:31:07 -07:00
Tyera Eulberg	bc005e3408	Add configurable limit to number of blocks to check before Bigtable upload (#24716 ) * Add ConfirmedBlockUploadConfig, no behavior changes * Add comment * A little DRY cleanup * Add configurable limit to number of blocks to check in Blockstore and Bigtable before uploading * Limit blockstore and bigtable look-ahead * Exit iterator early when reach ending_slot * Use rooted_slot_iterator instead of slot_meta_iterator * Only check blocks in the ledger	2022-05-13 07:34:02 +00:00
Jason	08da486c05	additional costs in block capacity calc (#25059 ) * Added additional costs to block capacity computation, and pushed alloc of CostModel all the way to the top of the call chain, instead of reallocing * Fix two compiler errors * Update block processing to propagate computed costs, rather than re-computing deeper in the call stack * Clippy fix * Reformatting fix after merge * Add CostModel::sum_without_bpf	2022-05-12 13:52:20 -05:00
Yueh-Hsuan Chiang	b2dcda8980	(LedgerStore) Move metric sample counters out from LedgerColumnOptions (#25042 ) #### Problem LedgerColumnOptions contain two fields, perf_read_counter and perf_write_counter, that are not really options but internal counters. #### Summary of Changes This PR introduces BlockstoreRocksDbPerfSamplingStatus, a struct that holds internal status for RocksDB perf sampling and moves perf_read_counter and perf_write_counter out from LedgerColumnOptions.	2022-05-10 16:13:19 -07:00
Pankaj Garg	c838e15234	Unset needs_unlock for rebatched transactions batches (#25095 ) * Unset needs_unlock for rebatched transactions batches * address review comments	2022-05-10 13:39:08 -07:00
DimAn	2fa9bc3e70	Add options to store full and/or incremental snapshots in separate locations (#24247 )	2022-05-10 16:37:41 -04:00

1 2 3 4 5 ...

958 Commits