solana

Commit Graph

Author	SHA1	Message	Date
behzad nouri	72537e7e07	bypasses rayon thread-pool for single entry batches (#28077 ) With no parallelization, thread-pool only adds overhead.	2022-09-26 21:32:58 +00:00
behzad nouri	f49beb0cbc	caches reed-solomon encoder/decoder instance (#27510 ) ReedSolomon::new(...) initializes a matrix and a data-decode-matrix cache: https://github.com/rust-rse/reed-solomon-erasure/blob/273ebbced/src/core.rs#L460-L466 In order to cache this computation, this commit caches the reed-solomon encoder/decoder instance for each (data_shards, parity_shards) pair.	2022-09-25 18:09:47 +00:00
behzad nouri	97c9af4c6b	plumbs through flag to generate merkle variant of shreds	2022-09-23 16:45:18 +00:00
behzad nouri	7d3f3b2f7d	generates merkle shreds from ledger entries The commit adds methods to convert &[Entry] to vector of Merkle shreds.	2022-09-23 16:45:18 +00:00
Brennan Watt	dfdb422fb1	Minor shred constant cleanup (#27472 ) * Minor shred constant cleanup to eliminate magic number	2022-08-30 18:53:05 -07:00
Michael Vines	3f4731b37f	Standardize thread names Tenets: 1. Limit thread names to 15 characters 2. Prefix all Solana-controlled threads with "sol" 3. Use Camel case. It's more character dense than Snake or Kebab case	2022-08-20 07:49:39 -07:00
behzad nouri	c0b63351ae	recovers merkle shreds from erasure codes (#27136 ) The commit * Identifies Merkle shreds when recovering from erasure codes and dispatches specialized code to reconstruct shreds. * Coding shred headers are added to recovered erasure shards. * Merkle tree is reconstructed for the erasure batch and added to recovered shreds. * The common signature (for the root of Merkle tree) is attached to all recovered shreds.	2022-08-19 21:07:32 +00:00
behzad nouri	b3b57a0f07	adjusts max coding shreds per slot (#27083 ) As a consequence of removing buffering when generating coding shreds: https://github.com/solana-labs/solana/pull/25807 more coding shreds are generated than data shreds, and so MAX_CODE_SHREDS_PER_SLOT needs to be adjusted accordingly. The respective value is tied to ERASURE_BATCH_SIZE.	2022-08-12 18:02:01 +00:00
behzad nouri	ac91cdab74	removes buffering when generating coding shreds in broadcast (#25807 ) Given the 32:32 erasure recovery schema, current implementation requires exactly 32 data shreds to generate coding shreds for the batch (except for the final erasure batch in each slot). As a result, when serializing ledger entries to data shreds, if the number of data shreds is not a multiple of 32, the coding shreds for the last batch cannot be generated until there are more data shreds to complete the batch to 32 data shreds. This adds latency in generating and broadcasting coding shreds. In addition, with Merkle variants for shreds, data shreds cannot be signed and broadcasted until coding shreds are also generated. As a result both code and data shreds will be delayed before broadcast if we still require exactly 32 data shreds for each batch. This commit instead always generates and broadcast coding shreds as soon as there any number of data shreds available. When serializing entries to shreds: * if the number of resulting data shreds is less than 32, then more coding shreds will be generated so that the resulting erasure batch has the same recovery probabilities as a 32:32 batch. * if the number of data shreds is more than 32, then the data shreds are split uniformly into erasure batches with _at least_ 32 data shreds in each batch. Each erasure batch will have the same number of code and data shreds. For example: * If there are 19 data shreds, 27 coding shreds are generated. The resulting 19(data):27(code) erasure batch has the same recovery probabilities as a 32:32 batch. * If there are 107 data shreds, they are split into 3 batches of 36:36, 36:36 and 35:35 data:code shreds each. A consequence of this change is that code and data shreds indices will no longer align as there will be more coding shreds than data shreds (not only in the last batch in each slot but also in the intermediate ones);	2022-08-11 12:44:27 +00:00
behzad nouri	403b2e4841	records num data shreds obtained from serializing entries (#26888 )	2022-08-03 17:07:40 +00:00
behzad nouri	1181510531	infers erasure batches from FEC-set indices of data shreds (#26873 ) data_shreds_to_coding_shreds relies on the hardcoded MAX_DATA_SHREDS_PER_FEC_BLOCK batches of data shreds: https://github.com/solana-labs/solana/blob/e74ad90cd/ledger/src/shredder.rs#L175-L183 This hardcoded logic is unnecessary since the shreds belonging to the same erasure batch can be identified from run of the same FEC-set index.	2022-08-02 16:05:27 +00:00
behzad nouri	f534b8981b	maps number of data shreds to erasure batch size (#25917 ) In prepration of https://github.com/solana-labs/solana/pull/25807 which reworks erasure batch sizes, this commit: * adds a helper function mapping the number of data shreds to the erasure batch size. * adds ProcessShredsStats to Shredder::entries_to_shreds in order to replace and remove entries_to_data_shreds from the public interface.	2022-06-23 13:27:54 +00:00
behzad nouri	31b3e0e15a	adds metric tracking wasted data buffer in shreds (#25972 )	2022-06-16 16:14:00 +00:00
behzad nouri	5f04512d3a	adds a new shred variant embedding merkle tree hashes of the erasure batch (#25237 ) Coding shreds can only be signed once erasure codings are already generated. Therefore coding shreds recovered from erasure codings lack slot leader's signature and so cannot be retransmitted to the rest of the cluster. shred/merkle.rs implements a new shred variant where we generate merkle tree for each erasure encoded batch and each shred includes: * root of the merkle tree (Hash truncated to 20 bytes). * slot leader's signature of the root of the merkle tree. * merkle tree nodes along the branch the shred belongs to, where hashes are trimmed to 20 bytes during tree construction. This schema results in the same signature for all shreds within an erasure batch. When recovering shreds from erasure codes, we can reconstruct merkle tree for the batch and for each recovered shred also recover respective merkle tree branch; then snap the slot leader's signature from any of the shreds received from turbine and retransmit all recovered code or data shreds. Backward compatibility is achieved by encoding shred variant at byte 65 of payload (previously shred-type at this position): * 0b0101_1010 indicates a legacy coding shred, which is also equal to ShredType::Code for backward compatibility. * 0b1010_0101 indicates a legacy data shred, which is also equal to ShredType::Data for backward compatibility. * 0b0100_???? indicates a merkle coding shred with merkle branch size indicated by the last 4 bits. * 0b1000_???? indicates a merkle data shred with merkle branch size indicated by the last 4 bits. Merkle root and branch are encoded at the end of the shred payload.	2022-06-07 22:41:03 +00:00
behzad nouri	6c9f2eac78	removes fec_set_offset from UnfinishedSlotInfo (#25815 ) If the blockstore has shreds for a slot, it should not recreate the slot: https://github.com/solana-labs/solana/blob/ff68bf6c2/ledger/src/leader_schedule_cache.rs#L142-L146 https://github.com/solana-labs/solana/pull/15849/files#r596657314 Therefore in broadcast stage if UnfinishedSlotInfo is None, then fec_set_offset will be zero: https://github.com/solana-labs/solana/blob/ff68bf6c2/core/src/broadcast_stage/standard_broadcast_run.rs#L111-L120 As a result fec_set_offset will always be zero, and is so redundant and can be removed.	2022-06-07 22:17:37 +00:00
behzad nouri	81231a89b9	adds support for different variants of ShredCode and ShredData The commit implements two new types: pub enum ShredCode { Legacy(legacy::ShredCode), } pub enum ShredData { Legacy(legacy::ShredData), } Following commits will extend these types by adding merkle variants: pub enum ShredCode { Legacy(legacy::ShredCode), Merkle(merkle::ShredCode), } pub enum ShredData { Legacy(legacy::ShredData), Merkle(merkle::ShredData), }	2022-06-02 18:55:50 +00:00
behzad nouri	29cfa04c05	records number of residual data shreds which don't make a full batch (#25693 ) Data shreds are batched into MAX_DATA_SHREDS_PER_FEC_BLOCK shreds for each erasure batch. If there are residual shreds not making a full batch, then we cannot generate coding shreds and need to buffer shreds until there is a full batch; This may add latency to coding shreds generation and broadcast. In order to evaluate upcoming changes removing this buffering logic, this commit adds metrics tracking residual number of data shreds which don't make a full batch.	2022-06-02 00:32:32 +00:00
behzad nouri	de612c25b3	removes shred wire layout specs from sigverify (#25520 ) sigverify_shreds relies on wire layout specs of shreds: https://github.com/solana-labs/solana/blob/0376ab41a/ledger/src/sigverify_shreds.rs#L39-L46 https://github.com/solana-labs/solana/blob/0376ab41a/ledger/src/sigverify_shreds.rs#L298-L305 In preparation of https://github.com/solana-labs/solana/pull/25237 which adds a new shred variant with different layout and signed message, this commit removes shred layout specification from sigverify and instead encapsulate that in shred module.	2022-05-26 13:06:27 +00:00
behzad nouri	be1d606dea	adds sanity checks to Shred::reference_tick_from_data Shred::reference_tick_from_data should check if payload is indeed a data shred and has valid size.	2022-05-18 21:56:22 +00:00
behzad nouri	e2bbc3913d	separates out data vs code shreds at the type level Working towards revising shred struct to embed versioning so that a new variant can contain merkle tree hashes of the erasure batch. To ease out migration the commit adds more type-safety by distinguishing data vs code shreds at the type level. Additionally having both data and coding headers in each shred is redundant as only one is relevant for each shred. The revised shred type in this commit will only have one type-specific header. https://github.com/solana-labs/solana/blob/c785f1ffc/ledger/src/shred.rs#L198-L203	2022-05-18 21:56:22 +00:00
behzad nouri	a01291069a	initializes thread-pools with lazy_static instead of thread_local (#24853 ) In addition to thread_local -> lazy_static change, a number of thread-pools are initialized with get_max_thread_count to achieve parity with the older code in terms of number of validator threads.	2022-05-05 20:00:50 +00:00
behzad nouri	eff59193db	enforces that LAST_SHRED_IN_SLOT is also DATA_COMPLETE_SHRED (#24892 ) A data shred cannot be LAST_SHRED_IN_SLOT if not also DATA_COMPLETE_SHRED. So LAST_SHRED_IN_SLOT should also imply DATA_COMPLETE_SHRED: https://github.com/solana-labs/solana/blob/74b586ae7/ledger/src/shredder.rs#L116-L117 https://github.com/solana-labs/solana/blob/74b586ae7/core/src/broadcast_stage/standard_broadcast_run.rs#L80-L81 However current shred constructs allow specifying a shred which is LAST_SHRED_IN_SLOT but not DATA_COMPLETE_SHRED: https://github.com/solana-labs/solana/blob/74b586ae7/ledger/src/shred.rs#L117-L118 https://github.com/solana-labs/solana/blob/74b586ae7/ledger/src/shred.rs#L272-L273 The commit updates ShredFlags so that if a shred is not DATA_COMPLETE_SHRED it cannot be LAST_SHRED_IN_SLOT either.	2022-05-02 23:33:53 +00:00
behzad nouri	e812430e28	defines shred flags using bitflags crate (#24874 ) Shred flags uses raw bit-masking ops which lacks type-safety: https://github.com/solana-labs/solana/blob/a829ddc92/ledger/src/shred.rs#L112-L114 This commit instead uses bitflags crate to define shred flags.	2022-05-01 19:25:15 +00:00
behzad nouri	4b62c93f87	removes erasure::Session, using Reed-Solomon constructs directly The extra wrapping and indirection by the Session struct is not used in any form. The commit removes Session and instead uses Reed-Solomon constructs directly.	2022-04-29 23:02:29 +00:00
behzad nouri	08e1727926	minor edits in shred and shredder (#24841 ) Removed Default implementation for ShredType. ShredType should always be explicitly specified, and not rely on default values. Simplified single-arg Shred Error variants to use shorter syntax. Renamed erasure blocks to shards, to be consistent with reed_solomon crate and not to confuse with FEC blocks.	2022-04-29 19:42:15 +00:00
behzad nouri	008860bb36	removes SHRED_PAYLOAD_SIZE from shred public interface (#24806 )	2022-04-28 23:42:37 +00:00
behzad nouri	1e86d3ab59	moves shredder out of the shred module (#24755 ) Working towards embedding versioning into shreds binary; so Shredder can no longer depend on internals of Shred.	2022-04-27 22:00:04 +00:00

27 Commits