solana

Commit Graph

Author	SHA1	Message	Date
behzad nouri	5e1cf39c74	adds metrics for number of outgoing shreds in retransmit stage (#20882 )	2021-10-24 13:12:27 +00:00
behzad nouri	0c0384ec32	revises turbine peers shuffling order (#20480 ) Turbine randomly shuffles cluster nodes on a broadcast tree for each shred. This requires knowing the stakes and nodes' contact-infos (from gossip). However gossip is subject to partitioning and propogation delays. Additionally unstaked nodes may join and leave the cluster at any moment, changing the cluster view from one node to another. This commit: * Always arranges the unstaked nodes at the bottom of turbine broadcast tree. * Staked nodes are always included regardless of if their contact-info is available in gossip or not. * Uses the unbiased WeightedShuffle construct for shuffling nodes.	2021-10-14 15:09:36 +00:00
Lijun Wang	fe97cb2ddf	AccountsDb plugin framework (#20047 ) Summary of Changes Create a plugin mechanism in the accounts update path so that accounts data can be streamed out to external data stores (be it Kafka or Postgres). The plugin mechanism allows Data stores of connection strings/credentials to be configured, Accounts with patterns to be streamed PostgreSQL implementation of the streaming for different destination stores to be plugged in. The code comprises 4 major parts: accountsdb-plugin-intf: defines the plugin interface which concrete plugin should implement. accountsdb-plugin-manager: manages the load/unload of plugins and provide interfaces which the validator can notify of accounts update to plugins. accountsdb-plugin-postgres: the concrete plugin implementation for PostgreSQL The validator integrations: updated streamed right after snapshot restore and after account update from transaction processing or other real updates. The plugin is optionally loaded on demand by new validator CLI argument -- there is no impact if the plugin is not loaded.	2021-09-30 14:26:17 -07:00
Brooks Prumo	a0552e5b46	Make startup aware of Incremental Snapshots (#19600 )	2021-09-07 20:43:43 +00:00
behzad nouri	01a7ec8198	uses rayon thread-pool for retransmit-stage parallelization (#19486 )	2021-09-07 15:15:01 +00:00
Brooks Prumo	e9374d32a3	Revert "Make startup aware of Incremental Snapshots (#19550 )" (#19599 ) This reverts commit `d45ced0a5d`.	2021-09-02 19:14:41 -05:00
Brooks Prumo	d45ced0a5d	Make startup aware of Incremental Snapshots (#19550 )	2021-09-02 19:05:15 -05:00
behzad nouri	6d9818b8e4	skips retransmit for shreds with unknown slot leader (#19472 ) Shreds' signatures should be verified before they reach retransmit stage, and if the leader is unknown they should fail signature check. Therefore retransmit-stage can as well expect to know who the slot leader is and otherwise just skip the shred. Blockstore checking signature of recovered shreds before sending them to retransmit stage: https://github.com/solana-labs/solana/blob/4305d4b7b/ledger/src/blockstore.rs#L884-L930 Shred signature verifier: https://github.com/solana-labs/solana/blob/4305d4b7b/core/src/sigverify_shreds.rs#L41-L57 https://github.com/solana-labs/solana/blob/4305d4b7b/ledger/src/sigverify_shreds.rs#L105	2021-09-01 15:44:26 +00:00
behzad nouri	7a8807b8bb	retransmits shreds recovered from erasure codes Shreds recovered from erasure codes have not been received from turbine and have not been retransmitted to other nodes downstream. This results in more repairs across the cluster which is slower. This commit channels through recovered shreds to retransmit stage in order to further broadcast the shreds to downstream nodes in the tree.	2021-08-17 13:44:10 +00:00
behzad nouri	3efccbffab	sends shreds (instead of packets) to retransmit stage Working towards channelling through shreds recovered from erasure codes to retransmit stage.	2021-08-17 13:44:10 +00:00
behzad nouri	6e413331b5	removes erroneous uses of Arc<...> from retransmit stage	2021-08-17 13:44:10 +00:00
behzad nouri	bf437b0336	removes packet-count metrics from retransmit stage Working towards sending shreds (instead of packets) to retransmit stage so that shreds recovered from erasure codes are as well retransmitted. Following commit will add these metrics back to window-service, earlier in the pipeline.	2021-08-17 13:44:10 +00:00
behzad nouri	b64eeb7729	removes erroneous uses of &Arc<...> from window-service	2021-08-13 17:26:31 +00:00
behzad nouri	e4be00fece	falls back on working-bank if root-bank::epoch-staked-nodes is none bank.get_leader_schedule_epoch(shred_slot) is one epoch after epoch_schedule.get_epoch(shred_slot). At epoch boundaries, shred is already one epoch after the root-slot. So we need epoch-stakes 2 epochs ahead of the root. But the root bank only has epoch-stakes for one epoch ahead, and as a result looking up epoch staked-nodes from the root-bank fails. To be backward compatible with the current master code, this commit implements a fallback on working-bank if epoch staked-nodes obtained from the root-bank is none.	2021-08-05 21:47:33 +00:00
behzad nouri	50d0e830c9	unifies cluster-nodes computation & caching across turbine stages Broadcast-stage is using epoch_staked_nodes based on the same slot that shreds belong to: https://github.com/solana-labs/solana/blob/049fb0417/core/src/broadcast_stage/standard_broadcast_run.rs#L208-L228 https://github.com/solana-labs/solana/blob/0cf52e206/core/src/broadcast_stage.rs#L342-L349 But retransmit-stage is using bank-epoch of the working-bank: https://github.com/solana-labs/solana/blob/19bd30262/core/src/retransmit_stage.rs#L272-L289 So the two are not consistent at epoch boundaries where some nodes may have a working bank (or similarly a root bank) lagging other nodes. As a result the node which obtains a packet may construct turbine broadcast tree inconsistently with its parent node in the tree and so some packets may fail to reach all nodes in the tree.	2021-08-05 21:47:33 +00:00
behzad nouri	30bec3921e	uses cluster-nodes cache in retransmit stage The new cluster-nodes cache will: * ensure cluster-nodes are recalculated if the epoch (and so the epoch staked nodes) changes. * encapsulate time-to-live eviction policy.	2021-08-05 21:47:33 +00:00
Ryo Onodera	da480bdb5f	Fix unstable retransmit-num_nodes (#18970 )	2021-07-29 17:32:32 +00:00
behzad nouri	d06dc6c8a6	shares cluster-nodes between retransmit threads (#18947 ) cluster_nodes and last_peer_update are not shared between retransmit threads, as each thread have its own value: https://github.com/solana-labs/solana/blob/65ccfed86/core/src/retransmit_stage.rs#L476-L477 Additionally, with shared references, this code: https://github.com/solana-labs/solana/blob/0167daa11/core/src/retransmit_stage.rs#L315-L328 has a concurrency bug where the thread which does compare_and_swap, updates cluster_nodes much later after other threads have run with outdated cluster_nodes for a while. In particular, the write-lock there may block.	2021-07-29 16:20:15 +00:00
sakridge	84e78316b1	Write helper for multithread update (#18808 )	2021-07-29 03:16:36 +02:00
carllin	c0704d4ec9	Plumb signal from replay to ancestor hashes service (#18880 )	2021-07-26 20:59:00 -07:00
carllin	1ee64afb12	Introduce AncestorHashesService (#18812 )	2021-07-23 16:54:47 -07:00
behzad nouri	d2d5f36a3c	adds validator flag to allow private ip addresses (#18850 )	2021-07-23 15:25:03 +00:00
behzad nouri	04787be8b1	encapsulates turbine peers computations of broadcast & retransmit stages (#18238 ) Broadcast stage and retransmit stage should arrange nodes on turbine broadcast tree in exactly same order. Additionally any changes to this ordering (e.g. updating how unstaked nodes are handled) requires feature gating to keep the cluster in sync. Current implementation is scattered out over several public methods and exposes too much of implementation details (e.g. usize indices into peers vector) which makes code changes and checking for feature activations more difficult. This commit encapsulates turbine peer computations into a new struct, and only exposes two public methods, get_broadcast_peer and get_retransmit_peers, for call-sites.	2021-07-07 00:35:25 +00:00
Jeff Washington (jwash)	ec2f930475	user process.accounts_db_test_hash_calculation for debug_verify hash (#18053 )	2021-06-21 10:20:27 -05:00
Michael Vines	4a12c715a3	Drop Error suffix from enum values to avoid the enum_variant_names clippy lint	2021-06-18 23:02:13 +00:00
Michael Vines	fa04531c7a	Extricate RpcCompletedSlotsService from RetransmitStage	2021-06-16 16:20:35 -07:00
behzad nouri	161838655c	removes port-based forwarding logic from turbine retransmit (#17716 ) Turbine retransmit logic is based on which socket it received the packet from (i.e `packet.meta.forward`): https://github.com/solana-labs/solana/blob/708bbcb00/core/src/retransmit_stage.rs#L467-L470 This can leave the cluster vulnerable to spoofing and selective propagation of packets; see https://github.com/solana-labs/solana/issues/6672 https://github.com/solana-labs/solana/pull/7774 This commit identifies if the node is on the "critical path" based on its index in the shuffled cluster. If so, it forwards the packet to both neighbors and children; otherwise, the packet is only forwarded to the children. The metrics added in https://github.com/solana-labs/solana/pull/17351 shows that the number of times the index does not match the port is very rare, and therefore this change should be safe.	2021-06-15 13:19:41 +00:00
behzad nouri	be957f25c9	adds fallback logic if retransmit multicast fails (#17714 ) In retransmit-stage, based on the packet.meta.seed and resulting children/neighbors, each packet is sent to a different set of peers: https://github.com/solana-labs/solana/blob/708bbcb00/core/src/retransmit_stage.rs#L421-L457 However, current code errors out as soon as a multicast call fails, which will skip all the remaining packets: https://github.com/solana-labs/solana/blob/708bbcb00/core/src/retransmit_stage.rs#L467-L470 This can exacerbate packets loss in turbine. This commit: * keeps iterating over retransmit packets for loop even if some intermediate sends fail. * adds a fallback to UdpSocket::send_to if multicast fails. Recent discord chat: https://discord.com/channels/428295358100013066/689412830075551748/849530845052403733	2021-06-04 12:16:37 +00:00
carllin	96ba2edfeb	Switch EpochSlots to be frozen slots, not completed slots (#17168 )	2021-06-03 00:20:00 +00:00
Tyera Eulberg	9a5330b7eb	Move gossip modules into solana-gossip crate (#17352 ) * Move gossip modules to solana-gossip * Update Protocol abi digest due to move * Move gossip benches and hook up CI * Remove unneeded Result entries * Single use statements	2021-05-26 09:15:46 -06:00
behzad nouri	ff0e623d30	removes the nested for loop from retransmit-stage The code can be simplified by just flattening the vector of packets.	2021-05-21 17:10:56 +00:00
behzad nouri	71de021177	adds metric for turbine retransmit tree mismatch In order to remove port-based forwarding logic in turbine, we need to first track how often the turbine retransmit/broadcast trees mismatch across nodes. One consistency condition is that if the node is on the critical path (i.e. the first node in each neighborhood), then we expect that the packet arrives at tvu socket as opposed to tvu-forwards. This commit adds a metric to track how often above condition is not met.	2021-05-21 17:10:56 +00:00
Tao Zhu	0781fe1b4f	Upgrade Rust to 1.52.0 (#17096 ) * Upgrade Rust to 1.52.0 update nightly_version to newly pushed docker image fix clippy lint errors 1.52 comes with grcov 0.8.0, include this version to script * upgrade to Rust 1.52.1 * disabling Serum from downstream projects until it is upgraded to Rust 1.52.1	2021-05-19 09:31:47 -05:00
Tyera Eulberg	827355a6b1	Create solana-rpc crate and move subscriptions (#17320 ) * Move non_circulating_supply to runtime * Add solana-rpc crate and move max_slots * Move subscriptions to solana-rpc * Single use statements	2021-05-19 00:54:28 -06:00
Tyera Eulberg	6e9deaf1bd	Move block-time caching earlier (#17109 ) * Require that blockstore block-time only be recognized slot, instead of root * Move cache_block_time to after Bank freeze * Single use statement * Pass transaction_status_sender by reference * Remove unnecessary slot-existence check before caching block time altogether * Move block-time existence check into Blockstore::cache_block_time, Blockstore no longer needed in blockstore_processor helper	2021-05-10 13:14:56 -06:00
behzad nouri	81ad795d46	removes position field in coding-shred-header CodingShredHeader.position is equal to ShredCommonHeader.index - ShredCommonHeader.fec_set_index and is so redundant. The extra position field can add bugs if not consistent with index and fec_set_index.	2021-05-10 13:20:56 +00:00
behzad nouri	9706512115	removes old runtime feature gates in gossip and turbine (#16633 )	2021-04-26 17:12:02 +00:00
Michael Vines	a911ae00ba	clippy	2021-04-18 20:55:02 -07:00
carllin	52703badfa	Setup ReplayStage confirmation scaffolding for duplicate slots (#9698 )	2021-03-24 23:41:52 -07:00
behzad nouri	570fd3f810	makes turbine peer computation consistent between broadcast and retransmit (#14910 ) get_broadcast_peers is using tvu_peers: https://github.com/solana-labs/solana/blob/84e52b606/core/src/broadcast_stage.rs#L362-L370 which is potentially inconsistent with retransmit_peers: https://github.com/solana-labs/solana/blob/84e52b606/core/src/cluster_info.rs#L1332-L1345 Also, the leader does not include its own contact-info when broadcasting shreds: https://github.com/solana-labs/solana/blob/84e52b606/core/src/cluster_info.rs#L1324 but on the retransmit side, slot leader is removed only _after_ neighbors and children are computed: https://github.com/solana-labs/solana/blob/84e52b606/core/src/retransmit_stage.rs#L383-L384 So the turbine broadcast tree is different between the two stages. This commit: * Removes retransmit_peers. Broadcast and retransmit stages will use tvu_peers consistently. * Retransmit stage removes slot leader _before_ computing children and neighbors.	2021-03-24 13:34:48 +00:00
Justin Starry	918d04e3f0	Add more slot update notifications (#15734 ) * Add more slot update notifications * fix merge * Address feedback and add integration test * switch to datapoint * remove unused shred method * fix clippy * new thread for rpc completed slots * remove extra constant * fixes * rely on channel closing * fix check	2021-03-12 21:44:06 +08:00
carllin	331c45decf	Report datapoint on number of retransmit shreds (#15694 )	2021-03-08 17:54:53 -08:00
carllin	ae96ba3459	Plumb slot update pubsub notifications (#15488 )	2021-02-28 23:29:11 -08:00
sakridge	1b59b163dd	Add max retransmit and shred insert slot (#15475 )	2021-02-23 13:06:33 -08:00
behzad nouri	e1021d9f83	removes redundant epoch stakes cache in retransmit (#14781 ) Following `d6d76219b`, staked nodes computed from vote accounts are already cached in runtime::Stakes, so the caching in retransmit_stage is redundant.	2021-01-24 21:15:09 +00:00
Michael Vines	cbffab7850	Upgrade to Rust v1.49.0	2021-01-23 19:16:36 -08:00
behzad nouri	b5fd0ed859	rewrites turbine retransmit peers computation (#14584 )	2021-01-19 04:18:47 +00:00
behzad nouri	c6ae0667e6	feature gates turbine retransmit peers patch (#14631 )	2021-01-19 04:16:19 +00:00
behzad nouri	cfcca1cd3c	patches bug in turbine's neighbors computation (#14565 ) Removing local node's index early from the set here: https://github.com/solana-labs/solana/blob/e1b59ded4/core/src/retransmit_stage.rs#L346 distorts the order of nodes depending on which node is computing the turbine fan-out tree, and results in incorrect neighbors computation.	2021-01-13 22:25:29 +00:00
sakridge	c693ffaa08	Fix subtraction overflow in metrics (#14290 )	2020-12-27 16:26:22 -08:00

1 2 3

136 Commits