zebra

Commit Graph

Author	SHA1	Message	Date
Alfredo Garcia	7d42c63790	fix comment	2020-11-25 10:55:44 -08:00
teor	8d6ac8eece	Placate clippy	2020-11-24 20:03:21 +10:00
Henry de Valence	d90e709ce1	network: tidy peer set implementation - rename functions more descriptively - create a common `take_ready_service` function - organize poll_ functions separately	2020-11-24 20:03:21 +10:00
Henry de Valence	f36a4800b2	network: fix invariant violation in peer set Closes #1183. The peer set maintains a preselected ready service that it can use to perform power-of-two-choices (p2c) routing of requests. Ready services are stored by key (socket address) in an `IndexMap`, and the preselected service is represented by an `Option<usize>` indexing that map. This means that whenever the set of ready services changes (e.g., a service is removed from the peer set, or a service is taken to be used to process a request), the preselected index is invalidated. The original P2C-only implementation maintained this invariant but did not document it. The change to inventory-based routing introduced a bug by failing to maintain this invariant and appropriately invalidate the preselected index. However, this was only noticeable approximately 1/N of the time on the next request after an inventory-directed request, so the bug occurred infrequently. Luckily, the use of `.expect` caused the bug to be an immediate panic, making it possible to identify by inspecting all uses of the ready service map.	2020-11-24 20:03:21 +10:00
teor	6387dfe1d0	Fix individual crate compilation failures Some Zebra crates don't compile individually due to missing features in their dependencies. Add those features to each crate's dependency list.	2020-11-23 23:56:28 -08:00
Henry de Valence	add94c1c45	deps: move to tokio 0.3, tower 0.4 This change is mostly mechanical, with the exception of the changes to the `tower-batch` middleware. This middleware was adapted from `tower::buffer`, and the `tower::buffer` code was changed to implement its own bounded queue, because Tokio 0.3 removed the `mpsc::Sender::poll_send` method. See `ddc64e8d4d` for more context on the Tower changes. To match Tower as closely as possible in order to be able to upstream `tower-batch`, those changes are copied from `tower::Buffer` to `tower-batch`.	2020-11-20 10:08:16 -08:00
Henry de Valence	06dd39df54	network: bump network version for Canopy (#1333 ) Per https://zips.z.cash/zip-0251, nodes compatible with Canopy activation on mainnet MUST advertise protocol version 170013 or later. Once Canopy activates on testnet or mainnet, Canopy nodes SHOULD reject new connections from pre-Canopy nodes, so this also increases the minimum version.	2020-11-20 09:50:05 +10:00
Henry de Valence	a3ab589d89	consensus,state: document cancellation contracts for services This change explicitly documents cancellation contracts for our Tower services, and tries to correct a bug in the implementation of the CheckpointVerifier, which duplicates information from the state service but did not ensure that it would be kept in sync.	2020-11-17 14:56:27 -08:00
teor	ca4e792f47	Put messages in request/response order And fix a comment typo	2020-11-17 07:52:53 +10:00
Alfredo Garcia	128643d81e	Call `zebra_test::init` where needed. (#1227 ) * Add missing `zebra_test::init()` to zebra-chain * Add missing `zebra_test::init()` to zebra-consensus * Add missing `zebra_test::init()` to zebra-network * Add missing `zebra_test::init()` to zebra-state * Add missing `zebra_test::init()` to zebra-test * Add missing `zebra_test::init()` to zebrad	2020-11-10 10:29:25 +10:00
Henry de Valence	8e709bfa88	network: don't fail on unsolicited messages These messages might be unsolicited, or they might be a response to a request we already canceled. So don't fail the whole connection, just drop the message and move on.	2020-10-26 12:05:35 -07:00
Henry de Valence	13daefa729	network: handle request cancellation in Connection We handle request cancellation in two places: before we transition into the AwaitingResponse state, and while we are in AwaitingResponse. We need both places, or else if we started processing a request, we wouldn't process the cancellation until the timeout elapsed. The first is a check that the oneshot is not already canceled. For the second, we wait on a cancellation, either from a timeout or from the tx channel closing.	2020-10-26 12:05:35 -07:00
teor	1e97691fc8	Fix some "needless lifetime" clippy lints These lints seem to be new in clippy nightly.	2020-10-12 08:54:23 +10:00
Dimitris Apostolou	36279621f0	Fix typos	2020-10-06 12:16:41 +10:00
Henry de Valence	6dd7318d3b	deps: use Tower 0.4 from git instead of 0.3.1. This addresses at least three pain points: - we were affected by bugs that were already fixed in git, but not in the released crate; - we can use service combinators to transform requests and responses; - we can use the hedge middleware. The version in git is still marked as 0.3.1 but these changes will be part of tower 0.4: https://github.com/tower-rs/tower/issues/431	2020-09-21 14:16:56 -07:00
Deirdre Connolly	33afeb37cb	Add a comment about the short looo	2020-09-21 09:26:39 -07:00
Henry de Valence	6f3288814c	network: avoid GetPeers timeout to accelerate init The GetPeers requests sent while crawling the network are randomly load-balanced over available peers. But at the very beginning, they may be both routed to the same peer, causing network initialization to be delayed while the second one times out (since zcashd only ever responds to the first addr message). Only sending one GetPeers request per candidate set update means we crawl the network a little more slowly, but avoids hanging on start.	2020-09-21 09:26:39 -07:00
Henry de Valence	b72c249b96	network: add a metric+warning when shedding load	2020-09-21 09:26:39 -07:00
Henry de Valence	4df5632752	network: handle Message::NotFound as a response This cleans up the response processing logic a little bit along the way, but the overall division of responsibility should be better documented in a future commit.	2020-09-20 10:21:18 -07:00
Henry de Valence	64905563d1	network: remove glob import in message-handling This clarifies which parts are the handler state and which parts are the incoming message.	2020-09-20 10:21:18 -07:00
Henry de Valence	9c021025a7	network: fill in remaining request/response pairs	2020-09-20 10:21:18 -07:00
Henry de Valence	b289cb9164	network: clean up GetHeaders, GetBlocks modeling	2020-09-20 10:21:18 -07:00
Henry de Valence	3c993f33b1	network: add PeerError::WrongMessage This lets us distinguish between cases where the message was unsupported (e.g., BIP11 messages), and cases where the message was uninterpretable in context (e.g., unsolicited messages).	2020-09-20 10:21:18 -07:00
Henry de Valence	430176dd0d	network: clean up message-as-request translation	2020-09-20 10:21:18 -07:00
Henry de Valence	170f588ffb	network: document load-shedding behavior This was part of the original design and is described in the Connection internals, but we never documented it externally.	2020-09-18 18:34:25 -07:00
Henry de Valence	1d3892e1dc	network: rename alias to BoxError This is shorter and consistent with Tower (which is why we use it in the first place).	2020-09-18 18:34:25 -07:00
Henry de Valence	95f2463188	Try workaround for generator autotrait bug > Added a test that the handshake's version message matches specified fields, but the test does not compile, because rustc doesn't believe that the Box<dyn std::error::Error + Send + Sync + 'static> is 'static, and therefore isn't a Box<dyn std::error::Error + Send + Sync + 'static>. This manifests as being unable to spawn the connect_isolated task. From digging through Tokio issues I believe that this is an instance of rust-lang/rust#64552 . Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-09-17 12:02:20 -07:00
Henry de Valence	81e8195f68	network: add connect_isolated distinguisher test This is currently broken due to a rustc bug.	2020-09-17 12:02:20 -07:00
Henry de Valence	b7472de43f	network: add a zebra_network::connect_isolated() method. The peer set provides an automatically managed connection pool, abstracting away all the details of handling individual peer connections. However, it's also useful to be able to create completely isolated and minimally-distinguishable connections to individual peers, in order to be able to send specific messages over Tor, or to implement some custom network crawler logic.	2020-09-17 12:02:20 -07:00
teor	66265dc11a	Adjust the EWMA decay for the latest sync timeout	2020-09-09 15:35:09 -07:00
teor	1f7af0a779	Update the inv message processing comment Cleanup after PR #1028.	2020-09-09 15:29:38 -07:00
teor	2a68ef5acb	Update the peerset buffer size and sync timeout Also add a bunch of comments and documentation for network-constrained nodes, and for testnet.	2020-09-08 12:44:33 -07:00
teor	e6e859dce2	Tweak sync timeouts * increase the EWMA default and decay * increase the block download retries * increase the request and block download timeouts * increase the sync timeout	2020-09-08 12:44:33 -07:00
Jane Lusby	1b17691dda	improve logging	2020-09-08 12:37:34 -07:00
Jane Lusby	81a3ad3a0d	filter inventory advertisements correctly	2020-09-08 12:37:34 -07:00
Henry de Valence	3f150eb16e	network: implement transaction request handling. (#1016 ) This commit makes several related changes to the network code: - adds a `TransactionsByHash(HashSet<transaction::Hash>)` request and `Transactions(Vec<Arc<Transaction>>)` response pair that allows fetching transactions from a remote peer; - adds a `PushTransaction(Arc<Transaction>)` request that pushes an unsolicited transaction to a remote peer; - adds an `AdvertiseTransactions(HashSet<transaction::Hash>)` request that advertises transactions by hash to a remote peer; - adds an `AdvertiseBlock(block::Hash)` request that advertises a block by hash to a remote peer; Then, it modifies the connection state machine so that outbound requests to remote peers are handled properly: - `TransactionsByHash` generates a `getdata` message and collects the results, like the existing `BlocksByHash` request. - `PushTransaction` generates a `tx` message, and returns `Nil` immediately. - `AdvertiseTransactions` and `AdvertiseBlock` generate an `inv` message, and return `Nil` immediately. Next, it modifies the connection state machine so that messages from remote peers generate requests to the inbound service: - `getdata` messages generate `BlocksByHash` or `TransactionsByHash` requests, depending on the content of the message; - `tx` messages generate `PushTransaction` requests; - `inv` messages generate `AdvertiseBlock` or `AdvertiseTransactions` requests. Finally, it refactors the request routing logic for the peer set to handle advertisement messages, providing three routing methods: - `route_p2c`, which uses p2c as normal (default); - `route_inv`, which uses the inventory registry and falls back to p2c (used for `BlocksByHash` or `TransactionsByHash`); - `route_all`, which broadcasts a request to all ready peers (used for `AdvertiseBlock` and `AdvertiseTransactions`).	2020-09-08 10:16:29 -07:00
Henry de Valence	cad38415b2	network: fix bug in inventory advertisement handling (#1022 ) * network: fix bug in inventory advertisement handling The RFC https://zebra.zfnd.org/dev/rfcs/0003-inventory-tracking.html described the use of a `broadcast` channel in place of an `mpsc` channel to get ring-buffer behavior, keeping a bound on the size of the channel but dropping old entries when the channel is full. However, it didn't explicitly describe how this works (the `broadcast` channel returns a `RecvError::Lagged(u64)` to inform receivers that they lost messages), so the lag-handling wasn't implemented and I didn't notice in review. Instead, the ? operator bubbled the lag error all the way up from `InventoryRegistry::poll_inventory` through `<PeerSet as Service>::poll_ready` through various Tower wrappers to users of the peer set. The error propagation is bad enough, because it caused client errors that shouldn't have happened, but there's a worse interaction. The `Service` contract distinguishes between request errors (from `Service::call`, scoped to the request) and service errors (from `Service::poll_ready`, scoped to the service). The `Service` contract specifies that once a service returns an error from `poll_ready`, the service can be assumed to be failed permanently. I believe (but haven't tested or carefully worked through the details) that this caused various tower middleware to report the entire peer set service as permanently failed due to a transient inventory "error" (more of an indicator), and I suspect that this is the cause of #1003, where all of the sync component's requests end up failing because the peer set reported that it failed permanently. I am able to reproduce #1003 locally before this change and unable to reproduce it locally after this change, though I have not tested exhaustively. * network: add metric for dropped inventory advertisements Co-authored-by: teor <teor@riseup.net> Co-authored-by: teor <teor@riseup.net>	2020-09-07 21:24:31 -07:00
Henry de Valence	9682d452ee	network: add AddressBook::potentially_connected_peers().	2020-09-07 11:13:15 -07:00
dependabot[bot]	142226ad57	build(deps): bump indexmap from 1.5.2 to 1.6.0 Bumps [indexmap](https://github.com/bluss/indexmap) from 1.5.2 to 1.6.0. - [Release notes](https://github.com/bluss/indexmap/releases) - [Commits](https://github.com/bluss/indexmap/compare/1.5.2...1.6.0) Signed-off-by: dependabot[bot] <support@github.com>	2020-09-07 07:56:39 -04:00
Alfredo Garcia	454e75e7c0	Rename old references to BlockHeaderHash and BlockHeight (#1002 ) * rename some references * Apply suggestions from code review Co-authored-by: Deirdre Connolly <durumcrustulum@gmail.com> Co-authored-by: teor <teor@riseup.net> Co-authored-by: Deirdre Connolly <durumcrustulum@gmail.com> Co-authored-by: teor <teor@riseup.net>	2020-09-04 15:40:48 -07:00
teor	b5c653ed93	Use ok_or for constants, rather than a redudant closure * Use ok_or for constants in zebra-network * Use ok_or for constants in zebra-consensus	2020-09-02 14:26:26 +10:00
Jane Lusby	88557ddd0a	address more comments	2020-09-01 21:01:38 -04:00
Jane Lusby	d933abeebf	fix typo	2020-09-01 21:01:38 -04:00
Jane Lusby	96c8809348	Implement Inventory Tracking RFC (#963 ) * Add .cargo to the gitignore file * Implement Inventory Tracking RFC * checkpoint * wire together the inventory registry * add comment documenting condition * make inventory registry optional	2020-09-01 14:28:54 -07:00
Henry de Valence	f91b91b6d8	network: clarify comment on Default for handshake::Builder Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-09-01 13:56:00 -07:00
Henry de Valence	fddba7a336	network: remove handshake::Builder::with_addr Use the listen_addr field already specified in the config. Also, derive Clone for Handshake<S>. Co-authored-by: Jane Lusby <jane@zfnd.org>	2020-09-01 13:56:00 -07:00
Henry de Valence	a5b6f39850	network: don't leak our exact time skew in handshakes.	2020-09-01 13:56:00 -07:00
Henry de Valence	1b5a824584	network: fix bug in BIP37 relay flag handling. The relay flag in the version message is used in conjunction with BIP37 to receive bloom-filtered transactions. When it is set to false, transactions are not relayed until a bloom filter is set. Since we don't implement BIP37 (it's not useful for shielded transactions), this means we'll never receive transactions.	2020-09-01 13:56:00 -07:00
Henry de Valence	60a0b8c382	network: change Handshake::new to a Builder. This allows more detailed control over the handshake parameters.	2020-09-01 13:56:00 -07:00
teor	d7e32b68e5	fix: Split a clippy allow, so its comment is clearer	2020-09-01 11:40:18 -04:00
teor	5afa24588a	fix: Remove unused dependencies	2020-08-20 14:49:17 -04:00
Henry de Valence	ebdceb5197	chain: rename TransactionHash to transaction::Hash	2020-08-17 11:46:34 -07:00
Henry de Valence	2712c4b72a	chain: rename BlockHeader to block::Header	2020-08-17 11:46:34 -07:00
Henry de Valence	103b663c40	chain: rename BlockHeight to block::Height	2020-08-17 11:46:34 -07:00
Henry de Valence	61dea90e2f	chain: rename BlockHeaderHash to block::Hash This is the first in a sequence of changes that change the block:: items to not include Block as a prefix in their name, in accordance with the Rust API guidelines.	2020-08-17 11:46:34 -07:00
Henry de Valence	948b067808	chain: move Network, NetworkUpgrade to parameters Also, avoid using star-imports of the enum variants, which pollutes the namespace.	2020-08-17 11:46:34 -07:00
Henry de Valence	dad6340cd3	chain: move BlockHeight into block	2020-08-17 11:46:34 -07:00
Henry de Valence	b36fe8f937	chain: move sha256d to serialization module. This extracts the SHA256d code from being split across two modules and puts it in one module, under serialization. The code is unchanged except for three deleted tests: * `sha256d_flush` in `sha256d_writer` (not a meaningful test); * `transactionhash_debug` (constructs an invalid transaction hash, and the behavior is tested in the next test); * `decode_state_debug` (we do not need to test the Debug output of DecodeState);	2020-08-17 11:46:34 -07:00
Alfredo Garcia	b41e33e066	Bytes read and bytes written metrics (#901 ) * add bytes read and written metrics * Apply suggestions from code review Co-authored-by: Jane Lusby <jlusby42@gmail.com> * store address as string * Apply suggestions from code review Co-authored-by: Henry de Valence <hdevalence@hdevalence.ca> * change addr to label Co-authored-by: Henry de Valence <hdevalence@hdevalence.ca> * remove newline Co-authored-by: Jane Lusby <jlusby42@gmail.com> Co-authored-by: Henry de Valence <hdevalence@hdevalence.ca>	2020-08-14 15:50:26 -07:00
Henry de Valence	a79ce97957	Fix sync algorithm. (#887 ) * checkpoint: reject older of duplicate verification requests. If we get a duplicate block verification request, we should drop the older one in favor of the newer one, because the older request is likely to have been canceled. Previously, this code would accept up to four duplicate verification requests, then fail all subsequent ones. * sync: add a timeout layer to block requests. Note that if this timeout is too short, we'll bring down the peer set in a retry storm. * sync: restart syncing on error Restart the syncing process when an error occurs, rather than ignoring it. Restarting means we discard all tips and start over with a new block locator, so we can have another chance to "unstuck" ourselves. * sync: additional debug info * sync: handle lookahead limit correctly. Instead of extracting all the completed task results, the previous code pulled results out until there were fewer tasks than the lookahead limit, then stopped. This meant that completed tasks could be left until the limit was exceeded again. Instead, extract all completed results, and use the number of pending tasks to decide whether to extend the tip or wait for blocks to finish. * network: add debug instrumentation to retry policy * sync: instrument the spawned task * sync: streamline ObtainTips/ExtendTips logic & tracing This change does three things: 1. It aligns the implementation of ObtainTips and ExtendTips so that they use the same deduplication method. This means that when debugging we only have one deduplication algorithm to focus on. 2. It streamlines the tracing output to not include information already included in spans. Both obtain_tips and extend_tips have their own spans attached to the events, so it's not necessary to add Scope: prefixes in messages. 3. It changes the messages to be focused on reporting the actual events rather than the interpretation of the events (e.g., "got genesis hash in response" rather than "peer could not extend tip"). The motivation for this change is that when debugging, the interpretation of events is already known to be incorrect, in the sense that the mental model of the code (no bug) does not match its behavior (has bug), so presenting minimally-interpreted events forces interpretation relative to the actual code. * sync: hack to work around zcashd behavior * sync: localize debug statement in extend_tips * sync: change algorithm to define tips as pairs of hashes. This is different enough from the existing description that its comments no longer apply, so I removed them. A further chunk of work is to change the sync RFC to document this algorithm. * sync: reduce block timeout * state: add resource limits for sled Closes #888 * sync: add a restart timeout constant * sync: de-pub constants	2020-08-12 16:48:01 -07:00
teor	109666cc48	fix: Tweak the the network listener log (#886 )	2020-08-12 14:22:54 -07:00
Henry de Valence	299afe13df	zebra-network tweaks. (#877 ) * network: move gossiped peer selection logic into address book. * network: return BoxService from init. * zebrad: add note on why we truncate thegossiped peer list Co-authored-by: Jane Lusby <jlusby42@gmail.com> * Remove unused .rustfmt.toml Many of these options are never actually loaded by our CI because of a channel mismatch, where they're not applied on stable but only on nightly (see the logs from a rustfmt job). This means that we can get different settings when running `cargo fmt` on the nightly and stable channels, which was causing a CI failure on this PR. Reverting back to the default rustfmt settings avoids this problem and keeps us in line with upstream rustfmt. There's no loss to us since we were using the defaults anyways. Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-08-11 13:07:44 -07:00
Alfredo Garcia	9c387521bd	Print endpoint addresses at startup (#867 ) * print tracing and metrics endpoints in startup * print network address in startup	2020-08-10 12:47:26 -07:00
teor	ee6f0de14d	refactor: Move NetworkUpgrade to zebra-chain	2020-08-10 18:54:42 +10:00
Henry de Valence	3d46ab746a	Clean up options in network config section. (#839 ) Closes #536. This removes: - the user-agent (we can add a mechanism to specify extra BIP14 components later, if any users ask us for that feature); - the EWMA parameters (these were put in the config just to avoid making a choice); - the peer connection timeout (we can change the default value if anyone ever has a problem with it); - the peer set request buffer size (setting this too low can make the application deadlock); The new peer interval is left in.	2020-08-06 11:29:00 -07:00
teor	c95d980bc2	doc: Explain current and minimum network protocol versions	2020-08-04 15:11:16 -04:00
teor	59eb23772d	feature: Use the Canopy testnet network protocol version Canopy will activate on testnet within the next 24 hours. To continue to use testnet, we need to upgrade the Zebra network protocol version.	2020-08-04 12:13:58 +10:00
Henry de Valence	ef0b200b82	restore Zebras to part of the name, not a comment	2020-07-29 18:46:47 -07:00
Jack Grigg	d1e0e1abf5	fix: Broadcast a valid BIP 14 user agent Closes ZcashFoundation/zebra#791.	2020-07-29 15:49:14 -04:00
teor	6be0f8ed2f	fix: Warn if the listener port is for the wrong network We'll fix the underlying defaults in #660, with the rest of the listeners.	2020-07-29 16:03:52 +10:00
teor	536668f993	fix: allow(dead_code) on some protocol version functions	2020-07-28 22:10:20 -04:00
Henry de Valence	238dec51dd	network: do not export Builder This is used to construct the Codec, which is an internal type. The export was added in `4dc307f2`.	2020-07-28 11:10:15 -07:00
teor	993532b604	feature: Add a "Genesis" network upgrade We can use this network upgrade to implement different consensus rules and chain context handling for genesis blocks. Part of the chain state design in #682.	2020-07-27 14:03:14 -04:00
Henry de Valence	4aa00ad216	Align crate versions and user-agent with NU numbers. We had a brief discussion on discord and it seemed like we had consensus on the following versioning policy: * zebrad: match major version to NU version, so we will start by releasing zebrad 3.0.0; * zebra-* libraries: start by matching zebrad's version, then increment major versions of each library as we need to make breaking changes (potentially faster than the zebrad version, always respecting semver but making no guarantees about the longevity of major releases). This commit sets all of the crate versions to 3.0.0-alpha.0 -- the -alpha.0 marks it as a prerelease not subject to perfect adherence to compatibility guarantees.	2020-07-24 11:46:37 -07:00
teor	da09965a5f	feature: Get the current minimum protocol version	2020-07-23 15:52:18 +10:00
teor	85f113bc18	doc: Add a TODO to the network protocol	2020-07-23 15:52:18 +10:00
teor	c9ee85c3b5	feature: Add network upgrade activation heights	2020-07-23 15:52:18 +10:00
Henry de Valence	cc955a2bbe	network: document Responses, add warning about unsolicited invs.	2020-07-22 17:55:52 -07:00
Jane Lusby	a722cf33f7	enable new tracing instrumentation in tokio	2020-07-22 14:39:54 -04:00
Jane Lusby	e105b4f6c5	properly document guarantee provided by zebra-network (#713 ) Co-authored-by: Deirdre Connolly <deirdre@zfnd.org>	2020-07-22 14:38:00 -04:00
Henry de Valence	4a41c9254d	network: avoid panic when shutting down cleanly. When the connection sees the client_rx channel close it knows it will never get any more requests, and it should terminate. But instead of terminating, it errored itself, and the method to error itself tries to pull all the outstanding client requests from the channel in order to fail them before it shuts down. This results in reading from a closed channel, causing a panic. Instead we return cleanly rather than failing (since we know there are no outstanding requests, as the channel is closed).	2020-07-22 18:04:45 +10:00
Henry de Valence	0dc2d92ad8	network: ensure dropping a Client closes the connection. This fixes a bug introduced when we added heartbeat support. Recall that we handle the Bitcoin connection state machine on a per-peer basis. Each connection has a task created from the `Connection` struct, and a `Client: tower::Service` "frontend" that passes requests to it via a channel. In the `Connection` event loop, the connection checks whether the request channel has been closed, indicating no further requests from the `Client`, in which case it shuts itself down and cleans up resources. This occurs when all of the senders have been dropped. However, this behavior broke when we introduced heartbeat support, because we spawned an additional task to send heartbeat messages along the request channel. This meant that instead of having a single sender, dropped by the `Client`, we have two senders, the `Client` and the "shadow client" task that generates heartbeat messages. This means that when the `Client` is dropped, we still have a live sender and the connection is not closed. To fix this, the `Client` now uses a `oneshot` to shut down its corresponding heartbeat task. This closes all senders.	2020-07-21 15:43:31 -07:00
teor	b0cd920fad	feature: Use the Heartwood protocol version in zebra-network	2020-07-21 10:46:07 -07:00
teor	1cb1f1c52e	fix: Put the peer set config vars together	2020-07-21 12:20:48 -04:00
dependabot[bot]	c8fe4b43d8	build(deps): bump indexmap from 1.4.0 to 1.5.0 Bumps [indexmap](https://github.com/bluss/indexmap) from 1.4.0 to 1.5.0. - [Release notes](https://github.com/bluss/indexmap/releases) - [Commits](https://github.com/bluss/indexmap/compare/1.4.0...1.5.0) Signed-off-by: dependabot[bot] <support@github.com>	2020-07-21 12:19:01 -04:00
Alfredo Garcia	fe2a468417	add favicon to generated docs (#681 )	2020-07-17 16:45:29 -07:00
teor	ab6d1f5ec8	fix: Use the default Zcash port in version messages (#661 ) We don't provide our address yet, so the port should be ignored. But let's use the correct port, to avoid carrying this bug forward into working code.	2020-07-15 11:43:28 -07:00
Alfredo Garcia	d8834b149a	Limit protocol messages size (#645 ) * change body msg limit and test case * accept body at the exact limit len * test the edges of the limit value	2020-07-15 19:15:52 +10:00
Henry de Valence	fcd2f43f39	network: add warning to connection handling code.	2020-07-09 11:15:06 -07:00
Henry de Valence	217c25ef07	network: propagate tracing Spans through peer connection	2020-07-09 11:15:06 -07:00
Dimitris Apostolou	ba81d7d4c0	Fix typos	2020-07-07 11:13:49 -07:00
teor	f999ec75e6	fix: Remove a non-standard unicode character in a comment	2020-07-01 16:03:14 -04:00
Deirdre Connolly	05316dee21	Listen on 0.0.0.0, not 127.0.0.1 Turns out when your node faces the internet directly, it has to listen to those addresses directly.	2020-06-19 03:46:09 -04:00
Henry de Valence	6cc1627a5d	zebrad: apply serde(default) to config sections Each subsection has to have `serde(default)` to get the behaviour we want (delete all fields except the ones that have been changed); otherwise, we can delete only entire sections.	2020-06-18 17:43:36 -04:00
Jane Lusby	df18ac72c5	fix sharedpeererror to propagate tracing context	2020-06-17 14:38:26 -07:00
Jane Lusby	685bdaf2df	don't require absense of cancel handles Prior to this change, we required that services that are canceled do not have a cancel handle in the `cancel_handles` list, based on the assumption that the handle must have been removed in the process of canceling this service. This doesn't holding up though, because it is currently possible for us to have the same peer connect to us multiple times, the second connect removes the cancel handle of the original connect and inserts it's own cancel handle in its place. In this scenario, when the first service is polled for readiness it will see that it has been canceled and go to clean itself up, but when it asserts that it doesn't have a cancel handle it will see the cancel handle of the second connect event, which uses the same key as the first connect, and fail its debug assertion. This change removes that debug assert on the assumption that it is okay for a peer to connect multiple times consecutively, and that the correct behavior in that case is to just cancel the first connection and continue as normal.	2020-06-16 13:42:31 -07:00
Jane Lusby	4b9e4520ce	cleanup API for arc based error type (#469 ) Co-authored-by: Jane Lusby <jane@zfnd.org>	2020-06-12 11:29:42 -07:00
George Tankersley	d8b3db5679	Use new seeder address for yolo.money	2020-06-10 21:49:25 -04:00
George Tankersley	6606bcaa62	Update list of DNS seeders This adds the Foundation's new seeders and removes Simon's defunct one.	2020-06-10 20:56:31 -04:00
Jane Lusby	431f194c0f	propagate errors out of zebra_network::init (#435 ) Prior to this change, the service returned by `zebra_network::init` would spawn background tasks that could silently fail, causing unexpected errors in the zebra_network service. This change modifies the `PeerSet` that backs `zebra_network::init` to store all of the `JoinHandle`s for each background task it depends on. The `PeerSet` then checks this set of futures to see if any of them have exited with an error or a panic, and if they have it returns the error as part of `poll_ready`.	2020-06-09 12:24:28 -07:00
Jane Lusby	9f802cd8dd	Wrap Transaction in Arc	2020-06-06 18:13:17 -04:00
Jane Lusby	9bcda0f9c7	Wrap Blocks in Arc throughout codebase	2020-06-05 00:36:55 -04:00
dependabot-preview[bot]	7a75522885	Bump indexmap from 1.3.2 to 1.4.0 Bumps [indexmap](https://github.com/bluss/indexmap) from 1.3.2 to 1.4.0. - [Release notes](https://github.com/bluss/indexmap/releases) - [Commits](https://github.com/bluss/indexmap/compare/1.3.2...1.4.0) Signed-off-by: dependabot-preview[bot] <support@dependabot.com>	2020-06-01 15:38:00 -04:00
dependabot-preview[bot]	145d9a1835	Bump proptest from 0.9.6 to 0.10.0 Bumps [proptest](https://github.com/altsysrq/proptest) from 0.9.6 to 0.10.0. - [Release notes](https://github.com/altsysrq/proptest/releases) - [Changelog](https://github.com/AltSysrq/proptest/blob/master/CHANGELOG.md) - [Commits](https://github.com/altsysrq/proptest/commits) Signed-off-by: dependabot-preview[bot] <support@dependabot.com>	2020-05-29 15:06:40 -04:00
dependabot-preview[bot]	e317b68b1d	Bump proptest-derive from 0.1.2 to 0.2.0 Bumps [proptest-derive](https://github.com/AltSysrq/proptest) from 0.1.2 to 0.2.0. - [Release notes](https://github.com/AltSysrq/proptest/releases) - [Changelog](https://github.com/AltSysrq/proptest/blob/master/CHANGELOG.md) - [Commits](https://github.com/AltSysrq/proptest/compare/proptest-derive-0.1.2...proptest-derive-0.2.0) Signed-off-by: dependabot-preview[bot] <support@dependabot.com>	2020-05-28 23:00:29 -04:00
Jane Lusby	4a2d2a359c	add cargo fmt to ci (#403 ) * add cargo fmt to ci * rebase on main * switch to stable Co-authored-by: Jane Lusby <jane@zfnd.org>	2020-05-27 19:12:25 -07:00
Jane Lusby	8c178c3ee4	fix panic in seed subcommand (#401 ) Co-authored-by: Jane Lusby <jane@zfnd.org> Prior to this change, the seed subcommand would consistently encounter a panic in one of the background tasks, but would continue running after the panic. This is indicative of two bugs. First, zebrad was not configured to treat panics as non recoverable and instead defaulted to the tokio defaults, which are to catch panics in tasks and return them via the join handle if available, or to print them if the join handle has been discarded. This is likely a poor fit for zebrad as an application, we do not need to maximize uptime or minimize the extent of an outage should one of our tasks / services start encountering panics. Ignoring a panic increases our risk of observing invalid state, causing all sorts of wild and bad bugs. To deal with this we've switched the default panic behavior from `unwind` to `abort`. This makes panics fail immediately and take down the entire application, regardless of where they occur, which is consistent with our treatment of misbehaving connections. The second bug is the panic itself. This was triggered by a duplicate entry in the initial_peers set. To fix this we've switched the storage for the peers from a `Vec` to a `HashSet`, which has similar properties but guarantees uniqueness of its keys.	2020-05-27 17:40:12 -07:00
Jane Lusby	8276bed400	reinstate reject error variant	2020-05-27 15:42:29 -04:00
Jane Lusby	4dc307f2f3	fix last warnings	2020-05-27 15:42:29 -04:00
Jane Lusby	b6b35364f3	cleanup warnings throughout codebase	2020-05-27 15:42:29 -04:00
George Tankersley	df79fa75e0	Implement minimal version handshaking (#295 ) Co-authored-by: Deirdre Connolly <durumcrustulum@gmail.com> Co-authored-by: Henry de Valence <hdevalence@hdevalence.ca>	2020-04-13 18:33:15 -04:00
Deirdre Connolly	a5f4db7528	Move just the Network enum to -chain, keep everything else in -network	2020-03-12 22:02:17 -04:00
Deirdre Connolly	380d622b37	Fix imports	2020-03-12 22:02:17 -04:00
Deirdre Connolly	b68e1e2d55	Move Network, Magic, and magics to zebra-chain	2020-03-12 22:02:17 -04:00
Deirdre Connolly	8c0b00109f	Remove PeerError::DeadServer, unused, unneeded Resolves #251	2020-03-12 16:23:08 -04:00
Henry de Valence	ff3efd504c	Add Zebra logo to all workspace crates. Also add html_root_url attributes.	2020-02-26 21:25:35 -08:00
Henry de Valence	3ed75cb626	Tweak peer set metrics. - Add a total peers metric to prevent races between measurements of ready/unready peers (which can cause the sum to be wrong). - Add an outbound request counter.	2020-02-21 06:48:25 -05:00
Henry de Valence	94fe2c3b57	Increase the peerset request buffer size. tower-buffer uses tokio's mpsc channels, not the futures-rs mpsc channels. Unlike futures-rs mpsc channels, which have capacity n+m, where n is the buffer size and m is the number of senders, tokio channels always have buffer size n. This means that the buffer size is shared across all peer set handles. Thanks to @hawkw for sharing details of the Tokio internals!	2020-02-21 06:48:25 -05:00
Henry de Valence	5f07a25b05	Shorten the default new_peer_interval to 60s. This increases the frequency at which we crawl the network.	2020-02-21 06:48:25 -05:00
Henry de Valence	80e7ee6dae	Add metrics for inbound and outbound messages.	2020-02-21 06:48:25 -05:00
Henry de Valence	8c938af579	Spawn tasks for handshake futures. Previously, we relied on the owner of the handshake future to drive it to completion. This meant that there were cases where handshakes might never be completed, just because nothing was actively polling them.	2020-02-21 06:48:25 -05:00
Henry de Valence	43b2d35dda	Crawl for more peers when we exhaust candidates.	2020-02-21 06:48:25 -05:00
Henry de Valence	afa2c2347f	fmt	2020-02-21 06:48:25 -05:00
Henry de Valence	00edcae0c2	Add metrics for the crawler and candidate set.	2020-02-14 20:14:05 -05:00
Henry de Valence	75d3d44fb3	Metrics MVP: add two metrics and export them to Prometheus. Co-authored-by: Deirdre Connolly <deirdre@zfnd.org>	2020-02-14 20:14:05 -05:00
Henry de Valence	8000f888fd	Connect to multiple peers concurrently. The previous outbound peer connection logic got requests to connect to new peers and processed them one at a time, making single connection attempts and retrying if the connection attempt failed. This was quite slow, because many connections fail, and we have to wait for timeouts. Instead, this logic connects to new peers concurrently (up to 50 at a time).	2020-02-14 18:23:41 -05:00
Henry de Valence	7049f9d891	Add a FindBlocks request to get initial block hashes. Bitcoin does this either with `getblocks` (returns up to 500 following block hashes) or `getheaders` (returns up to 2000 following block headers, not just hashes). However, Bitcoin headers are much smaller than Zcash headers, which contain a giant Equihash solution block, and many Zcash blocks don't have many transactions in them, so the block header is often similarly sized to the block itself. Because we're aiming to have a highly parallel network layer, it seems better to use `getblocks` to implement `FindBlocks` (which is necessarily sequential) and parallelize the processing of the block downloads.	2020-02-14 18:23:41 -05:00
Henry de Valence	47cafc630f	Remove version fields from GetBlocks, GetHeaders. These are instead set by the negotiated version.	2020-02-14 18:23:41 -05:00
Henry de Valence	abcc0a6773	Add basic retry policies to zebra-network. This should be removed when https://github.com/tower-rs/tower/pull/414 lands but is good enough for our purposes for now.	2020-02-11 15:23:19 -05:00
Henry de Valence	befdb46dc3	Clean some warnings in the Bitcoin codec. This doesn't clean the warnings about unused items in the builder, since those are unused for a reason (the implementation that should use them is missing).	2020-02-10 09:03:56 -08:00
Henry de Valence	2082672b3c	Remove Response::Error. Error handling is already handled by Result; we don't need an "inner" error variant duplicating the outer one.	2020-02-10 09:03:56 -08:00
Henry de Valence	29f901add3	Rename Response::Ok to Response::Nil. This is a better name because it signals "no data in response" rather than "Ok", which is semantically mixed with `Ok/Err` of `Result`.	2020-02-10 09:03:56 -08:00
Henry de Valence	5929e05e52	Remove `PushPeers` and ignore unsolicited `addr` messages. PushPeers is more complicated to thread into the rest of our architecture (we would need to establish a data path connecting our service handling inbound requests to the network layer's auto-crawler), and since we crawl the network automatically anyways, we don't actually need to accept them in order to get updated address information. The only possible problem with this approach is that zcashd refuses to answer multiple address requests from the same connection, ostensibly for fingerprinting prevention (although it's totally happy to give exactly the same information, as long as you hang up and reconnect first, lol). It's unclear how this will interact with our design -- on the one hand, it could mean that we don't get new addr information when we ask, but on the other hand, we may have enough churn in our connection pool that this isn't a problem anyways.	2020-02-10 09:03:56 -08:00
Henry de Valence	2c0f48b587	Refactor connection logic and try a block request. Attempting to implement requests for block data revealed a problem with the previous connection logic. Block data is requested by sending a `getdata` message with hashes of the requested blocks; the peer responds with a sequence of `block` messages with the blocks themselves. However, this wasn't possible to handle with the previous connection logic, which could only convert a single Bitcoin message into a Response. Instead, we factor out the message handling logic into a Handler, which can statefully accumulate arbitrary data into a Response and signal completion. This is still pretty ugly but it does work. As a side effect, the HeartbeatNonceMismatch error is removed; because the Handler now tries to process messages until it comes to a Response, it just ignores mismatched nonces (and will eventually time out). The previous Mempool and Transaction requests were removed but could be re-added in a different form later. Also, the `Get` prefixes are removed from `Request` to tidy the name.	2020-02-10 09:03:56 -08:00
Henry de Valence	972d16518f	Make ZcashSerialize infallible mod its Writer. Closes #158. As discussed on the issue, this makes it possible to safely serialize data into hashes, and encourages serializable data to make illegal states unrepresentable.	2020-02-05 19:48:43 -05:00
Henry de Valence	b0f61c4dd2	Remove outdated comment (we use tokio codecs now)	2020-02-05 19:44:35 -05:00
Henry de Valence	ab94acf7da	fmt	2020-02-05 19:44:35 -05:00
Henry de Valence	eeb4a2470b	Remove version fields from Block, Tx messages. These are included in the Block, Transaction objects themselves, so the previous code ended up trying to deserialize two version fields per object. Closes #226.	2020-02-05 19:44:35 -05:00
Henry de Valence	51c744b1ae	Update network version number.	2020-02-05 14:06:35 -08:00
Henry de Valence	8d58dd804f	Note that tracing causes clippy false positives Thanks @hawkw for pointing this out.	2020-02-05 12:42:32 -08:00
Henry de Valence	f04f4f0b98	Apply clippy fixes	2020-02-05 12:42:32 -08:00
dependabot-preview[bot]	979cf7ac6d	Bump indexmap from 1.3.1 to 1.3.2 Bumps [indexmap](https://github.com/bluss/indexmap) from 1.3.1 to 1.3.2. - [Release notes](https://github.com/bluss/indexmap/releases) - [Commits](https://github.com/bluss/indexmap/compare/1.3.1...1.3.2) Signed-off-by: dependabot-preview[bot] <support@dependabot.com>	2020-02-05 14:14:51 -05:00
Henry de Valence	b24f53f4a1	Add From impls for InventoryHash	2020-02-04 17:48:36 -08:00
Deirdre Connolly	6d3d4c4f64	s/GetData/NotFound/ in read_notfound	2020-02-04 18:04:53 -05:00
Deirdre Connolly	1ca55846eb	Little test to exercise sha256dWriter::flush()	2020-02-04 18:04:53 -05:00
Deirdre Connolly	beb72080cb	Delete out of date comment on incomplete Message variants	2020-01-30 13:39:13 -05:00
Deirdre Connolly	53a7af82a0	Add back a missing quotemark Co-Authored-By: Henry de Valence <hdevalence@hdevalence.ca>	2020-01-28 03:48:23 -05:00
Deirdre Connolly	71d5571e39	Add roundtrip proptest for LockTime serialization/deserialization Relates to #150	2020-01-28 03:48:23 -05:00
Deirdre Connolly	d8ebeea08c	Add proptest regressions file	2020-01-28 03:48:23 -05:00
Deirdre Connolly	c2411f4315	Add a little proptest around Magic's Debug impl	2020-01-28 03:48:23 -05:00

1 2 3 4 5 ...

441 Commits