zebra

Commit Graph

Author	SHA1	Message	Date
Janito Vaqueiro Ferreira Filho	54809a1b89	Don't trust reported peer `last_seen` times Due to clock skew, the peers could end up at the front of the reconnection queue or far at the back. The solution to this is to offset the reported times by the difference between the most recent reported sight (in the remote clock) and the current time (in the local clock).	2021-06-01 03:42:08 -03:00
Janito Vaqueiro Ferreira Filho	29c51d5086	Implement `MetaAddr::set_last_seen` setter method Will be used when limiting the reported last seen times for recived gossiped addresses.	2021-06-01 03:42:08 -03:00
Janito Vaqueiro Ferreira Filho	14ecc79f01	Use `DateTime32` in `validate_addrs`	2021-06-01 03:42:08 -03:00
Janito Vaqueiro Ferreira Filho	b891a96a6d	Improve ergonomics by returning `impl Iterator` Returning `impl IntoIterator` means that the caller will always be forced to call `.into_iter()`, and returning `impl Iterator` still allows them to call `.into_iter()` because it becomes the identity function.	2021-06-01 03:42:08 -03:00
teor	ebe1c9f88e	Add a DateTime32 type for 32-bit serialized times (#2210 ) * Add a DateTime32 type for 32-bit serialized times * Use DateTime32 for MetaAddr.last_seen * Create and use a `DateTime32::now` method	2021-05-31 12:52:34 +10:00
teor	a6e272bf1c	Fix a typo: BIP11 -> BIP111 (#2223 )	2021-05-28 14:50:43 +02:00
teor	5cdcc5255f	Proptest `MetaAddr` sanitization and serialization together	2021-05-26 18:13:35 -04:00
teor	9f8b4f836e	Test round-trip serialization for gossiped `MetaAddr`s	2021-05-26 18:13:35 -04:00
teor	81630d19f2	Add service sanitization to `MetaAddr::sanitize` This makes sure that deserialization and generated `MetaAddr`s are consistent.	2021-05-26 18:13:35 -04:00
teor	bf6fe175dd	Stop deriving PartialEq for MetaAddr This makes sure Ord and ParitalEq are always consistent.	2021-05-26 18:13:35 -04:00
teor	078385ae00	Canonicalise arbitrary IP addresses in proptests This makes round-trip serialization tests work.	2021-05-26 18:13:35 -04:00
teor	c0114a2c5f	Security: Stop panicking when serializing out-of-range times Zebra assumes that deserialized times are always able to be serialized. But this assumption is wrong because: - sanitization can modify times - gossiped `MetaAddr` validation can modify times	2021-05-26 18:13:35 -04:00
Pili Guerra	e3d2ae0a8a	Update versions for zebra v1.0.0-alpha.9 release (#2196 ) * Update versions for zebra v1.0.0-alpha.9 release * Update Cargo.lock	2021-05-26 13:01:39 +02:00
teor	f0549b2f7c	Derive Arbitrary impls for a bunch of chain and network types (#2179 ) Enable proptests for internal and external network protocol messages, using times with the correct protocol-specific ranges. (4 or 8 bytes.)	2021-05-24 11:10:07 -04:00
teor	57fb5c028c	Fix up some doc links (#2180 )	2021-05-21 12:06:31 -03:00
teor	2685fc746e	Remove CandidateSet state and add last seen time limit to candidate_set::validate_addrs (#2177 )	2021-05-21 02:21:13 +00:00
teor	752358d236	Fix some candidate set and meta addr doc links (#2174 ) Suggested by jvff.	2021-05-21 11:40:14 +10:00
teor	40d06657b3	Update new_gossiped_meta_addr to the latest API	2021-05-21 06:51:34 +10:00
teor	c7ea1395e7	Security: Fix CandidateSet timeout and fanout * Refactor: Split CandidateSet::update into separate functions * Security: Apply a timeout to the entire CandidateSet::update * Security: Stop using very large fanout limits during initialization Previously, Zebra used the number of resolved peer addresses. So it was possible for all peers to fail, and for Zebra to hang on the first update. And Zebra could send a fanout for each initial peer, regardless of whether their connection was successful. Also: - wait for at least one successful peer before trying an update - warn if there are no successful initial peers	2021-05-21 06:51:34 +10:00
Deirdre Connolly	bf72d6dbc0	Update zebra-network/src/peer/handshake.rs Co-authored-by: teor <teor@riseup.net>	2021-05-18 14:02:19 +10:00
teor	92828bbb29	Reliability: send local listener address to peers When peers ask for peer addresses, add our local listener address to the set of addresses, sanitize, then truncate. Sanitize shuffles addresses, so if there are lots of addresses in the address book, our address will only be sent to some peers.	2021-05-18 14:02:19 +10:00
teor	d2a8985dbc	Reliability: Add inbound canonical addresses to the address book Add canonical addresses from inbound connections to the address book, so that Zebra can use them for reconnection attempts. Use the newly added `NeverAttemptedAlternate` state for these addresses, so we try gossiped addresses first, then canonical addresses. This avoids duplicate connections to inbound peers.	2021-05-18 14:02:19 +10:00
teor	458c26f1e3	Limit initial candidate set fanout to the number of initial peers If there is a small number of initial peers, and they are slow, the initial candidate set update can appear to hang. To avoid this issue, limit the initial candidate set fanout to the number of initial peers. Once the initial peers have sent us more peer addresses, there is no need to limit the fanouts for future updates. Reported by Niklas Long of Equilibrium.	2021-05-18 07:54:03 +10:00
teor	679920f6b8	Stop trying to resolve empty initial peer lists Instead, log an error and return immediately.	2021-05-18 07:54:03 +10:00
teor	b600e82d6e	Security: Avoid silently corrupting invalid times during serialization (#2149 ) * Security: panic if an internally generated time is out of range If Zebra has a bug where it generates blocks, transactions, or meta addresses with bad times, panic. This avoids sending bad data onto the network. (Previously, Zebra would truncate some of these times, silently corrupting the underlying data.) Make it clear that deserialization of these objects is infalliable.	2021-05-17 16:53:10 -04:00
teor	b0b8b2f61a	Add extra instrumentation for initialize and handshakes (#2122 ) * Instrument the crawl task When we created the crawl task, we forgot to instrument it with the global span. This fix makes sure that the git and network span appears on crawl logs. * Instrument the connector * Improve handshake instrumentation Make some spans debug, so there are not too many spans. * Add the address to initial peer connection errors	2021-05-17 16:49:16 -04:00
teor	7969459b19	Security: Move the Verack response after the version check (#2121 ) We should do as many local checks as possible, before sending further messages.	2021-05-17 16:39:44 -04:00
teor	c40cbee42f	Remove address book peers that have changed to clients If an address book peer stops advertising the NODE_SERVICES bit, remove it from the address book.	2021-05-14 23:45:42 +10:00
teor	f541f85792	Send unspecified addresses and client services for isolated connections	2021-05-14 23:45:42 +10:00
teor	9160365d06	Fix a comment	2021-05-14 23:45:42 +10:00
teor	a8a0d6450c	Security: stop gossiping temporary inbound remote addresses to peers - stop putting inbound addresses in the address book - drop address book entries that can't be used for outbound connections - distinguish between temporary inbound and permanent outbound peer addresses - also create variants to handle proxy connections (but don't use them yet) - avoid tracking connection state for isolated connections - document security constraints for the address book and peer set	2021-05-14 23:45:42 +10:00
teor	fde8f1e4ca	Security: stop panicking on out-of-range version timestamps, Credit: Equilibrium (#2148 ) * Security: stop panicking on out-of-range version timestamps Instead, return a deserialization error, and close the connection. This issue was reported by Equilibrium.	2021-05-14 17:13:11 +10:00
Pili Guerra	500dc2e511	Update version strings for Zebra v1.0.0-alpha.8 release (#2136 ) * Update versions for zebra v1.0.0-alpha.8 release * Update tower-batch and tower-fallback version strings * Update Cargo.lock	2021-05-12 14:27:36 +02:00
teor	1f40498fcf	Clippy nightly: disable owned cmp, stop comparing bool using assert_eq (#2073 ) * Disable clippy warnings about comparing a newly created struct In Sapling, we compare canonical JubJub bytes with a supplied byte array. Since we need to perform calculations to get it into canonical form, we need to create a newly owned object. * Clippy: use assert rather than assert_eq on a bool	2021-04-27 09:57:45 -03:00
Pili Guerra	ea1446ee92	Update version strings for Zebra v1.0.0-alpha.7 release (#2056 ) * Update version strings for Zebra v1.0.0-alpha.7 release	2021-04-23 12:56:25 +00:00
teor	7b13d5573a	Make String Zcash serialization consistent with deserialization After recent changes, serialization was `write_string`, but deserialization was `zcash_deserialize`.	2021-04-21 23:58:48 -04:00
Kirill Fomichev	afac2c2846	Use the default port for configured listen addresses with no port (#2043 ) * Allow use listen address in config without port * update comments * remove not used alias * use Network::default_port * Move tests and use toml instead json * change error message * Make match more readable Co-authored-by: teor <teor@riseup.net>	2021-04-21 23:14:29 +00:00
teor	0203d1475a	Refactor and document correctness for std::sync::Mutex<AddressBook>	2021-04-21 17:14:47 -04:00
teor	905b90d6a1	Refactor and document correctness for std::sync::Mutex in ErrorSlot	2021-04-21 16:39:06 -04:00
teor	3f45735f3f	Use futures:🔒:Mutex for the nonce set	2021-04-21 01:39:49 -04:00
teor	2ed8bb00cf	Clarify CandidateSet state diagram We get inbound connections on the listener port, but the important part is the inbound connection itself.	2021-04-21 01:37:43 -04:00
teor	ad272f2bee	Make sure handshake version negotiation always has a timeout As part of this change, refactor handshake version negotiation into its own function.	2021-04-19 18:31:28 -04:00
teor	2cecd52a10	Fix comment typo	2021-04-19 10:11:22 -04:00
teor	8fb12f07a1	Fix outdated comment	2021-04-19 10:11:22 -04:00
teor	eabadb8301	Make heartbeats wait for the connection queue to empty, with a timeout Also cleanup the heartbeat code, so each heartbeat request/response runs in a future with a single timeout.	2021-04-19 10:11:22 -04:00
teor	0def12f825	Add split array serialization functions for Transaction::V5 (#2017 ) * Add functions for serializing and deserializing split arrays In Transaction::V5, Zcash splits some types into multiple arrays, with a single prefix count before the first array. Add utility functions for serializing and deserializing the subsequent arrays, with a paramater for the original array's length. * Use zcash_deserialize_bytes_external_count in zebra-network * Move some preallocate proptests to their own file And fix the test module structure so it is consistent with the rest of zebra-chain. * Add a convenience alias zcash_serialize_external_count * Explain why u64::MAX items will never be reached	2021-04-16 08:23:00 +10:00
teor	381c20b6af	Security: change the GetAddr fanout to 3 Zebra avoids having a majority of addresses from a single peer by asking 3 peers for new addresses. Also update a bunch of security comments and related documentation.	2021-04-15 13:09:14 -04:00
teor	59aa04c9b9	Stop panicking when Zebra sends a reject without extra data Also add round-trip unit tests for extra data and no extra data.	2021-04-15 12:20:33 -04:00
teor	a417c7c8c7	Use meaningful names for select! variables	2021-04-13 23:56:16 -04:00
teor	fb95de99a6	Refactor the dial result into a From impl	2021-04-13 18:52:49 -04:00
Alfredo Garcia	5ec05e91e1	update version strings for v1.0.0-alpha.6	2021-04-08 18:48:34 -04:00
teor	1626ec383a	Add InventoryHash and MetaAddr proptests (#1985 ) * Make proptest dependencies consistent between chain and network * Implement Arbitrary for InventoryHash and use it in tests * Impl Arbitrary for MetaAddr and use it in tests Also test some extreme times in MetaAddr sanitization.	2021-04-07 14:13:52 -03:00
teor	375c8d8700	Fix a deadlock between the crawler and dialer, and other hangs (#1950 ) * Stop ignoring inbound message errors and handshake timeouts To avoid hangs, Zebra needs to maintain the following invariants in the handshake and heartbeat code: - each handshake should run in a separate spawned task (not yet implemented) - every message, error, timeout, and shutdown must update the peer address state - every await that depends on the network must have a timeout Once the Connection is created, it should handle timeouts. But we need to handle timeouts during handshake setup. * Avoid hangs by adding a timeout to the candidate set update Also increase the fanout from 1 to 2, to increase address diversity. But only return permanent errors from `CandidateSet::update`, because the crawler task exits if `update` returns an error. Also log Peers response errors in the CandidateSet. * Use the select macro in the crawler to reduce hangs The `select` function is biased towards its first argument, risking starvation. As a side-benefit, this change also makes the code a lot easier to read and maintain. * Split CrawlerAction::Demand into separate actions This refactor makes the code a bit easier to read, at the cost of sometimes blocking the crawler on `candidates.next()`. That's ok, because `next` only has a short (< 100 ms) delay. And we're just about to spawn a separate task for each handshake. * Spawn a separate task for each handshake This change avoids deadlocks by letting each handshake make progress independently. * Move the dial task into a separate function This refactor improves readability. * Fix buggy future::select function usage And document the correctness of the new code.	2021-04-07 10:25:10 -03:00
teor	de6d1c93f3	Clarify a comment	2021-04-07 18:56:38 +10:00
teor	64662a758d	Move the preallocate tests into their own files (#1977 ) * Move the preallocate tests into their own files And move the MetaAddr proptest into its own file. Also do some minor formatting and cleanups. Co-authored-by: Deirdre Connolly <durumcrustulum@gmail.com>	2021-04-07 12:32:27 +10:00
Preston Evans	0daaf582e2	Implement Trusted Vector Preallocation (#1920 ) * Implement SafePreallocate. Resolves #1880 * Add proptests for SafePreallocate * Apply suggestions from code review Comments which did not include replacement code will be addressed in a follow-up commit. Co-authored-by: teor <teor@riseup.net> * Rename [Safe-> Trusted]Allocate. Add doc and tests Add tests to show that the largest allowed vec under TrustedPreallocate is small enough to fit in a Zcash block/message (depending on type). Add doc comments to all TrustedPreallocate test cases. Tighten bounds on max_trusted_alloc for some types. Note - this commit does NOT include TrustedPreallocate impls for JoinSplitData, String, and Script. These impls will be added in a follow up commit * Implement SafePreallocate. Resolves #1880 * Add proptests for SafePreallocate * Apply suggestions from code review Comments which did not include replacement code will be addressed in a follow-up commit. Co-authored-by: teor <teor@riseup.net> * Rename [Safe-> Trusted]Allocate. Add doc and tests Add tests to show that the largest allowed vec under TrustedPreallocate is small enough to fit in a Zcash block/message (depending on type). Add doc comments to all TrustedPreallocate test cases. Tighten bounds on max_trusted_alloc for some types. Note - this commit does NOT include TrustedPreallocate impls for JoinSplitData, String, and Script. These impls will be added in a follow up commit * Impl TrustedPreallocate for Joinsplit * Impl ZcashDeserialize for Vec<u8> * Arbitrary, TrustedPreallocate, Serialize, and tests for Spend<SharedAnchor> Co-authored-by: teor <teor@riseup.net>	2021-04-06 09:49:42 +10:00
teor	83b88f5b7a	Merge pull request #1972 from ZcashFoundation/peer-set-demand-deadlock-doc Document peer set deadlock resistance	2021-04-01 22:50:17 -04:00
teor	306fa88214	Document the correctness of Poll::Pending wakeups	2021-03-27 08:55:49 -04:00
teor	b329892665	Add a comment about a zcashd inv message bug	2021-03-26 11:26:59 -04:00
teor	1a159dfcb6	Add more methods for creating MetaAddrs This refactor lets us remove `MetaAddr::update_last_seen()`.	2021-03-26 07:23:49 +10:00
teor	6fe81d8992	Make MetaAddr.last_seen into a private field	2021-03-26 07:23:49 +10:00
teor	eae59de1e8	use PeerAddrState::*	2021-03-26 07:23:49 +10:00
teor	e9cdc224a2	Rewrite MetaAddr::sanitize so it's harder to misuse `sanitize` could be misused in two ways: * accidentally modifying the addresses in the address book itself * forgetting to sanitize new fields added to `MetaAddr` This change prevents accidental modification by taking `&self`, and explicitly creates a new sanitized `MetaAddr` with all fields listed.	2021-03-26 07:23:49 +10:00
Deirdre Connolly	c5bad9fac2	Rename NU5 to Nu5 to appease newly stable clippy::upper-case-acronyms (#1945 )	2021-03-26 07:22:50 +10:00
Deirdre Connolly	7efc700aca	Merge pull request #1713 from ZcashFoundation/use-groth16-batch-math Use batch optimizations, load params in groth16::Verifier, verify Spend & Output descriptions in transaction verifier	2021-03-24 12:28:25 -04:00
Deirdre Connolly	ca1d2de87d	Bump versions for v1.0.0-alpha.5 (#1932 ) Zebra's latest alpha checkpoints on Canopy activation, continues our work on NU5, and fixes a security issue. Some notable changes include: ## Added - Log address book metrics when PeerSet or CandidateSet don't have many peers (#1906) - Document test coverage workflow (#1919) - Add a final job to CI, so we can easily require all the CI jobs to pass (#1927) ## Changed - Zebra has moved its mandatory checkpoint from Sapling to Canopy (#1898, #1926) - This is a breaking change for users that depend on the exact height of the mandatory checkpoint. ## Fixed - tower-batch: wake waiting workers on close to avoid hangs (#1908) - Assert that pre-Canopy blocks use checkpointing (#1909) - Fix CI disk space usage by disabling incremental compilation in coverage builds (#1923) ## Security - Stop relying on unchecked length fields when preallocating vectors (#1925)	2021-03-22 22:05:01 -04:00
Alfredo Garcia	c5b1d0deee	move consts to start of the function	2021-03-22 11:54:31 -04:00
teor	b623acc945	Add memory DoS prevention comments	2021-03-22 11:54:31 -04:00
teor	8e18c99cdc	Avoid risky use of Read::take with untrusted lengths Zebra already uses `Read::take` to enforce message, body, and block maximum sizes. So using `Read::take` on untrusted sizes can result in short reads, without a corresponding `UnexpectedEof` error. (The old code was correct, but copying it elsewhere would have been risky.)	2021-03-22 11:54:31 -04:00
teor	609d70ae53	Stop untrusted preallocation during string deserialization This is an easy memory denial of service attack.	2021-03-22 11:54:31 -04:00
teor	4f923b90ea	Log address book metrics when peers aren't responding	2021-03-17 10:47:04 +10:00
teor	5a30268d7a	Log address metrics when the peer set has no ready peers	2021-03-17 10:47:04 +10:00
teor	6a342e93ca	Refactor AddressBook metrics into their own struct And provide an accessor function for address book metrics.	2021-03-17 10:47:04 +10:00
Alfredo Garcia	d49eaab68e	Bump versions for zebrad 1.0.0-alpha.4 (#1913 ) * Bump versions for zebrad 1.0.0-alpha.4 * add Cargo.lock	2021-03-16 21:12:37 -03:00
Jack Grigg	7a8cae9321	Tag message metrics by type	2021-03-17 09:38:07 +10:00
Jack Grigg	e51f33a4b9	Use interoperable names for common metrics These names match the equivalent metrics in zcashd, enabling common metrics to be collected across both node types.	2021-03-17 09:38:07 +10:00
teor	8fabbce037	Document and log trailing message bytes (#1888 ) * Rename a variable for consistency * Log extra trailing message bytes at debug level	2021-03-15 08:25:27 +10:00
teor	976ec912db	Document that the listed address is also advertised to peers (#1891 ) Documents a potential privacy leak, and a missing feature.	2021-03-15 08:25:07 +10:00
teor	e50692bd51	CandidateSet: Add Listener Port Connections Inbound connections on the Zcash protocol listener port perform a handshake. If the handshake is successful, it adds the peer to the AddressBook.	2021-03-09 23:05:18 -05:00
Jane Lusby	03aa6f671f	Implement outbound connection rate limiting - includes config rename with alias (#1855 ) * Implement outbound connection rate limiting * fix breaking change on config Co-authored-by: teor <teor@riseup.net>	2021-03-10 01:36:05 +00:00
Jane Lusby	e541746a50	Add initial support for NU5 to zebra (#1823 ) * Add NU5 variant to NetworkUpgrade * Add consensus branch ID for NU5 * Add network protocol versions for NU5 * Add NU5 to the protocol::version_consistent test * Make unimplemented panic messages more specific * Block target spacing doesn't change in NU5 * add comments for future updates for NU5 Co-authored-by: teor <teor@riseup.net>	2021-03-03 06:22:11 +10:00
teor	895bb43ead	Clippy: Fix inconsistent struct member orders lint	2021-03-01 23:31:18 -05:00
teor	2587a4e272	Fix a peer DNS resolution edge case (#1796 ) * Retry each peer DNS a few times individually We retry each peer individually, as well as retrying if there are no peers in the combined list. DNS failures are correlated, so all peers can fail DNS, leaving Zebra with a small list of custom-configured IP address peers. Individual retries avoid this issue. * Rename parse_peers to resolve_peers Co-authored-by: Deirdre Connolly <durumcrustulum@gmail.com>	2021-02-26 09:06:27 +10:00
teor	9c3f236075	Stop sending blocks and transactions on error	2021-02-25 08:44:57 -08:00
teor	78f162733d	Revert "leverage return value for propagating errors" This reverts commit `e6cb20e13f`.	2021-02-24 13:07:31 -08:00
teor	72e2e83828	Revert "introduce Transition enum" This reverts commit `6906f87ead`.	2021-02-24 13:07:31 -08:00
teor	a5e89f4f2b	Revert "accidental drop on mustusesender" This reverts commit `5ec8d09e0d`.	2021-02-24 13:07:31 -08:00
teor	d60226a3cf	Revert "rustfmt" This reverts commit `9d9734ea81`.	2021-02-24 13:07:31 -08:00
teor	359015b2be	Revert "Only reject pending client requests when the peer has errored" This reverts commit `e06705ed81`.	2021-02-24 13:07:31 -08:00
teor	663ed6c842	Revert "Remove remaining references to fail_with" This reverts commit `5e4bf804aa`.	2021-02-24 13:07:31 -08:00
teor	3c225550ee	Revert "rename transitions from Exit to Close" This reverts commit `cfc4717b98`.	2021-02-24 13:07:31 -08:00
teor	86dc66dfa9	Revert "deduplicate match arms in handle_client_request" This reverts commit `2adee7b31a`.	2021-02-24 13:07:31 -08:00
teor	292a4391e2	Revert "update comments throughout connection.rs" This reverts commit `651d352ce1`.	2021-02-24 13:07:31 -08:00
teor	fc44a97925	Revert "remove unnecessary Option around request timeout" This reverts commit `c3724031df`.	2021-02-24 13:07:31 -08:00
teor	e06120cd36	Revert "ensure peer/client.rs comments are up to date" This reverts commit `2266886a53`.	2021-02-24 13:07:31 -08:00
teor	1a70d807b6	Revert "make sure peer/error.s comments are up to date" This reverts commit `6f205a1812`.	2021-02-24 13:07:31 -08:00
teor	3b2077fcfd	Revert "Apply suggestions from code review" This reverts commit `736092abb8`.	2021-02-24 13:07:31 -08:00
teor	7558f74c78	Bump versions for zebrad 1.0.0-alpha.3	2021-02-23 10:39:13 -05:00
dependabot[bot]	b578d1ff2e	build(deps): bump proptest-derive from 0.2.0 to 0.3.0 Bumps [proptest-derive](https://github.com/AltSysrq/proptest) from 0.2.0 to 0.3.0. - [Release notes](https://github.com/AltSysrq/proptest/releases) - [Changelog](https://github.com/AltSysrq/proptest/blob/master/CHANGELOG.md) - [Commits](https://github.com/AltSysrq/proptest/compare/proptest-derive-0.2.0...proptest-derive-0.3.0) Signed-off-by: dependabot[bot] <support@github.com>	2021-02-22 01:33:54 -05:00
teor	d4f2f27218	Add global span to spawned network tasks (#1761 ) Closes #1575	2021-02-20 08:36:50 +10:00
ebfull	b7fddbde94	Compute the expected body length to reduce heap allocations (#1773 ) * Compute the expected body length to reduce heap allocations	2021-02-19 22:18:57 +00:00
Jane Lusby	736092abb8	Apply suggestions from code review Co-authored-by: teor <teor@riseup.net>	2021-02-19 14:11:35 -08:00
Jane Lusby	6f205a1812	make sure peer/error.s comments are up to date	2021-02-19 14:11:35 -08:00
Jane Lusby	2266886a53	ensure peer/client.rs comments are up to date	2021-02-19 14:11:35 -08:00
Jane Lusby	c3724031df	remove unnecessary Option around request timeout	2021-02-19 14:11:35 -08:00
Jane Lusby	651d352ce1	update comments throughout connection.rs	2021-02-19 14:11:35 -08:00
Jane Lusby	2adee7b31a	deduplicate match arms in handle_client_request	2021-02-19 14:11:35 -08:00
Jane Lusby	cfc4717b98	rename transitions from Exit to Close	2021-02-19 14:11:35 -08:00
teor	5e4bf804aa	Remove remaining references to fail_with	2021-02-19 14:11:35 -08:00
teor	e06705ed81	Only reject pending client requests when the peer has errored - Add an `ExitClient` transition, used when the internal client channel is closed or dropped, and there are no more pending requests - Ignore pending requests after an `ExitClient` transition - Reject pending requests when the peer has caused an error (the `Exit` and `ExitRequest` transitions) - Remove `PeerError::ConnectionDropped`, because it is now handled by `ExitClient`. (Which is an internal error, not a peer error.)	2021-02-19 14:11:35 -08:00
teor	9d9734ea81	rustfmt	2021-02-19 14:11:35 -08:00
Jane Lusby	5ec8d09e0d	accidental drop on mustusesender	2021-02-19 14:11:35 -08:00
Jane Lusby	6906f87ead	introduce Transition enum	2021-02-19 14:11:35 -08:00
Jane Lusby	e6cb20e13f	leverage return value for propagating errors	2021-02-19 14:11:35 -08:00
teor	e61b5e50a2	Diagnostics for CI port conflict failures (#1766 ) Log a "Trying..." message before each listener opens, to see if the delay is inside Zebra, or in the test harness or OS. Also report the configured and actual ports where possible, for better diagnostics.	2021-02-18 12:15:09 -03:00
teor	5424e1d8ba	Fix candidate set address state handling (#1709 ) Design: - Add a `PeerAddrState` to each `MetaAddr` - Use a single peer set for all peers, regardless of state - Implement time-based liveness as an `AddressBook` method, rather than a `PeerAddrState` variant - Delete `AddressBook.by_state` Implementation: - Simplify `AddressBook` changes using `update` and `take` modifier methods - Simplify the `AddressBook` iterator implementation, replacing it with methods that are more obviously correct - Consistently collect peer set metrics Documentation: - Expand and update the peer set documentation We can optimise later, but for now we want simple code that is more obviously correct.	2021-02-18 11:18:32 +10:00
teor	579bd4a368	Retry DNS resolution on failure (#1762 ) Otherwise, a transient DNS failure makes the node hang.	2021-02-18 07:09:02 +10:00
teor	86169f6412	Update PeerSet metrics after every change (#1727 )	2021-02-18 07:06:59 +10:00
teor	8d1c498234	Log initial peer connection failures And standardise another log message	2021-02-17 09:21:53 -05:00
teor	e85441c914	Add a correctness comment to justify the revert	2021-02-16 05:52:54 +10:00
teor	a02a00a3f5	Revert "Stop using CallAllUnordered in peer_set::add_initial_peers (#1705 )" This reverts commit `241c7ad849`.	2021-02-16 05:52:54 +10:00
teor	e7176b86da	Clarify the Response::Nil documentation	2021-02-11 09:45:42 -05:00
Deirdre Connolly	0c5daa8410	Bump versions for zebrad 1.0.0-alpha.2 Including tower-batch bump to 0.2.0, tower-fallback to 0.2.0, zebra-script to 1.0.0-alpha.3	2021-02-09 16:14:29 -05:00
Alfredo Garcia	241c7ad849	Stop using CallAllUnordered in peer_set::add_initial_peers (#1705 ) * use ServiceExt::oneshot and FuturesUnordered Co-authored-by: teor <teor@riseup.net>	2021-02-09 08:16:02 +10:00
teor	1e156a5d60	Document that connect_isolated only works on mainnet Document that connect_isolated only works on mainnet. See #1687.	2021-02-04 17:32:00 -05:00
Alfredo Garcia	d7c40af2a8	Fix shutdown panics (#1637 ) * add a shutdown flag in zebra_chain::shutdown * fix network panic on shutdown * fix checkpoint panic on shutdown	2021-02-03 19:03:28 +10:00
Alfredo Garcia	221512c733	Async DNS seeder lookups (#1662 ) * replace to_socket_addrs * refactor `resolve()` into `resolve_host()` * use `resolve_host()` to resolve config peers * add DNS_LOOKUP_TIMEOUT constant * don't block the main thread in initialize	2021-02-03 12:20:26 +10:00
teor	983e94f9e4	Add a TODO for inbound error handling cleanup	2021-02-03 08:32:10 +10:00
Alfredo Garcia	4b34482264	Add hints to port conflict and lock file panics (#1535 ) * add hint for port error * add issue filter for port panic * add lock file hint * add metrics endpoint port conflict hint * add hint for tracing endpoint port conflict * add acceptance test for resource conflics * Split out common conflict test code into a function * Add state, metrics, and tracing conflict tests * Add a full set of stderr acceptance test functions This change makes the stdout and stderr acceptance test interfaces identical. * move Zcash listener opening * add todo about hint for disk full * add constant for lock file * match path in state cache * don't match windows cache path * Use Display for state path logs Avoids weird escaping on Windows when using Debug * Add Windows conflict error messages * Turn PORT_IN_USE_ERROR into a regex And add another alternative Windows-specific port error Co-authored-by: teor <teor@riseup.net> Co-authored-by: Jane Lusby <jane@zfnd.org>	2021-01-29 22:36:33 +10:00
Deirdre Connolly	1b09538277	Bump versions for zebrad 1.0.0-alpha.1 (#1646 ) * Bump versions where appropriate Tested with cargo install --locked --path etc * Remove fixed panics from 'Known Issues' * Change to alpha release series in the README Co-authored-by: teor <teor@riseup.net>	2021-01-27 20:31:39 -05:00
teor	b551d81f8d	Explain why we stay connected on Inbound errors We might be syncing using this peer, so it's ok to just ignore any internal errors in their Inbound requests, and drop the request.	2021-01-27 12:08:49 -08:00
teor	258789ed9b	Use the rustc unknown lints attribute The clippy unknown lints attribute was deprecated in nightly in rust-lang/rust#80524. The old lint name now produces a warning. Since we're using `allow(unknown_lints)` to suppress warnings, we need to add the canonical name, so we can continue to build without warnings on nightly. But we also need to keep the old name, so we can continue to build without warnings on stable. And therefore, we also need to disable the "removed lints" warning, otherwise we'll get warnings about the old name on nightly. We'll need to keep this transitional clippy config until rustc 1.51 is stable.	2021-01-19 11:02:20 -05:00
teor	05fff8e6f7	Revert "Stop panicking when fail_with is called twice on a connection" But keep the extra error information.	2021-01-18 00:23:36 -05:00
teor	4fe81da953	Improve logging for connection state errors	2021-01-18 00:23:36 -05:00
teor	a6c1cd3c35	Stop panicking when fail_with is called twice on a connection We can't rule out the connection state changing between the state checks and any eventual failures, particularly in the presence of async code. So we turn this panic into a warning.	2021-01-18 00:23:36 -05:00
teor	44c8fafc29	Stop processing the request after failing an overloaded connection zebra-network's Connection expects that `fail_with` is only called once per connection, but the overload handling code continues to process the current request after an overload error, potentially leading to further failures. Closes #1599	2021-01-18 00:23:36 -05:00
teor	0f0fb93b5c	Update some comments in zebra-network Add ticket numbers, and update based on design decisions and new code.	2021-01-15 09:02:10 -05:00
teor	730910cd99	Upgrade to tokio 0.3.6 from crates.io And remove the tokio git dependency patch	2021-01-12 15:37:27 -05:00
Jane Lusby	15698245e1	Deduplicate metrics dependencies (#1561 ) ## Motivation This PR is motivated by the regression identified in https://github.com/ZcashFoundation/zebra/issues/1349. That PR notes that the metrics stopped working for most of the crates other than `zebrad`. ## Solution This PR resolves the regression by deduplicating the `metrics` crate dependency. During a recent change we upgraded the metrics version in `zebrad` and a couple other of our crates, but we never updated the dependencies in `zebra-state`, `zebra-consensus`, or `zebra-network`. This caused the metrics macros to attempt to retrieve the current metrics exporter through the wrong function. We would install the metrics exporter in `0.13`, but then attempt to look it up through the `0.12` crate, which contains a different instance of the metrics exporter static variable which is unset. Doing this causes the metrics macros to return `None` for the current exporter after which they just silently give up. ## Related Issues closes https://github.com/ZcashFoundation/zebra/issues/1349 ## Follow Up Work I noticed we have quite a few duplicate dependencies in our tree. We might be able to save some compilation time by auditing those and deduplicating them as much as possible. - https://github.com/ZcashFoundation/zebra/issues/1582 Co-authored-by: teor <teor@riseup.net>	2021-01-12 12:28:56 +10:00
dependabot[bot]	38ac869f57	build(deps): bump byteorder from 1.3.4 to 1.4.2 Bumps [byteorder](https://github.com/BurntSushi/byteorder) from 1.3.4 to 1.4.2. - [Release notes](https://github.com/BurntSushi/byteorder/releases) - [Changelog](https://github.com/BurntSushi/byteorder/blob/master/CHANGELOG.md) - [Commits](https://github.com/BurntSushi/byteorder/compare/1.3.4...1.4.2) Signed-off-by: dependabot[bot] <support@github.com>	2021-01-11 18:45:49 -05:00
teor	b7d0a40ee1	Revert unused instrument macros Reverts most of "Instrument some functions to try to locate the panic"	2021-01-06 13:07:23 -08:00
teor	6d3aa0002c	Ensure received client request oneshots are used via the type system The `peer::Client` translates `Request`s into `ClientRequest`s, which it sends to a background task. If the send is `Ok(())`, it will assume that it is safe to unconditionally poll the `Receiver` tied to the `Sender` used to create the `ClientRequest`. We enforce this invariant via the type system, by converting `ClientRequest`s to `InProgressClientRequest`s when they are received by the background task. These conversions are implemented by `ClientRequestReceiver`. Changes: * Revert `ClientRequest` so it uses a `oneshot::Sender` * Add `InProgressClientRequest`, which is the same as `ClientRequest`, but has a `MustUseOneshotSender` * `impl From<ClientRequest> for InProgressClientRequest` * Add a new `ClientRequestReceiver` type that wraps a `mpsc::Receiver<ClientRequest>` * `impl Stream<InProgressClientRequest> for ClientRequestReceiver`, converting the successful result of `inner.poll_next_unpin` into an `InProgressClientRequest` * Replace `client_rx: mpsc::Receiver<ClientRequest>` in `Connection` with the new `ClientRequestReceiver` type * `impl From<mpsc::Receiver<ClientRequest>> for ClientRequestReceiver`	2021-01-06 13:07:23 -08:00
teor	df1b0c8d58	Defer a timeout fix until later	2021-01-06 13:07:23 -08:00
teor	d5cfd5ad5f	Clarify the ClientRequest invariant Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2021-01-06 13:07:23 -08:00
teor	f8ff2e9c0b	Add more sends before dropping ClientRequests This fix also changes heartbeat behaviour in the following ways: * if the queue is full, the connection is closed. Previously, the sender would wait until the queue had emptied * if the queue flush fails, Zebra panics, because it can't send an error on the ClientRequest sender, so the invariant is broken	2021-01-06 13:07:23 -08:00
teor	3e711ccc8a	Instrument some functions to try to locate the panic	2021-01-06 13:07:23 -08:00
teor	fa29fca917	Panic when must-use senders are dropped before use Add a MustUseOneshotSender, which panics if its inner sender is unused. Callers must call `send()` on the MustUseOneshotSender, or ensure that the sender is canceled. Replaces an unreliable panic in `Client::call()` with a reliable panic when a must-use sender is dropped.	2021-01-06 13:07:23 -08:00
teor	b03809ebe3	Add the invalid state to an unreachable panic message	2021-01-06 13:07:23 -08:00
teor	86136c7b5c	Stop ignoring errors when the new state is AwaitingRequest The previous code would send a Nil message on the Sender, even if the result was actually an error.	2021-01-06 13:07:23 -08:00
teor	da5084a10a	Split the 3-level match using a temporary	2021-01-06 13:07:23 -08:00

1 2 3 4 5 ...

611 Commits