zebra

Commit Graph

Author	SHA1	Message	Date
teor	92c623eddf	Log each genesis download This change helps us diagnose sync hangs.	2020-10-28 11:31:04 -04:00
teor	656bd24ba7	Hedge every syncer block download request Remove the minimum data points from the syncer hedge configuragtion. When there are no data points, hedge sends the second request immediately. Where there are less than 1/(1-latency_percentile) data points (20), hedge delays the second request by the highest recent download time. This change should improve genesis and post-restart sync latency.	2020-10-28 11:31:04 -04:00
teor	ea510b7d41	Run a block sync in CI with 2 large checkpoints (#1193 ) * Run large checkpoint sync tests in CI * Improve test child output match error context * Add a debug_stop_at_height config * Use stop at height in acceptance tests And add some restart acceptance tests, to make sure the stop at height feature works correctly.	2020-10-27 19:25:29 +10:00
Henry de Valence	4c960c4e6d	zebrad: treat duplicate downloads as an error We should error if we notice that we're attempting to download the same blocks multiple times, because that indicates that peers reported bad information to us, or we got confused trying to interpret their responses.	2020-10-26 12:05:35 -07:00
Henry de Valence	4127d086ea	zebrad: clarify hedge layering motivation Co-authored-by: teor <teor@riseup.net>	2020-10-26 12:05:35 -07:00
Henry de Valence	253bab042e	sync: add a concurrency limit for block downloads	2020-10-26 12:05:35 -07:00
Henry de Valence	0a405c737d	zebrad: check state in obtaintips, not extendtips. The original sync algorithm split the sync process into two phases, one that obtained prospective chain tips, and another that attempted to extend those chain tips as far as possible until encountering an error (at which point the prospective state is discarded and the process restarts). Because a previous implementation of this algorithm didn't properly enforce linkage between segments of the chain while extending tips, sometimes it would get confused and fail to discard responses that did not extend a tip. To mitigate this, a check against the state was added. However, this check can cause stalls while checkpointing, because when a checkpoint is reached we may suddenly need to commit thousands of blocks to the state. Because the sync algorithm now has a a `CheckedTip` structure that ensures that a new segment of hashes actually extends an existing one, we don't need to check against the state while extending a tip, because we don't get confused while interpreting responses. This change results in significantly smoother progress on mainnet.	2020-10-26 12:05:35 -07:00
Henry de Valence	65e0c22fbe	state: don't pre-buffer the service There's no reason to return a pre-Buffer'd service (there's no need for internal access to the state service, as in zebra-network), but wrapping it internally removes control of the buffer size from the caller.	2020-10-26 12:05:35 -07:00
Henry de Valence	ce2ac3336f	zebrad: add debug message before state check This reveals that there may be contention in access to the state, as this takes a long time.	2020-10-26 12:05:35 -07:00
Henry de Valence	91469faf3c	zebrad: eliminate duplicate span in sync	2020-10-26 12:05:35 -07:00
Henry de Valence	b5a43f4516	zebrad: remove implementation details from docs The timeout behavior in zebra-network is an implementation detail, not a feature of the public API. So it shouldn't be mentioned in the doc comments -- if we want timeout behavior, we have to layer it ourselves.	2020-10-26 12:05:35 -07:00
Henry de Valence	1d7309afe2	zebrad: correctly handle duplicates in DownloadSet Using the cancel_handles, we can deduplicate requests. This is important to do, because otherwise when we insert the second cancel handle, we'd drop the first one, cancelling an existing task for no reason.	2020-10-26 12:05:35 -07:00
Henry de Valence	56fe4f4379	zebrad: unify sync restart logic This lets us keep the main loop simple and just write `continue 'sync;` to keep going.	2020-10-26 12:05:35 -07:00
Henry de Valence	12d25159c6	zebrad: use hedged requests in sync The hedge middleware implements hedged requests, as described in _The Tail At Scale_. The idea is that we auto-tune our retry logic according to the actual network conditions, pre-emptively retrying requests that exceed some latency percentile. This would hopefully solve the problem where our timeouts are too long on mainnet and too slow on testnet.	2020-10-26 12:05:35 -07:00
Henry de Valence	5f229d1475	zebrad: use Downloads in sync Try to use the better cancellation logic to revert to previous sync algorithm. As designed, the sync algorithm is supposed to proceed by downloading state prospectively and handle errors by flushing the pipeline and starting over. This hasn't worked well, because we didn't previously cancel tasks properly. Now that we can, try to use something in the spirit of the original sync algorithm.	2020-10-26 12:05:35 -07:00
Henry de Valence	b90581a3d7	zebrad: create a Downloads Stream for syncing. This makes two changes relative to the existing download code: 1. It uses a oneshot to attempt to cancel the download task after it has started; 2. It encapsulates the download creation and cancellation logic into a Downloads struct.	2020-10-26 12:05:35 -07:00
Henry de Valence	b636660d6a	zebrad: rename sync::Error alias to BoxError.	2020-10-26 12:05:35 -07:00
dependabot[bot]	ff51c2e0c0	build(deps): bump tracing-subscriber from 0.2.13 to 0.2.14 Bumps [tracing-subscriber](https://github.com/tokio-rs/tracing) from 0.2.13 to 0.2.14. - [Release notes](https://github.com/tokio-rs/tracing/releases) - [Commits](https://github.com/tokio-rs/tracing/compare/tracing-subscriber-0.2.13...tracing-subscriber-0.2.14) Signed-off-by: dependabot[bot] <support@github.com>	2020-10-23 15:02:02 -04:00
Henry de Valence	cab96aa1a8	zebrad: clarify config help text (#1194 )	2020-10-22 15:03:01 +10:00
Alfredo Garcia	21ad6ffc47	Reverse displayed endianness of transaction and block hashes (#1171 ) * Reverse displayed endianness of transaction and block hashes * fix zebra-checkpoints utility for new hash order * Stop using "zebrad revhex" in zebrad-hash-lookup * Rebuild checkpoint lists in new hash order This change also adds additional checkpoints to the end of each list. * Replace TransactionHash with transaction::Hash This change should have been made in #905, but we missed Debug impls and some docs. Co-authored-by: Ramana Venkata <vramana@users.noreply.github.com> Co-authored-by: teor <teor@riseup.net>	2020-10-22 07:54:02 +10:00
teor	e52a1c07a3	Ignore longer sync tests by default	2020-10-21 21:08:04 +10:00
teor	0d121833af	Add sync tests that download 2000 blocks	2020-10-21 21:08:04 +10:00
teor	6fe3cc56dd	Refactor sync test to be more flexible And add documentation	2020-10-21 00:58:08 -04:00
teor	1d35c5a0b9	Enable the zebrad sync tests by default If your test environment does not have DNS or network access, set the ZEBRA_SKIP_NETWORK_TESTS environmental variable to disable these tests.	2020-10-21 00:58:08 -04:00
Henry de Valence	eb43893de0	consensus: minimize API, clean docs This reduces the API surface to the minimum required for functionality, and cleans up module documentation. The stub mempool module is deleted entirely, since it will need to be redone later anyways.	2020-10-20 11:16:22 -04:00
teor	d9fbba8a55	Skip the sync tests when ZEBRA_SKIP_NETWORK_TESTS is set	2020-10-16 15:21:01 -04:00
teor	04ce907dbf	Remove duplicate code in zebra_test::command	2020-10-15 19:54:00 -04:00
teor	32bbc19c6b	Fix a timeout bug in zebra_test::command And add tests for the command functionality. Also document some remaining bugs (see #1140).	2020-10-15 19:54:00 -04:00
teor	92f0c934cf	Add a sync acceptance test for the Testnet	2020-10-15 19:54:00 -04:00
Alfredo Garcia	2d3c3bcc23	add systemd service file	2020-10-14 15:33:00 -04:00
Alfredo Garcia	c0a14ecc8c	move genesis parameters to zebra-chain (#1151 )	2020-10-12 14:08:23 -07:00
dependabot[bot]	76e7e3d714	build(deps): bump tracing-subscriber from 0.2.12 to 0.2.13 Bumps [tracing-subscriber](https://github.com/tokio-rs/tracing) from 0.2.12 to 0.2.13. - [Release notes](https://github.com/tokio-rs/tracing/releases) - [Commits](https://github.com/tokio-rs/tracing/compare/tracing-subscriber-0.2.12...tracing-subscriber-0.2.13) Signed-off-by: dependabot[bot] <support@github.com>	2020-10-08 15:09:32 -04:00
Jane Lusby	855f9b5bcb	Implement MVP of NonFinalizedState and integrate it with the state service (#1101 ) * implement most of the chain functions * implement fork * fix outpoint handling in Chain struct * update expect for work * split utxo into two sets * update the Chain definition * remove allow attribute in zebra-state/lib.rs * merge ChainSet type into MemoryState * Add error messages to asserts * export proptest impls for use in downstream crates * add testjob for disabled feature in zebra-chain * try to fix github actions syntax * add module doc comment * update RFC for utxos * add missing header * working proptest for Chain * propagate back results over channel * Start updating RFC to match changes * implement queued block pruning * and now it syncs wooo! * remove empty modules * setup config for proptests * re-enable missing_docs lint * update RFC to match changes in impl * add documentation * use more explicit variable names	2020-10-08 13:07:32 +10:00
dependabot[bot]	23a62a2d87	build(deps): bump inferno from 0.10.0 to 0.10.1 Bumps [inferno](https://github.com/jonhoo/inferno) from 0.10.0 to 0.10.1. - [Release notes](https://github.com/jonhoo/inferno/releases) - [Changelog](https://github.com/jonhoo/inferno/blob/master/CHANGELOG.md) - [Commits](https://github.com/jonhoo/inferno/compare/v0.10.0...v0.10.1) Signed-off-by: dependabot[bot] <support@github.com>	2020-10-06 05:31:01 -04:00
dependabot[bot]	d769f62a73	build(deps): bump color-eyre from 0.5.5 to 0.5.6 Bumps [color-eyre](https://github.com/yaahc/color-eyre) from 0.5.5 to 0.5.6. - [Release notes](https://github.com/yaahc/color-eyre/releases) - [Changelog](https://github.com/yaahc/color-eyre/blob/master/CHANGELOG.md) - [Commits](https://github.com/yaahc/color-eyre/compare/v0.5.5...v0.5.6) Signed-off-by: dependabot[bot] <support@github.com>	2020-10-05 11:26:23 -04:00
Jane Lusby	40e22808c7	disable reporting url for timeout errors (#1087 ) * disable reporting url for timeout errors * revert newline removal * switch to released color-eyre version	2020-09-21 16:15:09 -07:00
Henry de Valence	fe61090a64	zebrad: make Inbound Poll::Ready before setup. The Inbound service only needs the network setup for some requests, but it can service other requests without it. Making it return Poll::Pending until the network setup finishes means that initial network connections may view the Inbound service as overloaded and attempt to load-shed.	2020-09-21 09:26:39 -07:00
dependabot[bot]	85241a49d6	build(deps): bump hyper from 0.13.7 to 0.13.8 Bumps [hyper](https://github.com/hyperium/hyper) from 0.13.7 to 0.13.8. - [Release notes](https://github.com/hyperium/hyper/releases) - [Changelog](https://github.com/hyperium/hyper/blob/master/CHANGELOG.md) - [Commits](https://github.com/hyperium/hyper/compare/v0.13.7...v0.13.8) Signed-off-by: dependabot[bot] <support@github.com>	2020-09-21 11:58:31 -04:00
Henry de Valence	9c021025a7	network: fill in remaining request/response pairs	2020-09-20 10:21:18 -07:00
Henry de Valence	4b35fea492	zebrad: document Inbound, ChainSync responsibilities	2020-09-18 18:34:25 -07:00
Henry de Valence	65877cb4b1	zebrad: make Inbound propagate backpressure	2020-09-18 18:34:25 -07:00
Henry de Valence	55f46967b2	zebrad: serve blocks from Inbound service The original version of this commit ran into https://github.com/rust-lang/rust/issues/64552 again. Thanks to @yaahc for suggesting a workaround (using futures combinators to avoid writing an async block).	2020-09-18 18:34:25 -07:00
Henry de Valence	170f588ffb	network: document load-shedding behavior This was part of the original design and is described in the Connection internals, but we never documented it externally.	2020-09-18 18:34:25 -07:00
Henry de Valence	1d0ebf89c6	zebrad: move seed command into inbound component Remove the seed command entirely, and make the behavior it provided (responding to `Request::Peers`) part of the ordinary functioning of the start command. The new `Inbound` service should be expanded to handle all request types.	2020-09-18 18:34:25 -07:00
Henry de Valence	1d3892e1dc	network: rename alias to BoxError This is shorter and consistent with Tower (which is why we use it in the first place).	2020-09-18 18:34:25 -07:00
Jane Lusby	ca648ff27c	Enable issue-url feature in color-eyre (#1072 ) * Enable issue-url feature in color-eyre * get version automatically * and the url!	2020-09-17 15:09:18 -07:00
dependabot[bot]	ba32d27f6e	build(deps): bump tracing-subscriber from 0.2.11 to 0.2.12 (#1059 ) Bumps [tracing-subscriber](https://github.com/tokio-rs/tracing) from 0.2.11 to 0.2.12. - [Release notes](https://github.com/tokio-rs/tracing/releases) - [Commits](https://github.com/tokio-rs/tracing/compare/tracing-subscriber-0.2.11...tracing-subscriber-0.2.12) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2020-09-14 13:49:07 -07:00
Jane Lusby	a7b418bfe5	Add test for first checkpoint verification (#1018 ) * add test for first checkpoint sync Prior this this change we've not had any tests that verify our sync / network logic is well behaved. This PR cleans up the test helper code to make error reports more consistent and uses this cleaned up API to implement a checkpoint sync test which runs zebrad until it reads the first checkpoint event from stdout. Co-authored-by: teor <teor@riseup.net> * move include out of unix cfg Co-authored-by: teor <teor@riseup.net>	2020-09-11 13:39:39 -07:00
Henry de Valence	3133214e4f	zebrad: use new state API	2020-09-11 13:37:49 -07:00
teor	b1e1291f45	Log inbound peer requests at debug Logging at info was a bit too verbose. Also add a short log message.	2020-09-10 09:46:53 -07:00

1 2 3 4 5 ...

271 Commits