zebra

Commit Graph

Author	SHA1	Message	Date
teor	92eb92d1dd	Disable the nightly clippy unnecessary_wraps lint (#1403 ) It seems to be a bit broken - some of our functions return `Result` for consistency with similar functions. But the lint picks them up anyway.	2020-12-01 12:20:57 +10:00
Henry de Valence	4fa119dd1f	chain: fix consensus-critical coinbase encoding bug The `CoinbaseData` parses the block height separately from the rest of the free-form coinbase data. However, it had two bugs: 1. It did not require that the height was canonically encoded; 2. Its canonical encoding was incorrect relative to the BIP34-inherited encoding. This meant that we computed some transaction hashes incorrectly, because when we re-serialized the coinbase transaction, we would canonically serialize the coinbase transaction (using the incorrect definition of canonical, bug 2). And we didn't notice that the wrong definition of canonical encoding was being used because we accepted what we thought were non-canonically encoded heights. The relevant rules are here: `877212414a/src/script/script.h (L307-L346)` This commit changes the encoding to reject non-canonically encoded heights, and to match the correct encoding rules. We check that at least one non-canonically encoded height is correctly rejected using a new test vector. The database format increments because we saved a bunch of wrongly encoded blocks. This discrepancy was originally noticed by @teor2345, who pointed out that a previous version of the block 202 test vector (now preserved as "bad block 202") did not match the block from zcashd.	2020-12-01 10:14:44 +10:00
Henry de Valence	7c08c0c315	consensus: check Merkle roots As a side effect of computing Merkle roots, we build a list of transaction hashes. Instead of discarding these, add them to PreparedBlock and FinalizedBlock so that they can be reused rather than recomputed. This commit adds Merkle root validation to: 1. the block verifier; 2. the checkpoint verifier. In the first case, Bitcoin Merkle tree malleability has no effect, because only a single Merkle tree in each malleablity set is valid (the others have duplicate transactions). In the second case, we need to check that the Merkle tree does not contain any duplicate transactions. Closes #1385 Closes #906	2020-12-01 10:14:44 +10:00
Alfredo Garcia	4544463059	Inbound `FindBlocks` and `FindHeaders` (#1347 ) * implement inbound `FindBlocks` * Handle inbound peer FindHeaders requests * handle request before having any chain tip * Split `find_chain_hashes` into smaller functions Add a `max_len` argument to support `FindHeaders` requests. Rewrite the hash collection code to use heights, so we can handle the `stop` hash and "no intersection" cases correctly. * Split state height functions into "any chain" and "best chain" * Rename the best chain block method to `best_block` * Move fmt utilities to zebra_chain::fmt * Summarise Debug for some Message variants Co-authored-by: teor <teor@riseup.net> Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-12-01 07:30:37 +10:00
teor	d007c76488	Turn a chain length check into an assert	2020-12-01 07:27:30 +10:00
teor	d1ba1146d4	Add intra-doc links	2020-12-01 07:27:30 +10:00
teor	1e4ce74c93	Turn the relevant chain into a Vec before using it Some checks use the same blocks, so we take a copy of the block borrows before using them. That way, we don't have to manage the position of the iterator between checks.	2020-12-01 07:27:30 +10:00
teor	712dd9ddf3	Make a module `pub(crate)` rather than `pub`	2020-12-01 07:27:30 +10:00
teor	ec6ef93b7b	Simplify an ExpandedDifficulty division	2020-12-01 07:27:30 +10:00
teor	d64c2976e3	Rewrite iterator processing using unzip Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-12-01 07:27:30 +10:00
teor	91476535d3	Doc comment formatting Co-authored-by: Jane Lusby <jlusby42@gmail.com>	2020-12-01 07:27:30 +10:00
teor	678e6ad090	Implement difficulty_threshold_is_valid	2020-12-01 07:27:30 +10:00
teor	750f096a99	Implement testnet minimum difficulty	2020-12-01 07:27:30 +10:00
teor	bb9c4918bf	Implement threshold_bits	2020-12-01 07:27:30 +10:00
teor	f0a49d64bf	Split out a median_timespan function	2020-12-01 07:27:30 +10:00
teor	054d6f0525	Implement median_timespan_bounded	2020-12-01 07:27:30 +10:00
teor	75519b0ae9	Implement averaging_window_timespan	2020-12-01 07:27:30 +10:00
teor	bcabf75fe9	Replace integer lengths with named constants	2020-12-01 07:27:30 +10:00
teor	e07b0bc8da	Implement median_time And enough stubs to run it on real data.	2020-12-01 07:27:30 +10:00
teor	741c44cd55	Implement mean_target_difficulty And enough stub code to actually run it on the context.	2020-12-01 07:27:30 +10:00
teor	939c2b97a6	Implement AdjustedDifficulty creation Also: * call the difficulty check from `block_is_contextually_valid` * add a stub `difficulty_threshold_is_valid` function	2020-12-01 07:27:30 +10:00
teor	fa03b83351	Update some contextual validation comments and error messages	2020-12-01 07:27:30 +10:00
teor	1bf5ff07fb	Fix a state config comment	2020-11-30 15:57:46 -05:00
teor	176923a771	Add an info-level log when UTXO requests are pruned (#1396 ) And a debug-level log when no requests are pruned. I'm seeing some hangs during the initial sync, these logs might help identify the cause.	2020-11-26 17:26:10 +10:00
Deirdre Connolly	e11e8e1373	s/TRASPARENT/TRANSPARENT/g	2020-11-25 17:22:26 -05:00
teor	31eb0a5126	Avoid verbose default logs Temporary fix so that Zebra's default logs support a typical workflow: 1. Developer or user runs Zebra with the default config 2. They send the logs to a terminal 3. When they see a bug, they copy-paste the last few log lines into a bug report This is the same change that was merged in #1373 and reverted in #1375. We'll create a consistent logging design for Zebra in ticket #1381.	2020-11-25 10:55:15 -08:00
teor	b1bbb13978	Make debug_stop_at_height and ephemeral work together (#1339 ) * Make debug_stop_at_height and ephemeral work together * if `debug_stop_at_height` and `ephemeral` are set, delete the database files after reaching the stop height * drop or flush the database before `debug_stop_at_height` exits Zebra	2020-11-25 15:04:18 +10:00
Deirdre Connolly	2a21c86b91	I before E except after C (or uh, not-english)	2020-11-24 22:23:57 -05:00
Henry de Valence	2e0ed94b22	Revert "Downgrade a per-block log to debug level" This reverts commit `15d26e3c47`.	2020-11-24 14:39:45 -05:00
teor	15d26e3c47	Downgrade a per-block log to debug level	2020-11-24 10:56:57 -05:00
Henry de Valence	040e50b183	state: service::utxo -> service::pending_utxos	2020-11-23 22:18:43 -08:00
Henry de Valence	342eb166ff	state: track UTXO provenance This commit changes the state system and database format to track the provenance of UTXOs, in addition to the outputs themselves. Specifically, it tracks the following additional metadata: - the height at which the UTXO was created; - whether or not the UTXO was created from a coinbase transaction or not. This metadata will allow us to: - check the coinbase maturity consensus rule; - check the coinbase inputs => no transparent outputs rule; - implement lookup of transactions by utxo (using the height to find the block and then scanning the block) for a future RPC mechanism. Closes #1342	2020-11-23 22:18:43 -08:00
teor	00c52d28cd	Appease rustfmt	2020-11-23 14:16:39 +10:00
teor	acf6096103	Appease clippy stable	2020-11-23 14:16:39 +10:00
Henry de Valence	2a4a89c002	state,zebrad: tidy span levels for good INFO output This provides useful and not too noisy output at INFO level. We do an info-level message on every block commit instead of trying to do one message every N blocks, because this is useful both for initial block sync as well as continuous state updates on new blocks.	2020-11-23 14:16:39 +10:00
Henry de Valence	e0817d1747	state: introduce PreparedBlock, FinalizedBlock This change introduces two new types: - `PreparedBlock`, representing a block which has undergone semantic validation and has been prepared for contextual validation; - `FinalizedBlock`, representing a block which is ready to be finalized immediately; and changes the `Request::CommitBlock`,`Request::CommitFinalizedBlock` variants to use these types instead of their previous fields. This change solves the problem of passing data between semantic validation and contextual validation, and cleans up the state code by allowing it to pass around a bundle of data. Previously, the state code just passed around an `Arc<Block>`, which forced it to needlessly recompute block hashes and other data, and was incompatible with the already-known but not-yet-implemented data transfer requirements, namely passing in the Sprout and Sapling anchors computed during contextual validation. This commit propagates the `PreparedBlock` and `FinalizedBlock` types through the state code but only uses their data opportunistically, e.g., changing .hash() computations to use the precomputed hash. In the future, these structures can be extended to pass data through the verification pipeline for reuse as appropriate. For instance, these changes allow the sprout and sapling anchors to be propagated through the state.	2020-11-23 14:16:39 +10:00
Henry de Valence	3f78476693	state: check queued blocks for known UTXOs The behavior of a request for a UTXO from a previous block depends on whether that block has already been submitted to the state, or not: * if it has, the state should be able to find it and answer immediately. * if it has not, the state should see it in a later request. However, the previous code only checked committed blocks, not queued blocks, so if the block containing the UTXO had already arrived but had not been committed, it would never be scanned. This patch fixes the problem but is a bad solution, duplicating computation between the block verifier and the state. A better fix follows in the next commit.	2020-11-23 14:16:39 +10:00
Henry de Valence	719a48ad9e	state: shorten tracing messages Make tracing messages more concise by omitting information already contained in a parent span and by shortening messages. This makes them easier to read.	2020-11-23 14:16:39 +10:00
Henry de Valence	3192a5008d	state: add additional traces to block commit logic	2020-11-23 14:16:39 +10:00
Henry de Valence	36cd76d590	state: tidy process_queued tracing Previously, this function was instrumented with a span containing the parent hash that was the entry to the function. But it doesn't make sense to consider the work done by the function as happening in the context of the supplied parent hash (as distinct from the context of the hash of the newly arrived block, which is already contained in an outer span), so this adds noise without conveying extra context. Instead, use events that occur within the context of the existing spans.	2020-11-23 14:16:39 +10:00
Henry de Valence	f0810b028d	state,consensus,sync: shorten span lengths These changes help reduce the size of the resulting spans, making the output more compact. Together they save about 30-40 characters.	2020-11-23 14:16:39 +10:00
Henry de Valence	77b60f3a30	state: add traces for utxo scanning	2020-11-23 14:16:39 +10:00
Henry de Valence	7dfea510d5	state: remove state_trace span This turns out not to give much additional information when stacked with child spans.	2020-11-20 15:28:46 -08:00
Henry de Valence	bbd7a62b20	state: add service request count metrics These are all one metric, with the type as an attribute, so that we can display total requests, filter by a particular type, etc.	2020-11-20 17:38:21 -05:00
Henry de Valence	3bfe63e38f	state: add span to state service Here the span is added to the body of the `Service::call` implementation, not to the futures it returns, because the state service does all of the work synchronously in `call` rather than in the futures it returns. The service is skipped as a span field. We could either include or exclude the request itself. It would be useful, but the request body can be very large. Instead, we make two spans, one at info level and one at trace level, and filter that way.	2020-11-20 17:38:21 -05:00
Henry de Valence	add94c1c45	deps: move to tokio 0.3, tower 0.4 This change is mostly mechanical, with the exception of the changes to the `tower-batch` middleware. This middleware was adapted from `tower::buffer`, and the `tower::buffer` code was changed to implement its own bounded queue, because Tokio 0.3 removed the `mpsc::Sender::poll_send` method. See `ddc64e8d4d` for more context on the Tower changes. To match Tower as closely as possible in order to be able to upstream `tower-batch`, those changes are copied from `tower::Buffer` to `tower-batch`.	2020-11-20 10:08:16 -08:00
teor	ec00ee4cf0	Stop using /dev/shm on Linux (#1338 ) Some systems have a very small /dev/shm, for example, see: https://github.com/docker-library/postgres/issues/416 So we should just use the temporary directory on all operating systems. Also: * use TempDir to generate the temporary path * delete the code that we copied from sled * prefix the temporary path with the state version and network	2020-11-20 13:01:19 +10:00
Jane Lusby	4c9bb87df2	zebra-state: replace sled with rocksdb (#1325 ) ## Motivation Prior to this PR we've been using `sled` as our database for storing persistent chain data on the disk between boots. We picked sled over rocksdb to minimize our c++ dependencies despite it being a less mature codebase. The theory was if it worked well enough we'd prefer to have a pure rust codebase, but if we ever ran into problems we knew we could easily swap it out with rocksdb. Well, we ran into problems. Sled's memory usage was particularly high, and it seemed to be leaking memory. On top of all that, the performance for writes was pretty poor, causing us to become bottle-necked on sled instead of the network. ## Solution This PR replaces `sled` with `rocksdb`. We've seen a 10x improvement in memory usage out of the box, no more leaking, and much better write performance. With this change writing chain data to disk is no longer a limiting factor in how quickly we can sync the chain. The code in this pull request has: - [x] Documentation Comments - [x] Unit Tests and Property Tests ## Review @hdevalence	2020-11-18 18:05:06 -08:00
Jane Lusby	65a605520f	remove references to sled from service.rs	2020-11-18 15:09:43 -05:00
Jane Lusby	5a6a9fd51e	remove some references to sled in serialization definition module	2020-11-18 15:09:43 -05:00

1 2 3 4

199 Commits