solana/docs/src/cluster/ledger-replication.md

# Ledger Replication

At full capacity on a 1gbps network solana will generate 4 petabytes of data per year. To prevent the network from centralizing around validators that have to store the full data set this protocol proposes a way for mining nodes to provide storage capacity for pieces of the data.

The basic idea to Proof of Replication is encrypting a dataset with a public symmetric key using CBC encryption, then hash the encrypted dataset. The main problem with the naive approach is that a dishonest storage node can stream the encryption and delete the data as it's hashed. The simple solution is to periodically regenerate the hash based on a signed PoH value. This ensures that all the data is present during the generation of the proof and it also requires validators to have the entirety of the encrypted data present for verification of every proof of every identity. So the space required to validate is `number_of_proofs * data_size`

## Optimization with PoH

Our improvement on this approach is to randomly sample the encrypted segments faster than it takes to encrypt, and record the hash of those samples into the PoH ledger. Thus the segments stay in the exact same order for every PoRep and verification can stream the data and verify all the proofs in a single batch. This way we can verify multiple proofs concurrently, each one on its own CUDA core. The total space required for verification is `1_ledger_segment + 2_cbc_blocks * number_of_identities` with core count equal to `number_of_identities`. We use a 64-byte chacha CBC block size.

## Network

Validators for PoRep are the same validators that are verifying transactions. If an archiver can prove that a validator verified a fake PoRep, then the validator will not receive a reward for that storage epoch.

Archivers are specialized _light clients_. They download a part of the ledger \(a.k.a Segment\) and store it, and provide PoReps of storing the ledger. For each verified PoRep archivers earn a reward of sol from the mining pool.

## Constraints

We have the following constraints:

* Verification requires generating the CBC blocks. That requires space of 2

  blocks per identity, and 1 CUDA core per identity for the same dataset. So as

  many identities at once should be batched with as many proofs for those

  identities verified concurrently for the same dataset.

* Validators will randomly sample the set of storage proofs to the set that

  they can handle, and only the creators of those chosen proofs will be

  rewarded. The validator can run a benchmark whenever its hardware configuration

  changes to determine what rate it can validate storage proofs.

## Validation and Replication Protocol

### Constants

1. SLOTS\_PER\_SEGMENT: Number of slots in a segment of ledger data. The

   unit of storage for an archiver.

2. NUM\_KEY\_ROTATION\_SEGMENTS: Number of segments after which archivers

   regenerate their encryption keys and select a new dataset to store.

3. NUM\_STORAGE\_PROOFS: Number of storage proofs required for a storage proof

   claim to be successfully rewarded.

4. RATIO\_OF\_FAKE\_PROOFS: Ratio of fake proofs to real proofs that a storage

   mining proof claim has to contain to be valid for a reward.

5. NUM\_STORAGE\_SAMPLES: Number of samples required for a storage mining

   proof.

6. NUM\_CHACHA\_ROUNDS: Number of encryption rounds performed to generate

   encrypted state.

7. NUM\_SLOTS\_PER\_TURN: Number of slots that define a single storage epoch or

   a "turn" of the PoRep game.

### Validator behavior

1. Validators join the network and begin looking for archiver accounts at each

   storage epoch/turn boundary.

2. Every turn, Validators sign the PoH value at the boundary and use that signature

   to randomly pick proofs to verify from each storage account found in the turn boundary.

   This signed value is also submitted to the validator's storage account and will be used by

   archivers at a later stage to cross-verify.

3. Every `NUM_SLOTS_PER_TURN` slots the validator advertises the PoH value. This is value

   is also served to Archivers via RPC interfaces.

4. For a given turn N, all validations get locked out until turn N+3 \(a gap of 2 turn/epoch\).

   At which point all validations during that turn are available for reward collection.

5. Any incorrect validations will be marked during the turn in between.

### Archiver behavior

1. Since an archiver is somewhat of a light client and not downloading all the

   ledger data, they have to rely on other validators and archivers for information.

   Any given validator may or may not be malicious and give incorrect information, although

   there are not any obvious attack vectors that this could accomplish besides having the

   archiver do extra wasted work. For many of the operations there are a number of options

   depending on how paranoid an archiver is:

   * \(a\) archiver can ask a validator
   * \(b\) archiver can ask multiple validators
   * \(c\) archiver can ask other archivers
   * \(d\) archiver can subscribe to the full transaction stream and generate

     the information itself \(assuming the slot is recent enough\)

   * \(e\) archiver can subscribe to an abbreviated transaction stream to

     generate the information itself \(assuming the slot is recent enough\)

2. An archiver obtains the PoH hash corresponding to the last turn with its slot.
3. The archiver signs the PoH hash with its keypair. That signature is the

   seed used to pick the segment to replicate and also the encryption key. The

   archiver mods the signature with the slot to get which segment to

   replicate.

4. The archiver retrives the ledger by asking peer validators and

   archivers. See 6.5.

5. The archiver then encrypts that segment with the key with chacha algorithm

   in CBC mode with `NUM_CHACHA_ROUNDS` of encryption.

6. The archiver initializes a chacha rng with the a signed recent PoH value as

   the seed.

7. The archiver generates `NUM_STORAGE_SAMPLES` samples in the range of the

   entry size and samples the encrypted segment with sha256 for 32-bytes at each

   offset value. Sampling the state should be faster than generating the encrypted

   segment.

8. The archiver sends a PoRep proof transaction which contains its sha state

   at the end of the sampling operation, its seed and the samples it used to the

   current leader and it is put onto the ledger.

9. During a given turn the archiver should submit many proofs for the same segment

   and based on the `RATIO_OF_FAKE_PROOFS` some of those proofs must be fake.

10. As the PoRep game enters the next turn, the archiver must submit a

    transaction with the mask of which proofs were fake during the last turn. This

    transaction will define the rewards for both archivers and validators.

11. Finally for a turn N, as the PoRep game enters turn N + 3, archiver's proofs for

    turn N will be counted towards their rewards.

### The PoRep Game

The Proof of Replication game has 4 primary stages. For each "turn" multiple PoRep games can be in progress but each in a different stage.

The 4 stages of the PoRep Game are as follows:

1. Proof submission stage
   * Archivers: submit as many proofs as possible during this stage
   * Validators: No-op
2. Proof verification stage
   * Archivers: No-op
   * Validators: Select archivers and verify their proofs from the previous turn
3. Proof challenge stage
   * Archivers: Submit the proof mask with justifications \(for fake proofs submitted 2 turns ago\)
   * Validators: No-op
4. Reward collection stage
   * Archivers: Collect rewards for 3 turns ago
   * Validators:  Collect rewards for 3 turns ago

For each turn of the PoRep game, both Validators and Archivers evaluate each stage. The stages are run as separate transactions on the storage program.

### Finding who has a given block of ledger

1. Validators monitor the turns in the PoRep game and look at the rooted bank

   at turn boundaries for any proofs.

2. Validators maintain a map of ledger segments and corresponding archiver public keys.

   The map is updated when a Validator processes an archiver's proofs for a segment.

   The validator provides an RPC interface to access the this map. Using this API, clients

   can map a segment to an archiver's network address \(correlating it via cluster\_info table\).

   The clients can then send repair requests to the archiver to retrieve segments.

3. Validators would need to invalidate this list every N turns.

## Sybil attacks

For any random seed, we force everyone to use a signature that is derived from a PoH hash at the turn boundary. Everyone uses the same count, so the same PoH hash is signed by every participant. The signatures are then each cryptographically tied to the keypair, which prevents a leader from grinding on the resulting value for more than 1 identity.

Since there are many more client identities then encryption identities, we need to split the reward for multiple clients, and prevent Sybil attacks from generating many clients to acquire the same block of data. To remain BFT we want to avoid a single human entity from storing all the replications of a single chunk of the ledger.

Our solution to this is to force the clients to continue using the same identity. If the first round is used to acquire the same block for many client identities, the second round for the same client identities will force a redistribution of the signatures, and therefore PoRep identities and blocks. Thus to get a reward for archivers need to store the first block for free and the network can reward long lived client identities more than new ones.

## Validator attacks

* If a validator approves fake proofs, archiver can easily out them by

  showing the initial state for the hash.

* If a validator marks real proofs as fake, no on-chain computation can be done

  to distinguish who is correct. Rewards would have to rely on the results from

  multiple validators to catch bad actors and archivers from being denied rewards.

* Validator stealing mining proof results for itself. The proofs are derived

  from a signature from an archiver, since the validator does not know the

  private key used to generate the encryption key, it cannot be the generator of

  the proof.

## Reward incentives

Fake proofs are easy to generate but difficult to verify. For this reason, PoRep proof transactions generated by archivers may require a higher fee than a normal transaction to represent the computational cost required by validators.

Some percentage of fake proofs are also necessary to receive a reward from storage mining.

## Notes

* We can reduce the costs of verification of PoRep by using PoH, and actually

  make it feasible to verify a large number of proofs for a global dataset.

* We can eliminate grinding by forcing everyone to sign the same PoH hash and

  use the signatures as the seed

* The game between validators and archivers is over random blocks and random

  encryption identities and random data samples. The goal of randomization is

  to prevent colluding groups from having overlap on data or validation.

* Archiver clients fish for lazy validators by submitting fake proofs that

  they can prove are fake.

* To defend against Sybil client identities that try to store the same block we

  force the clients to store for multiple rounds before receiving a reward.

* Validators should also get rewarded for validating submitted storage proofs

  as incentive for storing the ledger. They can only validate proofs if they

  are storing that slice of the ledger.
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00			`# Ledger Replication`

			`At full capacity on a 1gbps network solana will generate 4 petabytes of data per year. To prevent the network from centralizing around validators that have to store the full data set this protocol proposes a way for mining nodes to provide storage capacity for pieces of the data.`

			The basic idea to Proof of Replication is encrypting a dataset with a public symmetric key using CBC encryption, then hash the encrypted dataset. The main problem with the naive approach is that a dishonest storage node can stream the encryption and delete the data as it's hashed. The simple solution is to periodically regenerate the hash based on a signed PoH value. This ensures that all the data is present during the generation of the proof and it also requires validators to have the entirety of the encrypted data present for verification of every proof of every identity. So the space required to validate is `number_of_proofs * data_size`

			`## Optimization with PoH`

			Our improvement on this approach is to randomly sample the encrypted segments faster than it takes to encrypt, and record the hash of those samples into the PoH ledger. Thus the segments stay in the exact same order for every PoRep and verification can stream the data and verify all the proofs in a single batch. This way we can verify multiple proofs concurrently, each one on its own CUDA core. The total space required for verification is `1_ledger_segment + 2_cbc_blocks * number_of_identities` with core count equal to `number_of_identities`. We use a 64-byte chacha CBC block size.

			`## Network`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`Validators for PoRep are the same validators that are verifying transactions. If an archiver can prove that a validator verified a fake PoRep, then the validator will not receive a reward for that storage epoch.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`Archivers are specialized _light clients_. They download a part of the ledger \(a.k.a Segment\) and store it, and provide PoReps of storing the ledger. For each verified PoRep archivers earn a reward of sol from the mining pool.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`## Constraints`

			`We have the following constraints:`

			`* Verification requires generating the CBC blocks. That requires space of 2`

			`blocks per identity, and 1 CUDA core per identity for the same dataset. So as`

			`many identities at once should be batched with as many proofs for those`

			`identities verified concurrently for the same dataset.`

			`* Validators will randomly sample the set of storage proofs to the set that`

			`they can handle, and only the creators of those chosen proofs will be`

			`rewarded. The validator can run a benchmark whenever its hardware configuration`

			`changes to determine what rate it can validate storage proofs.`

			`## Validation and Replication Protocol`

			`### Constants`

			`1. SLOTS\_PER\_SEGMENT: Number of slots in a segment of ledger data. The`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`unit of storage for an archiver.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`2. NUM\_KEY\_ROTATION\_SEGMENTS: Number of segments after which archivers`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`regenerate their encryption keys and select a new dataset to store.`

			`3. NUM\_STORAGE\_PROOFS: Number of storage proofs required for a storage proof`

			`claim to be successfully rewarded.`

			`4. RATIO\_OF\_FAKE\_PROOFS: Ratio of fake proofs to real proofs that a storage`

			`mining proof claim has to contain to be valid for a reward.`

			`5. NUM\_STORAGE\_SAMPLES: Number of samples required for a storage mining`

			`proof.`

			`6. NUM\_CHACHA\_ROUNDS: Number of encryption rounds performed to generate`

			`encrypted state.`

			`7. NUM\_SLOTS\_PER\_TURN: Number of slots that define a single storage epoch or`

			`a "turn" of the PoRep game.`

			`### Validator behavior`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`1. Validators join the network and begin looking for archiver accounts at each`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`storage epoch/turn boundary.`

			`2. Every turn, Validators sign the PoH value at the boundary and use that signature`

			`to randomly pick proofs to verify from each storage account found in the turn boundary.`

			`This signed value is also submitted to the validator's storage account and will be used by`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`archivers at a later stage to cross-verify.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			3. Every `NUM_SLOTS_PER_TURN` slots the validator advertises the PoH value. This is value

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`is also served to Archivers via RPC interfaces.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`4. For a given turn N, all validations get locked out until turn N+3 \(a gap of 2 turn/epoch\).`

			`At which point all validations during that turn are available for reward collection.`

			`5. Any incorrect validations will be marked during the turn in between.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`### Archiver behavior`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`1. Since an archiver is somewhat of a light client and not downloading all the`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`ledger data, they have to rely on other validators and archivers for information.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`Any given validator may or may not be malicious and give incorrect information, although`

			`there are not any obvious attack vectors that this could accomplish besides having the`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`archiver do extra wasted work. For many of the operations there are a number of options`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`depending on how paranoid an archiver is:`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* \(a\) archiver can ask a validator`
			`* \(b\) archiver can ask multiple validators`
			`* \(c\) archiver can ask other archivers`
			`* \(d\) archiver can subscribe to the full transaction stream and generate`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`the information itself \(assuming the slot is recent enough\)`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* \(e\) archiver can subscribe to an abbreviated transaction stream to`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`generate the information itself \(assuming the slot is recent enough\)`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`2. An archiver obtains the PoH hash corresponding to the last turn with its slot.`
			`3. The archiver signs the PoH hash with its keypair. That signature is the`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`seed used to pick the segment to replicate and also the encryption key. The`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`archiver mods the signature with the slot to get which segment to`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`replicate.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`4. The archiver retrives the ledger by asking peer validators and`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`archivers. See 6.5.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`5. The archiver then encrypts that segment with the key with chacha algorithm`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			in CBC mode with `NUM_CHACHA_ROUNDS` of encryption.

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`6. The archiver initializes a chacha rng with the a signed recent PoH value as`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`the seed.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			7. The archiver generates `NUM_STORAGE_SAMPLES` samples in the range of the
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`entry size and samples the encrypted segment with sha256 for 32-bytes at each`

			`offset value. Sampling the state should be faster than generating the encrypted`

			`segment.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`8. The archiver sends a PoRep proof transaction which contains its sha state`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`at the end of the sampling operation, its seed and the samples it used to the`

			`current leader and it is put onto the ledger.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`9. During a given turn the archiver should submit many proofs for the same segment`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			and based on the `RATIO_OF_FAKE_PROOFS` some of those proofs must be fake.

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`10. As the PoRep game enters the next turn, the archiver must submit a`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`transaction with the mask of which proofs were fake during the last turn. This`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`transaction will define the rewards for both archivers and validators.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`11. Finally for a turn N, as the PoRep game enters turn N + 3, archiver's proofs for`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`turn N will be counted towards their rewards.`

			`### The PoRep Game`

			`The Proof of Replication game has 4 primary stages. For each "turn" multiple PoRep games can be in progress but each in a different stage.`

			`The 4 stages of the PoRep Game are as follows:`

			`1. Proof submission stage`
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* Archivers: submit as many proofs as possible during this stage`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00			`* Validators: No-op`
			`2. Proof verification stage`
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* Archivers: No-op`
			`* Validators: Select archivers and verify their proofs from the previous turn`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00			`3. Proof challenge stage`
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* Archivers: Submit the proof mask with justifications \(for fake proofs submitted 2 turns ago\)`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00			`* Validators: No-op`
			`4. Reward collection stage`
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* Archivers: Collect rewards for 3 turns ago`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00			`* Validators: Collect rewards for 3 turns ago`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`For each turn of the PoRep game, both Validators and Archivers evaluate each stage. The stages are run as separate transactions on the storage program.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`### Finding who has a given block of ledger`

			`1. Validators monitor the turns in the PoRep game and look at the rooted bank`

			`at turn boundaries for any proofs.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`2. Validators maintain a map of ledger segments and corresponding archiver public keys.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`The map is updated when a Validator processes an archiver's proofs for a segment.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`The validator provides an RPC interface to access the this map. Using this API, clients`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`can map a segment to an archiver's network address \(correlating it via cluster\_info table\).`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`The clients can then send repair requests to the archiver to retrieve segments.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`3. Validators would need to invalidate this list every N turns.`

			`## Sybil attacks`

			`For any random seed, we force everyone to use a signature that is derived from a PoH hash at the turn boundary. Everyone uses the same count, so the same PoH hash is signed by every participant. The signatures are then each cryptographically tied to the keypair, which prevents a leader from grinding on the resulting value for more than 1 identity.`

			`Since there are many more client identities then encryption identities, we need to split the reward for multiple clients, and prevent Sybil attacks from generating many clients to acquire the same block of data. To remain BFT we want to avoid a single human entity from storing all the replications of a single chunk of the ledger.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`Our solution to this is to force the clients to continue using the same identity. If the first round is used to acquire the same block for many client identities, the second round for the same client identities will force a redistribution of the signatures, and therefore PoRep identities and blocks. Thus to get a reward for archivers need to store the first block for free and the network can reward long lived client identities more than new ones.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`## Validator attacks`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* If a validator approves fake proofs, archiver can easily out them by`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`showing the initial state for the hash.`

			`* If a validator marks real proofs as fake, no on-chain computation can be done`

			`to distinguish who is correct. Rewards would have to rely on the results from`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`multiple validators to catch bad actors and archivers from being denied rewards.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`* Validator stealing mining proof results for itself. The proofs are derived`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`from a signature from an archiver, since the validator does not know the`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`private key used to generate the encryption key, it cannot be the generator of`

			`the proof.`

			`## Reward incentives`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`Fake proofs are easy to generate but difficult to verify. For this reason, PoRep proof transactions generated by archivers may require a higher fee than a normal transaction to represent the computational cost required by validators.`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`Some percentage of fake proofs are also necessary to receive a reward from storage mining.`

			`## Notes`

			`* We can reduce the costs of verification of PoRep by using PoH, and actually`

			`make it feasible to verify a large number of proofs for a global dataset.`

			`* We can eliminate grinding by forcing everyone to sign the same PoH hash and`

			`use the signatures as the seed`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* The game between validators and archivers is over random blocks and random`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`encryption identities and random data samples. The goal of randomization is`

			`to prevent colluding groups from having overlap on data or validation.`

Rename replicator to archiver (#6464) * Rename replicator to archiver * cargo fmt * Fix grammar 2019-10-21 10:29:37 -07:00			`* Archiver clients fish for lazy validators by submitting fake proofs that`
GitBook: [master] 156 pages and 12 assets modified 2019-09-22 20:38:34 -07:00
			`they can prove are fake.`

			`* To defend against Sybil client identities that try to store the same block we`

			`force the clients to store for multiple rounds before receiving a reward.`

			`* Validators should also get rewarded for validating submitted storage proofs`

			`as incentive for storing the ledger. They can only validate proofs if they`

			`are storing that slice of the ledger.`