tendermint/docs/specification/new-spec/reactors/pex/pex.md

124 lines
5.3 KiB
Markdown
Raw Normal View History

2018-01-01 12:59:53 -08:00
# Peer Strategy and Exchange
Here we outline the design of the AddressBook
and how it used by the Peer Exchange Reactor (PEX) to ensure we are connected
to good peers and to gossip peers to others.
## Peer Types
Certain peers are special in that they are specified by the user as `persistent`,
2018-03-20 13:41:08 -07:00
which means we auto-redial them if the connection fails, or if we fail to dial
them.
2018-01-19 14:10:08 -08:00
Some peers can be marked as `private`, which means
we will not put them in the address book or gossip them to others.
2018-01-01 12:59:53 -08:00
2018-01-19 14:10:08 -08:00
All peers except private peers are tracked using the address book.
2018-01-01 12:59:53 -08:00
## Discovery
Peer discovery begins with a list of seeds.
When we have no peers, or have been unable to find enough peers from existing ones,
we dial a randomly selected seed to get a list of peers to dial.
2018-03-20 13:41:08 -07:00
On startup, we will also immediately dial the given list of `persisten_peers`,
2018-04-28 10:10:27 -07:00
and will attempt to maintain persistent connections with them. If the connections die, or we fail to dail,
2018-03-20 13:41:08 -07:00
we will redial every 5s for a a few minutes, then switch to an exponential backoff schedule,
and after about a day of trying, stop dialing the peer.
2018-04-28 10:10:27 -07:00
So long as we have less than `MinNumOutboundPeers`, we periodically request additional peers
2018-01-01 12:59:53 -08:00
from each of our own. If sufficient time goes by and we still can't find enough peers,
we try the seeds again.
2018-04-28 10:10:27 -07:00
## Listening
2018-04-28 14:14:58 -07:00
Peers listen on a configurable ListenAddr that they self-report in their
NodeInfo during handshakes with other peers. Peers accept up to (MaxNumPeers -
MinNumOutboundPeers) incoming peers.
2018-04-28 10:10:27 -07:00
2018-01-01 12:59:53 -08:00
## Address Book
Peers are tracked via their ID (their PubKey.Address()).
Peers are added to the address book from the PEX when they first connect to us or
when we hear about them from other peers.
The address book is arranged in sets of buckets, and distinguishes between
2018-01-19 14:10:08 -08:00
vetted (old) and unvetted (new) peers. It keeps different sets of buckets for vetted and
2018-04-28 14:27:51 -07:00
unvetted peers. Buckets provide randomization over peer selection. Peers are put
in buckets according to their IP groups.
2018-01-01 12:59:53 -08:00
2018-04-28 14:27:51 -07:00
A vetted peer can only be in one bucket. An unvetted peer can be in multiple buckets, and
each instance of the peer can have a different IP:PORT.
2018-01-01 12:59:53 -08:00
2018-04-28 14:27:51 -07:00
If we're trying to add a new peer but there's no space in its bucket, we'll
remove the worst peer from that bucket to make room.
2018-04-28 10:10:27 -07:00
2018-01-01 12:59:53 -08:00
## Vetting
When a peer is first added, it is unvetted.
Marking a peer as vetted is outside the scope of the `p2p` package.
For Tendermint, a Peer becomes vetted once it has contributed sufficiently
at the consensus layer; ie. once it has sent us valid and not-yet-known
votes and/or block parts for `NumBlocksForVetted` blocks.
Other users of the p2p package can determine their own conditions for when a peer is marked vetted.
If a peer becomes vetted but there are already too many vetted peers,
a randomly selected one of the vetted peers becomes unvetted.
If a peer becomes unvetted (either a new peer, or one that was previously vetted),
a randomly selected one of the unvetted peers is removed from the address book.
More fine-grained tracking of peer behaviour can be done using
2018-01-19 14:10:08 -08:00
a trust metric (see below), but it's best to start with something simple.
2018-01-01 12:59:53 -08:00
## Select Peers to Dial
When we need more peers, we pick them randomly from the addrbook with some
configurable bias for unvetted peers. The bias should be lower when we have fewer peers
2018-01-01 12:59:53 -08:00
and can increase as we obtain more, ensuring that our first peers are more trustworthy,
but always giving us the chance to discover new good peers.
We track the last time we dialed a peer and the number of unsuccessful attempts
we've made. If too many attempts are made, we mark the peer as bad.
Connection attempts are made with exponential backoff (plus jitter). Because
the selection process happens every `ensurePeersPeriod`, we might not end up
dialing a peer for much longer than the backoff duration.
2018-04-28 12:01:33 -07:00
If we fail to conenct to the peer after 16 tries (with exponential backoff), we remove from address book completely.
2018-03-20 13:41:08 -07:00
2018-01-01 12:59:53 -08:00
## Select Peers to Exchange
When were asked for peers, we select them as follows:
- select at most `maxGetSelection` peers
- try to select at least `minGetSelection` peers - if we have less than that, select them all.
- select a random, unbiased `getSelectionPercent` of the peers
Send the selected peers. Note we select peers for sending without bias for vetted/unvetted.
## Preventing Spam
There are various cases where we decide a peer has misbehaved and we disconnect from them.
When this happens, the peer is removed from the address book and black listed for
some amount of time. We call this "Disconnect and Mark".
2018-01-19 14:10:08 -08:00
Note that the bad behaviour may be detected outside the PEX reactor itself
2018-01-01 12:59:53 -08:00
(for instance, in the mconnection, or another reactor), but it must be communicated to the PEX reactor
so it can remove and mark the peer.
2018-03-20 13:41:08 -07:00
In the PEX, if a peer sends us an unsolicited list of peers,
or if the peer sends a request too soon after another one,
we Disconnect and MarkBad.
2018-01-01 12:59:53 -08:00
## Trust Metric
The quality of peers can be tracked in more fine-grained detail using a
2018-01-19 14:10:08 -08:00
Proportional-Integral-Derivative (PID) controller that incorporates
2018-01-01 12:59:53 -08:00
current, past, and rate-of-change data to inform peer quality.
While a PID trust metric has been implemented, it remains for future work
to use it in the PEX.
2018-01-19 14:10:08 -08:00
See the [trustmetric](../../../architecture/adr-006-trust-metric.md )
and [trustmetric useage](../../../architecture/adr-007-trust-metric-usage.md )
architecture docs for more details.