\setlength{\oddsidemargin}{-0.25in} % Left margin of 1 in + 0 in = 1 in
\setlength{\textwidth}{7in} % Right margin of 8.5 in - 1 in - 6.5 in = 1 in
\setlength{\topmargin}{-.75in} % Top margin of 2 in -0.75 in = 1 in
\setlength{\textheight}{9.2in} % Lower margin of 11 in - 9 in - 1 in = 1 in
\newcommand{\doctitle}{Zcash Protocol Specification}
\newcommand{\docversion}{Version 2.0-alpha-1}
\newcommand{\authors}{Sean Bowe | Daira Hopwood | Taylor Hornby | Nathan Wilcox}
pdfborderstyle={/S/U/W 0.7},
Title={\doctitle, \docversion},
\newcommand{\crossref}[1]{\autoref{#1} \emph{`\nameref*{#1}\kern -0.1em'} on p.\,\pageref*{#1}}
\newcommand{\eli}[1]{{\color{JungleGreen}\sf{Eli: #1}}}
\newcommand{\sean}[1]{{\color{blue}\sf{Sean: #1}}}
\newcommand{\taylor}[1]{{\color{red}\sf{Taylor: #1}}}
\newcommand{\daira}[1]{{\color{RedOrange}\sf{Daira: #1}}}
\newcommand{\nathan}[1]{{\color{ForestGreen}\sf{Nathan: #1}}}
\newcommand{\todo}[1]{{\color{Sepia}\sf{TODO: #1}}}
\newcommand{\MUSTNOT}{\conformance{MUST NOT}}
\newcommand{\SHOULDNOT}{\conformance{SHOULD NOT}}
\newcommand{\noteCommitment}{\term{note commitment}}
\newcommand{\noteCommitments}{\term{note commitments}}
\newcommand{\NoteCommitment}{\titleterm{Note Commitment}}
\newcommand{\NoteCommitments}{\titleterm{Note Commitments}}
\newcommand{\noteCommitmentTree}{\term{note commitment tree}}
\newcommand{\joinSplitDescription}{\term{JoinSplit description}}
\newcommand{\joinSplitDescriptions}{\term{JoinSplit descriptions}}
\newcommand{\sequenceOfJoinSplitDescriptions}{\changed{sequence of} \joinSplitDescription\changed{\term{s}}\xspace}
\newcommand{\joinSplitTransfer}{\term{JoinSplit operation}}
\newcommand{\joinSplitTransfers}{\term{JoinSplit operations}}
\newcommand{\JoinSplitTransfer}{\titleterm{JoinSplit Operation}}
\newcommand{\JoinSplitTransfers}{\titleterm{JoinSplit Operations}}
\newcommand{\joinSplitSignature}{\term{JoinSplit signature}}
\newcommand{\fullnode}{\term{full node}}
\newcommand{\fullnodes}{\term{full nodes}}
\newcommand{\blockchainview}{\term{blockchain view}}
\newcommand{\nullifierSet}{\term{nullifier set}}
\newcommand{\NullifierSet}{\titleterm{Nullifier Set}}
% Daira: This doesn't adequately distinguish between zk stuff and transparent stuff
\newcommand{\paymentAddress}{\term{payment address}}
\newcommand{\paymentAddresses}{\term{payment addresses}}
\newcommand{\viewingKey}{\term{viewing key}}
\newcommand{\viewingKeys}{\term{viewing keys}}
\newcommand{\spendingKey}{\term{spending key}}
\newcommand{\spendingKeys}{\term{spending keys}}
\newcommand{\keyTuple}{\term{key tuple}}
\newcommand{\notePlaintext}{\term{note plaintext}}
\newcommand{\notePlaintexts}{\term{note plaintexts}}
\newcommand{\NotePlaintexts}{\titleterm{Note Plaintexts}}
\newcommand{\notesCiphertext}{\term{transmitted notes ciphertext}}
\newcommand{\discloseKey}{\term{disclosure key}}
\newcommand{\incrementalMerkleTree}{\term{incremental merkle tree}}
\newcommand{\memo}{\term{memo field}}
\newcommand{\Memos}{\titleterm{Memo Fields}}
\newcommand{\dontcare}{\kern -0.06em\raisebox{0.1ex}{\footnotesize{$\times$}}}
\newcommand{\SHAName}{\term{SHA-256 compression}}
2015-12-14 09:03:59 -08:00
2015-12-14 09:03:59 -08:00
\newcommand{\sighashType}{\term{SIGHASH type}}
\newcommand{\sighashTypes}{\term{SIGHASH types}}
\newcommand{\JoinSplitCircuit}{\term{\texttt{JoinSplit} circuit}}
\newcommand{\COMMtrapdoor}{\term{\textsf{COMM} trapdoor}}
\title{\doctitle \\
\Large \docversion}
2016-03-31 01:00:04 -07:00
\eli{A few general comments that will re-appear numerous times:
\item The ``intended usage'' explanations are confusing: What happens if some player does not play as intended? Suggest using standard Crypto terminology: the algorithms specified in the paper refer to the \emph{honest} player(s). Any deviation is treated non-honest. Some
deviations harm only their perpetrator, and we should state this;
other deviations threaten other honest players
and we should explain how these are mitigated.
\item Suggest having two separate parts: first (i) an abstract functionality, stated in terms
of crypto primitives with a paramaterized security parameter, and (ii)
concrete instantiations. I'll call these ``abstract'' and ``instantiation'' later on.
\Zcash is an implementation of the \term{Decentralized Anonymous Payment}
scheme \Zerocash \cite{ZerocashOakland} with some adjustments to terminology,
functionality and performance. It bridges the existing \emph{transparent}
payment scheme used by \Bitcoin with a \emph{confidential} payment scheme
protected by zero-knowledge succinct non-interactive arguments of knowledge
Changes from the original \Zerocash are highlighted in \changed{\changedcolor}.
2016-03-31 01:00:04 -07:00
\eli{Should this section appear in this paper? This paper describes \emph{our implementation} so what does divergence mean? Perhaps this refers to non-honest players?}
\Zcash security depends on consensus. Should your program diverge from
consensus, its security is weakened or destroyed. The cause of the divergence
doesn't matter: it could be a bug in your program, it could be an error in
this documentation which you implemented as described, or it could be you do
everything right but other software on the network behaves unexpectedly. The
specific cause will not matter to the users of your software whose wealth is
2016-03-31 01:00:04 -07:00
\eli{I don't understand what indended behavior means?
This paper specifies how an \emph{honest player} should act, and whenever
there's deviation we should have something that mitigates it.}
Having said that, a specification of \emph{intended} behaviour is essential
for security analysis, understanding of the protocol, and maintenance of
Zcash Core and related software. If you find any mistake in this specification,
please contact \texttt{<>}. While the production \Zcash network
has yet to be launched, please feel free to do so in public even if you believe
the mistake may indicate a security weakness.
\subsection{Integers, Bit Sequences, and Endianness}
2016-03-31 01:00:04 -07:00
\eli{Suggest this subsection be moved to ``instantiation''}
All integers in \emph{\Zcash-specific} encodings are unsigned, have a fixed
bit length, and are encoded in little-endian byte order. \changed{The definition of
the encryption scheme based on $\SymSpecific$ \cite{rfc7539} in \crossref{inband}
uses length fields encoded as little-endian. Also, Curve25519 public and
private keys are defined as byte sequences, which are converted from integers
using little-endian encoding.}
2015-12-14 09:03:59 -08:00
The notation $\hexint{}$ followed by a string of \textbf{boldface} hexadecimal
digits represents the corresponding integer converted from hexadecimal.
The notation $\ascii{...}$ represents the given string represented as a
sequence of bytes in US-ASCII. For example, $\ascii{abc}$ represents the
byte sequence $[\hexint{61}, \hexint{62}, \hexint{63}]$.
In bit layout diagrams, each box of the diagram represents a sequence of bits.
The bit length is given explicitly in each box, except for the case of a single
bit, or for the notation $\zeros{n}$ which represents the sequence of $n$ zero bits.
The entire diagram represents the sequence of \emph{bytes} formed by first
concatenating these bit sequences, and then treating each subsequence of 8 bits
as a byte with the bits ordered from \emph{most significant} to
\emph{least significant}. Thus the \emph{most significant} bit in each byte
is toward the left of a diagram. Where bit fields are used, the text will
clarify their position in each case.
\todo{Update example for big-bit-endian order.}
\bitbox{1}{0} &
\bitbox{1}{1} &
\bitbox{1}{0} &
\bitbox{1}{0} &
\bitbox{16}{16 bit $\hexint{ABCD}$} &
\bitbox{12}{12 bit $\hexint{123}$} &
\bitbox{4}{4 bit $\hexint{2}$} &
\bitbox{4}{4 bit $\hexint{D}$} &
\bitbox{4}{4 bit $\hexint{C}$} &
\bitbox{4}{4 bit $\hexint{B}$} &
\bitbox{4}{4 bit $\hexint{A}$} &
\bitbox{4}{4 bit $\hexint{3}$} &
\bitbox{4}{4 bit $\hexint{2}$} &
\bitbox{4}{4 bit $\hexint{1}$} &
\bitbox{8}{8 bit $\hexint{D2}$} &
\bitbox{8}{8 bit $\hexint{BC}$} &
\bitbox{8}{8 bit $\hexint{3A}$} &
\bitbox{8}{8 bit $\hexint{12}$} &
For example, the following diagrams are all equivalent:
\item[] $\Justthebox{\exampleabox}{-1.3ex}$
\item[] $\Justthebox{\examplebbox}{-1.3ex}$
\item[] $\Justthebox{\examplecbox}{-1.3ex}$
and represent the byte sequence $[\hexint{D2}, \hexint{BC}, \hexint{3A}, \hexint{12}]$.
$\LeadingBytes{k}(x)$, where $k$ is an integer, returns the leading (initial)
$k$ bytes of $x$.
The notation $\allN{}$, used as a subscript, means the sequence of values
with indices $1$ through $\mathrm{N}$ inclusive. For example,
$\AuthPublicNew{\allNew}$ means the sequence $[\AuthPublicNew{\mathrm{1}},
\AuthPublicNew{\mathrm{2}}, ...\;\AuthPublicNew{\NNew}]$.
The symbol $\bot$ is used to indicate unavailable information or a failed decryption.
\subsection{Cryptographic Functions}
2016-03-31 01:00:04 -07:00
\eli{Would be good to start with ``abstract'', using only a single security parameter $\lambda$ and multiples of it, and then in ``instantiations'' have
the specific details. In more detail, in ``abstract'' we'll have
only $\CRH$ with security $\lambda$, and multiple $\PRF{a}{b}$'s all with same (?) security $\lambda$, and we demand that the PRFs are ``computationally pairwise independent'', i.e.,
$\PRF{a}{b}$ is pseudorandom given $\PRF{a'}{b'}$ whenever
$(a,b)\neq (a',b')$.}
$\CRH$ is a collision-resistant hash function. In \Zcash, the $\SHAName$ function
is used which takes a 512-bit block and produces a 256-bit hash. This is
different from the $\FullHashName$ function, which hashes arbitrary-length sequences.
$\PRF{x}{}$ is a pseudo-random function seeded by $x$. \changed{Four} \emph{independent}
$\PRF{x}{}$ are needed in our scheme: $\PRFaddr{x}$, $\PRFnf{x}$, $\PRFpk{x}$\changed{,
and $\PRFrho{x}$}.
It is required that $\PRFnf{x}$ \changed{and $\PRFrho{x}$} be collision-resistant
across all $x$ --- i.e. it should not be feasible to find $(x, y) \neq (x', y')$
2016-03-31 01:00:04 -07:00
such that $\PRFnf{x}(y) = \PRFnf{x'}(y')$\changed{, and similarly for $\PRFrho{}$}. \eli{we likely need more than just not having a collision. Likely
various bad things would happen if $\PRF{x}{y}$ can be predicted given
$\PRF{x'}{y'}$. So we really need them to be pairwise distinct pseudorandom
In \Zcash, the $\SHAName$ function is used to construct all of these
\bitbox{18}{1} &
\bitbox{18}{1} &
\bitbox{18}{0} &
\bitbox{18}{0} &
\bitbox{224}{252 bit $x$} &
\bitbox{56}{8 bit $t$} &
2015-12-14 09:03:59 -08:00
\bitbox{18}{1} &
\bitbox{18}{1} &
\bitbox{18}{1} &
\bitbox{18}{0} &
\bitbox{224}{252 bit $\AuthPrivate$} &
\bitbox{256}{256 bit $\NoteAddressRand$}
2015-12-14 09:03:59 -08:00
\bitbox{18}{0} &
\bitbox{18}{\iminusone} &
\bitbox{18}{0} &
\bitbox{18}{0} &
\bitbox{224}{252 bit $\AuthPrivate$} &
\bitbox{256}{256 bit $\hSig$}
2015-12-14 09:03:59 -08:00
\bitbox{18}{0} &
\bitbox{18}{\iminusone} &
\bitbox{18}{1} &
\bitbox{18}{0} &
\bitbox{224}{252 bit $\NoteAddressPreRand$} &
\bitbox{256}{256 bit $\hSig$}
&\setchanged \PRFaddr{x}(t) &\setchanged := \CRHbox{\addrbox} \\
\nf =\;& \PRFnf{\AuthPrivate}(\NoteAddressRand) &:= \CRHbox{\nfbox} \\
\h{i} =\;& \PRFpk{\AuthPrivate}(i, \hSig) &:= \CRHbox{\pkbox} \\
\setchanged \NoteAddressRandNew{i} =\;&\setchanged \PRFrho{\NoteAddressPreRand}(i, \hSig)
&\setchanged := \CRHbox{\rhobox}
The first four bits --i.e. the most significant four bits of the first byte--
are used to distinguish different uses of $\CRH$, ensuring that the functions
are independent. In addition to the inputs shown here, the bits $\mathtt{1011}$
in this position are used to distinguish uses of the full $\FullHashName$ hash
function --- see \crossref{comm}.
(The specific bit patterns chosen here are motivated by the possibility of future
extensions that either increase $\NOld$ and/or $\NNew$ to 3, or that add an
additional bit to $\AuthPrivate$ to encode a new key type, or that require an
additional PRF.)
$\BlakeHashName$ is also used to construct a Key Derivation Function and as a
hash function for the computation of $\hSig$. The notation $\BlakeHash(p, x)$
represents the application of unkeyed $\BlakeHashName$ to a 16-byte personalization
string $p$ and input $x$, as defined in \cite{blake2}.
\subsection{Payment Addresses and Spending Keys}
2015-12-14 09:03:59 -08:00
A \keyTuple $(\SpendingKey, \PaymentAddress)$ is
2016-03-31 01:00:04 -07:00
generated by users \eli{``who wish to receive payments'' is an intended usage explanation, suggest removing} who wish to receive payments under this scheme.
The \paymentAddress $\PaymentAddress$ is derived \eli{it's not clear what ``derived'' means, suggest putting the formal derivations right away here} from the \spendingKey
The following diagram depicts the relations between key components.
Arrows point from a component to any other component(s) that can be derived
2016-03-31 01:00:04 -07:00
from it. \eli{formally, any $a$ can be derived from any $b$ as long as ``derived'' remains undefined, hence think the
derivation rule should be explicitly stated here}
2016-03-31 01:00:04 -07:00
\eli{following sentence is an intended usage explanation, and its very confusing: what happens if a player deviates? will it ruin the system?
Additionally, its not clear what happens in our system, how is it ``not explosed to users''?}
The composition of \paymentAddresses and \spendingKeys
is a cryptographic protocol detail that should not normally be
2016-03-31 01:00:04 -07:00
exposed to users. However, user-visible operations \eli{what's a user-visible operation? what's an operation? how is it provided?} should be provided
to obtain a \paymentAddress or \viewingKey from a \spendingKey.
\changed{$\AuthPrivate$ is 252 bits.}
$\AuthPublic$, $\TransmitPrivate$, and $\TransmitPublic$, are each 256 bits.
\changed{$\AuthPublic$, $\TransmitPrivate$ and $\TransmitPublic$ are derived
2016-03-31 01:00:04 -07:00
as follows: \eli{this be moved up. Additionally, the clamp/curve stuff is an ``instantiation'' and here we should have abstract PKC functionality (public key, private key)}}
\AuthPublic &:= \changed{\PRFaddr{\AuthPrivate}(0)} \\
\TransmitPrivate &:= \changed{\Clamp(\PRFaddr{\AuthPrivate}(1))} \\
\TransmitPublic &:= \changed{\CurveMultiply(\TransmitPrivate, \CurveBase)}
\item $\CurveMultiply(\bytes{n}, \bytes{q})$ performs point
multiplication of the Curve25519 public key represented by the byte
sequence $\bytes{q}$ by the Curve25519 secret key represented by the
byte sequence $\bytes{n}$, as defined in section 2 of \cite{Curve25519};
\item $\CurveBase$ is the public byte sequence representing the Curve25519
base point;
\item $\Clamp(\bytes{x})$ takes a 32-byte sequence $\bytes{x}$ as input
and returns a byte sequence representing a Curve25519 private key, with
bits ``clamped'' as described in section 3 of \cite{Curve25519}:
``clear bits 0, 1, 2 of the first byte, clear bit 7 of the last byte,
and set bit 6 of the last byte.'' Here the bits of a byte are numbered
such that bit $b$ has numeric weight $2^b$.
2016-03-31 01:00:04 -07:00
\eli{following paragraph talks about intended usage, hence confusing. }
Users can accept payment from multiple parties with a single
$\PaymentAddress$ and the fact that these payments are destined to
the same payee is not revealed on the blockchain, even to the
paying parties. \emph{However} if two parties collude to compare a
$\PaymentAddress$ they can trivially determine they are the same. In the
case that a payee wishes to prevent this they should create a distinct
\paymentAddress for each payer.
A \note (denoted $\NoteTuple{}$) is a tuple $\changed{(\AuthPublic, \Value,
2016-03-31 01:00:04 -07:00
\NoteAddressRand, \NoteCommitRand)}$ \eli{shouldn't $\NoteCommitRand$ be part of the note commitment, not the note?} which represents that a value $\Value$ is
spendable by the recipient who holds the \spendingKey $\AuthPrivate$ corresponding
to $\AuthPublic$, as described in the previous section.
2016-03-31 01:00:04 -07:00
\eli{following list mixes abstract and instantiation stuff}
2016-03-31 01:00:04 -07:00
\item $\AuthPublic$ is a 32-byte \authKeypair public key \eli{this should have been defined in the previous section} of the recipient.
\item $\Value$ is a 64-bit unsigned integer representing the value of the
\note in \zatoshi (1 \ZEC = $10^8$ \zatoshi).
\item $\NoteAddressRand$ is a 32-byte $\PRFnf{\AuthPrivate}$ preimage.
\item $\NoteCommitRand$ is a 32-byte \COMMtrapdoor.
2016-03-31 01:00:04 -07:00
$\NoteCommitRand$ is randomly generated by the sender. \eli{previous and next sentence should be merged into bullets above} \changed{$\NoteAddressRand$
is generated from a random seed $\NoteAddressPreRand$ using
$\PRFrho{\NoteAddressPreRand}$.} Only a commitment to these values is disclosed
publicly, which allows the tokens $\NoteCommitRand$ and $\NoteAddressRand$ to blind
2016-03-31 01:00:04 -07:00
the value and recipient \emph{except} to those who possess these tokens \eli{inaccurate, even if you have these tokens you'd have to exhaustively try all address keys (not all of them are necessarily known to you) and values
to find the recipient}.
\subsubsection{\NoteCommitments} \label{comm}
2015-12-14 09:03:59 -08:00
The underlying $\Value$ and $\AuthPublic$ are blinded with $\NoteAddressRand$
and $\NoteCommitRand$ \changed{using the collision-resistant hash function $\FullHash$}.
2016-03-31 01:00:04 -07:00
\eli{The note commitment of $\NoteTuple{}$, denoted $\cm$, is defined as \ldots}
The resulting hash $\cm = \Commitment(\NoteTuple{})$.
2015-12-14 09:03:59 -08:00
\bitbox{24}{1} &
\bitbox{24}{0} &
\bitbox{24}{1} &
\bitbox{24}{1} &
\bitbox{24}{0} &
\bitbox{24}{0} &
\bitbox{24}{0} &
\bitbox{24}{0} &
\bitbox{256}{256 bit $\AuthPublic$} &
\bitbox{128}{64 bit $\Value$} &
\bitbox{256}{256 bit $\NoteAddressRand$}
\bitbox{256}{256 bit $\NoteCommitRand$} &
\hskip 1em $\cm := \FullHashbox{\cmbox}$
The leading byte of the $\FullHash$ input is $\hexint{B0}$.
A \nullifier (denoted $\nf$) is derived from the $\NoteAddressRand$ component
of a \note as $\PRFnf{\AuthPrivate}(\NoteAddressRand)$. A \note is spent by proving
knowledge of $\NoteAddressRand$ and $\AuthPrivate$ in zero knowledge while
disclosing its \nullifier $\nf$, allowing $\nf$ to be used to prevent double-spending.
\subsubsection{\NotePlaintexts and \Memos} \label{notept}
2016-03-31 01:00:04 -07:00
\eli{what's a transmitted note? is it different from a note? and who encrypts
them? }
Transmitted \notes are stored on the blockchain in encrypted form, together with
a \noteCommitment $\cm$.
2016-03-31 01:00:04 -07:00
\eli{next paragraph mentions ``note plainext'', ``joinsplit description'',
``transmission keys'', ``transmitted notes cyphertext'' all of which are undefined yet. It seems the main thing defined here is the memo, and then a note-plaintext. In the ``abstract'' part both are easy to describe, and the rest of the details should likely be moved to ``instantiation''}
The \notePlaintexts associated with a \joinSplitDescription are encrypted to the
respective \transmitKeypair keys $\TransmitPublicNew{\allNew}$,
and the result forms part of a \notesCiphertext (see \crossref{inband}
for further details).
Each \notePlaintext (denoted $\NotePlaintext{}$) consists of
$(\Value, \NoteAddressRand, \NoteCommitRand\changed{, \Memo})$.
The first three of these fields are as defined earlier.
\changed{$\Memo$ is a 128-byte \memo associated with this \note.
The usage of the \memo is by agreement between the sender and recipient of the
\note. The memo \SHOULD be encoded either as:
\item a UTF-8 human-readable string \cite{Unicode}, padded with zero bytes; or
\item an arbitrary sequence of 128 bytes starting with a byte value of $\hexint{F5}$
or greater, which is therefore not a valid UTF-8 string.
In the former case, wallet software is expected to strip any trailing zero bytes
and then display the resulting \mbox{UTF-8} string to the recipient user, where applicable.
Incorrect UTF-8-encoded byte sequences should be displayed as replacement characters
In the latter case, the contents of the \memo \SHOULDNOT be displayed. A start byte
of $\hexint{F5}$ is reserved for use by automated software by private agreement.
A start byte of $\hexint{F6}$ or greater is reserved for use in future \Zcash
protocol extensions.
The encoding of a \notePlaintext consists of, in order:
\bitbox{192}{8 bit $\NotePlaintextLeadByte$}
&}\bitbox{192}{$\Value$ (8 bytes)} &
\bitbox{256}{$\NoteAddressRand$ (32 bytes)} &
\bitbox{256}{$\NoteCommitRand$ (\changed{32} bytes)} &
\changed{\bitbox{800}{$\Memo$ (128 bytes)}}
\item A byte, $\NotePlaintextLeadByte$, indicating this version of the
encoding of a \notePlaintext.
\item 8 bytes specifying $\Value$.
\item 32 bytes specifying $\NoteAddressRand$.
\item \changed{32} bytes specifying $\NoteCommitRand$.
\item 128 bytes specifying $\Memo$.
\subsection{\NoteCommitment Tree}
2015-12-14 09:03:59 -08:00
2016-03-31 01:00:04 -07:00
\eli{Prefer ``A \noteCommitmentTree'' because all that follows works for any tree and we expect it to dynamically change}
The \noteCommitmentTree is an \incrementalMerkleTree of depth $\MerkleDepth$ used to
2016-03-31 01:00:04 -07:00
store \noteCommitments that \joinSplitTransfers produce \eli{undefined yet because \joinSplitTransfer. Why not define it just as in the figure, it's a merkle tree of note commitments.}. Just as the \term{unspent
transaction output set} (UTXO) used in \Bitcoin, it is used to express the existence
of value and the capability to spend it. However, unlike the UTXO, it is \emph{not}
2016-03-31 01:00:04 -07:00
the job of this tree \eli{intended usage. particularly confusing because we now assign a job to a tree.} to protect against double-spending, as it is append-only.
2016-03-31 01:00:04 -07:00
\eli{more intendend usage: how does the honest player associate blocks with a tree? and what happens if a user deviates from this? We're going to describe the split-join circuit. For this purpose we only need to have defined a tree of commitments and define (or assume knowledge) of authentication paths in a merkle tree} Blocks in the blockchain are associated (by all nodes) with the root of this tree
after all of its constituent \joinSplitDescriptions' \noteCommitments have been
entered into the tree associated with the previous block.
2016-03-31 01:00:04 -07:00
\eli{intended usage. Prefer to have defined (i) a nullifier set and (ii) commitment tree, and now talk about join-split. Later we'll explain how these two objects are modified.}
\eli{tx undefined yet}
Transactions insert \nullifiers into a \nullifierSet which is maintained
alongside the UTXO by all nodes.
\eli{a tx is just a string, so it doesn't insert anything. Rather, nodes process
tx's and the ``good'' ones lead to the addition of \nullifiers to the
Transactions that attempt to insert a \nullifier into this set that already
exists within it are invalid as they are attempting to double-spend.
\eli{After defining \term{transaction}, one should define what a \term{legal tx} is
(this definition depends on a particular blockchain [view]) and only then can one
talk about ``attempts'' of transactions, and insertions of \nullifiers into the
2016-03-31 01:00:04 -07:00
\eli{suggest to move the discussion of the split-join circuit to here, so that we first completely discuss the single-tx level (with respect to some fixed comm-tree and nullifier set). Then we'll move to the topic of consensus and blockchain.}
\subsection{The Blockchain}
2016-03-31 01:00:04 -07:00
\eli{For the ``abstract'' part, all we care about is that each block-chain is converted to a merkle tree in a deterministic manner, so that all honest players agree on it's structure (assuming they see the same blockchain). so the particular way the ``canonical'' tree is constructed should appear in ``instantiation''.}
At a given point in time, the \blockchainview of each \fullnode \eli{undefined. So far we had ``users''. Are there several kinds?} consists of a
sequence of one or more valid \blocks. Each \block consists of a sequence of one or
2016-03-31 01:00:04 -07:00
more \transactions. In a given node's \blockchainview, \treestates are chained
\eli{defined?} in an
obvious way:
\item The input \treestate of the first \block is the empty \treestate.
\item The input \treestate of the first \transaction of a \block is the final
\treestate of the immediately preceding \block.
\item The input \treestate of each subsequent \transaction in a \block is the
output \treestate of the immediately preceding \transaction.
\item The final \treestate of a \block is the output \treestate of its last
An \anchor is a Merkle tree root of a \treestate, and uniquely identifies that
\treestate given the assumed security properties of the Merkle tree's hash function.
2016-03-31 01:00:04 -07:00
Each \transaction is associated with a \sequenceOfJoinSplitDescriptions \eli{this is undefined yet. let's work bottom up: first define a join-split assuming a fixed tree}.
\todo{They also have a transparent value flow that interacts with the \joinSplitDescription's
\changed{$\vpubOld$ and} $\vpubNew$.}
2016-03-31 01:00:04 -07:00
Inputs and outputs are associated with a value. \eli{what're inputs/outputs of join-split?}
The total value of the outputs must not exceed the total value of the inputs.
The \anchor of the \changed{first} \joinSplitDescription in a \transaction must refer to
some earlier \block's final \treestate.
The \anchor of each subsequent \joinSplitDescription may refer either to some earlier
\block's final \treestate, or to the output \treestate of the immediately preceding
These conditions act as constraints on the blocks that a \fullnode will
accept into its \blockchainview.
We rely on Bitcoin-style consensus for \fullnodes to eventually converge on their
views of valid \blocks, and therefore of the sequence of \treestates in those
\subparagraph{Value pool}
2016-03-31 01:00:04 -07:00
\eli{who maintains the value pool? is this part of the specification? seems like
more intended usage?}\eli{tx undefined yet}
Transaction inputs insert value into a \term{value pool}, and transaction outputs
remove value from this pool. The remaining value in the pool is available to miners
as a fee.
\section{\JoinSplitTransfers and Descriptions} \label{pourdesc}
2016-03-31 01:00:04 -07:00
A \joinSplitDescription is data included in a \transaction \eli{lets first define split-join, then tx} that describes a \joinSplitTransfer,
i.e. a confidential value transfer. This kind of value transfer is the primary
2016-03-31 01:00:04 -07:00
\Zcash-specific operation performed by \transactions \eli{we keep referring to undefined tx. furthermore, it's not clear from last sentence if tx is a type of data or a process, as it performs something}; it uses, but should not be
confused with, the \JoinSplitCircuit used for the \zkSNARK proof and verification.
2016-03-31 01:00:04 -07:00
A \joinSplitTransfer spends $\NOld$ \notes $\cOld{\allOld}$ \eli{confusing notation, usually one uses $x_1,\ldots, x_n$ for a sequence of $n$ objects, not $x_{1\ldots,n}$ } and transparent input
$\vpubOld$, and creates $\NNew$ \notes $\cNew{\allNew}$ and transparent output
2016-03-31 01:00:04 -07:00
\subparagraph{Consensus rule:} \eli{what's a consensus rule? what happens if I deviate from it?}
Either $\vpubOld$ or $\vpubNew$ \MUST be zero. \eli{why? what harm would follow
if we allow both nonzero?}
2016-03-31 01:00:04 -07:00
\Zcash \transactions have the following additional fields \eli{additional to what?}:
Bytes & \heading{Name} & \heading{Data Type} & \heading{Description} \\
\Varies & $\nJoinSplit$ & \type{compactSize uint} & The number of \joinSplitDescriptions
in $\vJoinSplit$. \\ \hline
$1026 \times \nJoinSplit$ & $\vJoinSplit$ &
\type{JoinSplitDescription} \type{[$\nJoinSplit$]} &
The \sequenceOfJoinSplitDescriptions in this \transaction. \\ \hline
33 $\dagger$ & $\joinSplitPubKey$ & \type{char[33]} & An encoding of a ECDSA public verification key,
using the secp256k1 curve and parameters defined in \cite{sec2-ecdsa} and
\cite{secp256k1}. \\ \hline
64 $\dagger$ & $\joinSplitSig$ & \type{char[64]} & A signature on a prefix of the \transaction encoding,
to be verified using $\joinSplitPubKey$. \\ \hline
$\dagger$ The $\joinSplitPubKey$ and $\joinSplitSig$ fields are present if and only if
$\nJoinSplit > 0$.
The encoding of $\joinSplitPubKey$ and the data to be signed are specified in
more detail in \crossref{nonmalleability}.
2016-03-31 01:00:04 -07:00
Each \type{JoinSplitDescription} consists of: \eli{maybe move this up, to beginning of section?}
Bytes & \heading{Name} & \heading{Data Type} & \heading{Description} \\
\setchanged 8 &\setchanged $\vpubOldField$ &\setchanged \type{int64\_t} &\mbox{}\setchanged
A value $\vpubOld$ that the \joinSplitTransfer removes from the value pool. \\ \hline
2015-12-14 09:03:59 -08:00
8 & $\vpubNewField$ & \type{int64\_t} & A value $\vpubNew$ that the \joinSplitTransfer inserts
into the value pool. \\ \hline
2015-12-14 09:03:59 -08:00
32 & $\anchorField$ & \type{char[32]} & A merkle root $\rt$ of the \noteCommitmentTree at
some block height in the past, or the merkle root produced by a previous \joinSplitTransfer in
this \transaction. \sean{We need to be more specific here.} \\ \hline
2015-12-14 09:03:59 -08:00
64 & $\nullifiersField$ & \type{char[32][$\NOld$]} & A sequence of \nullifiers of the input
\notes $\nfOld{\allOld}$. \\ \hline
2015-12-14 09:03:59 -08:00
64 & $\commitments$ & \type{char[32][$\NNew$]}. & A sequence of \noteCommitments for the
output \notes $\cmNew{\allNew}$. \\ \hline
2015-12-14 09:03:59 -08:00
\setchanged 32 &\setchanged $\ephemeralKey$ &\setchanged \type{char[32]} &\mbox{}\setchanged
A Curve25519 public key $\EphemeralPublic$. \\ \hline
434 & $\encCiphertexts$ & \type{char[217][$\NNew$]} & A sequence of ciphertext
components for the encrypted output \notes, $\TransmitCiphertext{\allNew}$. \\ \hline
\setchanged 32 &\setchanged $\randomSeed$ &\setchanged \type{char[32]} &\mbox{}\setchanged
A 256-bit seed that must be chosen independently at random for each \joinSplitDescription. \\ \hline
64 & $\vmacs$ & \type{char[32][$\NOld$]} & A sequence of message authentication tags
$\h{\allOld}$ that bind $\hSig$ to each $\AuthPrivate$ of the
$\joinSplitDescription$. \\ \hline
288 & $\zkproof$ & \type{char[288]} & An encoding, as determined by the libsnark library
\cite{libsnark}, of the zero-knowledge proof $\JoinSplitProof$. \\ \hline
The $\ephemeralKey$ and $\encCiphertexts$ fields together form the \notesCiphertext.
\todo{Describe case where there are fewer than $\NOld$ real input \notes.}
\subsection{Computation of \hSigText} \label{hsig}
2016-03-31 01:00:04 -07:00
\eli{this is mostly ``instantiation'' but I notice we use a different crypto primitive instantiation (Blake) not mentioned previously. Let's use the ``abstract'' notation here: what is it? a CRH? PRF? requires different multiple of the security parameter $\lambda$? does it require special properties not needed by other instances of these that we used previously?}
\bitbox{72}{72 bit $\ascii{ZcashhSig}$}
\bitbox{256}{\hfill 256 bit $\nfOld{\mathrm{1}}$\hfill...\;} &
\bitbox{256}{256 bit $\nfOld{\NOld}$} &
Given a \joinSplitDescription, we define:
\hskip 1em $\hSigtag := \Justthebox{\hsigtagbox}{-1.3ex}$
\hskip 1em $\hSig := \BlakeHashbox{\hSigtag}{\hsigbox}$
\subsection{Merkle root validity}
A \joinSplitDescription is valid if $\rt$ is a \noteCommitmentTree root found in
either the blockchain or a merkle root produced by inserting the \noteCommitments
of a previous \joinSplitDescription in the \transaction to the \noteCommitmentTree
identified by that previous \joinSplitDescription's $\anchor$.
2015-12-14 09:03:59 -08:00
2016-03-31 01:00:04 -07:00
\eli{mix of ``abstract'' and ``instantiation''. Can we first describe abstractly what's going on here? I got pretty lost pretty quickly. In particular, what
are the abstract functionalities of SIGHASH, SIGHASHALL, ECDSA,
the ``compressed elliptic curve point'', are they CRH and digital signature, respectively? what security (as multiple of $\lambda$?)}
\Bitcoin defines several \sighashTypes that cover various parts of a transaction.
In \Zcash, all of these \sighashTypes are extended to cover the \Zcash-specific
fields $\nJoinSplit$, $\vJoinSplit$, and $\joinSplitPubKey$. They \emph{do not}
cover the field $\joinSplitSig$.
2016-03-31 01:00:04 -07:00
\subparagraph{Consensus rule:}\eli{what's a consensus rule?}
If $\nJoinSplit > 0$, the \transaction \MUSTNOT use \sighashTypes other than
Let $\dataToBeSigned$ be the hash of the \transaction using the $\SIGHASHALL$
\sighashType. Note that this \emph{excludes} all of the $\scriptSig$ fields in
the non-\Zcash-specific parts of the \transaction.
In order to ensure that a \joinSplitDescription is cryptographically bound to the
transparent inputs and outputs corresponding to $\vpubNew$ and $\vpubOld$, and
to the other \joinSplitDescriptions in the same \transaction, an ephemeral ECDSA
key pair is generated for each \transaction, and the $\dataToBeSigned$ is
signed with the private signing key of this key pair. The corresponding public
verification key is included in the \transaction encoding as $\joinSplitPubKey$.
If $\nJoinSplit$ is zero, the $\joinSplitPubKey$ and $\joinSplitSig$ fields are
omitted. Otherwise, a \transaction has a correct \joinSplitSignature if:
\item $\joinSplitSig$ can be verified as an encoding of a signature on
$\dataToBeSigned$, using the ECDSA public key encoded as $\joinSplitPubKey$; and
\item $\joinSplitSig$ has an $\ECDSAs$ value in the lower half of the possible range
(i.e. $\ECDSAs$ must be in the range from 0x1 to \linebreak
If $\ECDSAs$ is not in the given range, the signature is treated as invalid.
\bitbox{256}{256 bit $\ECDSAr$}
\bitbox{256}{256 bit $\ECDSAs$}
\bitbox{56}{1 bit $\tilde{y}_P$}
\bitbox{256}{256 bit $x_P$}
The encoding of a signature is:
\item[] $\Justthebox{\sigbox}{-1.3ex}$
where $\ECDSAr$ and $\ECDSAs$ are as defined in \cite{sec2-ecdsa}.
The encoding of a public key is as defined in section E.2.3.2 of \cite{std1363}
for a compressed elliptic curve point with $x$-coordinate $x_P$ and compressed
$y$-coordinate $\tilde{y}_P$:
\item[] $\Justthebox{\pubkeybox}{-1.3ex}$
Note that only compressed public keys are valid.
The condition enforced by the \JoinSplitCircuit specified in \crossref{nonmalleablepour}
ensures that a holder of all of $\AuthPrivateOld{\allOld}$ for each
\joinSplitDescription has authorized the use of the private signing key corresponding
to $\joinSplitPubKey$ to sign this \transaction.
2016-03-31 01:00:04 -07:00
A \joinSplitTransfer can be seen \eli{more intended usage}, from the perspective of the \transaction \eli{does a transaction ``see''?}, as
an input \changed{and an output simultaneously}.
\changed{$\vpubOld$ takes value from the value pool and}
$\vpubNew$ adds value to the value pool. As a result, \changed{$\vpubOld$ is
2016-03-31 01:00:04 -07:00
treated \eli{by whom?} like an \emph{output} value, whereas} $\vpubNew$ is treated like an
\emph{input} value.
Note that unlike original \Zerocash \cite{ZerocashOakland}, \Zcash does not have
a distinction between Mint and Pour operations. The addition of $\vpubOld$ to a
\joinSplitDescription subsumes the functionality of both Mint and Pour. Also,
\joinSplitDescriptions are indistinguishable regardless of the number of real input
As stated in \crossref{pourdesc}, either $\vpubOld$ or $\vpubNew$ \MUST be zero.
No generality is lost because, if a \transaction in which both $\vpubOld$ and
$\vpubNew$ were nonzero were allowed, it could be replaced by an equivalent one
in which $\minimum(\vpubOld, \vpubNew)$ is subtracted from both of these values.
This restriction helps to avoid unnecessary distinctions between \transactions
according to client implementation.
2015-12-14 09:03:59 -08:00
2016-03-31 01:00:04 -07:00
A \transaction \eli{undefined yet} that contains one or more \joinSplitDescriptions, when entered into the
blockchain, appends to the \noteCommitmentTree with all constituent
\noteCommitments. All of the constituent \nullifiers are also entered into the
\nullifierSet of the \blockchainview \emph{and} \mempool. A \transaction is not
valid if it attempts to add a \nullifier to the \nullifierSet that already
2016-03-31 01:00:04 -07:00
exists in the set.
2015-12-14 09:03:59 -08:00
In \Zcash, $\NOld$ and $\NNew$ are both $2$.
2015-12-14 09:03:59 -08:00
A valid instance of $\JoinSplitProof$ assures that given a \term{primary input}:
2015-12-14 09:03:59 -08:00
\item[] $(\rt, \nfOld{\allOld}, \cmNew{\allNew}, \changed{\vpubOld,\;}
\vpubNew, \hSig, \h{\allOld})$,
2015-12-14 09:03:59 -08:00
there exists a witness of \term{auxiliary input}:
\item[] $(\treepath{\allOld}, \cOld{\allOld}, \AuthPrivateOld{\allOld},
\cNew{\allNew}\changed{, \NoteAddressPreRand})$
2015-12-14 09:03:59 -08:00
2015-12-14 09:03:59 -08:00
\item[] for each $i \in \setofOld$: $\cOld{i} = (\AuthPublicOld{i},
\vOld{i}, \NoteAddressRandOld{i}, \NoteCommitRandOld{i})$;
\item[] for each $i \in \setofNew$: $\cNew{i} = (\AuthPublicNew{i},
\vNew{i}, \NoteAddressRandNew{i}, \NoteCommitRandNew{i})$
2015-12-14 09:03:59 -08:00
such that the following conditions hold:
2015-12-14 09:03:59 -08:00
\subparagraph{Merkle path validity}
for each $i \in \setofOld$ \changed{$\mid$ $\vOld{i} \neq 0$}:
$\treepath{i}$ must be a valid path of depth $\MerkleDepth$ from \linebreak
$\Commitment(\cOld{i})$ to \noteCommitmentTree root $\rt$.
2015-12-14 09:03:59 -08:00
$\changed{\vpubOld\; +} \vsum{i=1}{\NOld} \vOld{i} = \vpubNew + \vsum{i=1}{\NNew} \vNew{i}$.
2015-12-14 09:03:59 -08:00
\subparagraph{\Nullifier integrity}
2015-12-14 09:03:59 -08:00
for each $i \in \setofNew$:
$\nfOld{i} = \PRFnf{\AuthPrivateOld{i}}(\NoteAddressRandOld{i})$.
2015-12-14 09:03:59 -08:00
\subparagraph{Spend authority}
for each $i \in \setofOld$:
$\AuthPublicOld{i} = \changed{\PRFaddr{\AuthPrivateOld{i}}(0)}$.
2015-12-14 09:03:59 -08:00
\subparagraph{Non-malleability} \label{nonmalleablepour}
2015-12-14 09:03:59 -08:00
for each $i \in \setofOld$:
$\h{i} = \PRFpk{\AuthPrivateOld{i}}(i, \hSig)$.
2015-12-14 09:03:59 -08:00
\subparagraph{Uniqueness of $\NoteAddressRandNew{i}$} \label{uniquerho}
for each $i \in \setofNew$:
$\NoteAddressRandNew{i} = \PRFrho{\NoteAddressPreRand}(i, \hSig)$.
2015-12-14 09:03:59 -08:00
\subparagraph{Commitment integrity}
for each $i \in \setofNew$: $\cmNew{i}$ = $\Commitment(\cNew{i})$.
\section{In-band secret distribution} \label{inband}
In order to transmit the secret $\Value$, $\NoteAddressRand$, and $\NoteCommitRand$
(necessary for the recipient to later spend) \changed{and also a \memo} to the
recipient \emph{without} requiring an out-of-band communication channel, the
\transmitKeypair public key $\TransmitPublic$ is used to encrypt these
secrets. The recipient's possession of the associated
$(\PaymentAddress, \SpendingKey)$ (which contains both $\AuthPublic$ and
$\TransmitPrivate$) is used to reconstruct the original \note \changed{ and \memo}.
All of the resulting ciphertexts are combined to form a \notesCiphertext.
\bitbox{64}{64 bit $\ascii{ZcashKDF}$} &
\bitbox{32}{8 bit $i\!-\!1$}
\bitbox{256}{256-bit $\hSig$}
\bitbox{256}{256 bit $\DHSecret{i}$} &
\bitbox{256}{256 bit $\EphemeralPublic$} &
\bitbox{256}{256 bit $\TransmitPublicNew{i}$} &
Let $\SymEncrypt{\Key}(\Plaintext)$ be authenticated encryption using
$\SymSpecific$ \cite{rfc7539} encryption of plaintext $\Plaintext$, with empty
``associated data", all-zero nonce $\zeros{96}$, and 256-bit key $\Key$.
Similarly, let $\SymDecrypt{\Key}(\Ciphertext)$ be $\SymSpecific$
decryption of ciphertext $\Ciphertext$, with empty ``associated data",
all-zero nonce $\zeros{96}$, and 256-bit key $\Key$. The result is either
the plaintext byte sequence, or $\bot$ indicating failure to decrypt.
\hskip 1.5em $\KDF(i, \hSig, \DHSecret{i}, \EphemeralPublic, \TransmitPublicNew{i}) :=
\LeadingBytes{32}(\BlakeHash(\kdftag, \kdfinput))$
\hskip 1.5em $\kdftag := \Justthebox{\kdftagbox}{-1.3ex}$
\hskip 1.5em $\kdfinput := \Justthebox{\kdfinputbox}{-1.3ex}$.
Let $\TransmitPublicNew{\allNew}$ be the \changed{Curve25519} public keys
for the intended recipient addresses of each new \note, and let
$\NotePlaintext{\allNew}$ be the \notePlaintexts. Let $\hSig$ be the
value computed in \crossref{hsig}.
Then to encrypt:
\item Generate a new Curve25519 (public, private) key pair
$(\EphemeralPublic, \EphemeralPrivate)$.
\item For $i \in \setofNew$,
\item Let $\TransmitPlaintext{i}$ be the raw encoding of $\NotePlaintext{i}$.
\item Let $\DHSecret{i} := \CurveMultiply(\EphemeralPrivate,
\item Let $\TransmitKey{i} := \KDF(i, \hSig, \DHSecret{i}, \EphemeralPublic,
\item Let $\TransmitCiphertext{i} :=
The resulting \notesCiphertext is $\changed{(\EphemeralPublic,
\subsection{Decryption by a Recipient}
Let $\PaymentAddress = (\AuthPublic, \TransmitPublic)$ be the recipient's
\paymentAddress, and let $\TransmitPrivate$ be the recipient's \changed{Curve25519}
private key. Let $\hSig$ be the value computed in \crossref{hsig}.
Let $\cmNew{\allNew}$ be the \noteCommitments of each output coin.
Then for each $i \in \setofNew$, the recipient will attempt to decrypt that ciphertext
component as follows:
\item Let $\DHSecret{i} := \CurveMultiply(\TransmitPrivate, \EphemeralPublic)$.
\item Let $\TransmitKey{i} := \KDF(i, \hSig, \DHSecret{i}, \EphemeralPublic,
\item Return $\DecryptNote(\TransmitKey{i}, \TransmitCiphertext{i}, \cmNew{i},
$\DecryptNote(\TransmitKey{i}, \TransmitCiphertext{i}, \cmNew{i}, \AuthPublic)$
is defined as follows:
\item Let $\TransmitPlaintext{i} :=
\item If $\TransmitPlaintext{i} = \bot$, return $\bot$.
\item Extract $\NotePlaintext{i} = (\ValueNew{i},
\NoteAddressRandNew{i}, \NoteCommitRandNew{i}, \Memo_i)$ from $\TransmitPlaintext{i}$.
\item If $\Commitment((\AuthPublic, \ValueNew{i}, \NoteAddressRandNew{i},
\NoteCommitRandNew{i})) \neq \cmNew{i}$, return $\bot$, else return $\NotePlaintext{i}$.
Note that this corresponds to step 3 (b) i. and ii. (first bullet point) of the
$\Receive$ algorithm shown in Figure 2 of \cite{ZerocashOakland}.
To test whether a \note is unspent in a particular \blockchainview also requires
the \authKeypair private key $\AuthPrivate$; the coin is unspent if and only if
$\nf = \PRFnf{\AuthPrivate}(\NoteAddressRand)$ is not in the \nullifierSet
for that \blockchainview.
Note that a \note may change from being unspent to spent on a given \blockchainview,
as \transactions are added to that view. Also, blockchain reorganisations may cause
the \transaction in which a \note was output to no longer be on the consensus
The public key encryption used in this part of the protocol is based loosely on
other encryption schemes based on Diffie-Hellman over an elliptic curve, such
as ECIES or the $\CryptoBoxSeal$ algorithm defined in libsodium \cite{cryptoboxseal}.
Note that:
\item The same ephemeral key is used for all encryptions to the recipient keys
in a given \joinSplitDescription.
\item In addition to the Diffie-Hellman secret, the KDF takes as input the
seed $\hSig$, the public keys of both parties, and the index $i$.
\item The nonce parameter to $\SymSpecific$ is not used.
\item The ``IETF" definition of $\SymSpecific$ from \cite{rfc7539} is
used; this uses a 32-bit block count and a 96-bit nonce, rather than a 64-bit
block count and 64-bit nonce as in the original definition of $\SymCipher$.
\section{Encoding Addresses and Keys}
This section describes how \Zcash encodes \paymentAddresses and \spendingKeys.
Addresses and keys can be encoded as a byte sequence; this is called
the \term{raw encoding}. This byte sequence can then be further encoded using
Base58Check. The Base58Check layer is the same as for upstream \Bitcoin
addresses \cite{Base58Check}.
SHA-256 compression function outputs are always represented as sequences of 32
The language consisting of the following encoding possibilities is prefix-free.
\subsection{Transparent Payment Addresses}
These are encoded in the same way as in \Bitcoin \cite{Base58Check}.
\subsection{Transparent Private Keys}
These are encoded in the same way as in \Bitcoin \cite{Base58Check}.
\subsection{Protected Payment Addresses}
A \paymentAddress consists of $\AuthPublic$ and $\TransmitPublic$.
$\AuthPublic$ is a SHA-256 compression function output.
$\TransmitPublic$ is a \changed{Curve25519} public key, for use with the
encryption scheme defined in \crossref{inband}.
The raw encoding of a \paymentAddress consists of:
\bitbox{72}{8 bit $\PaymentAddressLeadByte$}
&}\bitbox{256}{256 bit $\AuthPublic$} &
\bitbox{256}{\changed{256 bit} $\TransmitPublic$}
\item A byte, $\PaymentAddressLeadByte$, indicating this version of the
raw encoding of a \Zcash public address.
\item 256 bits specifying $\AuthPublic$.
\item \changed{256 bits} specifying $\TransmitPublic$, \changed{using the
normal encoding of a Curve25519 public key \cite{Curve25519}}.
\daira{check that this lead byte is distinct from other Bitcoin stuff,
and produces `z' as the Base58Check leading character.}
\nathan{what about the network version byte?}
\subsection{Spending Keys}
A \spendingKey consists of $\AuthPrivate$, which is a sequence of 252 bits.
The raw encoding of a \spendingKey consists of, in order:
\bitbox{72}{8 bit $\SpendingKeyLeadByte$}
\bitbox{32}{$\zeros{4}$} &
&}\bitbox{252}{\changed{252} bit $\AuthPrivate$}
\item A byte $\SpendingKeyLeadByte$ indicating this version of the
raw encoding of a \Zcash \spendingKey.
\item 4 zero padding bits.
\item \changed{252} bits specifying $\AuthPrivate$.
The zero padding occupies the most significant 4 bits of the second byte.
\subparagraph{Note:} If an implementation represents $\AuthPrivate$
internally as a sequence of 32 bytes with the 4 bits of zero padding
intact, it will be in the correct form for use as an input to
$\PRFaddr{}$, $\PRFnf{}$, and $\PRFpk{}$ without need for bit-shifting.
Future key representations may make use of these padding bits.
\daira{check that this lead byte is distinct from other Bitcoin stuff,
and produces a suitable Base58Check leading character.}
\nathan{what about the network version byte?}
\section{Differences from the Zerocash paper}
\subsection{Transaction Structure} \label{trstructure}
\Zerocash introduces two new operations, which are described in
the paper as new transaction types, in addition to the original
transaction type of the cryptocurrency on which it is based
(e.g. \Bitcoin).
In \Zcash, there is only the original \Bitcoin transaction type,
which is extended to contain a sequence of zero or more
\Zcash-specific operations.
This allows for the possibility of chaining transfers of protected
value in a single \Zcash \transaction, e.g. to spend a protected \note
that has just been created. (In \Zcash, we refer to value stored in
UTXOs as ``transparent'', and value stored in \joinSplitTransfer output
\notes as ``protected''.)
This was not possible in the \Zerocash design without using multiple
transactions. It also allows transparent and protected transfers to
happen atomically --- possibly under the control of nontrivial script
conditions, at some cost in distinguishability.
\todo{Describe changes to signing.}
\subsection{Unification of Mints and Pours}
In the original \Zerocash protocol, there were two kinds of transaction
relating to protected \notes:
\item a ``Mint'' transaction takes value from transparent UTXOs as
input and produces a new protected \note as output.
\item a ``Pour'' transaction takes up to $\NOld$ protected
\notes as input, and produces up to $\NNew$ protected \notes and a
transparent UTXO as output.
Only ``Pour'' transactions included a \zkSNARK proof.
In \Zcash, the sequence of operations added to a \transaction
(described in \crossref{trstructure}) consists only of \joinSplitTransfers.
A \joinSplitTransfer is a Pour operation generalized to take a transparent
UTXO as input, allowing \joinSplitTransfers to subsume the functionality of
Mints. An advantage of this is that a \Zcash \transaction that takes
input from an UTXO can produce up to $\NNew$ output \notes, improving
the indistinguishability properties of the protocol. A related change
conceals the input arity of the \joinSplitTransfer: an unused (zero-value)
input is indistinguishable from an input that takes value from a \note.
This unification also simplifies the fix to the Faerie Gold attack
described below, since no special case is needed for Mints.
\Zcash adds a \memo sent from the creator of a \joinSplitDescription to
the recipient of each output \note. This feature is described in
more detail in \crossref{notept}.
\subsection{Faerie Gold attack and fix}
When a protected \note is created in \Zerocash, the creator is
supposed to choose a new $\NoteAddressRand$ value at random.
The \nullifier of the \note is derived from its \spendingKey
($\AuthPrivate$) and $\NoteAddressRand$. The \noteCommitment
is derived from the recipient address component $\AuthPublic$,
the value $\Value$, and the commitment trapdoor $\NoteCommitRand$,
as well as $\NoteAddressRand$. However nothing prevents creating
multiple \notes with different $\Value$ and $\NoteCommitRand$
(hence different \noteCommitments) but the same $\NoteAddressRand$.
An adversary can use this to mislead a \note recipient, by sending
two \notes both of which are verified as valid by $\Receive$ (as
defined in Figure 2 of \cite{ZerocashOakland}), but only one of
which can be spent.
We call this a ``Faerie Gold'' attack --- referring to various Celtic
legends in which faeries pay mortals in what appears to be gold,
but which soon after reveals itself to be leaves, gorse blossoms,
gingerbread cakes, or other less valuable things \cite{LG2004}.
This attack does not violate the security definitions given in
\cite{ZerocashOakland}. The issue could be framed as a problem
either with the definition of Completeness, or the definition of
\item The Completeness property asserts that a validly received
\note can be spent provided that its \nullifier does not appear
on the ledger. This does not take into account the possibility
that distinct \notes, which are validly received, could have the
same \nullifier. That is, the security definition depends on
a protocol detail --\nullifiers-- that is not part of the
intended abstract security property, and that could be implemented
\item The Balance property only asserts that an adversary cannot
obtain \emph{more} funds than they have minted or received via
payments. It does not prevent an adversary from causing others'
funds to decrease. In a Faerie Gold attack, an adversary can cause
spending of a \note to reduce (to zero) the effective value of another
\note for which the attacker does not know the \spendingKey, which
violates an intuitive conception of global balance.
These problems with the security definitions need to be repaired,
but doing so is outside the scope of this specification. Here we
only describe how \Zcash addresses the immediate attack.
It would be possible to address the attack by requiring that a
recipient remember all of the $\NoteAddressRand$ values for all
\notes they have ever received, and reject duplicates (as proposed
in \cite{GGM2016}). However, this requirement would interfere
with the intended \Zcash feature that a holder of a \spendingKey
can recover access to (and be sure that they are able to spend) all
of their funds, even if they have forgotten everything but the
Instead, \Zcash enforces that an adversary must choose distinct values
for each $\NoteAddressRand$, by making use of the fact that all of the
\nullifiers in \joinSplitDescriptions that appear in a valid \blockchainview
must be distinct. The \nullifiers are used as input to $\BlakeHashName$
to derive a public value $\hSig$ which uniquely identifies the transaction,
as described in \crossref{hsig}. ($\hSig$ was already used in \Zerocash
in a way that requires it to be unique in order to maintain
indistinguishability of \joinSplitDescriptions; adding the \nullifiers
to the input of the hash used to calculate it has the effect of making
this uniqueness property robust even if the \transaction creator is an
The $\NoteAddressRand$ value for each output \note is then derived from
a random private seed $\NoteAddressPreRand$ and $\hSig$ using
$\PRFrho{\NoteAddressPreRand}$. The correct construction of
$\NoteAddressRand$ for each output \note is enforced by the circuit
(see \crossref{uniquerho}).
Now even if the creator of a \joinSplitDescription does not choose
$\NoteAddressPreRand$ randomly, uniqueness of \nullifiers and
collision resistance of both $\BlakeHashName$ and $\PRFrho{}$ will ensure
that the derived $\NoteAddressRand$ values are unique, at least for
any two \joinSplitDescriptions that get into a valid \blockchainview.
This is sufficient to prevent the Faerie Gold attack.
\subsection{Internal hash collision attack and fix}
The \Zerocash security proof requires that the composition of
$\COMM{\NoteCommitRand}$ and $\COMM{\NoteCommitS}$ is a computationally
binding commitment to its inputs $\AuthPublic$, $\Value$, and
$\NoteAddressRand$. However, the instantiation of $\COMM{\NoteCommitRand}$
and $\COMM{\NoteCommitS}$ in section 5.1 of the paper did not meet
the definition of a binding commitment at a 128-bit security level.
Specifically, the internal hash of $\AuthPublic$ and $\NoteAddressRand$
is truncated to 128 bits (motivated by providing statistical hiding
security). This allows an attacker, with a work factor on the order of
$2^{64}$, to find distinct values of $\NoteAddressRand$ with colliding
outputs of the truncated hash, and therefore the same \noteCommitment.
This would have allowed such an attacker to break the balance property
by double-spending \notes, potentially creating arbitrary amounts of
currency for themself.
\Zcash uses a simpler construction with a single $\FullHash$ evaluation
for the commitment. The motivation for the nested construction in \Zerocash
was to allow Mint transactions to be publically verified without requiring
a ZK proof (as described under step 3 in section 1.3 of
\cite{ZerocashOakland}). Since \Zcash combines ``Mint'' and ``Pour''
transactions into a generalized \joinSplitTransfer which always uses a ZK proof,
it does not require the nesting. A side benefit is that this reduces the
number of $\SHA$ evaluations needed to compute each \noteCommitment from
three to two, saving a total of four $\SHA$ evaluations in the
Note that \Zcash \noteCommitments are not statistically hiding, and
so \Zcash does not support the ``everlasting anonymity'' property
described in section 8.1 of the \Zerocash paper \cite{ZerocashOakland},
even when used as described in that section. While it is possible to
define a statistically hiding, computationally binding commitment scheme
for this use at a 128-bit security level, the overhead of doing so
within the circuit was not considered to justify the benefits.
\subsection{Changes to PRF inputs and truncation}
%The need for collision resistance of \CRH(.) truncated to 253 bits was not
%explicitly stated in \ (This does not follow from collision resistance of $\CRH$.)
\subsection{In-band secret distribution}
\item The paper defines a \note as a tuple $(\AuthPublic, \Value,
\NoteAddressRand, \NoteCommitRand, \NoteCommitS, \cm)$, whereas this specification
defines it as $(\AuthPublic, \Value, \NoteAddressRand, \NoteCommitRand)$.
This is just a clarification, because the instantiation of $\COMM{\NoteCommitS}$
in section 5.1 of the paper did not use $\NoteCommitS$ (and neither does the
new instantiation of $\Commitment$). $\cm$ can be computed from the other
The inventors of \Zerocash are Eli Ben-Sasson, Alessandro Chiesa,
Christina Garman, Matthew Green, Ian Miers, Eran Tromer, and Madars
The authors would like to thank everyone with whom they have discussed
the \Zerocash protocol design; in addition to the inventors, this includes
Mike Perry, Isis Lovecruft, Leif Ryge, Andrew Miller, Zooko Wilcox,
Samantha Hulsey, and no doubt others.
The Faerie Gold attack was found by Zooko Wilcox.
The internal hash collision attack was found by Taylor Hornby.
