DS - A purely peer-to-peer, local-first, self-verifiable, portable, encrypted-by-default document stamp format for the next billions of people and years
by Josh Spooner, the last date scientist, and many giants, published 2024
Satoshi Nakamoto
In the digital realm, a visionary’s spark ignited,
Satoshi’s genius, both in code and narrative, united.
A chain of trust, decentralized and bright, your legacy forever recited.
Abstract.
A local-first, tamper-proof, portable, self-verifiable, encrypted-by-default, AI-efficient, peer-to-peer document format is within reach that allows the the next billions of people to cost-effectively record and account with reputable institutions using tamper-verifiable they trust and on-chain where they choose. Blockchain has truly unleashed unmatched decentralized innovation like nothing before, but the path beyond smart contracts to ricardian contracts is only reachable if we standardize and localize the verifiability of content blocks and timeserver and can trust that each devices is accurate at the time of sync or that date manipulation can be detected via domain-specific means. Thus, we propose a markdown format that provides verifiable standards for digesting, time-stamping, and updating the content block with canonical content-addressable reference in time (collectively, “date seal”). Archiving the deal seal and optionally publishing on-chain or via NOSTR builds a decentralized repository of date seals accessible for any party to verify the existence of a content block in time. The privacy and transmutable identity problems are solved by way of letting the authors of smart documents choose if a content block is date-sealed with personal or other identifiers or links. Date-sealing the cover page, initials, and/or signature section of a document is a portable means enabling multi-chain, multi-party ricardian contracts. Open source, local AI can provide literate, non-technical people the means of verifying the SHA-256 hashing algorithm’s in-line implementation for more accessible self-verification. [^ds/2024-08-19/baa6e318e9c1^]
1. Introduction
Data on the Internet has come to rely almost exclusively on cloud platforms serving as trusted third parties to process data and check its integrity, and crypto platforms, which have not made it usefully offline-able. While the system works well enough for most people, it still suffers from the inherent weaknesses of every centralized trust based model.
What is needed is a human and machine readable, portable document standard based on cryptographic proof instead of centralized trust, allowing any two willing parties to self-verify “date-sealed” smart documents directly with each other without the need for a trusted intermediary. Date-seal is an in-line and archived document stamp, usually displayed as a pop-up or footnote of at least the block’s digest and timestamp.
In this paper, we propose a solution to the document verifiability and agentic identity problem using a peer-to-peer distributed server to generate computational proof of the chronological order of date-seal stamps and allow identifying information to be optionally redacted. The system is secure in establishing offline-able trust as long as honest nodes sync periodically and collectively contain the longest sequence of cross-verified date-sealed blocks longer than any cooperating group of attacker nodes. [^ds/2024-08-19/c95c39517245^]
2. Date-Sealing, DS, or ds
Markdown Content Blocks
The solution we propose begins with markdown, a human-readable, AI-efficient, portable content format. Mermaid, Obsidian Canvas, and Excalidraw already demonstrate methods to incorporate graphs, charts, and graphics into markdown documents. Wiki-links ([[]]) can provide inline navigation to markdown files, foldernotes, header blocks, content blocks, and other types of information. Transclusion, sometimes referred to as “embeds”, (![[]]) can be used to display content from proximal documents within a folder (sometimes called “vault”). The ability to link local and cloud content is particularly useful as we endeavor to digest
Markdown Element
Usage
Example
[[]]
Internal link to another note
[[Note Title]] links to a note titled “Note Title”
![[ ]]
Embed another note or media
![[Image.png]] embeds an image file named “Image.png”
[[\#^]]
Link to a specific block or heading
[[Note Title\#^block-id]] links to a block with ID block-id in “Note Title”
[^]
Footnote reference
This is a sentence with a footnote.[^1]
[^]:
Footnote definition
[^1]: This is the footnote text.
^
block reference
^id links to a block of content concluded with ^id
The solution we propose begins with a timestamp and encoded title or unique identifier within the squared carrot notation [^square-carrot-notation^]1 formatted as [^${domain}/${currentDate}/${normalizedBlockHash}^]. The squared carrot link is generally not visible in reading mode and most published versions. This becomes a means of canonically referring to timestamped content within a domain, folder, or file, and a means of eventually caching and syncing content in apps and tools between devices. The timestamp need not be displayed or may be displayed relative to the device in a pop-up or foot-note. Each device provides their timestamp and source servers (or proof of time method), instead of counting on trusted time servers like most blockchains and cloud platforms. If devices are synced at an initially set schedule or interval, say weekly, assuming no networks outages, this becomes a domain-specific method for cross-verifying device time and other settings. [^ds/2024-08-19/fd3d1148c716^]
B. Header Block Hashing [^ds/2024-08-19/23136272d843^] ^ds-2024-08-19-0f2ac9f50515
Each block in a document contains a header and content that can be normalized, hashed, and combined with the date as a locally unique identifier and precise reference to that content block in time. Block normalization would include removing markdown delimiters, prefixes, and postfixes, including headers, lists, content block references, and trimming white space. The header ##### A. Time-stamping would be normalized to Time-stamping and could be canonically referenced as [^hs/2024-08-10/77dab0106edb^]. These references can be printed as squared carrot link or carrot content ref with differing presentation, verification, and cross-linking behavior demonstrated in this smart paper.
For demonstration, a table of headers and content blocks of these steps to date-sealing in this DS Smart Paper]] - using the SHA-256 cryptographic digest in hexadecimal 12-length format - would look something like the table in this toggle box. Open to inspect and verify. [^ds/2024-08-19/3803b5b6cbb3^]
Table of Content Blocks
// Function to compute a 12-character SHA-256 hashasync function computeHash(text) { const encoder = new TextEncoder(); const data = encoder.encode(text); const hashBuffer = await crypto.subtle.digest('SHA-256', data); const hashArray = Array.from(new Uint8Array(hashBuffer)); const hashHex = hashArray.map(b => b.toString(16).padStart(2, '0')).join(''); return hashHex.slice(0, 12);}// Function to normalize text by removing markdown prefixes, squared carrot links, and carrot content block refsfunction normalizeText(text) { return text .replace(/^#+\s*/, '') // Remove markdown headers .replace(/^[A-Z]\.\s*/, '') // Remove ordered list prefixes .replace(/$$\^ds\/.*?\^$$/g, '') // Remove squared carrot links .replace(/\^ds-.*?$/, '') // Remove carrot content block refs .trim(); // Trim any leading or trailing whitespace}// Function to get the current date in YYYY-MM-DD formatfunction getCurrentDate() { const now = new Date(); return now.toISOString().split('T')[0];}// Data rowsconst rows = [ "Time-stamping", "Time-stamping", "A. Time-stamping", "A--Time-stamping", "##### A. Time-stamping", "#Time-stamping", "#A--Time-stamping", "Block Header Hashing", "##### B. Block Header Hashing", "##### C. Content Block Referencing", "##### A. Time-stamping [^hs/2024-08-10/Time-stamping^]", "##### B. Block Header Hashing [^hs/2024-08-10/69aca9b9359f^]", "##### C. Content Block Referencing [^hs/2024-08-10/3826a96abbda^]"];// Current dateconst currentDate = getCurrentDate();// Prepare data for the tablePromise.all(rows.map(async text => { const normalized = normalizeText(text); const textHash = await computeHash(text); const normalizedHash = await computeHash(normalized); const squaredCarrotLink = `[^hs/${currentDate}/${normalizedHash}^]`; const squaredCarrotCode = `\`[^hs/${currentDate}/${normalizedHash}^]\``; const carrotRef = `\`^ds-${normalizedHash}-${currentDate}\``; return { text, sourceText: text, normalized, textHash, normalizedHash, squaredCarrotLink, squaredCarrotCode, carrotRef };})).then(tableData => { // Render the table dv.table( ["Markdown Block", "Source Code Block", "Normal Text Block", "MD Hash", "Normal Hash", "Squared Carrot Link", "Squared Carrot Code", "Premature Content Block Reference"], tableData.map(row => [ row.text, row.sourceText, row.normalized, row.textHash, row.normalizedHash, row.squaredCarrotLink, row.squaredCarrotCode, row.carrotRef ]) );});
Important Observations
“A—Time-stamping” is not valid markdown and thus is not normalized consistently as the other markdown blocks
“Premature” Content Block Reference should not be computed this way and is an easy mistake. The Content Block Reference should include squared carrot link as more clearly defined in the next section.
Prompt for DataviewJS script v0.94
write an Obsidian dataviewjs query that displays a table with these columns:
- markdown text block as "Markdown Block",
- source text block (unrendered markdown) as "Source Code Block",
- normalized text block as "Normal Text Block",
- 12-character hex SHA-256 hash of text block as "MD Hash",
- 12-character hex SHA-256 hash of normalized text block as "Normal Hash",
- dated squared carrot link in this format: [^cbs/{YYYY-MM-DD}/{normal-hash}^] as "Squared Carrot Link"
- dated squared carrot code in this format (unrendered markdown): `[^ds/{YYYY-MM-DD}/{normal-hash}^]` as "Squared Carrot Code"
- normalized hash tag in this format: `#ds/{normal-hash}/{YYYY-MM-DD}]` as "DS Tag"
- normalized carrot content block ref in this format (unrendered markdown): ^cbr-{normal-hash}-{YYYY-MM-DD} as "Premature Content Block Reference"
normalize text by
- stripping markdown prefix
- removing squared carrot links
- removing ordered lists, lists, and todos
- removing content block references
set formats
- dates as YYYY-MM-DD, don't forget `[0]` after splitting the now date
- hashes as 12-character hexadecimal
and the table with the following rows:
Time-stamping
Time-stamping
A. Time-stamping
A--Time-stamping
##### A. Time-stamping
#Time-stamping
#A--Time-stamping
Block Header Hashing
##### B. Block Header Hashing
##### C. Content Block Referencing
C. Content Block Referencing [^ds/2024-08-19/48855b6e6bff^] ^ds-2024-08-19-6891290385fa
Similar to the block header, a content block within a document can be verifiably referenced using the squared carrot link format like so [^cbs/${currentDate}/${normalizedHash}^]. Once a squared carrot link has been appended to the block header or content block, a canonical carrot or content block ref can be generated based upon the format ^cbr-${normalizedHash}-{CurrentDate}. After which, the content block can be linked, transcluded, and referenced by [[#^cbr-normalizedhash-currentdate|^cbr-${normalizedHash}-{CurrentDate}]]. [^ds/2024-08-19/c78de1fa4893^]
Prompt for Templater script - CBS v0.7
Tip: Install “Hotkeys for templates” community plugin and give the script CMD + SHIFT + D.
Tip: Use Meta-Bind Buttons
POE Script - CBS v0.7
write an Obsidian templater script that replaces a square carrot formatting ([^...^] ^...) prior to a newline at the end of each selected block of content in the current file
- make sure the script is idempotent
- a hash digest of the normalized content must be generated first, then a hash digest of the normalized content that includes the Square Carrot Block Link appended to the end of the line when generating seal-hash
Use the following obsidian functions:
- let selection = app.workspace.activeEditor.getSelection(); // Get currently selected text in Editor
- app.workspace.activeEditor.editor.replaceSelection(selection);
- console.log("{activity}") // add log lines to each major step in the templater script with a timestamp and last hash generated
definitions:
- Square Carrot Block Link: [^...^]
- Carrot Block Ref: ^...
- Square Carrot Block DS: [^...^] ^...
- English Square Carrot Link Format: [^ds/{YYYY-MM-DD}/{normal-hash}^] ^ds-{YYYY-MM-DD}-{seal-hash}
- normal-hash: a 12-character hex SHA-256 hash of normalized text block of selected content
- seal-hash: a hash digest of the normalized content that includes the Square Carrot Block Link appended to the end of the line "{normalized-content} [^ds/{YYYY-MM-DD}/{normal-hash}^]"
normalize text by:
- stripping markdown prefix
- removing ordered lists, lists, and todos
- removing Square Carrot Block Link and Carrot Block Ref just before the newline
set formats:
- dates as YYYY-MM-DD, don't forget `[0]` after splitting the now date
- hashes as 12-character hexadecimal
append to a file (of vault path "Notes/Digests/"{normal-hash}.md) the following format:
- ### {normal-hash}/{YYYY-MM-DD}
POE Premature Format Working CBS v0.6
write an Obsidian templater script that replaces a squared carrot link format ([^...^] ^...) prior to a newline at the end of each selected block of content in the current file, make sure the script is idempotent and thus only replaces the squared carrot link
Use the following obsidian functions:
- let selection = app.workspace.activeEditor.getSelection(); // Get currently selected text in Editor
- app.workspace.activeEditor.editor.replaceSelection(selection);
- console.log("{activity}") // add log lines to each major step in the templater script with a timestamp
definitions:
- Squared Carrot Link format: [^...^] ^...
- English Squared Carrot Link format: [^ds/{YYYY-MM-DD}/{normal-hash}^] ^ds-{YYYY-MM-DD}-{normal-hash}
- normal-hash: a 12-character hex SHA-256 hash of normalized text block of selected content
normalize text by:
- stripping markdown prefix
- removing ordered lists, lists, and todos
set formats:
- dates as YYYY-MM-DD, don't forget `[0]` after splitting the now date
- hashes as 12-character hexadecimal
append to a file (of vault path "Notes/Digests/"{normal-hash}.md) the following format:
- ### {normal-hash}/{YYYY-MM-DD}
WORKING Script - CBS v0.5
write an Obsidian templater script that replaces a squared carrot link ([^...^]) prior to a newline at the end of each selected block of content in the current file, make sure the script is idempotent and thus only replaces the squared carrot link
Use the following obsidian functions
- let selection = app.workspace.activeEditor.getSelection(); // Get currently selected text in Editor
- app.workspace.activeEditor.editor.replaceSelection(selection);
- console.log("{activity}") // add log lines to each major step in the templater script
definitions:
- Squared Carrot Link format: [^cbs/{YYYY-MM-DD}/{normal-hash}^]
- normal-hash: a 12-character hex SHA-256 hash of normalized text block of selected content
normalize text by:
- stripping markdown prefix
- removing squared carrot links
- removing ordered lists, lists, and todos
- removing content block references
set formats:
- dates as YYYY-MM-DD, don't forget `[0]` after splitting the now date
- hashes as 12-character hexadecimal
Prompt CBS v.04
write an Obsidian templater script that replaces a squared carrot link ([^ds/20240818/date-deal^]) at the end of each block of content in the current file, make sure the script is idempotent and thus updates or replaces the squared carrot link if it does not exist, learn from [obsidian-to-napkin/Templater Scripts/Send selected Obsidian text to Napkin.md at main · TfTHacker/obsidian-to-napkin · GitHub](https://github.com/TfTHacker/obsidian-to-napkin/blob/main/Templater%20Scripts/Send%20selected%20Obsidian%20text%20to%20Napkin.md)
definitions:
- Squared Carrot Link format: [^cbs/{YYYY-MM-DD}/{normal-hash}^]
- normal-hash: a 12-character hex SHA-256 hash of normalized text block
normalize text by:
- stripping markdown prefix
- removing squared carrot links
- removing ordered lists, lists, and todos
- removing content block references
set formats:
- dates as YYYY-MM-DD, don't forget `[0]` after splitting the now date
- hashes as 12-character hexadecimal
Prompt CBS v.03
write an Obsidian templater script that inserts or replaces a squared carrot link at the end of each selected block of content in the current file, make sure the script is idempotent and thus updates or replaces the squared carrot link if it does not exist, learn from [obsidian-to-napkin/Templater Scripts/Send selected Obsidian text to Napkin.md at main · TfTHacker/obsidian-to-napkin · GitHub](https://github.com/TfTHacker/obsidian-to-napkin/blob/main/Templater%20Scripts/Send%20selected%20Obsidian%20text%20to%20Napkin.md)
definitions:
- Squared Carrot Link format: [^cbs/{YYYY-MM-DD}/{normal-hash}^]
- normal-hash: a 12-character hex SHA-256 hash of normalized text block
normalize text by:
- stripping markdown prefix
- removing squared carrot links
- removing ordered lists, lists, and todos
- removing content block references
set formats:
- dates as YYYY-MM-DD, don't forget `[0]` after splitting the now date
- hashes as 12-character hexadecimal
Prompt CBS v0.3
write an Obsidian templater script that inserts or replaces a squared carrot link at the end of each selected block of content in the current file, make sure the script is idempotent and thus updates or replaces the squared carrot link if it does not exist.
definitions:
- Squared Carrot Link format: [^cbs/{YYYY-MM-DD}/{normal-hash}^]
- normal-hash: a 12-character hex SHA-256 hash of normalized text block
normalize text by:
- stripping markdown prefix
- removing squared carrot links
- removing ordered lists, lists, and todos
- removing content block references
set formats:
- dates as YYYY-MM-DD, don't forget `[0]` after splitting the now date
- hashes as 12-character hexadecimal
Prompt v0.2
write a idempotent templater script that SHA-256 hashes the content block with the cursor and appends a date and 8-chart SHA-256 digest in the following format [^2024-08-10-cc42bd42bd]
How to add Templatar script to Obsidian
Add
const crypto = require('crypto');
function hashContent(content) {
return crypto.createHash('sha256').update(content).digest('hex');
}
function formatDate(date) {
return date.toISOString().split('T')[0];
}
module.exports = async (tp) => {
// Get the current file content
const fileContent = await tp.file.content();
// Find the block where the cursor is
const cursorPosition = await tp.file.cursor();
const lines = fileContent.split('\n');
let blockStart = cursorPosition.line;
let blockEnd = cursorPosition.line;
// Find the start of the block
while (blockStart > 0 && lines[blockStart - 1].trim() !== '') {
blockStart--;
}
// Find the end of the block
while (blockEnd < lines.length - 1 && lines[blockEnd + 1].trim() !== '') {
blockEnd++;
}
// Extract the block content
const blockContent = lines.slice(blockStart, blockEnd + 1).join('\n');
// Generate the hash
const fullHash = hashContent(blockContent);
const shortHash = fullHash.substring(0, 8);
// Get the current date
const currentDate = formatDate(new Date());
// Create the formatted reference
const reference = `[^${currentDate}-${shortHash}]`;
// Check if the reference already exists
if (!fileContent.includes(reference)) {
// Append the reference to the end of the block
lines[blockEnd] += ' ' + reference;
// Join the lines back together
const newContent = lines.join('\n');
// Update the file content
await tp.file.set_content(newContent);
}
// Return an empty string to avoid inserting any text at the cursor position
return '';
}
Abstract.
A purely peer-to-peer version of electronic cash would allow online payments to be sent directly from one party to another without going through a financial institution. Digital signatures provide part of the solution, but the main benefits are lost if a trusted third party is still required to prevent double-spending. We propose a solution to the double-spending problem using a peer-to-peer network. The network timestamps transactions by hashing them into an ongoing chain of hash-based proof-of-work, forming a record that cannot be changed without redoing the proof-of-work. The longest chain not only serves as proof of the sequence of events witnessed, but proof that it came from the largest pool of CPU power. As long as a majority of CPU power is controlled by nodes that are not cooperating to attack the network, they’ll generate the longest chain and outpace attackers. The network itself requires minimal structure. Messages are broadcast on a best effort basis, and nodes can leave and rejoin the network at will, accepting the longest proof-of-work chain as proof of what happened while they were gone. 1
1. Introduction
Commerce on the Internet has come to rely almost exclusively on financial institutions serving as trusted third parties to process electronic payments. While the system works well enough for most transactions, it still suffers from the inherent weaknesses of the trust based model. Completely non-reversible transactions are not really possible, since financial institutions cannot avoid mediating disputes. The cost of mediation increases transaction costs, limiting the minimum practical transaction size and cutting off the possibility for small casual transactions, and there is a broader cost in the loss of ability to make non-reversible payments for nonreversible services. With the possibility of reversal, the need for trust spreads. Merchants must be wary of their customers, hassling them for more information than they would otherwise need. A certain percentage of fraud is accepted as unavoidable. These costs and payment uncertainties can be avoided in person by using physical currency, but no mechanism exists to make payments over a communications channel without a trusted party.
What is needed is an electronic payment system based on cryptographic proof instead of trust, allowing any two willing parties to transact directly with each other without the need for a trusted third party.
Transactions that are computationally impractical to reverse would protect sellers from fraud, and routine escrow mechanisms could easily be implemented to protect buyers. In this paper, we propose a solution to the double-spending problem using a peer-to-peer distributed timestamp server to generate computational proof of the chronological order of transactions. The system is secure as long as honest nodes collectively control more CPU power than any cooperating group of attacker nodes. [^ds/2024-08-19/33b05b4e5d3a^]
2. Transactions
We define an electronic coin as a chain of digital signatures. Each owner transfers the coin to the next by digitally signing a hash of the previous transaction and the public key of the next owner and adding these to the end of the coin. A payee can verify the signatures to verify the chain of ownership.
The problem of course is the payee can’t verify that one of the owners did not double-spend the coin. A common solution is to introduce a trusted central authority, or mint, that checks every transaction for double spending. After each transaction, the coin must be returned to the mint to issue a new coin, and only coins issued directly from the mint are trusted not to be double-spent. The problem with this solution is that the fate of the entire money system depends on the company running the mint, with every transaction having to go through them, just like a bank.
We need a way for the payee to know that the previous owners did not sign any earlier transactions. For our purposes, the earliest transaction is the one that counts, so we don’t care about later attempts to double-spend. The only way to confirm the absence of a transaction is to be aware of all transactions. In the mint based model, the mint was aware of all transactions and decided which arrived first. To accomplish this without a trusted party, transactions must be publicly announced 2, and we need a system for participants to agree on a single history of the order in which they were received. The payee needs proof that at the time of each transaction, the majority of nodes agreed it was the first received. [^ds/2024-08-19/3967f2ab8b78^]
3. Timestamp Server
The solution we propose begins with a timestamp server. A timestamp server works by taking a hash of a block of items to be timestamped and widely publishing the hash, such as in a newspaper or Usenet post [2-5]. The timestamp proves that the data must have existed at the time, obviously, in order to get into the hash. Each timestamp includes the previous timestamp in its hash, forming a chain, with each additional timestamp reinforcing the ones before it. [^ds/2024-08-19/4fab248cf4b3^] ^ds-2024-08-19-b434aa848617
4. Proof-of-Work
To implement a distributed timestamp server on a peer-to-peer basis, we will need to use a proofof-work system similar to Adam Back’s Hashcash 3, rather than newspaper or Usenet posts. The proof-of-work involves scanning for a value that when hashed, such as with SHA-256, the hash begins with a number of zero bits. The average work required is exponential in the number of zero bits required and can be verified by executing a single hash.
For our timestamp network, we implement the proof-of-work by incrementing a nonce in the block until a value is found that gives the block’s hash the required zero bits. Once the CPU effort has been expended to make it satisfy the proof-of-work, the block cannot be changed without redoing the work. As later blocks are chained after it, the work to change the block would include redoing all the blocks after it.
The proof-of-work also solves the problem of determining representation in majority decision making. If the majority were based on one-IP-address-one-vote, it could be subverted by anyone able to allocate many IPs. Proof-of-work is essentially one-CPU-one-vote. The majority decision is represented by the longest chain, which has the greatest proof-of-work effort invested in it. If a majority of CPU power is controlled by honest nodes, the honest chain will grow the fastest and outpace any competing chains. To modify a past block, an attacker would have to redo the proof-of-work of the block and all blocks after it and then catch up with and surpass the work of the honest nodes. We will show later that the probability of a slower attacker catching up diminishes exponentially as subsequent blocks are added.
To compensate for increasing hardware speed and varying interest in running nodes over time, the proof-of-work difficulty is determined by a moving average targeting an average number of blocks per hour. If they’re generated too fast, the difficulty increases. [^ds/2024-08-19/890619bdedb1^]
5. Network
The steps to run the network are as follows:
New transactions are broadcast to all nodes.
Each node collects new transactions into a block.
Each node works on finding a difficult proof-of-work for its block.
When a node finds a proof-of-work, it broadcasts the block to all nodes.
Nodes accept the block only if all transactions in it are valid and not already spent.
Nodes express their acceptance of the block by working on creating the next block in the chain, using the hash of the accepted block as the previous hash.
Nodes always consider the longest chain to be the correct one and will keep working on extending it. If two nodes broadcast different versions of the next block simultaneously, some nodes may receive one or the other first. In that case, they work on the first one they received, but save the other branch in case it becomes longer. The tie will be broken when the next proofof-work is found and one branch becomes longer; the nodes that were working on the other branch will then switch to the longer one.
New transaction broadcasts do not necessarily need to reach all nodes. As long as they reach many nodes, they will get into a block before long. Block broadcasts are also tolerant of dropped messages. If a node does not receive a block, it will request it when it receives the next block and realizes it missed one. [^ds/2024-08-19/b91872342ef6^]
6. Incentive
By convention, the first transaction in a block is a special transaction that starts a new coin owned by the creator of the block. This adds an incentive for nodes to support the network, and provides a way to initially distribute coins into circulation, since there is no central authority to issue them. The steady addition of a constant of amount of new coins is analogous to gold miners expending resources to add gold to circulation. In our case, it is CPU time and electricity that is expended.
The incentive can also be funded with transaction fees. If the output value of a transaction is less than its input value, the difference is a transaction fee that is added to the incentive value of the block containing the transaction. Once a predetermined number of coins have entered circulation, the incentive can transition entirely to transaction fees and be completely inflation free.
The incentive may help encourage nodes to stay honest. If a greedy attacker is able to assemble more CPU power than all the honest nodes, he would have to choose between using it to defraud people by stealing back his payments, or using it to generate new coins. He ought to find it more profitable to play by the rules, such rules that favour him with more new coins than everyone else combined, than to undermine the system and the validity of his own wealth. [^ds/2024-08-19/bd7459a0dd2c^]
7. Reclaiming Disk Space
Once the latest transaction in a coin is buried under enough blocks, the spent transactions before it can be discarded to save disk space. To facilitate this without breaking the block’s hash, transactions are hashed in a Merkle Tree 456, with only the root included in the block’s hash. Old blocks can then be compacted by stubbing off branches of the tree. The interior hashes do not need to be stored.
A block header with no transactions would be about 80 bytes. If we suppose blocks are generated every 10 minutes, 80 bytes * 6 * 24 * 365 = 4.2MB per year. With computer systems typically selling with 2GB of RAM as of 2008, and Moore’s Law predicting current growth of 1.2GB per year, storage should not be a problem even if the block headers must be kept in memory. [^ds/2024-08-19/fdc6b3befb1b^]
8. Simplified Payment Verification
It is possible to verify payments without running a full network node. A user only needs to keep a copy of the block headers of the longest proof-of-work chain, which he can get by querying network nodes until he’s convinced he has the longest chain, and obtain the Merkle branch linking the transaction to the block it’s timestamped in. He can’t check the transaction for himself, but by linking it to a place in the chain, he can see that a network node has accepted it, and blocks added after it further confirm the network has accepted it.
As such, the verification is reliable as long as honest nodes control the network, but is more vulnerable if the network is overpowered by an attacker. While network nodes can verify transactions for themselves, the simplified method can be fooled by an attacker’s fabricated transactions for as long as the attacker can continue to overpower the network. One strategy to protect against this would be to accept alerts from network nodes when they detect an invalid block, prompting the user’s software to download the full block and alerted transactions to confirm the inconsistency. Businesses that receive frequent payments will probably still want to run their own nodes for more independent security and quicker verification. [^ds/2024-08-19/f4245a13e08b^]
9. Combining and Splitting Value
Although it would be possible to handle coins individually, it would be unwieldy to make a separate transaction for every cent in a transfer. To allow value to be split and combined, transactions contain multiple inputs and outputs. Normally there will be either a single input from a larger previous transaction or multiple inputs combining smaller amounts, and at most two outputs: one for the payment, and one returning the change, if any, back to the sender.
It should be noted that fan-out, where a transaction depends on several transactions, and those transactions depend on many more, is not a problem here. There is never the need to extract a complete standalone copy of a transaction’s history. [^ds/2024-08-19/4d7242b47323^]
10. Privacy
The traditional banking model achieves a level of privacy by limiting access to information to the parties involved and the trusted third party. The necessity to announce all transactions publicly precludes this method, but privacy can still be maintained by breaking the flow of information in another place: by keeping public keys anonymous. The public can see that someone is sending an amount to someone else, but without information linking the transaction to anyone. This is similar to the level of information released by stock exchanges, where the time and size of individual trades, the “tape”, is made public, but without telling who the parties were.
As an additional firewall, a new key pair should be used for each transaction to keep them from being linked to a common owner. Some linking is still unavoidable with multi-input transactions, which necessarily reveal that their inputs were owned by the same owner. The risk is that if the owner of a key is revealed, linking could reveal other transactions that belonged to the same owner. [^ds/2024-08-19/a1d2c37895fd^]
11. Calculations
We consider the scenario of an attacker trying to generate an alternate chain faster than the honest chain. Even if this is accomplished, it does not throw the system open to arbitrary changes, such as creating value out of thin air or taking money that never belonged to the attacker. Nodes are not going to accept an invalid transaction as payment, and honest nodes will never accept a block containing them. An attacker can only try to change one of his own transactions to take back money he recently spent.
The race between the honest chain and an attacker chain can be characterized as a Binomial Random Walk. The success event is the honest chain being extended by one block, increasing its lead by +1, and the failure event is the attacker’s chain being extended by one block, reducing the gap by -1.
The probability of an attacker catching up from a given deficit is analogous to a Gambler’s Ruin problem. Suppose a gambler with unlimited credit starts at a deficit and plays potentially an infinite number of trials to try to reach breakeven. We can calculate the probability he ever reaches breakeven, or that an attacker ever catches up with the honest chain, as follows 7:
p = probability an honest node finds the next block
q = probability the attacker finds the next block
qz = probability the attacker will ever catch up from z blocks behind
qz={ 1 if p≤q q/ p z if pq}
Given our assumption that p > q, the probability drops exponentially as the number of blocks the attacker has to catch up with increases. With the odds against him, if he doesn’t make a lucky lunge forward early on, his chances become vanishingly small as he falls further behind.
We now consider how long the recipient of a new transaction needs to wait before being sufficiently certain the sender can’t change the transaction. We assume the sender is an attacker who wants to make the recipient believe he paid him for a while, then switch it to pay back to himself after some time has passed. The receiver will be alerted when that happens, but the sender hopes it will be too late.
The receiver generates a new key pair and gives the public key to the sender shortly before signing. This prevents the sender from preparing a chain of blocks ahead of time by working on it continuously until he is lucky enough to get far enough ahead, then executing the transaction at that moment. Once the transaction is sent, the dishonest sender starts working in secret on a parallel chain containing an alternate version of his transaction.
The recipient waits until the transaction has been added to a block and z blocks have been linked after it. He doesn’t know the exact amount of progress the attacker has made, but assuming the honest blocks took the average expected time per block, the attacker’s potential progress will be a Poisson distribution with expected value:
λ=zpq
To get the probability the attacker could still catch up now, we multiply the Poisson density for each amount of progress he could have made by the probability he could catch up from that point:
∑ k=0 ∞ k e − k! ⋅{ q/ p z−k if k≤z 1 if kz}
Rearranging to avoid summing the infinite tail of the distribution…
1−∑ k=0 z k e − k! 1−q/ p z−k
Converting to C code…
#include <math.h>double AttackerSuccessProbability(double q, int z){ double p = 1.0 - q; double lambda = z * (q / p); double sum = 1.0; int i, k; for (k = 0; k <= z; k++) { double poisson = exp(-lambda); for (i = 1; i <= k; i++) poisson *= lambda / i; sum -= poisson * (1 - pow(q / p, z - k)); } return sum;}
Running some results, we can see the probability drop off exponentially with z.
q=0.1
z=0 P=1.0000000
z=1 P=0.2045873
z=2 P=0.0509779
z=3 P=0.0131722
z=4 P=0.0034552
z=5 P=0.0009137
z=6 P=0.0002428
z=7 P=0.0000647
z=8 P=0.0000173
z=9 P=0.0000046
z=10 P=0.0000012
q=0.3
z=0 P=1.0000000
z=5 P=0.1773523
z=10 P=0.0416605
z=15 P=0.0101008
z=20 P=0.0024804
z=25 P=0.0006132
z=30 P=0.0001522
z=35 P=0.0000379
z=40 P=0.0000095
z=45 P=0.0000024
z=50 P=0.0000006
Solving for P less than 0.1%…
P < 0.001
q=0.10 z=5
q=0.15 z=8
q=0.20 z=11
q=0.25 z=15
q=0.30 z=24
q=0.35 z=41
q=0.40 z=89
q=0.45 z=340 [^ds/2024-08-19/348971e798f5^]
12. Conclusion
We have proposed a system for electronic transactions without relying on trust. We started with the usual framework of coins made from digital signatures, which provides strong control of ownership, but is incomplete without a way to prevent double-spending. To solve this, we proposed a peer-to-peer network using proof-of-work to record a public history of transactions that quickly becomes computationally impractical for an attacker to change if honest nodes control a majority of CPU power. The network is robust in its unstructured simplicity. Nodes work all at once with little coordination. They do not need to be identified, since messages are not routed to any particular place and only need to be delivered on a best effort basis. Nodes can leave and rejoin the network at will, accepting the proof-of-work chain as proof of what happened while they were gone. They vote with their CPU power, expressing their acceptance of valid blocks by working on extending them and rejecting invalid blocks by refusing to work on them. Any needed rules and incentives can be enforced with this consensus mechanism. [^ds/2024-08-19/784b7820d873^]
R.C. Merkle, “Protocols for public key cryptosystems,” In Proc. 1980 Symposium on Security and Privacy, IEEE Computer Society, pages 122-133, April 1980. ↩
H. Massias, X.S. Avila, and J.-J. Quisquater, “Design of a secure timestamping service with minimal trust requirements,” In 20th Symposium on Information Theory in the Benelux, May 1999. ↩
S. Haber, W.S. Stornetta, “Secure names for bit-strings,” In Proceedings of the 4th ACM Conference on Computer and Communications Security, pages 28-35, April 1997. ↩
W. Feller, “An introduction to probability theory and its applications,” 1957. 9 [^ds/2024-08-19/eb4e05853840^] ↩
Non-exclusive, Worldwide, Non-transferable, Fair-share Royalty License: Users are granted a license to use, reproduce, distribute, copy, create derivative works of, and make modifications to the this hashed record format in accordance with the Interpersonal Cross License Agreement 2.0. [^ds/2024-08-19/d1e227d5687a^]