Persistence Design (Not Yet Implemented)

This document outlines the proposed design for persisting the Mina ledger and other critical state to disk, reducing memory usage and enabling faster node restarts.

Status: Not yet implemented - this is a design proposal only.

Critical for Mainnet: This is one of the most important changes required to make the webnode mainnet-ready.

Overview

Currently, the Mina Rust node keeps the entire ledger in memory, which creates scalability issues for mainnet deployment where the ledger can be large. A persistent storage solution is needed to:

Reduce memory usage for both server-side nodes and webnodes
Enable faster node restarts by avoiding full ledger reconstruction
Deduplicate SNARK verification work across blocks and pools
Support partial ledger storage for light clients

Design Reference

A draft design for the persistence database is outlined in Issue #522, which proposes an approach for efficiently storing, updating, and retrieving accounts and hashes.

Note: There is a very old implementation for on-disk storage in ledger/src/ondisk/ that was never used - a lightweight key-value store implemented to avoid the RocksDB dependency.

Database Design Resources: For those implementing persistence, "Database Internals" and "Designing Data-Intensive Applications" are excellent books on database design and implementation.

Key Design Principles

Based on Issue #522, the persistence design follows these principles:

Simplicity First: The design prioritizes simplicity over optimal performance
Fixed-Size Storage: Most data (except zkApp accounts) uses fixed-size slots for predictable access patterns
Sequential Account Creation: Mina creates accounts sequentially, filling leaves from left to right in the Merkle tree, enabling an append-only design
Selective Persistence: Only epoch ledgers and the root ledger need persistence; masks can remain in-memory
Infrequent Updates: Root ledger updates occur only when the transition frontier root moves
Hashes in Memory: All Merkle tree hashes remain in RAM for quick access
Recoverable: Data corruption is not catastrophic as ledgers can be reconstructed from the network

Problems to be Solved

Memory Usage

Current State: The entire ledger is kept in memory, which can be substantial on mainnet:

Account data includes balances, nonces, zkApp state
Merkle tree structure for cryptographic proofs
Multiple ledger versions for different blockchain heights

Solution: Move account data to persistent storage while keeping frequently accessed data (like Merkle tree hashes) in memory.

Startup Time

Current State: Nodes must reconstruct the full ledger from genesis or sync from peers, which is time-consuming.

Solution: Persist confirmed ledger states to enable fast startup by loading from disk rather than network reconstruction.

SNARK Verification Deduplication

Current State: The same SNARK work may be verified multiple times across different blocks and transaction pools.

Solution: Cache verification results persistently to avoid redundant computation.

Proposed Architecture

Storage Layers

1. Root Ledger Storage

Purpose: Store the confirmed ledger at the root of the transition frontier
Update frequency: Only when transition frontier advances
Data: Account balances, nonces, zkApp state
Access pattern: Random reads, infrequent writes

2. Epoch Ledger Storage

Purpose: Store ledger snapshots for staking epoch calculations
Update frequency: Once per epoch
Data: Complete ledger state at epoch boundaries
Access pattern: Sequential reads during epoch transitions

3. Verification Cache

Purpose: Store SNARK verification results
Update frequency: High during block processing
Data: Verification status keyed by work specification
Access pattern: High read/write frequency

Data Structures

Account Storage Format

struct PersistedAccount {
    public_key: PublicKey,          // 32 bytes
    balance: u64,                   // 8 bytes
    nonce: u32,                     // 4 bytes
    delegate: Option<PublicKey>,    // 33 bytes (1 + 32)
    voting_for: StateHash,          // 32 bytes
    zkapp_state: Option<ZkAppState>, // Variable size
    // ... other fields
}

Index Structure

Account Index: Maps public keys to storage locations
Merkle Index: Maps tree positions to account locations
Height Index: Maps blockchain heights to ledger versions

Memory vs Disk Trade-offs

Keep in Memory

Merkle Tree Hashes: Fast cryptographic proof generation
Recent Transactions: Active processing requirements
Connection State: Network and consensus data
Indices: Fast lookup structures

Move to Disk

Account Data: Large, infrequently accessed in bulk
Historical Ledgers: Epoch snapshots and old states
Verification Cache: Large datasets with locality

Implementation Strategy

Phase 1: Foundation

Storage Interface: Define abstract storage traits
Account Serialization: Implement efficient encoding/decoding
Index Management: Create lookup structures
Testing Framework: Comprehensive test suite

Phase 2: Basic Persistence

Root Ledger Storage: Implement basic account persistence
Startup Recovery: Load ledger from disk on startup
Incremental Updates: Efficient account modifications
Corruption Recovery: Handle storage failures gracefully

Phase 3: Advanced Features

Epoch Ledgers: Historical snapshot storage
Verification Cache: SNARK result persistence
Compaction: Optimize storage usage over time
Partial Loading: Support for light client scenarios

Phase 4: Optimization

Performance Tuning: Optimize for real-world usage patterns
Memory Management: Fine-tune memory vs disk balance
Concurrent Access: Support multiple readers/writers
Monitoring: Add persistence-related metrics

Technical Considerations

Storage Backend Options

File-Based Storage

Pros: Simple, no external dependencies, full control Cons: Must implement indexing, compression, concurrent access

Embedded Database (e.g., RocksDB)

Pros: Battle-tested, efficient indexing, concurrent access Cons: Additional dependency, larger binary size

Custom Key-Value Store

Pros: Optimized for Mina's specific needs, lightweight Cons: More development effort, needs thorough testing

Consistency Guarantees

Atomic Updates: Ensure ledger state changes are atomic
Crash Recovery: Handle interruptions during writes
Checksum Validation: Detect storage corruption
Version Management: Track ledger version compatibility

Performance Requirements

Read Latency: Account lookups must remain fast
Write Throughput: Handle block processing rates
Memory Usage: Significant reduction from current levels
Startup Time: Faster than network reconstruction

Migration Strategy

Development Phase

Parallel Implementation: Build alongside current in-memory system
Feature Flags: Enable persistence selectively
Testing: Extensive testing with mainnet data
Benchmarking: Performance comparison with current system

Deployment Phase

Opt-in: Initially optional for testing
Gradual Rollout: Enable for specific node types
Full Migration: Make persistence default
Legacy Support: Maintain fallback to in-memory mode

Success Metrics

Memory Usage

Target: 50-80% reduction in memory usage
Measurement: RSS and heap size monitoring
Threshold: Must support mainnet ledger sizes

Performance

Startup Time: <5 minutes for full ledger load
Query Latency: <1ms for account lookups
Block Processing: No degradation in processing speed

Reliability

Data Integrity: Zero data loss during normal operation
Crash Recovery: <30 seconds to restore consistent state
Storage Corruption: Graceful degradation and recovery

Risks and Mitigation

Technical Risks

Performance Degradation: Mitigate with extensive benchmarking
Data Corruption: Implement checksums and validation
Storage Space: Monitor and optimize storage usage

Operational Risks

Migration Complexity: Provide clear upgrade paths
Backup Requirements: Document backup and recovery procedures
Monitoring Needs: Add persistence-specific observability

Existing Implementations

OCaml Node: Uses RocksDB for ledger persistence
Other Blockchains: Study approaches from Ethereum, Bitcoin
Database Systems: Learn from established database designs

Design References

Issue #522: Original persistence design proposal
Ledger Implementation: Current in-memory ledger code
Database Internals: Database design principles
DDIA: Data-intensive application patterns

Conclusion

Implementing persistent storage is critical for the Mina Rust node's mainnet readiness. The proposed design balances simplicity with performance, enabling significant memory usage reduction while maintaining the fast query performance required for blockchain operations.

The phased implementation approach allows for careful validation and optimization, ensuring that persistence improves rather than degrades node performance. Success in this area will enable the Mina Rust node to scale to mainnet requirements and support a broader range of deployment scenarios.

Overview​

Design Reference​

Key Design Principles​

Problems to be Solved​

Memory Usage​

Startup Time​

SNARK Verification Deduplication​

Proposed Architecture​

Storage Layers​

1. Root Ledger Storage​

2. Epoch Ledger Storage​

3. Verification Cache​

Data Structures​

Account Storage Format​

Index Structure​

Memory vs Disk Trade-offs​

Keep in Memory​

Move to Disk​

Implementation Strategy​

Phase 1: Foundation​

Phase 2: Basic Persistence​

Phase 3: Advanced Features​

Phase 4: Optimization​

Technical Considerations​

Storage Backend Options​

File-Based Storage​

Embedded Database (e.g., RocksDB)​

Custom Key-Value Store​

Consistency Guarantees​

Performance Requirements​

Migration Strategy​

Development Phase​

Deployment Phase​

Success Metrics​

Memory Usage​

Performance​

Reliability​

Risks and Mitigation​

Technical Risks​

Operational Risks​

Related Work​

Existing Implementations​

Design References​

Conclusion​

Overview

Design Reference

Key Design Principles

Problems to be Solved

Memory Usage

Startup Time

SNARK Verification Deduplication

Proposed Architecture

Storage Layers

1. Root Ledger Storage

2. Epoch Ledger Storage

3. Verification Cache

Data Structures

Account Storage Format

Index Structure

Memory vs Disk Trade-offs

Keep in Memory

Move to Disk

Implementation Strategy

Phase 1: Foundation

Phase 2: Basic Persistence

Phase 3: Advanced Features

Phase 4: Optimization

Technical Considerations

Storage Backend Options

File-Based Storage

Embedded Database (e.g., RocksDB)

Custom Key-Value Store

Consistency Guarantees

Performance Requirements

Migration Strategy

Development Phase

Deployment Phase

Success Metrics

Memory Usage

Performance

Reliability

Risks and Mitigation

Technical Risks

Operational Risks

Related Work

Existing Implementations

Design References

Conclusion