Day 02 Core Concepts

Write-Ahead Logging

WAL purpose, LSN, redo/undo logs, checkpoint, crash recovery sequence

~1 hour Hands-on Precision AI Academy

Today’s Objective

WAL purpose, LSN, redo/undo logs, checkpoint, crash recovery sequence

Day 2 of Database Internals in 5 Days builds directly on Day 1. You're moving from theory into applied practice. The concepts today require the foundation from yesterday, so if anything felt unclear, review it now.

Topics today: WAL, LSN, checkpoint. Each section has code you can copy and run immediately.

WAL

Understanding WAL is the core goal of Day 2. The concept is straightforward once you see it in practice — most confusion comes from skipping the mental model and jumping straight to implementation. Start with the model, then write the code.

WAL
WAL
# WAL — Working Example
# Study this pattern carefully before writing your own version

class WALExample: """ Demonstrates core WAL concepts. Replace placeholder values with your real implementation. """ def __init__(self, config: dict): self.config = config self._validate() def _validate(self): required = ['name', 'type'] for field in required: if field not in self.config: raise ValueError(f"Missing required field: {field}") def process(self) -> dict: # Core logic goes here result = { 'status': 'success', 'topic': 'WAL', 'data': self.config } return result

# Usage
example = WALExample({ 'name': 'my-implementation', 'type': 'wal'
})
output = example.process()
print(output)
Key insight: When working with WAL, always start with the simplest possible case that works end-to-end. Complexity is easier to add than simplicity is to recover.

LSN

LSN is the practical application of WAL in real projects. Once you understand the underlying model, LSN becomes the natural next step.

Pro tip: When working with LSN, always read the official documentation for the exact version you're using. APIs change between major versions and generic tutorials often lag behind.

checkpoint

checkpoint rounds out today's lesson. It connects WAL and LSN into a complete picture. You'll use all three concepts together in the exercise below.

Common Mistakes on Day 2

📝 Day 2 Exercise Write-Ahead Logging — Hands-On
  1. Set up your environment for today's topic: install required tools and verify the basics work before writing any logic.
  2. Implement a minimal working version of WAL using the code example in this lesson as your starting point.
  3. Extend your implementation to incorporate LSN — this is where the two concepts connect.
  4. Test your implementation with both valid and invalid inputs. What happens at the boundaries?
  5. Review your code: is there anything you'd name differently? Any function doing more than one thing? Refactor one thing.

Supporting Resources

Go deeper with these references.

Databass.dev
Database Internals by Alex Petrov The definitive book on storage engines, B-trees, and distributed systems fundamentals.
CMU
CMU Database Systems Course World-class free university course on database internals with lecture videos.
GitHub
rqlite — distributed SQLite Production distributed database built on SQLite — excellent architecture reference.

Day 2 Checkpoint

Before moving on, make sure you can answer these without looking:

Continue To Day 3
MVCC & Transactions