Data Migration Strategies

23 min read

When you rewrite a Node.js service in Rust, the code is only half the job. The other half is the data: a live database holding millions of rows that customers depend on right now. This page covers the three strategies that let you move from a Node-owned database to a Rust-owned one without downtime, lost writes, or a terrifying big-bang cutover.

Quick Overview

A rewrite rarely fails on the new code — it fails on the data underneath it. You cannot stop the world, dump a database, and re-import it into a Rust service while customers are writing to it. Instead you use one of three patterns, often in sequence: shared database (Rust and Node read/write the same tables), dual-write (writes go to both an old and a new store), and backfill (a batch job copies historical rows into the new store). This page shows how each looks from a TypeScript/JavaScript developer’s perspective and how to express the critical bits idiomatically in Rust, with the type-safety and error-handling guarantees that make Rust a good fit for migration tooling.

Note: This page is about moving data during a rewrite. For moving traffic incrementally see incremental.md; for keeping HTTP responses byte-compatible see api-compatibility.md; for the end-to-end service port see node-to-rust.md. For the general database APIs (sqlx, Diesel, connection pooling, schema migrations) used in the snippets here, see Section 17, starting with ../17-database/02_sqlx-transactions.md and ../17-database/09_migrations.md.

TypeScript/JavaScript Example

A typical Node service owns its database. Writes flow straight through an ORM or query builder, and the schema is whatever the Node code has accreted over the years. Here is a dual-write helper written the way a Node team usually first reaches for it — the legacy store is the source of truth, and a new store is shadowed alongside it so the request never fails just because the new store hiccups.

1
// dual-write.ts — Node 22, run with `node --experimental-strip-types`
2
class WriteError extends Error {}
3

4
const legacy = new Map<number, string>();
5
const next = new Map<number, string>();
6
let failNext = false;
7

8
async function legacyUpsert(id: number, email: string): Promise<void> {
9
  legacy.set(id, email);
10
}
11

12
async function newUpsert(id: number, email: string): Promise<void> {
13
  if (failNext) throw new WriteError("new store unavailable");
14
  next.set(id, email);
15
}
16

17
// Source of truth = legacy. The new store is best-effort.
18
async function dualWrite(id: number, email: string): Promise<void> {
19
  await legacyUpsert(id, email); // hard requirement
20
  try {
21
    await newUpsert(id, email); // shadow write, best-effort
22
  } catch (e) {
23
    const msg = e instanceof Error ? e.message : String(e);
24
    console.error(`shadow write failed for id=${id}: ${msg} (request still succeeds)`);
25
  }
26
}
27

28
await dualWrite(1, "ada@example.com");
29
console.log(`after write 1: legacy=${legacy.size} new=${next.size}`);
30

31
failNext = true; // simulate the new store going down
32
await dualWrite(2, "alan@example.com");
33
console.log(`after write 2: legacy=${legacy.size} new=${next.size}`);

Running it under Node v22 prints:

1
after write 1: legacy=1 new=1
2
shadow write failed for id=2: new store unavailable (request still succeeds)
3
after write 2: legacy=2 new=1

This works, but notice what TypeScript does not protect you from: nothing forces legacyUpsert and newUpsert to agree on the shape of a row, nothing catches a number that has silently lost integer precision, and the e instanceof Error dance is needed because a thrown value in JavaScript can be anything. These are exactly the gaps Rust closes.

Rust Equivalent

The same dual-write control flow in Rust. The legacy store returns Result, so its failure propagates with ?; the new (shadow) store’s failure is deliberately swallowed and logged, so it can never turn into a request failure.

1
// Dual-write pattern, distilled to the essential control flow.
2
// The OLD store is the source of truth; the NEW store is shadowed.
3
// A failure writing the new store must NEVER fail the request.
4

5
use std::collections::HashMap;
6

7
#[derive(Debug)]
8
struct WriteError(String);
9

10
#[derive(Default)]
11
struct LegacyStore {
12
    rows: HashMap<i64, String>,
13
}
14
impl LegacyStore {
15
    fn upsert(&mut self, id: i64, email: &str) -> Result<(), WriteError> {
16
        self.rows.insert(id, email.to_string());
17
        Ok(())
18
    }
19
}
20

21
#[derive(Default)]
22
struct NewStore {
23
    rows: HashMap<i64, String>,
24
    fail_next: bool,
25
}
26
impl NewStore {
27
    fn upsert(&mut self, id: i64, email: &str) -> Result<(), WriteError> {
28
        if self.fail_next {
29
            return Err(WriteError("new store unavailable".into()));
30
        }
31
        self.rows.insert(id, email.to_string());
32
        Ok(())
33
    }
34
}
35

36
/// Returns Ok as long as the SOURCE OF TRUTH (legacy) succeeded.
37
/// The shadow write is best-effort: log on failure, alert on drift, but don't 500.
38
fn dual_write(
39
    legacy: &mut LegacyStore,
40
    new: &mut NewStore,
41
    id: i64,
42
    email: &str,
43
) -> Result<(), WriteError> {
44
    legacy.upsert(id, email)?; // hard requirement
45

46
    if let Err(e) = new.upsert(id, email) {
47
        // In production: increment a `dual_write_shadow_failures` counter + structured log.
48
        eprintln!("shadow write failed for id={id}: {e:?} (request still succeeds)");
49
    }
50
    Ok(())
51
}
52

53
fn main() {
54
    let mut legacy = LegacyStore::default();
55
    let mut new = NewStore::default();
56

57
    dual_write(&mut legacy, &mut new, 1, "ada@example.com").unwrap();
58
    println!("after write 1: legacy={} new={}", legacy.rows.len(), new.rows.len());
59

60
    // New store hiccups — request must still succeed.
61
    new.fail_next = true;
62
    dual_write(&mut legacy, &mut new, 2, "alan@example.com").unwrap();
63
    println!("after write 2: legacy={} new={}", legacy.rows.len(), new.rows.len());
64
}

Real output:

1
after write 1: legacy=1 new=1
2
shadow write failed for id=2: WriteError("new store unavailable") (request still succeeds)
3
after write 2: legacy=2 new=1

The behavior is identical to the Node version by design — during a migration, identical behavior is the goal. The difference is that the legacy write’s Result cannot be accidentally ignored (an unused Result is a compiler warning), and the only failure path that can reach the caller is the legacy store’s, which is exactly the contract we want.

Note: These snippets use HashMap as a stand-in for a real database so they compile and run with no external services. In production each upsert would be a sqlx::query! against PostgreSQL inside a transaction — see ../17-database/02_sqlx-transactions.md.

Detailed Explanation

There are three building blocks. You usually apply them in this order over weeks or months.

1. Shared database

The simplest start: the Rust service connects to the same database the Node service already owns. No data moves. Both services read and write the same tables. This is the lowest-risk way to ship the first Rust endpoint, because the data layer does not change at all — only which process answers a given request.

The catch: the Rust types must faithfully mirror what Node persists. Node wrote the schema, so Rust is a guest in that schema and must not break it. The discipline is to make every Rust schema change additive and backward-compatible — add nullable columns, never rename or retype a column Node still reads.

1
use serde::{Deserialize, Serialize};
2

3
// A row as it exists in the LEGACY (Node-written) schema and the NEW (Rust) schema.
4
// During a shared-DB migration both services read/write the SAME table, so the
5
// Rust types must be a faithful mirror of what Node persists.
6

7
#[derive(Debug, Serialize, Deserialize, PartialEq)]
8
struct User {
9
    id: i64,
10
    email: String,
11
    // Node stored this as a Unix-millis number; we mirror it exactly.
12
    #[serde(rename = "createdAt")]
13
    created_at: i64,
14
    // Added by the Rust service. Old rows won't have it, so default it.
15
    #[serde(default)]
16
    verified: bool,
17
}
18

19
fn main() {
20
    // Simulate a row that the legacy Node service wrote (no `verified` column yet).
21
    let legacy_json = r#"{"id":1,"email":"ada@example.com","createdAt":1700000000000}"#;
22
    let user: User = serde_json::from_str(legacy_json).expect("parse legacy row");
23
    assert_eq!(user.verified, false); // defaulted, not crashed
24
    println!("parsed legacy: {user:?}");
25

26
    // What Rust now writes back — additive, so Node keeps working.
27
    let json = serde_json::to_string(&user).unwrap();
28
    println!("rust writes:   {json}");
29
}

This needs serde (cargo add serde --features derive and cargo add serde_json). Real output:

1
parsed legacy: User { id: 1, email: "ada@example.com", created_at: 1700000000000, verified: false }
2
rust writes:   {"id":1,"email":"ada@example.com","createdAt":1700000000000,"verified":false}

Three things to notice:

#[serde(rename = "createdAt")] maps Node’s camelCase field to Rust’s snake_case without changing the wire/column name.
created_at: i64 mirrors Node’s millis-as-number. In JavaScript that value is an IEEE-754 f64; as long as it stays under 2^53 it round-trips exactly, but if it is a large 64-bit identifier you must treat precision carefully (see Pitfalls below).
#[serde(default)] on verified means old rows that predate the column deserialize cleanly to false instead of erroring. That is what makes the change backward-compatible.

2. Dual-write

Once the Rust service has its own schema (or its own database), you can no longer share one table. The transition pattern is to write to both stores for a period: the legacy store stays the source of truth, the new store is shadowed. The dual_write function shown earlier is the heart of it. The rules:

The legacy (source-of-truth) write must succeed or the request fails.
The new-store write is best-effort: a failure is logged and counted, never propagated.
You alert on the shadow-failure rate. A steady stream of shadow failures means the two stores are diverging, and you must not cut over until that is near zero.

Dual-write only covers rows written after you turned it on. Everything older still lives only in the legacy store. That is what backfill is for.

3. Backfill

A batch job copies historical rows from the legacy store into the new one. A correct backfill is chunked, resumable, and idempotent:

Chunked — process a page at a time (here, keyset pagination on the primary key) so you never load the whole table into memory or hold a giant transaction.
Resumable — persist a checkpoint (the last processed id) after each chunk commits, so a crash resumes instead of restarting.
Idempotent — each row is written with an UPSERT, so re-running over already-migrated rows is a safe no-op. This matters because backfill and live dual-write run concurrently, and you will re-run after fixing bugs.

1
// Idempotent, chunked, resumable backfill — the shape that matters in production.
2
// Key ideas: process by ascending primary key, remember the last id (checkpoint),
3
// and make each row's transform a no-op if already done.
4

5
#[derive(Debug, Clone)]
6
struct LegacyUser {
7
    id: i64,
8
    email: String,
9
    // legacy stored display name as "First Last" in one column
10
    full_name: String,
11
}
12

13
#[derive(Debug, PartialEq)]
14
struct NewUser {
15
    id: i64,
16
    email: String,
17
    first_name: String,
18
    last_name: String,
19
}
20

21
/// Pure transform: trivially testable, no I/O. Splits the legacy name.
22
fn transform(row: &LegacyUser) -> NewUser {
23
    let (first, last) = row.full_name.split_once(' ').unwrap_or((&row.full_name, ""));
24
    NewUser {
25
        id: row.id,
26
        email: row.email.clone(),
27
        first_name: first.to_string(),
28
        last_name: last.to_string(),
29
    }
30
}
31

32
/// Fetch one page of rows whose id > `after`, ordered by id (keyset pagination).
33
fn fetch_chunk(all: &[LegacyUser], after: i64, limit: usize) -> Vec<LegacyUser> {
34
    all.iter().filter(|u| u.id > after).take(limit).cloned().collect()
35
}
36

37
fn main() {
38
    let legacy = vec![
39
        LegacyUser { id: 1, email: "ada@x.io".into(), full_name: "Ada Lovelace".into() },
40
        LegacyUser { id: 2, email: "alan@x.io".into(), full_name: "Alan Turing".into() },
41
        LegacyUser { id: 3, email: "grace@x.io".into(), full_name: "Grace Hopper".into() },
42
    ];
43

44
    let mut checkpoint: i64 = 0; // resume from here if the job crashes
45
    let mut migrated = 0usize;
46
    const CHUNK: usize = 2;
47

48
    loop {
49
        let chunk = fetch_chunk(&legacy, checkpoint, CHUNK);
50
        if chunk.is_empty() {
51
            break;
52
        }
53
        for row in &chunk {
54
            let new = transform(row);
55
            // UPSERT into the new table here; idempotent so re-runs are safe.
56
            println!("backfilled id={} -> {} / {}", new.id, new.first_name, new.last_name);
57
            migrated += 1;
58
            checkpoint = row.id; // persist this after each chunk commits
59
        }
60
    }
61
    println!("done: {migrated} rows, checkpoint={checkpoint}");
62

63
    // Sanity: the transform is a pure function we can unit-test.
64
    assert_eq!(
65
        transform(&legacy[0]),
66
        NewUser { id: 1, email: "ada@x.io".into(), first_name: "Ada".into(), last_name: "Lovelace".into() }
67
    );
68
}

Real output:

1
backfilled id=1 -> Ada / Lovelace
2
backfilled id=2 -> Alan / Turing
3
backfilled id=3 -> Grace / Hopper
4
done: 3 rows, checkpoint=3

The single most important design choice here is separating the pure transform from the I/O (fetch_chunk and the UPSERT). The transform has no database, no async, no error handling — so it is trivially unit-testable, and the trickiest part of any migration (the field-by-field mapping) gets full test coverage without a database. This is the same separation Rust pushes you toward everywhere; see ../22-common-patterns/README.md.

Tip: Use keyset pagination (WHERE id > $checkpoint ORDER BY id LIMIT n), never OFFSET. With OFFSET the database still scans and discards all skipped rows, so a backfill gets quadratically slower as it progresses. Keyset pagination stays flat.

Key Differences

Concern	TypeScript / Node	Rust
Row shape vs. DB schema	ORM types are often hand-maintained; drift is silent	`sqlx::query!` checks queries against the live schema at compile time
Large 64-bit ids	`number` is f64 — loses precision past 2^53	native `i64` / `u64`, exact
Ignoring a failed write	easy — a forgotten `await` or unhandled rejection	unused `Result` warns; `?` makes propagation explicit
Transform testability	possible, but I/O often tangled in	borrow checker pushes you to isolate the pure transform
Migration job crashing	unhandled rejection may abort silently	`Result` + `panic = "abort"` make failure loud and deliberate
Concurrency in backfill	event loop; CPU-bound transforms block it	`rayon` / async tasks parallelize the transform safely

The deepest difference is when you find out the new store’s schema disagrees with reality. In Node, a column rename usually surfaces as a runtime error (or worse, a silently undefined field) in production. With sqlx’s compile-time-checked queries, the same mistake fails cargo build on your laptop. During a migration — when two schemas are evolving in parallel — that shift from runtime to compile time is worth a great deal.

Note: Rust is not multi-threaded by default; concurrency is opt-in and the compiler enforces data-race freedom (“fearless concurrency”). For a CPU-heavy backfill transform, rayon’s par_iter parallelizes across cores safely, which a single Node event loop cannot do without worker threads.

Common Pitfalls

Pitfall 1: mirroring a numeric column as `String`

A JavaScript developer used to everything being loosely typed may declare a timestamp field as String because “it’s just data.” But Node wrote createdAt as a JSON number. Serde is strict and will refuse it at deserialize time:

1
use serde::Deserialize;
2

3
#[derive(Debug, Deserialize)]
4
struct User {
5
    id: i64,
6
    email: String,
7
    created_at: String, // BUG: Node wrote a numeric millis timestamp, not a string
8
}
9

10
fn main() {
11
    let legacy = r#"{"id":1,"email":"ada@x.io","created_at":1700000000000}"#;
12
    let user: User = serde_json::from_str(legacy).unwrap(); // panics here
13
    println!("{user:?}");
14
}

Running it produces this real runtime error:

1
thread 'main' panicked at src/main.rs:12:51:
2
called `Result::unwrap()` on an `Err` value: Error("invalid type: integer `1700000000000`, expected a string", line: 1, column: 53)

The fix is to type the field as i64 to match what Node actually stored. Unlike TypeScript — where a wrong type annotation is erased at runtime and the mismatch slips through — serde checks the value against the declared type and tells you exactly which field is wrong.

Pitfall 2: defaulting an auto-increment id to `i32`

JavaScript has one number type, so there is no “pick the integer width” decision. In Rust there is, and reaching for i32 (a common default in many languages) silently caps your ids at about 2.1 billion. A mature table can exceed that. This one is caught at compile time:

1
fn main() {
2
    // Node stored an auto-increment id that has grown past 2^31.
3
    // Mirroring it as i32 (the "default int" reflex) overflows.
4
    let id: i32 = 3_000_000_000; // does not compile
5
    println!("{id}");
6
}

The real rustc error:

1
error: literal out of range for `i32`
2
 --> src/main.rs:4:19
3
  |
4
4 |     let id: i32 = 3_000_000_000;
5
  |                   ^^^^^^^^^^^^^
6
  |
7
  = note: the literal `3_000_000_000` does not fit into the type `i32` whose range is `-2147483648..=2147483647`
8
  = help: consider using the type `u32` instead
9
  = note: `#[deny(overflowing_literals)]` on by default

Mirror PostgreSQL BIGSERIAL/BIGINT ids as i64. Note also that if such an id had passed through a Node service as a JavaScript number, any value above 2^53 would already have lost precision in transit — a real cross-language migration hazard, not a wrapping bug.

Pitfall 3: treating the shadow write as load-bearing

The whole point of dual-write is that the new store is not yet trusted. If you propagate its errors (new.upsert(...)? instead of logging), a flaky new store now causes customer-facing 500s — you have made the migration reduce reliability. Keep the shadow write best-effort and gate the cutover on the measured drift rate instead.

Pitfall 4: forgetting backfill and dual-write race on the same row

Backfill reads an old row, transforms it, and writes it. Meanwhile a live dual-write may have already written a newer version of that same row. If backfill blindly overwrites, it clobbers fresh data with stale data. Defend against this by making the new-store write a conditional UPSERT (ON CONFLICT ... DO UPDATE WHERE excluded.updated_at > target.updated_at) so newer data always wins. This is impossible to test by eyeballing it; make the transform pure and unit-test the conflict resolution.

Best Practices

Make schema changes additive during shared-DB. Add nullable columns; never rename or retype a column the other service still reads. Use #[serde(default)] so old rows deserialize cleanly.
Mirror integer widths exactly. PostgreSQL BIGINT → i64, INT → i32, and check whether a column is signed before choosing u*. When in doubt, prefer i64.
Separate the pure transform from I/O so the row-mapping logic gets full unit-test coverage without a database, as in the backfill example.
Use keyset pagination and a persisted checkpoint for backfills. Resumability is not optional at scale.
Make every write idempotent (UPSERT). Backfill will be re-run, and it overlaps live dual-write.
Measure drift continuously with a reconciliation job (next section) and alert on it. Cut over only when drift is near zero.
Let sqlx’s compile-time query checks guard the new schema. Run cargo sqlx prepare in CI so a schema/query mismatch fails the build, not production. See ../17-database/00_sqlx-intro.md.
Wrap multi-table writes in a transaction so a partial dual-write to the new store cannot leave it half-updated. See ../17-database/02_sqlx-transactions.md.
Keep a rollback path. Until the legacy store is retired, you can always fall back to it. Do not delete legacy data until the new store has been the source of truth, verified, for a full business cycle.

Real-World Example

Before flipping the read path to the new store, you run a reconciliation pass: sample (or fully scan) both stores, fingerprint each row’s canonical form, and report rows that are missing or divergent. The cutover is gated on this report being clean — you cut over on evidence, not optimism.

Fingerprinting via a hash lets you compare rows cheaply and detect drift even when the two stores format a value differently (trailing whitespace, casing). This uses sha2 (cargo add sha2):

1
// A reconciliation pass run AFTER backfill, BEFORE flipping the read path.
2
// It scans both stores and reports drift, so you gate the cutover on real
3
// evidence instead of hope.
4

5
use sha2::{Digest, Sha256};
6
use std::collections::BTreeMap;
7

8
#[derive(Clone)]
9
struct Row {
10
    id: i64,
11
    email: String,
12
    verified: bool,
13
}
14

15
/// Hash the CANONICAL form so the same logical row hashes identically across
16
/// stores even if they format the value differently (case, whitespace).
17
fn fingerprint(r: &Row) -> u64 {
18
    let canonical = format!("{}|{}|{}", r.id, r.email.trim().to_lowercase(), r.verified);
19
    let digest = Sha256::digest(canonical.as_bytes());
20
    u64::from_be_bytes(digest[..8].try_into().unwrap())
21
}
22

23
struct ReconcileReport {
24
    checked: usize,
25
    missing_in_new: Vec<i64>,
26
    mismatched: Vec<i64>,
27
}
28

29
fn reconcile(legacy: &BTreeMap<i64, Row>, new: &BTreeMap<i64, Row>) -> ReconcileReport {
30
    let mut report = ReconcileReport { checked: 0, missing_in_new: vec![], mismatched: vec![] };
31
    for (id, lrow) in legacy {
32
        report.checked += 1;
33
        match new.get(id) {
34
            None => report.missing_in_new.push(*id),
35
            Some(nrow) if fingerprint(lrow) != fingerprint(nrow) => report.mismatched.push(*id),
36
            Some(_) => {}
37
        }
38
    }
39
    report
40
}
41

42
fn main() {
43
    let mut legacy = BTreeMap::new();
44
    let mut new = BTreeMap::new();
45
    for (id, email, v) in [(1, "ada@x.io", true), (2, "alan@x.io", false), (3, "grace@x.io", true)] {
46
        legacy.insert(id, Row { id, email: email.into(), verified: v });
47
    }
48
    // New store is missing id=3 and has stale `verified` for id=2.
49
    new.insert(1, Row { id: 1, email: "Ada@x.io ".into(), verified: true }); // formatting differs, same logical value
50
    new.insert(2, Row { id: 2, email: "alan@x.io".into(), verified: true });  // drifted
51

52
    let report = reconcile(&legacy, &new);
53
    println!("checked: {}", report.checked);
54
    println!("missing in new: {:?}", report.missing_in_new);
55
    println!("mismatched:     {:?}", report.mismatched);
56
    let healthy = report.missing_in_new.is_empty() && report.mismatched.is_empty();
57
    println!("safe to cut over? {healthy}");
58
}

Real output:

1
checked: 3
2
missing in new: [3]
3
mismatched:     [2]
4
safe to cut over? false

Row 1 differs only in casing and whitespace, and the canonical fingerprint correctly treats it as in-sync. Row 2 has genuinely drifted (the new store has stale verified), and row 3 was never backfilled — both are flagged, and the gate refuses the cutover. In production this job runs on a schedule, exports the mismatched ids for repair, and emits a metric so the cutover decision is data-driven. For honestly measuring the performance payoff once you have cut over, see performance-gains.md.

Exercises

Exercise 1: read-path migration with fallback

Difficulty: Beginner

Objective: Implement the read side of a migration. Reads should prefer the new store, and on a miss fall back to the legacy store while lazily copying the row forward (a “read-through” cache pattern applied to migration).

Instructions: Write a function read_through(new, legacy, id) that returns the row from new if present; otherwise reads from legacy, writes it into new, and returns it; otherwise returns None. Use Option combinators, not nested if let where avoidable.

Solution

1
use std::collections::HashMap;
2

3
#[derive(Default)]
4
struct Store(HashMap<i64, String>);
5
impl Store {
6
    fn get(&self, id: i64) -> Option<&String> { self.0.get(&id) }
7
    fn put(&mut self, id: i64, v: String) { self.0.insert(id, v); }
8
}
9

10
/// Read-through: new store first; on miss, read legacy and write it forward.
11
fn read_through(new: &mut Store, legacy: &Store, id: i64) -> Option<String> {
12
    if let Some(v) = new.get(id) {
13
        return Some(v.clone());
14
    }
15
    let v = legacy.get(id)?.clone(); // `?` short-circuits to None if absent everywhere
16
    new.put(id, v.clone()); // lazy backfill on read
17
    Some(v)
18
}
19

20
fn main() {
21
    let mut new = Store::default();
22
    let mut legacy = Store::default();
23
    legacy.put(1, "ada@x.io".into());
24

25
    // First read: miss in new, hit in legacy, backfilled.
26
    let first = read_through(&mut new, &legacy, 1);
27
    println!("first read: {first:?}, new now has it: {}", new.get(1).is_some());
28

29
    // Missing everywhere.
30
    println!("unknown: {:?}", read_through(&mut new, &legacy, 99));
31
}

Real output:

1
first read: Some("ada@x.io"), new now has it: true
2
unknown: None

The ? operator on legacy.get(id)? returns None from the whole function if the row is absent everywhere — the same short-circuit you would write with ?? and early-return in TypeScript, but type-checked end to end.

Exercise 2: model the cutover as a state machine

Difficulty: Intermediate

Objective: Encode the migration phases as an enum so an illegal phase transition is unrepresentable, and expose a helper that reports whether the legacy store is still being written in a given phase.

Instructions: Define a Phase enum with variants LegacyOnly, DualWrite, Backfilling, NewPrimary, NewOnly. Add next(self) -> Option<Phase> that only advances forward (and returns None at the end), and writes_legacy(self) -> bool that is true for every phase except NewOnly. Drive it through all phases in main.

Solution

1
#[derive(Debug, Clone, Copy, PartialEq)]
2
enum Phase {
3
    LegacyOnly,   // only Node writes
4
    DualWrite,    // both written, legacy is source of truth
5
    Backfilling,  // historical rows being copied
6
    NewPrimary,   // Rust is source of truth, legacy shadowed
7
    NewOnly,      // legacy retired
8
}
9

10
impl Phase {
11
    /// Only forward transitions are legal; you never skip dual-write.
12
    fn next(self) -> Option<Phase> {
13
        use Phase::*;
14
        match self {
15
            LegacyOnly => Some(DualWrite),
16
            DualWrite => Some(Backfilling),
17
            Backfilling => Some(NewPrimary),
18
            NewPrimary => Some(NewOnly),
19
            NewOnly => None,
20
        }
21
    }
22
    fn writes_legacy(self) -> bool {
23
        !matches!(self, Phase::NewOnly)
24
    }
25
}
26

27
fn main() {
28
    let mut phase = Phase::LegacyOnly;
29
    print!("{phase:?}");
30
    while let Some(next) = phase.next() {
31
        print!(" -> {next:?}");
32
        phase = next;
33
    }
34
    println!();
35
    assert!(Phase::DualWrite.writes_legacy());
36
    assert!(!Phase::NewOnly.writes_legacy());
37
    println!("final phase still writes legacy? {}", phase.writes_legacy());
38
}

Real output:

1
LegacyOnly -> DualWrite -> Backfilling -> NewPrimary -> NewOnly
2
final phase still writes legacy? false

Because next is the only way to advance and match is exhaustive, the compiler guarantees you handle every phase and that the sequence can only move forward. A TypeScript union of string literals gives you the shape but not the exhaustiveness guarantee without extra discipline.

Exercise 3: idempotent UPSERT with conflict resolution

Difficulty: Advanced

Objective: Implement the “newer data wins” rule that prevents a backfill from clobbering a fresher dual-write (Pitfall 4).

Instructions: Model a row as { id: i64, email: String, updated_at: i64 }. Write upsert(store, incoming) that inserts the row only if there is no existing row for that id, or if incoming.updated_at is strictly greater than the stored row’s. Return a bool indicating whether the write was applied. Demonstrate that a stale backfill row does not overwrite a fresher live row.

Solution

1
use std::collections::HashMap;
2

3
#[derive(Debug, Clone)]
4
struct Row {
5
    id: i64,
6
    email: String,
7
    updated_at: i64,
8
}
9

10
/// Idempotent UPSERT with "newer wins" conflict resolution.
11
/// Returns true if the store was changed.
12
fn upsert(store: &mut HashMap<i64, Row>, incoming: Row) -> bool {
13
    match store.get(&incoming.id) {
14
        Some(existing) if existing.updated_at >= incoming.updated_at => false, // keep fresher data
15
        _ => {
16
            store.insert(incoming.id, incoming);
17
            true
18
        }
19
    }
20
}
21

22
fn main() {
23
    let mut store: HashMap<i64, Row> = HashMap::new();
24

25
    // Live dual-write lands a fresh row.
26
    let applied = upsert(&mut store, Row { id: 1, email: "new@x.io".into(), updated_at: 200 });
27
    println!("fresh write applied: {applied}");
28

29
    // Backfill replays an OLDER snapshot of the same row — must be rejected.
30
    let applied = upsert(&mut store, Row { id: 1, email: "stale@x.io".into(), updated_at: 100 });
31
    println!("stale backfill applied: {applied}");
32
    println!("stored email: {}", store[&1].email);
33

34
    // Re-running the exact same fresh write is a no-op (idempotent).
35
    let applied = upsert(&mut store, Row { id: 1, email: "new@x.io".into(), updated_at: 200 });
36
    println!("replay applied: {applied}");
37
}

Real output:

1
fresh write applied: true
2
stale backfill applied: false
3
stored email: new@x.io
4
replay applied: false

The stale backfill (updated_at: 100) is rejected even though it arrived later in wall-clock time, and a replay of the same fresh row is a no-op — exactly the idempotency and conflict-resolution guarantees a concurrent backfill needs. In SQL this is INSERT ... ON CONFLICT (id) DO UPDATE SET ... WHERE excluded.updated_at > <table>.updated_at.

Data Migration Strategies

Quick Overview

TypeScript/JavaScript Example

Rust Equivalent

Detailed Explanation

1. Shared database

2. Dual-write

3. Backfill

Key Differences

Common Pitfalls

Pitfall 1: mirroring a numeric column as String

Pitfall 2: defaulting an auto-increment id to i32

Pitfall 3: treating the shadow write as load-bearing

Pitfall 4: forgetting backfill and dual-write race on the same row

Best Practices

Real-World Example

Further Reading

Exercises

Exercise 1: read-path migration with fallback

Exercise 2: model the cutover as a state machine

Exercise 3: idempotent UPSERT with conflict resolution

Pitfall 1: mirroring a numeric column as `String`

Pitfall 2: defaulting an auto-increment id to `i32`