Native Threads with `std::thread`

22 min read

In Node.js, “a thread” is an exotic, heavyweight thing — a Worker with its own V8 isolate, its own heap, and a serialization boundary you must cross with postMessage. In Rust, an OS thread is a first-class, lightweight tool, and the compiler statically prevents the data races that make threads terrifying in C++. This page covers spawning threads, joining them, moving data into them, and the modern std::thread::scope API that lets threads safely borrow from their parent.

Quick Overview

std::thread gives you real, OS-backed threads that run on multiple cores simultaneously — true parallelism, not the single-threaded concurrency of Node’s event loop. You spawn a thread with a closure, get back a JoinHandle, and call join() to wait for its result. The headline feature for a TypeScript developer: Rust’s ownership system makes threads memory-safe by construction. Code that would race in JavaScript-with-SharedArrayBuffer (or segfault in C++) simply does not compile.

Note: This page is about raw OS threads. For CPU-bound data parallelism you will usually reach for the higher-level rayon thread pool and parallel iterators instead of spawning threads by hand. To pass messages between threads, see channels. To share mutable counters without locks, see atomic operations. For async tasks (which are not threads), see Section 11: async/concurrency.

TypeScript/JavaScript Example

JavaScript is single-threaded. To get real parallelism — to use a second CPU core — you must spin up a Worker Thread, which is a separate V8 isolate with its own memory. You cannot share ordinary objects with it; you communicate by copying messages across a serialization boundary (structured clone), or by using a SharedArrayBuffer for a narrow slice of raw bytes.

1
// main.ts — Node v22
2
import { Worker } from "node:worker_threads";
3

4
// Each worker is a heavyweight thread with its OWN heap. We send it a number,
5
// it sends back the sum 1..=n. The data is COPIED across the boundary.
6
function sumInWorker(n: number): Promise<number> {
7
  return new Promise((resolve, reject) => {
8
    const worker = new Worker(
9
      `
10
      const { parentPort, workerData } = require('node:worker_threads');
11
      let total = 0;
12
      for (let i = 1; i <= workerData; i++) total += i;
13
      parentPort.postMessage(total);
14
      `,
15
      { eval: true, workerData: n },
16
    );
17
    worker.on("message", resolve);
18
    worker.on("error", reject);
19
    worker.on("exit", (code) => {
20
      if (code !== 0) reject(new Error(`worker exited with code ${code}`));
21
    });
22
  });
23
}
24

25
async function main() {
26
  // Run several workers "in parallel" on real cores.
27
  const results = await Promise.all([
28
    sumInWorker(1000),
29
    sumInWorker(2000),
30
    sumInWorker(3000),
31
  ]);
32
  console.log(results); // [ 500500, 2001000, 4501500 ]
33
}
34

35
main();

Key facts about the JavaScript model:

A Worker is expensive — it boots a whole V8 isolate. You pool them, you do not create thousands.
Data is not shared. workerData and postMessage payloads are deep-copied (structured clone). The closure body cannot capture variables from main — note we had to inline the worker source as a string.
There is no compile-time protection against races on a SharedArrayBuffer; you reach for Atomics and hope you got it right.

Rust Equivalent

In Rust, a thread is just a function (closure) you hand to thread::spawn. It runs on a real OS thread, in parallel, on the same heap as the rest of your program — and the borrow checker guarantees you do not corrupt that shared heap.

1
use std::thread;
2

3
fn main() {
4
    // Spawn a thread. spawn() returns a JoinHandle<T> immediately; the closure
5
    // runs concurrently on another core. T is the closure's return type.
6
    let handle = thread::spawn(|| {
7
        let mut total = 0u64;
8
        for i in 1..=1_000 {
9
            total += i;
10
        }
11
        total // the closure's return value becomes the thread's result
12
    });
13

14
    // The main thread keeps running while the worker computes.
15
    println!("main thread keeps running");
16

17
    // join() blocks until the worker finishes and hands back its return value,
18
    // wrapped in a Result (Err if the thread panicked).
19
    let sum = handle.join().expect("worker thread panicked");
20
    println!("worker computed sum = {sum}");
21
}

Running it:

1
main thread keeps running
2
worker computed sum = 500500

No isolate to boot, no serialization, no message channel for a simple result — the value flows straight back through join(). The current stable toolchain is Rust 1.96.0 on the 2024 edition; cargo new selects it automatically, and everything here is in the standard library (no cargo add needed).

Detailed Explanation

`thread::spawn` and `JoinHandle`

1
use std::thread;
2

3
fn main() {
4
    let handle = thread::spawn(|| 21 * 2);
5
    let answer = handle.join().unwrap();
6
    println!("{answer}"); // 42
7
}

thread::spawn(f) takes a closure f and starts a new OS thread that runs it. It returns immediately — the thread runs concurrently.
The return type is JoinHandle<T>, where T is whatever the closure returns. Here T = i32.
handle.join() blocks the calling thread until the spawned thread finishes. It returns thread::Result<T> — an Ok(value) with the closure’s return value, or an Err if the thread panicked.

Compare to JavaScript: a JoinHandle<T> plays a role similar to a Promise<T>, but it is not lazy and not async — it is a handle to a thread that is already running on another core right now. And join() is a blocking wait, not an await that yields to an event loop.

Move closures: `move`

A spawned thread can outlive the function that created it, so by default Rust will not let the closure borrow local variables — those locals might be gone by the time the thread reads them. The move keyword transfers ownership of captured variables into the closure:

1
use std::thread;
2

3
fn main() {
4
    let data = vec![10, 20, 30, 40];
5

6
    // `move` transfers ownership of `data` INTO the thread's closure.
7
    let handle = thread::spawn(move || {
8
        let sum: i32 = data.iter().sum();
9
        println!("worker sees data with sum {sum}");
10
        sum
11
    });
12

13
    // `data` is no longer usable here — it was moved into the thread.
14
    let result = handle.join().unwrap();
15
    println!("main got {result}");
16
}

Output:

1
worker sees data with sum 100
2
main got 100

This is the big contrast with JavaScript’s Worker: there, data would be deep-copied across the boundary. In Rust, move transfers the same heap allocation — zero copy, zero serialization. After the move, the compiler statically forbids main from touching data, so there is no race: exactly one owner at a time.

Scoped threads: `thread::scope` (borrow instead of move)

What if you do not want to give away your data — you just want a few threads to read it (or write to disjoint parts) and then get control back? Moving works for one thread, but moving into many is impossible (you only have one value to give). Historically you wrapped everything in Arc and cloned the pointer. Since Rust 1.63, std::thread::scope offers a cleaner answer: scoped threads can borrow non-'static data because the scope guarantees they all finish before it returns.

1
use std::thread;
2

3
fn main() {
4
    let numbers = vec![1, 2, 3, 4, 5, 6, 7, 8];
5

6
    // thread::scope guarantees all spawned threads finish before it returns,
7
    // which lets them BORROW `numbers` instead of taking ownership.
8
    let total = thread::scope(|s| {
9
        let (left, right) = numbers.split_at(numbers.len() / 2);
10

11
        // Each handle borrows a slice of the SAME vector — no clone, no Arc.
12
        let h_left = s.spawn(|| left.iter().sum::<i32>());
13
        let h_right = s.spawn(|| right.iter().sum::<i32>());
14

15
        h_left.join().unwrap() + h_right.join().unwrap()
16
    });
17

18
    // `numbers` is still fully owned and usable here.
19
    println!("sum of {numbers:?} = {total}");
20
}

Output:

1
sum of [1, 2, 3, 4, 5, 6, 7, 8] = 36

The closures here capture left and right by shared reference — no move, no Arc, no clone. The borrow checker accepts this because scope will not return until every thread it spawned has been joined, so the borrows cannot outlive numbers. This is the idiomatic way to fan out work over data you own on the stack.

Tip: Reach for thread::scope first when you want a fixed set of threads to chew on borrowed data and you want to wait for all of them. It eliminates an entire class of Arc/clone boilerplate. Use a plain thread::spawn + move when the thread must outlive the current function or be detached.

`available_parallelism` and collecting handles

To size your work to the machine, ask how many cores you can actually use, then spawn and join a batch:

1
use std::thread;
2

3
fn main() {
4
    let n = thread::available_parallelism().map(|n| n.get()).unwrap_or(1);
5
    println!("this machine reports {n} usable cores");
6

7
    // Spawn one thread per id, then join them all and collect the results.
8
    let handles: Vec<_> = (0..4)
9
        .map(|id| thread::spawn(move || id * id))
10
        .collect();
11

12
    let squares: Vec<i32> = handles
13
        .into_iter()
14
        .map(|h| h.join().unwrap())
15
        .collect();
16

17
    println!("squares = {squares:?}");
18
}

Output on an 8-core machine:

1
this machine reports 8 usable cores
2
squares = [0, 1, 4, 9]

available_parallelism() is the rough equivalent of Node’s os.availableParallelism(). Note the move on the inner closure: each thread captures its own id by value, so there is no shared mutable state to race over.

The `Builder` API: names and stack size

thread::spawn uses sensible defaults. For control over the thread name (shown in panics and debuggers) and stack size, use thread::Builder:

1
use std::thread;
2

3
fn main() {
4
    let handle = thread::Builder::new()
5
        .name("crunch-worker".to_string())
6
        .stack_size(4 * 1024 * 1024) // 4 MiB stack
7
        .spawn(|| {
8
            let me = thread::current();
9
            format!("hello from {:?}", me.name().unwrap_or("<unnamed>"))
10
        })
11
        .expect("failed to spawn thread");
12

13
    println!("{}", handle.join().unwrap());
14
}

Output:

1
hello from "crunch-worker"

Unlike bare spawn, Builder::spawn returns an io::Result<JoinHandle<T>> — spawning an OS thread can genuinely fail (e.g., the OS refuses more threads), and the Builder API surfaces that instead of aborting.

Key Differences

Aspect	Node.js `Worker`	Rust `std::thread`
Weight	Heavy (full V8 isolate)	Light (one OS thread)
Memory	Separate heap per worker	Shared heap, compiler-checked
Passing data in	Copied (structured clone)	Moved (`move`) or borrowed (`scope`) — zero-copy
Getting a result out	`postMessage` + event listener	Return value via `handle.join()`
Capturing locals	Not possible (string source / `workerData`)	Closure captures directly
Race protection	None at compile time (`Atomics` by hand)	`Send`/`Sync` enforced by the compiler
True parallelism	Yes	Yes
Cancellation	`worker.terminate()`	Cooperative (no forced kill)

Why threads are safe in Rust: `Send` and `Sync`

The compiler enforces two auto-traits at the thread boundary:

Send — a type is safe to transfer ownership of to another thread. Most types are Send; notable exceptions are Rc<T> (non-atomic reference count) and raw pointers.
Sync — a type is safe to share by reference (&T) across threads. T is Sync iff &T is Send.

thread::spawn requires the closure (and everything it captures) to be Send + 'static. thread::scope’s spawn relaxes the 'static requirement to a scoped lifetime but still requires Send/Sync. This is the mechanism that turns “did I introduce a data race?” from a runtime gamble into a compile error. In JavaScript there is no equivalent — sharing a SharedArrayBuffer incorrectly is simply a bug you find in production.

Threads are not async tasks

A common point of confusion for Node developers: Rust threads are not the same as async tokio::spawn tasks. A thread is a real OS thread that the kernel schedules and that can block. An async task is a lightweight state machine multiplexed onto a small pool of threads by a runtime, and it must never block. Use threads for CPU-bound work and for blocking calls; use async for high-concurrency I/O. See async vs sync.

Common Pitfalls

Pitfall 1: Borrowing a local without `move`

Coming from JavaScript, you expect the closure to just “see” the surrounding variable. With thread::spawn, it cannot, because the thread might outlive the function:

1
use std::thread;
2

3
fn main() {
4
    let data = vec![1, 2, 3];
5

6
    // does not compile (error[E0373]: closure may outlive the current function)
7
    let handle = thread::spawn(|| {
8
        println!("{:?}", data); // borrows `data`
9
    });
10

11
    handle.join().unwrap();
12
}

The real compiler error:

1
error[E0373]: closure may outlive the current function, but it borrows `data`, which is owned by the current function
2
 --> src/bin/err_borrow.rs:6:32
3
  |
4
6 |     let handle = thread::spawn(|| {
5
  |                                ^^ may outlive borrowed value `data`
6
7 |         println!("{:?}", data); // borrows `data`
7
  |                          ---- `data` is borrowed here
8
  |
9
help: to force the closure to take ownership of `data` (and any other referenced variables), use the `move` keyword
10
  |
11
6 |     let handle = thread::spawn(move || {
12
  |                                ++++

Fix: add move (transfer ownership), or — if you only need to borrow and will join before the function returns — use thread::scope so the borrow is allowed.

Pitfall 2: Using a value after moving it into a thread

move is permanent. After the move, the original binding is gone:

1
use std::thread;
2

3
fn main() {
4
    let data = vec![1, 2, 3];
5

6
    let handle = thread::spawn(move || {
7
        println!("{:?}", data);
8
    });
9

10
    // does not compile (error[E0382]: borrow of moved value: `data`)
11
    println!("{:?}", data);
12
    handle.join().unwrap();
13
}

The real error:

1
error[E0382]: borrow of moved value: `data`
2
  --> src/bin/err_use_after_move.rs:10:22
3
   |
4
 4 |     let data = vec![1, 2, 3];
5
   |         ---- move occurs because `data` has type `Vec<i32>`, which does not implement the `Copy` trait
6
 6 |     let handle = thread::spawn(move || {
7
   |                                ------- value moved into closure here
8
 7 |         println!("{:?}", data);
9
   |                          ---- variable moved due to use in closure
10
...
11
10 |     println!("{:?}", data);
12
   |                      ^^^^ value borrowed here after move

Fix: if both the thread and main genuinely need the data, share it with Arc (read-only) or Arc<Mutex<T>> (mutable), and clone the Arc for the thread. If main only needs the result, retrieve it via join() instead of touching the moved value. Unlike JavaScript’s copy-on-postMessage, Rust forces you to be explicit about which strategy you want.

Pitfall 3: Sharing `Rc<T>` across threads

Rc<T> is the cheap, non-atomic reference-counted pointer. Its refcount updates are not thread-safe, so Rc is !Send and the compiler rejects it at the boundary:

1
use std::rc::Rc;
2
use std::thread;
3

4
fn main() {
5
    let shared = Rc::new(42);
6

7
    // does not compile (error[E0277]: `Rc<i32>` cannot be sent between threads safely)
8
    let handle = thread::spawn(move || {
9
        println!("{}", shared);
10
    });
11

12
    handle.join().unwrap();
13
}

The real error (abbreviated):

1
error[E0277]: `Rc<i32>` cannot be sent between threads safely
2
   --> src/bin/err_rc.rs:7:32
3
    |
4
  7 |     let handle = thread::spawn(move || {
5
    |                  ------------- ^------ ... `Rc<i32>` cannot be sent between threads safely
6
    |
7
    = help: within `{closure@...}`, the trait `Send` is not implemented for `Rc<i32>`
8
note: required by a bound in `spawn`

Fix: use Arc (Atomic Reference Counted), which is Send + Sync:

1
use std::sync::Arc;
2
use std::thread;
3

4
fn main() {
5
    let shared = Arc::new(vec![1, 2, 3]);
6

7
    let handles: Vec<_> = (0..3)
8
        .map(|i| {
9
            let shared = Arc::clone(&shared); // bump the atomic refcount, share the same data
10
            thread::spawn(move || {
11
                println!("thread {i} sees {:?}", shared);
12
            })
13
        })
14
        .collect();
15

16
    for h in handles {
17
        h.join().unwrap();
18
    }
19
}

Output (the line order varies run to run because the threads are genuinely concurrent):

1
thread 0 sees [1, 2, 3]
2
thread 1 sees [1, 2, 3]
3
thread 2 sees [1, 2, 3]

See reference counting for Rc vs Arc in depth.

Pitfall 4: Forgetting to `join` — the process exits and kills the thread

When main returns, the whole process exits, taking any still-running threads with it. There is no “wait for background threads” at exit:

1
use std::thread;
2
use std::time::Duration;
3

4
fn main() {
5
    thread::spawn(|| {
6
        thread::sleep(Duration::from_millis(500));
7
        println!("worker: this may NEVER print");
8
    });
9
    // No join() — main returns, the whole process exits, killing the worker.
10
    println!("main: exiting immediately");
11
}

Output:

1
main: exiting immediately

The worker’s println! never runs — the process was already gone. Fix: hold the JoinHandle and join() it before main ends (or use thread::scope, which joins for you).

Pitfall 5: Expecting a panic in one thread to crash the program

A panic unwinds only its own thread. The default panic behavior is unwind, so a panicking worker does not take down main — instead, join() returns Err:

1
use std::thread;
2

3
fn main() {
4
    let handle = thread::spawn(|| {
5
        panic!("worker exploded");
6
    });
7

8
    match handle.join() {
9
        Ok(()) => println!("worker finished cleanly"),
10
        Err(_) => println!("main: detected that the worker panicked, carrying on"),
11
    }
12

13
    println!("main is still alive");
14
}

Output:

1
thread '<unnamed>' panicked at src/bin/panic_thread.rs:5:9:
2
worker exploded
3
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
4
main: detected that the worker panicked, carrying on
5
main is still alive

The panic message is printed to stderr by the default hook, but the program keeps running. Always check the Result from join() if a thread might panic. (If your crate sets panic = "abort", the whole process aborts instead — there is no Err to observe.)

Best Practices

Prefer thread::scope for borrowing. If you have a fixed set of workers that read or write disjoint parts of stack-owned data and you will wait for all of them, scope avoids Arc/clone ceremony and keeps borrows checked.
Prefer rayon for data parallelism. Spawning one thread per item is almost always wrong. For “apply this to every element of a big collection,” use par_iter(); for divide-and-conquer, use rayon’s pool and join. They handle work-stealing and core sizing for you.
Use move deliberately. Reach for move when a thread must own its captures or outlive the spawning function. Do not sprinkle it reflexively — if scope lets you borrow, that is clearer.
Share with the right pointer. Arc<T> for read-only sharing, Arc<Mutex<T>> (or Arc<RwLock<T>>) for shared mutation, atomics for simple counters/flags. Never reach for unsafe to “just share a &mut.”
Always join (or scope). Detached threads that you never join are a resource and correctness hazard — at minimum keep the handle and join on shutdown.
Handle panics at the boundary. Inspect join()’s Result for any thread that can panic, and convert it into a clean error or a controlled shutdown.
Match thread count to cores. Use thread::available_parallelism() to size a pool; do not spawn thousands of OS threads for CPU-bound work — they will thrash.

Real-World Example

A production-flavored task: compute a content hash (checksum) for several files concurrently and collect the results into a shared map. This pattern — fan out over inputs, each worker computes independently, results aggregate under a lock — is the bread and butter of native threading. It uses thread::scope to borrow the inputs and the result map (no Arc needed) and a Mutex to serialize the inserts.

1
use std::collections::HashMap;
2
use std::sync::Mutex;
3
use std::thread;
4

5
/// A tiny FNV-1a hash so the example needs no external crate.
6
/// In real code you would use a crate like `sha2` or `blake3`.
7
fn fnv1a(bytes: &[u8]) -> u64 {
8
    let mut hash: u64 = 0xcbf2_9ce4_8422_2325;
9
    for &b in bytes {
10
        hash ^= b as u64;
11
        hash = hash.wrapping_mul(0x0000_0100_0000_01b3);
12
    }
13
    hash
14
}
15

16
fn main() {
17
    // Simulated "files": (name, contents). In production these would be paths
18
    // you read with std::fs — see ./file-system.md.
19
    let files: Vec<(&str, Vec<u8>)> = vec![
20
        ("config.toml", b"[server]\nport = 8080\n".to_vec()),
21
        ("index.html", b"<!doctype html><h1>hi</h1>".to_vec()),
22
        ("data.csv", b"id,name\n1,alice\n2,bob\n".to_vec()),
23
        ("notes.md", b"# TODO\n- ship it\n".to_vec()),
24
    ];
25

26
    // Shared map guarded by a Mutex; each thread inserts its own result.
27
    let checksums: Mutex<HashMap<&str, u64>> = Mutex::new(HashMap::new());
28

29
    thread::scope(|s| {
30
        for (name, contents) in &files {
31
            // Borrow `name`, `contents`, and `checksums`. The scope guarantees
32
            // these threads end before `files`/`checksums` are dropped, so no Arc.
33
            s.spawn(|| {
34
                let digest = fnv1a(contents);
35
                checksums.lock().unwrap().insert(name, digest);
36
            });
37
        }
38
    });
39

40
    // Back on the main thread, the scope has joined every worker.
41
    let map = checksums.into_inner().unwrap();
42
    let mut sorted: Vec<_> = map.into_iter().collect();
43
    sorted.sort_by_key(|(name, _)| *name);
44
    for (name, digest) in sorted {
45
        println!("{name:<12} {digest:016x}");
46
    }
47
}

Output:

1
config.toml  5f4a2791ebf9f924
2
data.csv     a976785c72644ad1
3
index.html   3df08d3c2aac493f
4
notes.md     5e8adba7f295886a

Notice what the borrow checker did for us: the worker closures hold a &Mutex<HashMap<..>> and references into files, with no Arc::clone and no lifetime annotations. The scope is the proof that none of those borrows escape. The equivalent in Node would require a worker pool, copying each file’s bytes across the postMessage boundary, and reassembling the results from messages — far more moving parts.

Note: A Mutex serializes the inserts, not the hashing. The expensive fnv1a work runs fully in parallel; the lock is held only for the brief insert. Keep critical sections small. For a lock-free counter instead of a map, atomics are a better fit — see atomic operations.

Exercises

Exercise 1: Parallel sum over chunks

Difficulty: Beginner

Objective: Use thread::scope to split a slice into chunks and sum each chunk on its own thread.

Instructions: Write fn parallel_sum(data: &[u64], chunks: usize) -> u64 that divides data into roughly chunks contiguous pieces, spawns one scoped thread per piece to sum it, and returns the grand total. Verify it against the closed-form sum of 1..=1_000_000. You should not need Arc or clone.

Solution

1
use std::thread;
2

3
fn parallel_sum(data: &[u64], chunks: usize) -> u64 {
4
    // Round up so we never produce more than `chunks` pieces.
5
    let chunk_size = data.len().div_ceil(chunks.max(1));
6
    thread::scope(|s| {
7
        let handles: Vec<_> = data
8
            .chunks(chunk_size.max(1))
9
            .map(|chunk| s.spawn(move || chunk.iter().sum::<u64>()))
10
            .collect();
11
        handles.into_iter().map(|h| h.join().unwrap()).sum()
12
    })
13
}
14

15
fn main() {
16
    let data: Vec<u64> = (1..=1_000_000).collect();
17
    let total = parallel_sum(&data, 8);
18
    let expected = 1_000_000u64 * 1_000_001 / 2;
19
    println!("parallel sum = {total}, expected = {expected}");
20
    assert_eq!(total, expected);
21
}

Output:

1
parallel sum = 500000500000, expected = 500000500000

Each thread borrows its chunk (a &[u64]) directly from data; scope guarantees they finish before parallel_sum returns. div_ceil (stable since Rust 1.73) rounds the chunk size up so the last chunk is not orphaned.

Exercise 2: A hand-rolled worker pool over a shared queue

Difficulty: Intermediate

Objective: Build a fixed pool of worker threads that pull jobs from a shared queue and push results to a shared vector, using Arc<Mutex<...>>.

Instructions: Start with jobs: Vec<u32> of 1..=10. Spawn 4 worker threads. Each worker loops: lock the queue, pop() one job (release the lock immediately), compute n * n, then lock the results vector and push (n, n*n). Stop when the queue is empty. Join all workers, sort the results, and print them. (This is exactly the kind of boilerplate that rayon eliminates — see thread pools — but doing it by hand once builds intuition.)

Solution

1
use std::sync::{Arc, Mutex};
2
use std::thread;
3

4
fn main() {
5
    let jobs: Vec<u32> = (1..=10).collect();
6
    let queue = Arc::new(Mutex::new(jobs));
7
    let results = Arc::new(Mutex::new(Vec::<(u32, u32)>::new()));
8

9
    let mut handles = Vec::new();
10
    for _worker in 0..4 {
11
        let queue = Arc::clone(&queue);
12
        let results = Arc::clone(&results);
13
        handles.push(thread::spawn(move || loop {
14
            // Pop one job UNDER the lock, then release it before working,
15
            // so workers do not serialize on the compute step.
16
            let job = queue.lock().unwrap().pop();
17
            match job {
18
                Some(n) => {
19
                    let squared = n * n;
20
                    results.lock().unwrap().push((n, squared));
21
                }
22
                None => break, // queue drained
23
            }
24
        }));
25
    }
26

27
    for h in handles {
28
        h.join().unwrap();
29
    }
30

31
    // Sole owner now that all workers are joined; unwrap the Arc and the Mutex.
32
    let mut out = Arc::try_unwrap(results).unwrap().into_inner().unwrap();
33
    out.sort();
34
    println!("{out:?}");
35
}

Output:

1
[(1, 1), (2, 4), (3, 9), (4, 16), (5, 25), (6, 36), (7, 49), (8, 64), (9, 81), (10, 100)]

The key discipline is the size of the critical sections: each worker holds the queue lock only long enough to pop, and the results lock only long enough to push. The n * n work happens with no locks held, so it parallelizes.

Exercise 3: Per-row maxima via disjoint `&mut` borrows

Difficulty: Advanced

Objective: Use thread::scope to write results into a pre-sized output Vec in parallel by handing each thread a disjoint &mut slot — no Mutex, no atomics.

Instructions: Write fn row_maxima(matrix: &[Vec<i32>]) -> Vec<i32> that returns, for each row, the maximum element. Pre-allocate the output Vec, then use iter_mut().zip(...) to pair each output slot with its input row and hand each (&mut i32, &Vec<i32>) pair to its own scoped thread. The trick: because each thread gets a disjoint mutable reference, the borrow checker allows concurrent writes with no synchronization at all.

Solution

1
use std::thread;
2

3
/// Compute per-row maxima of a matrix in parallel using scoped threads.
4
fn row_maxima(matrix: &[Vec<i32>]) -> Vec<i32> {
5
    let mut maxima = vec![i32::MIN; matrix.len()];
6

7
    thread::scope(|s| {
8
        // Pair each output slot with its input row, then hand each pair to its
9
        // own thread. `iter_mut` yields DISJOINT &mut, so no lock is needed.
10
        for (out, row) in maxima.iter_mut().zip(matrix.iter()) {
11
            s.spawn(move || {
12
                *out = row.iter().copied().max().unwrap_or(i32::MIN);
13
            });
14
        }
15
    });
16

17
    maxima
18
}
19

20
fn main() {
21
    let matrix = vec![
22
        vec![3, 7, 2],
23
        vec![9, 1, 4],
24
        vec![5, 5, 8],
25
    ];
26
    println!("{:?}", row_maxima(&matrix)); // [7, 9, 8]
27
}

Output:

1
[7, 9, 8]

This is the payoff of Rust’s aliasing rules: iter_mut() produces non-overlapping &mut i32 handles, so the compiler knows the threads cannot conflict and lets them write concurrently with zero runtime synchronization. There is no equivalent guarantee in JavaScript — writing into a SharedArrayBuffer from multiple workers is unchecked and easy to get wrong. (In practice, for this kind of slice-parallel write you would reach for rayon’s par_iter_mut(); see parallel iterators.)

Native Threads with std::thread

Quick Overview

TypeScript/JavaScript Example

Rust Equivalent

Detailed Explanation

thread::spawn and JoinHandle

Move closures: move

Scoped threads: thread::scope (borrow instead of move)

available_parallelism and collecting handles

The Builder API: names and stack size

Key Differences

Why threads are safe in Rust: Send and Sync

Threads are not async tasks

Common Pitfalls

Pitfall 1: Borrowing a local without move

Pitfall 2: Using a value after moving it into a thread

Pitfall 3: Sharing Rc<T> across threads

Pitfall 4: Forgetting to join — the process exits and kills the thread

Pitfall 5: Expecting a panic in one thread to crash the program

Best Practices

Real-World Example

Further Reading

Exercises

Exercise 1: Parallel sum over chunks

Exercise 2: A hand-rolled worker pool over a shared queue

Exercise 3: Per-row maxima via disjoint &mut borrows

Native Threads with `std::thread`

`thread::spawn` and `JoinHandle`

Move closures: `move`

Scoped threads: `thread::scope` (borrow instead of move)

`available_parallelism` and collecting handles

The `Builder` API: names and stack size

Why threads are safe in Rust: `Send` and `Sync`

Pitfall 1: Borrowing a local without `move`

Pitfall 3: Sharing `Rc<T>` across threads

Pitfall 4: Forgetting to `join` — the process exits and kills the thread

Exercise 3: Per-row maxima via disjoint `&mut` borrows