Memory Layout: Size, Alignment, and How Rust Packs Your Data

22 min read

In TypeScript you never think about how many bytes a { price: number, venue: number } object occupies — V8 owns that decision and hides it behind pointers and hidden classes. In Rust, every type has a precise, knowable size and alignment, and the way you order struct fields can change how much memory a million of them consume. This page shows you how Rust lays out structs and enums, how field ordering interacts with padding, what #[repr(...)] controls, and the “free” niche optimization that makes Option<&T> cost nothing.

Quick Overview

Every Rust type has a size (std::mem::size_of) and an alignment (std::mem::align_of) that are fixed at compile time. Because the compiler must place each field at an address that respects its alignment, a struct can contain invisible padding bytes — and the order you declare fields in can make a struct larger or smaller. Unlike a TypeScript object (whose layout is V8’s private business), a Rust struct’s layout is something you can measure, reason about, and — when you need to — control with the #[repr(...)] attribute.

For a TypeScript/JavaScript developer this matters in two situations: when you hold huge arrays of small structs (shaving 16 bytes off a 40-byte struct saves 160 MB across 10 million elements), and when you exchange bytes with C, the network, or the GPU (where the exact layout is part of the contract).

TypeScript/JavaScript Example

1
// TypeScript - you describe the *shape*, never the byte layout.
2
interface Tick {
3
  isBuy: boolean;
4
  price: number; // IEEE-754 f64
5
  venue: number;
6
  quantity: number;
7
}
8

9
const tick: Tick = { isBuy: true, price: 101.5, venue: 7, quantity: 100 };
10
console.log(tick);
11
// { isBuy: true, price: 101.5, venue: 7, quantity: 100 }
12

13
// How big is this object in memory? You cannot say.
14
// V8 stores it as a "hidden class" + a pointer-laden object on the heap.
15
// Every `number` is a 64-bit float (or a tagged 31-bit "SMI" if it fits),
16
// `boolean` is a tagged pointer-sized slot, and the JIT may change the
17
// representation as the program runs. The developer has zero control.
18

19
// The ONLY place JavaScript exposes byte layout is typed arrays / ArrayBuffer:
20
const buf = new ArrayBuffer(16);
21
const view = new DataView(buf);
22
view.setFloat64(0, 101.5, true); // write price at offset 0, little-endian
23
view.setUint32(8, 100, true); // write quantity at offset 8
24
console.log("byteLength =", buf.byteLength); // 16
25
console.log("price back =", view.getFloat64(0, true)); // 101.5

Key points:

A plain object’s memory layout is decided by V8 and can change at runtime; you cannot query its byte size or field offsets.
Every JavaScript number is an IEEE-754 f64 (8 bytes) — there is no u8, i32, or f32. Big integers lose precision; they do not wrap.
The only way to control exact bytes is ArrayBuffer + DataView/typed arrays, and even there you compute every offset by hand.

Note: The Node output above is real: console.log(tick) prints the object’s fields, not [object Object]. That string only appears from implicit coercion like "" + tick.

Rust Equivalent

1
use std::mem::{align_of, size_of};
2

3
// Rust - the struct *is* the layout. Each field has a concrete, sized type.
4
struct Tick {
5
    is_buy: bool, // 1 byte
6
    price: f64,   // 8 bytes
7
    venue: u8,    // 1 byte
8
    quantity: u32, // 4 bytes
9
}
10

11
fn main() {
12
    // size_of and align_of are const fns evaluated at compile time.
13
    println!("size  = {}", size_of::<Tick>());
14
    println!("align = {}", align_of::<Tick>());
15

16
    let tick = Tick {
17
        is_buy: true,
18
        price: 101.5,
19
        venue: 7,
20
        quantity: 100,
21
    };
22
    println!("price = {}", tick.price);
23
}

Running it:

1
size  = 16
2
align = 8
3
price = 101.5

The fields you wrote add up to 1 + 8 + 1 + 4 = 14 bytes, yet the struct reports 16. The extra two bytes are padding, and to understand where they come from — and why Rust gets away with only two bytes of waste where C would need ten — you need to understand alignment and Rust’s freedom to reorder fields.

Detailed Explanation

Size and alignment, the two numbers every type carries

Every Rust type T has:

size_of::<T>() — how many bytes one value occupies, including any internal padding. Arrays and Vecs stride by exactly this many bytes per element.
align_of::<T>() — the byte boundary an address must be a multiple of. A u32 (align 4) can only live at addresses 0, 4, 8, …; a u64 (align 8) at 0, 8, 16, ….

Here are the primitives a TypeScript developer should memorize. All output below is real, from a probe program:

1
use std::mem::{align_of, size_of};
2

3
fn main() {
4
    macro_rules! show {
5
        ($t:ty) => {
6
            println!("{:<10} size={} align={}", stringify!($t), size_of::<$t>(), align_of::<$t>());
7
        };
8
    }
9
    show!(bool);
10
    show!(u8);
11
    show!(u16);
12
    show!(u32);
13
    show!(u64);
14
    show!(char);
15
    show!(f64);
16
    show!(usize);
17
    show!(&u8);
18
    show!(String);
19
    show!(Vec<u8>);
20
    show!(());
21
}

1
bool       size=1 align=1
2
u8         size=1 align=1
3
u16        size=2 align=2
4
u32        size=4 align=4
5
u64        size=8 align=8
6
char       size=4 align=4
7
f64        size=8 align=8
8
usize      size=8 align=8
9
&u8        size=8 align=8
10
String     size=24 align=8
11
Vec<u8>    size=24 align=8
12
()         size=0 align=1

Two things jump out for a TypeScript developer:

A char is 4 bytes, not 1 — it holds a full Unicode scalar value, not a byte.
String and Vec<u8> are 24 bytes regardless of content. That is three machine words: a heap pointer, a length, and a capacity. The actual text lives on the heap, exactly like a JavaScript string’s backing store lives off to the side.
The unit type () has size 0 — Rust has genuine zero-sized types, something JavaScript has no equivalent for.

Why padding exists

Hardware reads memory most efficiently when a value sits at an address that is a multiple of its size. To guarantee that, the compiler inserts padding bytes so each field lands on a properly aligned offset, and pads the whole struct up to a multiple of its largest field’s alignment (so that in an array, element N+1 is just as aligned as element 0).

A struct’s alignment is the maximum alignment of its fields. For Tick, the f64 forces alignment 8, so size_of::<Tick>() must be a multiple of 8 — hence 16, not 14.

Rust reorders fields for you (the big difference from C and from TypeScript)

Here is the surprise. Naively, you’d expect is_buy, price, venue, quantity to lay out like this with padding:

1
[is_buy:1][pad:7][price:8][venue:1][pad:3][quantity:4]  = 24 bytes

But Rust reported 16. That is because the default representation, called repr(Rust), gives the compiler permission to reorder fields to minimize padding. It silently sorts the fields into something like price(8), quantity(4), venue(1), is_buy(1) plus two trailing pad bytes, reaching a tight 16. C never does this — declaration order is part of C’s ABI — which is exactly why Rust can be more compact than the equivalent C struct without you lifting a finger.

You can see the difference by forbidding reordering with #[repr(C)], which pins fields to declaration order:

1
use std::mem::{align_of, size_of};
2

3
#[repr(C)] // C layout: fields stay in declaration order
4
struct CBad {
5
    flag: bool, // offset 0, then 7 bytes of padding
6
    id: u64,    // offset 8
7
    code: u16,  // offset 16, then 6 bytes of trailing padding
8
}
9

10
#[repr(C)] // same fields, ordered largest-to-smallest by hand
11
struct CGood {
12
    id: u64,    // offset 0
13
    code: u16,  // offset 8
14
    flag: bool, // offset 10, then 5 bytes of trailing padding
15
}
16

17
struct RustOrder {
18
    // default repr(Rust): compiler reorders for us
19
    flag: bool,
20
    id: u64,
21
    code: u16,
22
}
23

24
fn main() {
25
    println!("repr(C)  CBad     size={} align={}", size_of::<CBad>(), align_of::<CBad>());
26
    println!("repr(C)  CGood    size={} align={}", size_of::<CGood>(), align_of::<CGood>());
27
    println!("repr(Rust) Order  size={} align={}", size_of::<RustOrder>(), align_of::<RustOrder>());
28
}

1
repr(C)  CBad     size=24 align=8
2
repr(C)  CGood    size=16 align=8
3
repr(Rust) Order  size=16 align=8

The lesson: field order only affects size when you opt out of repr(Rust)’s reordering. Under default repr(Rust), declaration order is purely a readability choice — the compiler will pack it optimally either way. Under #[repr(C)], you are responsible for ordering largest-aligned fields first.

Inspecting offsets with `offset_of!`

Since Rust 1.77 the standard library has std::mem::offset_of!, which tells you exactly where a field sits — invaluable for repr(C) structs that mirror a C header or a wire format:

1
use std::mem::{align_of, offset_of, size_of};
2

3
#[repr(C)]
4
struct Record {
5
    a: u8,
6
    b: u32,
7
    c: u16,
8
}
9

10
fn main() {
11
    println!("offset a = {}", offset_of!(Record, a));
12
    println!("offset b = {}", offset_of!(Record, b));
13
    println!("offset c = {}", offset_of!(Record, c));
14
    println!("size  = {}", size_of::<Record>());
15
    println!("align = {}", align_of::<Record>());
16
}

1
offset a = 0
2
offset b = 4
3
offset c = 8
4
size  = 12
5
align = 4

a is one byte at offset 0; b (align 4) cannot start at offset 1, so it jumps to offset 4 (three padding bytes between); c follows at offset 8; the whole struct rounds up to 12 to keep align 4.

Key Differences

Concept	TypeScript / JavaScript	Rust
Who decides layout	V8 (hidden classes, may change at runtime)	The compiler, deterministically at compile time
Can you measure a value’s byte size?	No (only `ArrayBuffer.byteLength` for raw buffers)	Yes — `std::mem::size_of::<T>()`
Numeric widths	One type: `number` = `f64`	`u8/u16/u32/u64`, `i*`, `f32/f64`, `usize`, `bool` (1 byte), `char` (4 bytes)
Field ordering	Irrelevant (no observable layout)	Irrelevant under `repr(Rust)`; load-bearing under `repr(C)`
Padding	Hidden, not your concern	Inserted for alignment; visible in `size_of`
Controlling exact bytes	`ArrayBuffer` + manual offsets	`#[repr(C)]`, `#[repr(packed)]`, `#[repr(align(N))]`
`Option<T>` overhead	`T	null` is just a tagged slot
Enum size	N/A (unions are compile-time only)	Discriminant + largest variant, aligned

Niche optimization: why `Option<&T>` is free

This is the single most delightful layout trick in Rust. A reference (&T), a Box<T>, and an NonZero* integer can never be all-zeroes/null. The compiler treats that forbidden bit pattern as a niche and reuses it to represent None, so wrapping such a type in Option costs zero extra bytes:

1
use std::mem::size_of;
2
use std::num::NonZeroU32;
3

4
fn main() {
5
    println!("&u8                 {}", size_of::<&u8>());
6
    println!("Option<&u8>         {}", size_of::<Option<&u8>>());
7
    println!("Box<u8>             {}", size_of::<Box<u8>>());
8
    println!("Option<Box<u8>>     {}", size_of::<Option<Box<u8>>>());
9
    println!("u32                 {}", size_of::<u32>());
10
    println!("Option<u32>         {}", size_of::<Option<u32>>());
11
    println!("NonZeroU32          {}", size_of::<NonZeroU32>());
12
    println!("Option<NonZeroU32>  {}", size_of::<Option<NonZeroU32>>());
13
    println!("bool                {}", size_of::<bool>());
14
    println!("Option<bool>        {}", size_of::<Option<bool>>());
15
    println!("char                {}", size_of::<char>());
16
    println!("Option<char>        {}", size_of::<Option<char>>());
17
    println!("String              {}", size_of::<String>());
18
    println!("Option<String>      {}", size_of::<Option<String>>());
19
}

1
&u8                 8
2
Option<&u8>         8
3
Box<u8>             8
4
Option<Box<u8>>     8
5
u32                 4
6
Option<u32>         8
7
NonZeroU32          4
8
Option<NonZeroU32>  4
9
bool                1
10
Option<bool>        1
11
char                4
12
Option<char>        4
13
String              24
14
Option<String>      24

Read those pairs carefully:

Option<&u8>, Option<Box<u8>>, Option<NonZeroU32>, Option<String> are the same size as the thing inside. None reuses the null/zero bit pattern — no separate flag byte.
Option<u32> jumps from 4 to 8 bytes: a plain u32 has no spare bit pattern (all 4 billion values are valid), so the compiler must add a discriminant, and alignment rounds it up to 8.
Option<bool> stays at 1 byte: bool only uses values 0 and 1, leaving 254 niche values, so None slots into one of them.

This is the layout-level reason Rust’s Option<&T> is the right way to express “a nullable pointer” — it is exactly as cheap as the T | null you reach for in TypeScript, but checked by the compiler. The niche even nests: Option<Option<bool>> is still 1 byte.

Enum sizes: tag plus the biggest variant

A Rust enum is a tagged union: it stores a discriminant (which variant) plus enough room for the largest variant’s payload, all rounded to the enum’s alignment.

1
use std::mem::size_of;
2

3
enum Direction {
4
    North,
5
    South,
6
    East,
7
    West,
8
}
9

10
enum Shape {
11
    Circle(f64),
12
    Rectangle(f64, f64),
13
    Point,
14
}
15

16
enum Message {
17
    Quit,
18
    Move { x: i32, y: i32 },
19
    Write(String),
20
    Color(u8, u8, u8),
21
}
22

23
fn main() {
24
    println!("Direction (4 unit variants) {}", size_of::<Direction>());
25
    println!("Shape (mixed payloads)      {}", size_of::<Shape>());
26
    println!("Message (one big variant)   {}", size_of::<Message>());
27
}

1
Direction (4 unit variants) 1
2
Shape (mixed payloads)      24
3
Message (one big variant)   24

Direction carries no data, so it is just a 1-byte discriminant — like a TypeScript string-literal union compiled down to a single byte.
Shape’s largest variant is Rectangle(f64, f64) = 16 bytes; the discriminant plus alignment round the whole enum to 24.
Message is dominated by Write(String) (24 bytes); the discriminant is folded into the String’s niche, so the enum is still 24.

Warning: An enum is as big as its largest variant, even when most values are small. If one variant is huge — say Payload([u8; 256]) — every value of that enum reserves 256+ bytes. The fix is to box the fat variant; see Best Practices below.

Common Pitfalls

Pitfall 1: Taking a reference into a `#[repr(packed)]` struct

#[repr(packed)] removes all padding (alignment 1), which is handy for parsing wire formats — but a field is then potentially misaligned, and Rust forbids creating a reference to it because a misaligned reference is undefined behavior:

1
#[repr(packed)]
2
struct Packed {
3
    flag: bool,
4
    id: u64,
5
}
6

7
fn main() {
8
    let p = Packed { flag: true, id: 42 };
9
    let r = &p.id; // does not compile (error[E0793])
10
    println!("{}", r);
11
}

The real rustc error:

1
error[E0793]: reference to packed field is unaligned
2
 --> src/main.rs:9:13
3
  |
4
9 |     let r = &p.id; // does not compile (error[E0793])
5
  |             ^^^^^
6
  |
7
  = note: packed structs are only aligned by one byte, and many modern architectures penalize unaligned field accesses
8
  = note: creating a misaligned reference is undefined behavior (even if that reference is never dereferenced)
9
  = help: copy the field contents to a local variable, or replace the reference with a raw pointer and use `read_unaligned`/`write_unaligned` (loads and stores via `*p` must be properly aligned even when using raw pointers)

The fix the compiler suggests is to copy the field out first: let id = p.id; reads it by value (a Copy field), and then &id is a normal aligned reference. Reach for #[repr(packed)] only when an external format truly demands it.

Pitfall 2: Assuming declaration order controls size

Coming from C (or from over-thinking it), TypeScript developers sometimes obsess over field order in plain Rust structs. Under the default repr(Rust), order does not change the size — the compiler reorders for you. Order only matters once you add #[repr(C)]. Don’t contort your struct’s readability for layout you aren’t actually controlling.

Pitfall 3: Expecting a stable layout from `repr(Rust)`

Because the compiler is free to reorder, you must never assume a particular field offset, transmute between two repr(Rust) structs with “the same fields,” or send a repr(Rust) struct’s raw bytes across an FFI or network boundary. The layout is unspecified and may differ between compiler versions. The moment bytes matter, add #[repr(C)]. This is covered in depth alongside FFI in Section 20.

Pitfall 4: A giant enum variant bloating a hot `Vec`

1
use std::mem::size_of;
2

3
enum Cmd {
4
    Ping,
5
    Payload([u8; 256]),
6
}
7

8
fn main() {
9
    // Even a `Cmd::Ping` value reserves room for the 256-byte array.
10
    println!("Cmd = {} bytes", size_of::<Cmd>());
11
}

1
Cmd = 257 bytes

A Vec<Cmd> of mostly Pings wastes 256 bytes per element. The fix is in Best Practices.

Best Practices

Let `repr(Rust)` do the packing; only override when bytes leave your program

For ordinary in-memory types, do nothing — the default representation already minimizes padding. Add a #[repr(...)] only for a concrete reason:

Attribute	What it does	When to use it
(none) `repr(Rust)`	Compiler reorders fields, minimal padding, unspecified layout	Almost always — normal application types
`#[repr(C)]`	Declaration-order layout, C-compatible	FFI, memory-mapped files, GPU buffers, anything `transmute`d or sent over the wire
`#[repr(packed)]`	Removes all padding, alignment 1	Tight binary/wire formats; pairs with `read_unaligned`
`#[repr(align(N))]`	Raises alignment to N	Avoiding false sharing (one value per cache line)
`#[repr(u8)]` / `#[repr(u16)]`… on an enum	Fixes discriminant size and makes `as` casts well-defined	Protocol opcodes, C enums

When you must use `#[repr(C)]`, order fields largest-aligned first

Since C layout honors declaration order, put 8-byte fields, then 4-byte, then 2-byte, then 1-byte, then zero-sized. This minimizes interior padding — the difference between CBad (24 bytes) and CGood (16 bytes) above.

Box the fat variant to shrink an enum

1
use std::mem::size_of;
2

3
enum Cmd {
4
    Ping,
5
    Payload(Box<[u8; 256]>), // the 256 bytes now live on the heap
6
}
7

8
fn main() {
9
    println!("Cmd = {} bytes", size_of::<Cmd>());
10
}

1
Cmd = 8 bytes

The enum shrinks from 257 bytes to 8 (a single pointer), so a Vec<Cmd> of mostly Pings no longer wastes 256 bytes per element. You pay one heap allocation only when you actually build a Payload. Clippy’s large_enum_variant lint flags this pattern for you. Box is covered in Section 10.

Fix integer discriminants for protocol enums

1
#[repr(u8)]
2
#[derive(Debug, Clone, Copy, PartialEq)]
3
enum Opcode {
4
    Get = 0x01,
5
    Set = 0x02,
6
    Delete = 0x03,
7
}
8

9
fn main() {
10
    println!("Opcode is {} byte(s)", std::mem::size_of::<Opcode>());
11
    println!("Set on the wire = 0x{:02X}", Opcode::Set as u8);
12
}

1
Opcode is 1 byte(s)
2
Set on the wire = 0x02

#[repr(u8)] guarantees the enum is exactly one byte and that Opcode::Set as u8 yields the explicit 0x02, which is what you want when serializing a packet header.

Measure, don’t guess

size_of and align_of are const fns — drop them into a #[test] to assert a layout never regresses (assert_eq!(size_of::<Tick>(), 16)), or use the cargo bloat-style tooling and the -Zprint-type-sizes nightly flag to audit large types. Profiling and measurement come first; see When to Optimize.

Real-World Example

A market-data feed handler holds tens of millions of ticks in memory. The struct is tiny, but multiplied across the buffer, layout decides whether the working set fits in RAM (and in cache). Here we contrast a naive ordering that mirrors how you’d write the TypeScript interface against a hand-packed #[repr(C)] layout — both pinned to C order so the difference is visible:

1
use std::mem::size_of;
2

3
// Naive order, mirroring a TypeScript interface field-by-field.
4
#[allow(dead_code)]
5
#[repr(C)]
6
struct TickNaive {
7
    is_buy: bool,      // 1
8
    price: f64,        // 8  (forces 7 bytes of padding before it)
9
    venue: u8,         // 1
10
    quantity: u32,     // 4  (3 bytes of padding before it)
11
    timestamp_ns: u64, // 8
12
    flags: u16,        // 2  (6 bytes of trailing padding)
13
}
14

15
// Optimized: largest-aligned fields first.
16
#[allow(dead_code)]
17
#[repr(C)]
18
struct TickPacked {
19
    price: f64,        // 8
20
    timestamp_ns: u64, // 8
21
    quantity: u32,     // 4
22
    flags: u16,        // 2
23
    venue: u8,         // 1
24
    is_buy: bool,      // 1
25
}
26

27
fn main() {
28
    let n = 10_000_000usize;
29
    println!(
30
        "TickNaive  = {} bytes -> {} MB for {} ticks",
31
        size_of::<TickNaive>(),
32
        size_of::<TickNaive>() * n / 1_000_000,
33
        n
34
    );
35
    println!(
36
        "TickPacked = {} bytes -> {} MB for {} ticks",
37
        size_of::<TickPacked>(),
38
        size_of::<TickPacked>() * n / 1_000_000,
39
        n
40
    );
41
}

1
TickNaive  = 40 bytes -> 400 MB for 10000000 ticks
2
TickPacked = 24 bytes -> 240 MB for 10000000 ticks

Reordering fields shaved 16 bytes off each tick — a 40% memory reduction, 160 MB saved across ten million elements, with zero change to behavior. Smaller elements also mean more of them per cache line, which is the bridge to cache-efficiency.md. In JavaScript this optimization is simply unavailable: an array of plain objects is an array of pointers to heap objects whose layout V8 controls, and the only escape hatch — a single packed Float64Array/DataView — forces you to abandon named fields and compute every offset by hand.

Tip: If you control the source order and use the default repr(Rust), you get the 24-byte layout automatically — you only need to hand-order fields when #[repr(C)] is in play (here, so the struct matches an external feed format).

Exercises

Exercise 1: Shrink a `repr(C)` struct by reordering

Difficulty: Easy

Objective: See first-hand how field order changes the size of a #[repr(C)] struct.

Instructions: Define a #[repr(C)] struct Bad with fields in this order: active: bool, score: f64, level: u8, xp: u32. Print its size. Then define a second #[repr(C)] struct Good with the same four fields reordered so the struct is as small as possible, and print its size. Explain the difference. (Add #[allow(dead_code)] to silence unused-field warnings.)

Solution

1
use std::mem::size_of;
2

3
#[allow(dead_code)]
4
#[repr(C)]
5
struct Bad {
6
    active: bool, // 1 + 7 padding
7
    score: f64,   // 8
8
    level: u8,    // 1 + 3 padding
9
    xp: u32,      // 4
10
}
11

12
#[allow(dead_code)]
13
#[repr(C)]
14
struct Good {
15
    score: f64,   // 8
16
    xp: u32,      // 4
17
    level: u8,    // 1
18
    active: bool, // 1  (+2 trailing padding)
19
}
20

21
fn main() {
22
    println!("Bad  = {} bytes", size_of::<Bad>());
23
    println!("Good = {} bytes", size_of::<Good>());
24
}

Output:

1
Bad  = 24 bytes
2
Good = 16 bytes

In Bad, the bool and u8 sit before larger-aligned fields, forcing the compiler to insert padding so score and xp land on aligned offsets. In Good, ordering largest-aligned first (f64, then u32, then the two single bytes) leaves only two trailing padding bytes, shrinking the struct from 24 to 16. Note that with the default repr(Rust) both orderings would already be 16 — the reordering matters only because we opted into C layout.

Exercise 2: Predict niche-optimized sizes

Difficulty: Medium

Objective: Build intuition for when Option is free and when it costs an extra word.

Instructions: Before running anything, predict the size_of for each of these, then write a program that prints them and check yourself: Option<&u32>, Option<u32>, Option<bool>, a three-variant unit enum enum Tri { A, B, C }, Option<Tri>, and Option<Box<[u8; 256]>>. For each, say in a comment whether the niche optimization applied and why.

Solution

1
use std::mem::size_of;
2

3
#[allow(dead_code)]
4
enum Tri {
5
    A,
6
    B,
7
    C,
8
}
9

10
fn main() {
11
    // Niche applies: references are never null, so None reuses the 0 pattern.
12
    println!("Option<&u32>          {}", size_of::<Option<&u32>>()); // 8
13

14
    // No niche: every u32 bit pattern is valid, so a discriminant is added
15
    // and alignment rounds the total to 8.
16
    println!("Option<u32>           {}", size_of::<Option<u32>>()); // 8
17

18
    // Niche applies: bool only uses 0 and 1, leaving 254 spare values.
19
    println!("Option<bool>          {}", size_of::<Option<bool>>()); // 1
20

21
    // A 3-variant unit enum needs only a 1-byte discriminant...
22
    println!("Tri                   {}", size_of::<Tri>()); // 1
23

24
    // ...and it has spare discriminant values, so Option reuses one: still 1.
25
    println!("Option<Tri>           {}", size_of::<Option<Tri>>()); // 1
26

27
    // Niche applies: Box is never null, so the Option is just a pointer.
28
    println!("Option<Box<[u8;256]>> {}", size_of::<Option<Box<[u8; 256]>>>()); // 8
29
}

Output:

1
Option<&u32>          8
2
Option<u32>           8
3
Option<bool>          1
4
Tri                   1
5
Option<Tri>           1
6
Option<Box<[u8;256]>> 8

The pattern: if a type has a forbidden bit pattern (null pointer, the unused range of a bool, the spare discriminants of a small enum), Option is free. Only types that use every bit pattern — like u32 — pay for a separate discriminant.

Exercise 3: A byte-exact protocol opcode

Difficulty: Advanced

Objective: Combine #[repr(u8)], explicit discriminants, as casts, and a fallible decoder to model a one-byte wire opcode.

Instructions: Define #[repr(u8)] enum Opcode { Hello = 0x10, Data = 0x20, Bye = 0x30 } deriving Debug, Clone, Copy, PartialEq. Add an associated function from_byte(b: u8) -> Option<Opcode> that maps known bytes to variants and returns None otherwise. In main, assert the opcode is exactly one byte, encode Opcode::Data to its byte with an as cast and print it in hex, then decode both a valid byte (0x30) and an invalid one (0x99).

Solution

1
use std::mem::size_of;
2

3
#[repr(u8)]
4
#[derive(Debug, Clone, Copy, PartialEq)]
5
enum Opcode {
6
    Hello = 0x10,
7
    Data = 0x20,
8
    Bye = 0x30,
9
}
10

11
impl Opcode {
12
    fn from_byte(b: u8) -> Option<Opcode> {
13
        match b {
14
            0x10 => Some(Opcode::Hello),
15
            0x20 => Some(Opcode::Data),
16
            0x30 => Some(Opcode::Bye),
17
            _ => None,
18
        }
19
    }
20
}
21

22
fn main() {
23
    assert_eq!(size_of::<Opcode>(), 1);
24

25
    let wire: u8 = Opcode::Data as u8;
26
    println!("Data on the wire = 0x{:02X}", wire);
27

28
    println!("decode 0x30 = {:?}", Opcode::from_byte(0x30));
29
    println!("decode 0x99 = {:?}", Opcode::from_byte(0x99));
30
}

Output:

1
Data on the wire = 0x20
2
decode 0x30 = Some(Bye)
3
decode 0x99 = None

#[repr(u8)] guarantees the single-byte size and gives the as u8 cast a well-defined result (the explicit discriminant). Because a raw byte off the network can hold any of 256 values — not just the three you defined — the decoder must be fallible: from_byte returns Option<Opcode>, never an invalid Opcode. This is the type-safe alternative to casting a stray byte straight into an enum, which would be undefined behavior.

Memory Layout: Size, Alignment, and How Rust Packs Your Data

Quick Overview

TypeScript/JavaScript Example

Rust Equivalent

Detailed Explanation

Size and alignment, the two numbers every type carries

Why padding exists

Rust reorders fields for you (the big difference from C and from TypeScript)

Inspecting offsets with offset_of!

Key Differences

Niche optimization: why Option<&T> is free

Enum sizes: tag plus the biggest variant

Common Pitfalls

Pitfall 1: Taking a reference into a #[repr(packed)] struct

Pitfall 2: Assuming declaration order controls size

Pitfall 3: Expecting a stable layout from repr(Rust)

Pitfall 4: A giant enum variant bloating a hot Vec

Best Practices

Let repr(Rust) do the packing; only override when bytes leave your program

When you must use #[repr(C)], order fields largest-aligned first

Box the fat variant to shrink an enum

Fix integer discriminants for protocol enums

Measure, don’t guess

Real-World Example

Further Reading

Exercises

Exercise 1: Shrink a repr(C) struct by reordering

Exercise 2: Predict niche-optimized sizes

Exercise 3: A byte-exact protocol opcode

Inspecting offsets with `offset_of!`

Niche optimization: why `Option<&T>` is free

Pitfall 1: Taking a reference into a `#[repr(packed)]` struct

Pitfall 3: Expecting a stable layout from `repr(Rust)`

Pitfall 4: A giant enum variant bloating a hot `Vec`

Let `repr(Rust)` do the packing; only override when bytes leave your program

When you must use `#[repr(C)]`, order fields largest-aligned first

Exercise 1: Shrink a `repr(C)` struct by reordering