Deploying Axum Applications

23 min read

Quick Overview

Deploying a Rust web service is, in most respects, easier than deploying a Node app: cargo build --release produces a single, self-contained, statically-ish linked native binary — there is no node_modules to ship, no separate runtime to install on the server, and no transpile step at deploy time. This page shows how a TypeScript/JavaScript developer goes from npm run build && node dist/index.js to a Rust release build, a slim multi-stage Docker image, the handful of operational habits Rust requires (binding 0.0.0.0, reading config from the environment, graceful shutdown), and where Rust deployment genuinely differs from Node deployment.

Note: This page uses axum 0.8 (current stable 0.8.9). The current stable toolchain is Rust 1.96.0 on the latest stable edition (2024); cargo new selects it automatically. Servers are started with axum::serve(listener, app) over a tokio::net::TcpListener, never the removed Server::bind().serve() builder from older axum.

TypeScript/JavaScript Example

A typical production Express service ships a transpiled dist/, reads config from process.env, binds 0.0.0.0 so it is reachable inside a container, and exits cleanly on SIGTERM. Here is the kind of index.ts and Dockerfile that pair you would deploy:

1
// src/index.ts — Express 5, production-shaped
2
import express from "express";
3

4
const app = express();
5
app.use(express.json());
6

7
app.get("/healthz", (_req, res) => {
8
  res.json({ status: "ok" });
9
});
10

11
// Read config from the environment, with sane local defaults.
12
const port = Number(process.env.PORT ?? 8080);
13
// Bind 0.0.0.0 (all interfaces) so the socket is reachable from outside a container.
14
const host = process.env.HOST ?? "0.0.0.0";
15

16
const server = app.listen(port, host, () => {
17
  console.log(`listening on http://${host}:${port}`);
18
});
19

20
// Orchestrators (Kubernetes, `docker stop`) send SIGTERM to ask for shutdown.
21
process.on("SIGTERM", () => {
22
  server.close(() => process.exit(0));
23
});

1
# Dockerfile — a typical Node multi-stage build
2
FROM node:22-slim AS builder
3
WORKDIR /app
4
COPY package*.json ./
5
RUN npm ci
6
COPY . .
7
RUN npm run build           # tsc -> dist/
8

9
FROM node:22-slim AS runtime
10
WORKDIR /app
11
ENV NODE_ENV=production
12
COPY package*.json ./
13
RUN npm ci --omit=dev       # prod deps only, but node_modules still ships
14
COPY --from=builder /app/dist ./dist
15
EXPOSE 8080
16
CMD ["node", "dist/index.js"]

The runtime image still contains Node itself plus a production node_modules tree — commonly 150–400 MB. The deploy artifact is “interpreter + your JavaScript + its dependency tree.”

Rust Equivalent

The deploy artifact is one file: the compiled binary. First, the production-shaped server — config from the environment, 0.0.0.0 binding, structured logs, a per-request timeout, and graceful shutdown:

1
cargo add axum
2
cargo add tokio --features full
3
cargo add serde --features derive
4
cargo add tower-http --features "trace timeout"
5
cargo add tracing
6
cargo add tracing-subscriber --features env-filter

1
use std::{net::SocketAddr, time::Duration};
2

3
use axum::{
4
    extract::State,
5
    http::StatusCode,
6
    routing::get,
7
    Json, Router,
8
};
9
use serde::Serialize;
10
use tokio::signal;
11
use tower_http::{timeout::TimeoutLayer, trace::TraceLayer};
12

13
/// Runtime configuration, loaded once from the environment at startup.
14
#[derive(Clone, Debug)]
15
struct Config {
16
    /// Address to bind, e.g. "0.0.0.0:8080".
17
    bind_addr: SocketAddr,
18
    database_url: String,
19
}
20

21
impl Config {
22
    fn from_env() -> Result<Self, String> {
23
        // PORT is the de-facto standard many platforms (Render, Railway,
24
        // Fly.io, Cloud Run) inject; default to 8080 for local runs.
25
        let port: u16 = std::env::var("PORT")
26
            .unwrap_or_else(|_| "8080".to_string())
27
            .parse()
28
            .map_err(|_| "PORT must be a number".to_string())?;
29

30
        // Bind 0.0.0.0 in containers so the socket is reachable from outside
31
        // the container, not just from inside it.
32
        let host = std::env::var("HOST").unwrap_or_else(|_| "0.0.0.0".to_string());
33
        let bind_addr: SocketAddr = format!("{host}:{port}")
34
            .parse()
35
            .map_err(|_| "HOST/PORT did not form a valid socket address".to_string())?;
36

37
        // Required secrets fail loudly at startup, not on the first request.
38
        let database_url =
39
            std::env::var("DATABASE_URL").map_err(|_| "DATABASE_URL is required".to_string())?;
40

41
        Ok(Config { bind_addr, database_url })
42
    }
43
}
44

45
#[derive(Clone)]
46
struct AppState {
47
    config: Config,
48
}
49

50
#[derive(Serialize)]
51
struct Health {
52
    status: &'static str,
53
}
54

55
async fn health() -> Json<Health> {
56
    Json(Health { status: "ok" })
57
}
58

59
async fn root(State(state): State<AppState>) -> String {
60
    format!("connected to {}", state.config.database_url)
61
}
62

63
fn app(state: AppState) -> Router {
64
    Router::new()
65
        .route("/", get(root))
66
        .route("/healthz", get(health))
67
        // Per-request timeout so a slow handler cannot pin a connection forever.
68
        .layer(TimeoutLayer::with_status_code(
69
            StatusCode::REQUEST_TIMEOUT,
70
            Duration::from_secs(15),
71
        ))
72
        .layer(TraceLayer::new_for_http())
73
        .with_state(state)
74
}
75

76
/// Resolve when the process receives Ctrl-C or (on Unix) SIGTERM — the signal
77
/// orchestrators like Kubernetes and `docker stop` send to ask for shutdown.
78
async fn shutdown_signal() {
79
    let ctrl_c = async {
80
        signal::ctrl_c().await.expect("failed to install Ctrl-C handler");
81
    };
82

83
    #[cfg(unix)]
84
    let terminate = async {
85
        signal::unix::signal(signal::unix::SignalKind::terminate())
86
            .expect("failed to install SIGTERM handler")
87
            .recv()
88
            .await;
89
    };
90

91
    #[cfg(not(unix))]
92
    let terminate = std::future::pending::<()>();
93

94
    tokio::select! {
95
        _ = ctrl_c => {},
96
        _ = terminate => {},
97
    }
98
    tracing::info!("shutdown signal received, draining connections");
99
}
100

101
#[tokio::main]
102
async fn main() -> Result<(), Box<dyn std::error::Error>> {
103
    // Structured logs to stdout; the platform collects them. RUST_LOG controls
104
    // verbosity, e.g. RUST_LOG=info,tower_http=debug.
105
    tracing_subscriber::fmt()
106
        .with_env_filter(
107
            tracing_subscriber::EnvFilter::try_from_default_env()
108
                .unwrap_or_else(|_| "info,tower_http=debug".into()),
109
        )
110
        .init();
111

112
    let config = Config::from_env().map_err(|e| {
113
        tracing::error!("configuration error: {e}");
114
        e
115
    })?;
116

117
    let state = AppState { config: config.clone() };
118
    let listener = tokio::net::TcpListener::bind(config.bind_addr).await?;
119
    tracing::info!("listening on http://{}", listener.local_addr()?);
120

121
    axum::serve(listener, app(state))
122
        .with_graceful_shutdown(shutdown_signal())
123
        .await?;
124
    Ok(())
125
}

Build it for production and run it with real environment variables:

1
cargo build --release
2
PORT=8080 DATABASE_URL="postgres://localhost/app" \
3
  RUST_LOG=info,tower_http=debug \
4
  ./target/release/myapi

Real startup log and responses (captured from running the binary above and curling it):

1
2026-06-01T12:28:24.340167Z  INFO myapi: listening on http://0.0.0.0:8080
2
2026-06-01T12:28:24.979435Z DEBUG request{method=GET uri=/healthz version=HTTP/1.1}: tower_http::trace::on_request: started processing request
3
2026-06-01T12:28:24.979550Z DEBUG request{method=GET uri=/healthz version=HTTP/1.1}: tower_http::trace::on_response: finished processing request latency=0 ms status=200

1
$ curl -s http://127.0.0.1:8080/healthz
2
{"status":"ok"}
3
$ curl -s -i http://127.0.0.1:8080/healthz | head -4
4
HTTP/1.1 200 OK
5
content-type: application/json
6
content-length: 15
7
date: Mon, 01 Jun 2026 12:28:25 GMT

And when a required secret is missing, the process fails at startup (exit code 1) instead of crashing on the first request:

1
$ PORT=8080 ./target/release/myapi
2
2026-06-01T12:28:36.290088Z ERROR myapi: configuration error: DATABASE_URL is required
3
Error: "DATABASE_URL is required"
4
$ echo $?
5
1

Detailed Explanation

cargo build --release is the deploy build. Without --release, cargo build produces an unoptimized debug binary that can be an order of magnitude slower — it is for local iteration only. The release binary lands in target/release/<crate-name>. This is the single line that replaces Node’s tsc transpile and the node runtime: the output is native machine code, not JavaScript that an interpreter still has to parse and JIT at runtime. There is no warm-up: a release binary is at full speed from the first request.

Config comes from the environment. Config::from_env() mirrors process.env access in Node, but with one deliberate difference: a missing required variable (DATABASE_URL) returns an Err that propagates out of main via ?, so the process exits non-zero before it ever binds a port. In Node it is common for a missing process.env.X to be undefined and only blow up later, deep inside a request handler. Failing fast at startup means a bad deploy is caught immediately by your platform’s health check, not by your first user.

bind_addr defaults to 0.0.0.0. This is the single most common deployment mistake for newcomers. 127.0.0.1 (loopback) only accepts connections from inside the same network namespace — inside the container itself. A container that binds 127.0.0.1 will pass its own internal health check and then reject every connection from the host or the orchestrator. Binding 0.0.0.0 listens on all interfaces, which is what containers and PaaS platforms require. (SocketAddr is std’s parsed IP:port type; parsing "0.0.0.0:8080" into it validates the address at startup.)

PORT is read from the environment. Most managed platforms — Render, Railway, Fly.io, Google Cloud Run, Heroku — inject the port your service must listen on via $PORT and route external traffic to it. Hardcoding 3000 will fail on those platforms. The default of 8080 is for local runs.

TraceLayer writes structured request logs to stdout. Production logging belongs on stdout/stderr; the platform (Docker, journald, your log aggregator) is responsible for collecting it. tracing_subscriber’s EnvFilter reads the RUST_LOG variable, the Rust analogue of DEBUG=express:* — RUST_LOG=info,tower_http=debug shows info-level app logs plus debug-level HTTP traces. See middleware.md for the layer mechanics.

with_graceful_shutdown drains in-flight requests. When the process receives SIGTERM (what docker stop and Kubernetes send first, before SIGKILL), axum::serve stops accepting new connections but lets in-flight requests finish. This is the direct equivalent of Node’s server.close() in a SIGTERM handler. Without it, the binary would be killed mid-request on every deploy. The #[cfg(unix)] block adds SIGTERM on top of Ctrl-C (SIGINT); on non-Unix the terminate future is pending() (never resolves), so only Ctrl-C triggers shutdown there.

A per-request TimeoutLayer ensures one stuck handler cannot tie up a connection indefinitely. In axum 0.8 / tower-http 0.6 the constructor is TimeoutLayer::with_status_code(status, duration); the older bare TimeoutLayer::new(duration) is deprecated.

Key Differences

Concern	Node / Express	Rust / Axum
Deploy artifact	Interpreter + your JS + `node_modules` (often 150–400 MB)	One native binary (~1–5 MB), optionally a slim base image
Build step	`tsc` transpile at build; V8 JITs at runtime	`cargo build --release` produces optimized machine code; no runtime warm-up
Runtime on server	Node must be installed/present	None — the binary is self-contained (with a libc, or fully static with musl)
Startup time	Process start + module load	Process start only (no module graph to load)
Memory baseline	Tens to hundreds of MB	Typically single-digit to low-tens of MB
Missing config	Often `undefined`, fails later in a handler	`?` out of `main`, process exits non-zero at startup
Graceful shutdown	`server.close()` in a `SIGTERM` handler	`.with_graceful_shutdown(future)` on `axum::serve`
Concurrency model	Single-threaded event loop; scale with cluster/PM2	Tokio multi-threaded runtime uses all cores in one process

Note: Because one Axum process already uses all CPU cores via the Tokio work-stealing runtime, you usually do not run a process-per-core supervisor like PM2 cluster or Node’s cluster module. One container = one binary = all cores. This is covered conceptually in the async section.

The deepest difference is the dependency story. In Node, dependencies are resolved and present at runtime inside node_modules. In Rust, every crate your code uses is compiled into the binary at build time — there is nothing to install on the server. The cost is paid once, during cargo build, which is why Docker layer caching of dependencies (below) matters so much for CI speed.

Common Pitfalls

Pitfall 1: Binding `127.0.0.1` inside a container

1
// Wrong for containers: only reachable from inside the container itself.
2
let listener = tokio::net::TcpListener::bind("127.0.0.1:8080").await?;

The server starts fine and even passes a self-issued health check, but the orchestrator and the host cannot reach it — every external request is refused. Bind 0.0.0.0 (all interfaces) in any containerized or PaaS deployment:

1
// Reachable from outside the container.
2
let listener = tokio::net::TcpListener::bind("0.0.0.0:8080").await?;

Pitfall 2: Shipping (or worse, deploying) the debug binary

Running plain cargo build and copying target/debug/myapi into your image ships an unoptimized binary. Debug builds skip optimizations and embed extra debug info; they can be many times slower and substantially larger. Always build with --release for deployment, and point your Dockerfile’s COPY --from=builder at target/release/..., not target/debug/....

Pitfall 3: Hardcoding the port

1
// Breaks on Render/Railway/Fly.io/Cloud Run, which inject $PORT.
2
let listener = tokio::net::TcpListener::bind("0.0.0.0:3000").await?;

Read PORT from the environment with a local default, as in the main example. A hardcoded port means the platform routes traffic to a port nothing is listening on.

Pitfall 4: Forgetting graceful shutdown, then losing requests on every deploy

Without .with_graceful_shutdown(...), the process is terminated immediately on SIGTERM and any in-flight request is dropped — visible to users as connection resets during every rolling deploy. Wire up the shutdown future once and the problem disappears.

Pitfall 5: A `glibc` mismatch between build and runtime images

If you build on a newer Debian/Ubuntu and copy the binary into an older or different base (or a musl-based Alpine image without recompiling for musl), the binary may fail to start with a dynamic-linker error such as version 'GLIBC_2.x' not found or no such file or directory (for the missing loader). Two reliable fixes: build and run on the same glibc (e.g. rust:1.96-slim builder → gcr.io/distroless/cc-debian12 runtime, both Debian 12), or build a fully static binary against musl (rustup target add x86_64-unknown-linux-musl then cargo build --release --target x86_64-unknown-linux-musl) so there is no dynamic-linking requirement at all.

Best Practices

Shrink the release binary with a profile

A default cargo build --release of the server above produced a 2.5 MB binary. Adding a size-tuned [profile.release] to Cargo.toml brought it down to 968 KB (measured on the same code, this machine):

1
[profile.release]
2
opt-level = "z"     # optimize for size ("s" is a slightly faster middle ground)
3
lto = true          # link-time optimization across crate boundaries
4
codegen-units = 1   # one codegen unit: better optimization, slower compile
5
strip = true        # strip symbols from the binary
6
panic = "abort"     # abort on panic; drops unwinding tables (std::panic::catch_unwind can no longer recover)

Tip: opt-level = "z"/"s" optimize for size; the default release opt-level = 3 optimizes for speed. For a network service, raw binary size rarely matters as much as throughput, so many teams keep opt-level = 3 and only add lto = true, codegen-units = 1, and strip = true. Measure before choosing — panic = "abort" in particular changes runtime behavior (a panic aborts the process instead of unwinding), which is usually fine and even desirable for a stateless web service, but confirm it suits yours.

Multi-stage Docker build with dependency caching

The whole point of a multi-stage build is to compile in a fat image with the full Rust toolchain, then copy only the resulting binary into a tiny runtime image. The dependency-caching trick — build a dummy main.rs from just the manifests first — means cargo only recompiles your dependency graph when Cargo.toml/Cargo.lock change, not on every source edit:

1
# ---- Stage 1: build ----
2
# Pin the toolchain so CI builds are reproducible.
3
FROM rust:1.96-slim AS builder
4
WORKDIR /app
5

6
# Cache dependencies: copy only the manifests first, build a dummy main,
7
# then copy the real sources. The dependency layer only rebuilds when Cargo.* changes.
8
COPY Cargo.toml Cargo.lock ./
9
RUN mkdir src && echo "fn main() {}" > src/main.rs \
10
    && cargo build --release \
11
    && rm -rf src
12

13
COPY src ./src
14
# `touch` so Cargo sees the real main.rs as newer than the dummy build.
15
RUN touch src/main.rs && cargo build --release
16

17
# ---- Stage 2: runtime ----
18
# Distroless "cc" image: a glibc + libstdc++ runtime, no shell, no package
19
# manager, runs as a non-root user — a tiny attack surface.
20
FROM gcr.io/distroless/cc-debian12 AS runtime
21
WORKDIR /app
22
COPY --from=builder /app/target/release/myapi /usr/local/bin/myapi
23
ENV PORT=8080
24
EXPOSE 8080
25
USER nonroot:nonroot
26
CMD ["myapi"]

Add a .dockerignore so the local target/ directory (which can be gigabytes) is never sent to the Docker daemon:

1
target
2
.git
3
Dockerfile
4
.dockerignore

Build, run, and verify (real output from building the myapi project above with this exact Dockerfile):

1
$ docker build -t myapi:latest .
2
...
3
 => [builder 6/6] RUN touch src/main.rs && cargo build --release
4
 #13 2.878    Compiling myapi v0.1.0 (/app)
5
 #13 2.878     Finished `release` profile [optimized] target(s) in 2.04s
6
 => exporting to image ... done
7

8
$ docker images myapi:latest --format '{{.Repository}}:{{.Tag}}  {{.Size}}'
9
myapi:latest  36.2MB
10

11
# The server requires DATABASE_URL, so pass it in; the Dockerfile already sets PORT=8080.
12
$ docker run -d -e DATABASE_URL=postgres://localhost/app -p 18080:8080 myapi:latest
13
$ curl -s http://127.0.0.1:18080/healthz
14
{"status":"ok"}
15
$ docker logs <container>
16
2026-06-01T12:30:11.482913Z  INFO myapi: listening on http://0.0.0.0:8080

The final image is 36.2 MB — most of which is the distroless base; the binary itself is around 1–3 MB. Compare that to a typical 150–400 MB Node runtime image. Notice the second cargo build finished in 2.04s because the dependency layer was cached.

Note: The -e DATABASE_URL=... flag is required because Config::from_env() treats DATABASE_URL as a mandatory secret and exits non-zero at startup if it is missing — exactly the fail-fast behavior shown earlier. Without it the container would crash on launch and curl would get a connection refused, not {"status":"ok"}.

Tip: For even faster CI, replace the manual dummy-main.rs trick with cargo-chef, which computes a recipe of your dependencies and caches them as a dedicated Docker layer. For statically-linked images on scratch or Alpine, build against x86_64-unknown-linux-musl and copy into FROM scratch — the binary then needs no base OS at all.

Run as non-root and add a health check

The distroless USER nonroot:nonroot line above runs the process unprivileged. Expose a cheap /healthz route (no database call) for liveness and a separate readiness route if you need to gate traffic on dependencies being up. Most platforms poll an HTTP health endpoint; your Dockerfile can also declare one:

1
# Optional: container-level health check (note distroless has no shell,
2
# so use an exec-form check that does not rely on /bin/sh).
3
HEALTHCHECK --interval=30s --timeout=3s --start-period=5s \
4
  CMD ["/usr/local/bin/myapi", "--health-check"]

Note: Distroless images have no shell, so the common CMD curl ... health check (which needs /bin/sh and curl) will not work there. Either add a tiny --health-check subcommand to your binary, switch the runtime base to debian:bookworm-slim (which has a shell), or let the orchestrator do the HTTP probe instead of Docker.

Reverse proxy and TLS termination

In production you usually put a reverse proxy (Nginx, Caddy, Traefik, or your cloud load balancer) in front of Axum. The proxy terminates TLS and forwards plain HTTP to your app on 0.0.0.0:8080. A minimal Nginx server block:

1
server {
2
    listen 443 ssl;
3
    server_name api.example.com;
4

5
    ssl_certificate     /etc/letsencrypt/live/api.example.com/fullchain.pem;
6
    ssl_certificate_key /etc/letsencrypt/live/api.example.com/privkey.pem;
7

8
    location / {
9
        proxy_pass http://127.0.0.1:8080;
10
        proxy_set_header Host $host;
11
        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
12
        proxy_set_header X-Forwarded-Proto $scheme;
13
    }
14
}

This is the same pattern you would use in front of Express, and the reasoning is identical: a battle-tested proxy handles TLS, HTTP/2, compression, and rate limiting at the edge while your app speaks plain HTTP behind it.

Tip: When you sit behind a proxy, the client IP arrives in X-Forwarded-For, not on the TCP socket. To read the real client IP in a handler, parse that header (via tower-http’s SetSensitiveHeaders/your own extractor) rather than using ConnectInfo<SocketAddr>, which would give you the proxy’s address. Only trust forwarded headers from a proxy you control.

Axum can terminate TLS itself (e.g. with axum-server + rustls) when there is no proxy — common on Fly.io or a bare VM — but a fronting proxy or platform load balancer is the more common production shape.

Keep secrets out of the image

Never COPY a .env file or bake secrets into a layer — image layers are cacheable and inspectable. Inject secrets at runtime via environment variables (docker run -e, Kubernetes Secret, your platform’s secret store). For local development, the dotenvy crate can load a git-ignored .env, but treat that strictly as a dev convenience.

Real-World Example

A deployment-ready binary that ties the pieces together: environment-driven config that fails fast, a database-pool placeholder in shared state, 0.0.0.0/$PORT binding, request tracing, a per-request timeout, a body-size limit, and graceful shutdown. This compiles and runs as shown above.

1
cargo add axum
2
cargo add tokio --features full
3
cargo add serde --features derive
4
cargo add tower-http --features "trace timeout limit"
5
cargo add tracing
6
cargo add tracing-subscriber --features env-filter

1
use std::{net::SocketAddr, time::Duration};
2

3
use axum::{
4
    extract::State,
5
    http::StatusCode,
6
    routing::get,
7
    Json, Router,
8
};
9
use serde::Serialize;
10
use tokio::signal;
11
use tower_http::{
12
    limit::RequestBodyLimitLayer, timeout::TimeoutLayer, trace::TraceLayer,
13
};
14

15
#[derive(Clone, Debug)]
16
struct Config {
17
    bind_addr: SocketAddr,
18
    database_url: String,
19
    max_body_bytes: usize,
20
}
21

22
impl Config {
23
    fn from_env() -> Result<Self, String> {
24
        let port: u16 = std::env::var("PORT")
25
            .unwrap_or_else(|_| "8080".to_string())
26
            .parse()
27
            .map_err(|_| "PORT must be a number".to_string())?;
28
        let host = std::env::var("HOST").unwrap_or_else(|_| "0.0.0.0".to_string());
29
        let bind_addr: SocketAddr = format!("{host}:{port}")
30
            .parse()
31
            .map_err(|_| "HOST/PORT did not form a valid socket address".to_string())?;
32

33
        let database_url =
34
            std::env::var("DATABASE_URL").map_err(|_| "DATABASE_URL is required".to_string())?;
35

36
        let max_body_bytes: usize = std::env::var("MAX_BODY_BYTES")
37
            .unwrap_or_else(|_| "1048576".to_string()) // 1 MiB default
38
            .parse()
39
            .map_err(|_| "MAX_BODY_BYTES must be a number".to_string())?;
40

41
        Ok(Config { bind_addr, database_url, max_body_bytes })
42
    }
43
}
44

45
#[derive(Clone)]
46
struct AppState {
47
    config: Config,
48
    // In a real app this would hold a `sqlx::PgPool` or similar; see
49
    // ../17-database/README.md. We keep a string here so the example is
50
    // self-contained and compiles without a database crate.
51
    db: String,
52
}
53

54
#[derive(Serialize)]
55
struct Health {
56
    status: &'static str,
57
}
58

59
// Liveness: cheap, no dependencies. Used by orchestrator liveness probes.
60
async fn healthz() -> Json<Health> {
61
    Json(Health { status: "ok" })
62
}
63

64
// Readiness: confirm dependencies are reachable before accepting traffic.
65
async fn readyz(State(state): State<AppState>) -> Result<Json<Health>, StatusCode> {
66
    if state.db.is_empty() {
67
        // 503 tells the load balancer "not ready, do not route to me yet".
68
        return Err(StatusCode::SERVICE_UNAVAILABLE);
69
    }
70
    Ok(Json(Health { status: "ready" }))
71
}
72

73
fn app(state: AppState) -> Router {
74
    let max_body = state.config.max_body_bytes;
75
    Router::new()
76
        .route("/healthz", get(healthz))
77
        .route("/readyz", get(readyz))
78
        .layer(RequestBodyLimitLayer::new(max_body))
79
        .layer(TimeoutLayer::with_status_code(
80
            StatusCode::REQUEST_TIMEOUT,
81
            Duration::from_secs(15),
82
        ))
83
        .layer(TraceLayer::new_for_http())
84
        .with_state(state)
85
}
86

87
async fn shutdown_signal() {
88
    let ctrl_c = async {
89
        signal::ctrl_c().await.expect("failed to install Ctrl-C handler");
90
    };
91

92
    #[cfg(unix)]
93
    let terminate = async {
94
        signal::unix::signal(signal::unix::SignalKind::terminate())
95
            .expect("failed to install SIGTERM handler")
96
            .recv()
97
            .await;
98
    };
99

100
    #[cfg(not(unix))]
101
    let terminate = std::future::pending::<()>();
102

103
    tokio::select! {
104
        _ = ctrl_c => {},
105
        _ = terminate => {},
106
    }
107
    tracing::info!("shutdown signal received, draining connections");
108
}
109

110
#[tokio::main]
111
async fn main() -> Result<(), Box<dyn std::error::Error>> {
112
    tracing_subscriber::fmt()
113
        .with_env_filter(
114
            tracing_subscriber::EnvFilter::try_from_default_env()
115
                .unwrap_or_else(|_| "info,tower_http=debug".into()),
116
        )
117
        .init();
118

119
    let config = Config::from_env().map_err(|e| {
120
        tracing::error!("configuration error: {e}");
121
        e
122
    })?;
123

124
    // Pretend to open a connection pool from config.database_url here.
125
    let state = AppState { db: config.database_url.clone(), config: config.clone() };
126

127
    let listener = tokio::net::TcpListener::bind(config.bind_addr).await?;
128
    tracing::info!("listening on http://{}", listener.local_addr()?);
129

130
    axum::serve(listener, app(state))
131
        .with_graceful_shutdown(shutdown_signal())
132
        .await?;
133
    Ok(())
134
}

This separates liveness (/healthz: am I running?) from readiness (/readyz: are my dependencies up and should I receive traffic?), which is exactly the distinction Kubernetes liveness vs. readiness probes expect. RequestBodyLimitLayer (from tower-http’s limit feature) rejects oversized request bodies before they reach a handler — a cheap, important hardening step for any public API. Swap the db: String placeholder for a real sqlx::PgPool as described in the database section, and pair it with the connection-pool startup pattern from state-management.md.

Exercises

Exercise 1: Read the port from the environment

Difficulty: Beginner

Objective: Make a server deploy-ready by binding 0.0.0.0 and reading PORT from the environment with a sensible default.

Instructions: Start from a hello-world Axum app. Replace any hardcoded 127.0.0.1:3000 bind address with one that reads the PORT environment variable (default 8080) and binds 0.0.0.0. Print the bound address on startup. Verify it works by running it twice: once with PORT unset, once with PORT=9000.

Solution

1
// cargo add axum
2
// cargo add tokio --features full
3
use axum::{routing::get, Router};
4

5
async fn root() -> &'static str {
6
    "hello"
7
}
8

9
#[tokio::main]
10
async fn main() {
11
    let app = Router::new().route("/", get(root));
12

13
    // Default to 8080; many platforms inject the real port via $PORT.
14
    let port = std::env::var("PORT").unwrap_or_else(|_| "8080".to_string());
15
    // Bind 0.0.0.0 so the socket is reachable from outside a container.
16
    let addr = format!("0.0.0.0:{port}");
17

18
    let listener = tokio::net::TcpListener::bind(&addr).await.unwrap();
19
    println!("listening on http://{}", listener.local_addr().unwrap());
20

21
    axum::serve(listener, app).await.unwrap();
22
}

Running it (real output from this code):

1
$ cargo run
2
listening on http://0.0.0.0:8080
3
$ PORT=9000 cargo run
4
listening on http://0.0.0.0:9000

Reading PORT from the environment with a default is the smallest change that makes a Rust web server portable across local runs and managed platforms.

Exercise 2: Add graceful shutdown

Difficulty: Intermediate

Objective: Drain in-flight requests on SIGINT (Ctrl-C) and SIGTERM instead of dropping them.

Instructions: Take the server from Exercise 1 and add a shutdown_signal() async function that resolves on either Ctrl-C or (on Unix) SIGTERM, then pass it to axum::serve(...).with_graceful_shutdown(...). Print a message when the signal arrives. Verify by starting the server and pressing Ctrl-C: it should log the shutdown message and exit cleanly.

Solution

1
// cargo add axum
2
// cargo add tokio --features full
3
use axum::{routing::get, Router};
4
use tokio::signal;
5

6
async fn root() -> &'static str {
7
    "hello"
8
}
9

10
async fn shutdown_signal() {
11
    let ctrl_c = async {
12
        signal::ctrl_c().await.expect("failed to install Ctrl-C handler");
13
    };
14

15
    #[cfg(unix)]
16
    let terminate = async {
17
        signal::unix::signal(signal::unix::SignalKind::terminate())
18
            .expect("failed to install SIGTERM handler")
19
            .recv()
20
            .await;
21
    };
22

23
    #[cfg(not(unix))]
24
    let terminate = std::future::pending::<()>();
25

26
    tokio::select! {
27
        _ = ctrl_c => {},
28
        _ = terminate => {},
29
    }
30
    println!("shutdown signal received, draining connections");
31
}
32

33
#[tokio::main]
34
async fn main() {
35
    let app = Router::new().route("/", get(root));
36
    let listener = tokio::net::TcpListener::bind("0.0.0.0:8080").await.unwrap();
37
    println!("listening on http://{}", listener.local_addr().unwrap());
38

39
    axum::serve(listener, app)
40
        .with_graceful_shutdown(shutdown_signal())
41
        .await
42
        .unwrap();
43
}

tokio::select! races the two signal futures; whichever fires first wins, and the function returns, which tells axum::serve to stop accepting new connections and finish in-flight ones. On non-Unix targets the terminate branch is std::future::pending() — a future that never completes — so only Ctrl-C triggers shutdown.

Exercise 3: Multi-stage Dockerfile with a size-tuned profile

Difficulty: Advanced

Objective: Produce a small, secure container image for an Axum binary, building in a Rust toolchain image and shipping only the binary in a distroless runtime.

Instructions: Write a [profile.release] in Cargo.toml that strips symbols and enables LTO, a .dockerignore that excludes target and .git, and a multi-stage Dockerfile that (1) builds with rust:1.96-slim, caching dependencies via the dummy-main.rs trick, and (2) copies only the release binary into gcr.io/distroless/cc-debian12, running as nonroot, listening on $PORT/0.0.0.0. Build the image and curl a health endpoint to confirm.

Solution

Cargo.toml profile:

1
[profile.release]
2
lto = true
3
codegen-units = 1
4
strip = true

.dockerignore:

1
target
2
.git
3
Dockerfile
4
.dockerignore

Dockerfile:

1
# ---- Stage 1: build ----
2
FROM rust:1.96-slim AS builder
3
WORKDIR /app
4

5
# Dependency cache layer: build a dummy main from the manifests only.
6
COPY Cargo.toml Cargo.lock ./
7
RUN mkdir src && echo "fn main() {}" > src/main.rs \
8
    && cargo build --release \
9
    && rm -rf src
10

11
# Now the real sources; only this layer rebuilds on a code change.
12
COPY src ./src
13
RUN touch src/main.rs && cargo build --release
14

15
# ---- Stage 2: runtime ----
16
FROM gcr.io/distroless/cc-debian12 AS runtime
17
WORKDIR /app
18
COPY --from=builder /app/target/release/myapi /usr/local/bin/myapi
19
ENV PORT=8080
20
EXPOSE 8080
21
USER nonroot:nonroot
22
CMD ["myapi"]

Build and verify (real output from building and running this against the myapi server):

1
$ docker build -t myapi:latest .
2
 => exporting to image ... done
3
$ docker images myapi:latest --format '{{.Size}}'
4
36.2MB
5
# Pass the required DATABASE_URL; the Dockerfile already sets PORT=8080.
6
$ docker run -d -e DATABASE_URL=postgres://localhost/app -p 18080:8080 myapi:latest
7
$ curl -s http://127.0.0.1:18080/healthz
8
{"status":"ok"}

The dependency layer is cached, so editing only src/ rebuilds in seconds rather than recompiling every crate. The distroless runtime has no shell or package manager and runs unprivileged, giving a small image with a minimal attack surface — and the deployed artifact is just your binary, not a runtime plus a dependency tree.

Deploying Axum Applications

Quick Overview

TypeScript/JavaScript Example

Rust Equivalent

Detailed Explanation

Key Differences

Common Pitfalls

Pitfall 1: Binding 127.0.0.1 inside a container

Pitfall 2: Shipping (or worse, deploying) the debug binary

Pitfall 3: Hardcoding the port

Pitfall 4: Forgetting graceful shutdown, then losing requests on every deploy

Pitfall 5: A glibc mismatch between build and runtime images

Best Practices

Shrink the release binary with a profile

Multi-stage Docker build with dependency caching

Run as non-root and add a health check

Reverse proxy and TLS termination

Keep secrets out of the image

Real-World Example

Further Reading

Exercises

Exercise 1: Read the port from the environment

Exercise 2: Add graceful shutdown

Exercise 3: Multi-stage Dockerfile with a size-tuned profile

Pitfall 1: Binding `127.0.0.1` inside a container

Pitfall 5: A `glibc` mismatch between build and runtime images