Graceful Shutdown

19 min read

When an orchestrator like Kubernetes redeploys your service, it sends a signal and gives you a few seconds to clean up before it kills the process. Handling that window correctly is the difference between zero-downtime deploys and a stream of 502s for every in-flight request. This page shows how to catch shutdown signals with Tokio and drain in-flight requests with axum’s with_graceful_shutdown.

Quick Overview

Graceful shutdown means: stop accepting new work, let work already in progress finish, then exit. In a Node.js service you reach for process.on("SIGTERM", ...) and server.close(callback). In Rust with Tokio and axum, you build a future that resolves when a signal arrives and hand it to axum::serve(...).with_graceful_shutdown(...). The server then stops accepting connections and waits for outstanding requests to complete before the await returns.

This matters to a TypeScript/JavaScript developer because the mental model is nearly identical to server.close(), but the mechanics are different: instead of a callback you pass a future, and instead of an event-loop hook you compose async building blocks (tokio::select!, CancellationToken) that the compiler checks for you.

TypeScript/JavaScript Example

A typical Node.js HTTP service that drains on SIGTERM. This is realistic production code: it tracks the server lifecycle, flips a “draining” flag, and enforces a hard deadline so a stuck request can never block the deploy forever.

1
// server.mjs — Node v22, graceful shutdown of an http.Server
2
import http from "node:http";
3

4
let shuttingDown = false;
5

6
const server = http.createServer((req, res) => {
7
  if (req.url === "/slow") {
8
    // An in-flight request that takes a while to finish.
9
    setTimeout(() => {
10
      res.writeHead(200, { "Content-Type": "text/plain" });
11
      res.end("done\n");
12
    }, 1500);
13
    return;
14
  }
15
  // Once draining, fail readiness so the load balancer stops sending traffic.
16
  res.writeHead(shuttingDown ? 503 : 200);
17
  res.end(shuttingDown ? "draining\n" : "ok\n");
18
});
19

20
function shutdown(signal: string) {
21
  console.log(`${signal} received, draining`);
22
  shuttingDown = true;
23

24
  // Stop accepting new connections; the callback fires once existing ones close.
25
  server.close(() => {
26
    console.log("all connections drained, exiting");
27
    process.exit(0);
28
  });
29

30
  // Safety net: never wait forever. `.unref()` lets the process exit early
31
  // if draining finishes first.
32
  setTimeout(() => {
33
    console.error("drain timed out, forcing exit");
34
    process.exit(1);
35
  }, 10_000).unref();
36
}
37

38
process.on("SIGTERM", () => shutdown("SIGTERM"));
39
process.on("SIGINT", () => shutdown("SIGINT"));
40

41
server.listen(3100, () => console.log("listening on :3100"));

Running this, sending SIGTERM while a /slow request is in flight, prints (real Node v22 output):

1
listening on :3100
2
SIGTERM received, draining
3
all connections drained, exiting

The in-flight /slow request still returns done with exit code 0; a brand-new connection opened after SIGTERM is refused. That is exactly the behavior we want to reproduce in Rust.

Key points:

process.on("SIGTERM", ...) registers a signal handler on the event loop.
server.close(cb) stops accepting connections and calls back once existing ones finish.
A setTimeout(...).unref() enforces a hard drain deadline.
A shuttingDown flag lets readiness probes fail so traffic stops arriving.

Rust Equivalent

The same idea in axum 0.8 on Tokio. The current stable toolchain is Rust 1.96.0 on the 2024 edition, and cargo new selects it automatically. Add the dependencies:

1
cargo add axum
2
cargo add tokio --features full

1
use std::time::Duration;
2

3
use axum::{extract::State, routing::get, Router};
4
use tokio::signal;
5

6
#[derive(Clone)]
7
struct AppState {
8
    started_at: std::time::Instant,
9
}
10

11
async fn root() -> &'static str {
12
    "Hello from a graceful server\n"
13
}
14

15
async fn slow(State(state): State<AppState>) -> String {
16
    // Simulate an in-flight request that takes a while to finish.
17
    tokio::time::sleep(Duration::from_secs(3)).await;
18
    format!("done after {:?}\n", state.started_at.elapsed())
19
}
20

21
#[tokio::main]
22
async fn main() {
23
    let state = AppState {
24
        started_at: std::time::Instant::now(),
25
    };
26

27
    let app = Router::new()
28
        .route("/", get(root))
29
        .route("/slow", get(slow))
30
        .with_state(state);
31

32
    let listener = tokio::net::TcpListener::bind("127.0.0.1:3000")
33
        .await
34
        .expect("failed to bind");
35

36
    println!("listening on {}", listener.local_addr().unwrap());
37

38
    axum::serve(listener, app)
39
        .with_graceful_shutdown(shutdown_signal())
40
        .await
41
        .expect("server error");
42

43
    println!("server has shut down cleanly");
44
}
45

46
/// A future that resolves when the process should begin shutting down.
47
async fn shutdown_signal() {
48
    let ctrl_c = async {
49
        signal::ctrl_c()
50
            .await
51
            .expect("failed to install Ctrl+C handler");
52
    };
53

54
    #[cfg(unix)]
55
    let terminate = async {
56
        signal::unix::signal(signal::unix::SignalKind::terminate())
57
            .expect("failed to install SIGTERM handler")
58
            .recv()
59
            .await;
60
    };
61

62
    #[cfg(not(unix))]
63
    let terminate = std::future::pending::<()>();
64

65
    tokio::select! {
66
        _ = ctrl_c => {},
67
        _ = terminate => {},
68
    }
69

70
    println!("signal received, starting graceful shutdown");
71
}

Building and running this server, then sending SIGTERM while a curl http://127.0.0.1:3000/slow is in flight, produces this real output:

1
listening on 127.0.0.1:3000
2
signal received, starting graceful shutdown
3
server has shut down cleanly

Crucially, the in-flight /slow request still completes — the client receives done after 3.04008875s with a success exit code — while a new connection attempted after the signal is refused (the listener is already closed). The await on axum::serve(...) returns only after the last in-flight request has been served.

Key points:

axum::serve(listener, app) is the current API. The old axum::Server::bind(...).serve(...) builder was removed; do not use it.
.with_graceful_shutdown(future) makes the server stop accepting connections as soon as future resolves, then drain.
shutdown_signal() is just an async fn — a future you compose from signal sources with tokio::select!.
#[cfg(unix)] guards the SIGTERM handler so the code still compiles on Windows.

Detailed Explanation

`axum::serve` and the shutdown future

1
// (excerpt)
2
axum::serve(listener, app)
3
    .with_graceful_shutdown(shutdown_signal())
4
    .await
5
    .expect("server error");

axum::serve(listener, app) returns a Serve value — a future that, when awaited, runs the accept loop forever. Calling .with_graceful_shutdown(future) wraps it so the accept loop also watches future. The moment future resolves:

The server stops accepting new connections (the TCP listener is dropped).
Connections with a request currently in flight are allowed to finish.
Once they all complete, the outer .await returns and control falls through to println!("server has shut down cleanly").

This is the direct analog of server.close(callback) in Node, but instead of a callback you supply a future describing when to start closing, and the cleanup code is whatever you write after .await.

Note: The future you pass decides when shutdown begins. It does not have to be a signal — it could resolve on a message from a channel, a CancellationToken, or an admin HTTP endpoint. Signals are just the most common trigger.

Catching signals with Tokio

1
// (excerpt)
2
let ctrl_c = async {
3
    signal::ctrl_c().await.expect("failed to install Ctrl+C handler");
4
};
5

6
#[cfg(unix)]
7
let terminate = async {
8
    signal::unix::signal(signal::unix::SignalKind::terminate())
9
        .expect("failed to install SIGTERM handler")
10
        .recv()
11
        .await;
12
};

tokio::signal::ctrl_c() returns a future that resolves once on the next Ctrl+C (SIGINT). That covers interactive use and docker stop --signal SIGINT.

For Unix, SIGTERM is the signal Kubernetes and most process managers send first, so you must handle it explicitly. signal::unix::signal(SignalKind::terminate()) returns a Signal, which is a stream of signal deliveries, not a one-shot future — you call .recv().await to wait for the next one. (Forgetting .recv() is a real compile error; see Common Pitfalls.)

`tokio::select!` — wait for whichever happens first

1
// (excerpt)
2
tokio::select! {
3
    _ = ctrl_c => {},
4
    _ = terminate => {},
5
}

tokio::select! polls several futures concurrently and completes as soon as any one of them resolves, dropping the rest. This is the async equivalent of registering listeners for both SIGINT and SIGTERM and reacting to whichever fires first. It is conceptually close to Promise.race([...]), but select! works on the spot inside an async block, can bind the winning value with patterns, and cancels the losing futures cleanly. See select and join for a deeper comparison.

Why the `#[cfg(not(unix))]` arm exists

1
// (excerpt)
2
#[cfg(not(unix))]
3
let terminate = std::future::pending::<()>();

SignalKind::terminate() does not exist on Windows, so the SIGTERM block is compiled in only on Unix. On other platforms we substitute std::future::pending::<()>() — a future that never resolves — so the select! still type-checks and simply relies on ctrl_c. Without this arm, the code would fail to compile on Windows. This is the conditional-compilation analog of a runtime if (process.platform !== "win32") guard, but resolved at compile time with zero runtime cost.

Key Differences

Concern	Node.js	Rust (`axum` + Tokio)
Register signal handler	`process.on("SIGTERM", cb)`	`tokio::signal::ctrl_c()` / `signal::unix::signal(...)` futures
Stop accepting connections	`server.close(cb)`	`.with_graceful_shutdown(future)` resolves
”When to start” trigger	a callback fired by an event	a future you compose and pass in
Wait for in-flight work	callback fires after sockets close	the `serve(...).await` returns after draining
React to first of N events	`Promise.race([...])`	`tokio::select! { ... }`
Hard drain deadline	`setTimeout(...).unref()`	`tokio::time::timeout(dur, fut)`
Cancel background tasks	manual flags / `AbortController`	`CancellationToken` + `TaskTracker`
Cross-platform signals	`process.platform` checks	`#[cfg(unix)]` conditional compilation

The deepest conceptual shift: in Node you hook the event loop and write imperative cleanup in a callback. In Rust you describe shutdown declaratively as a future, and the runtime drives it. Because futures are values, you can clone the trigger, hand copies to background tasks, and compose timeouts around them — all checked by the compiler.

Warning: Rust futures are lazy. A future does nothing until it is .awaited or spawned onto a runtime — the opposite of an eager JavaScript Promise, which starts executing the moment you create it. signal::ctrl_c() does not begin listening until its future is polled inside select!.

Common Pitfalls

Awaiting a `Signal` directly

A tokio::signal::unix::Signal is a stream of deliveries, not a future. Awaiting it directly does not compile:

1
use tokio::signal::unix::{signal, SignalKind};
2

3
#[tokio::main]
4
async fn main() {
5
    let mut sigterm = signal(SignalKind::terminate()).unwrap();
6
    // does not compile (error[E0277]: `Signal` is not a future)
7
    sigterm.await;
8
}

The real compiler error:

1
error[E0277]: `Signal` is not a future
2
 --> src/main.rs:7:13
3
  |
4
7 |     sigterm.await;
5
  |             ^^^^^ `Signal` is not a future
6
  |
7
  = help: the trait `Future` is not implemented for `Signal`
8
  = note: Signal must be a future or must implement `IntoFuture` to be awaited
9
help: remove the `.await`
10
  |
11
7 -     sigterm.await;
12
7 +     sigterm;
13
  |

The fix is sigterm.recv().await, which waits for the next delivery and yields Option<()>.

Forgetting `with_graceful_shutdown` entirely

1
// Compiles and runs, but NOT graceful.
2
// axum::serve(listener, app).await.unwrap();

If you omit .with_graceful_shutdown(...), the server has no idea a signal arrived. The default runtime behavior on Ctrl+C is to abort the process immediately, severing every in-flight request mid-response. The code compiles fine — the bug is silent and only shows up as truncated responses during a deploy. Always attach a shutdown future to any service that must deploy without dropping requests.

Doing the slow cleanup before the server drains

A tempting mistake is to run all your cleanup (flush metrics, close the DB pool) inside the shutdown future, before the server has drained:

1
// Anti-pattern: close the DB pool while requests are still in flight.
2
// .with_graceful_shutdown(async move {
3
//     wait_for_signal().await;
4
//     db_pool.close().await; // requests still running will now fail!
5
// })

The shutdown future resolves the instant the signal arrives; draining happens after it returns. So tearing down dependencies inside that future yanks them out from under requests that are still completing. Put dependency teardown after axum::serve(...).await, once draining is done. (See the Real-World Example.)

Blocking the async runtime during shutdown

Calling a blocking function (std::thread::sleep, synchronous file I/O, a blocking DB driver) inside the shutdown path stalls a Tokio worker thread and can wedge the drain. Use the async equivalents (tokio::time::sleep, tokio::fs) or tokio::task::spawn_blocking for unavoidably blocking work. See async vs sync.

Not bounding the drain

If one request hangs forever (a slow upstream, a deadlock), an unbounded drain blocks your deploy indefinitely, and Kubernetes will eventually SIGKILL you anyway — ungracefully. Always wrap the drain in a deadline with tokio::time::timeout, mirroring the Node setTimeout(...).unref() safety net.

Best Practices

Always handle both SIGINT and SIGTERM. SIGTERM is what orchestrators send; SIGINT is Ctrl+C in development. Handle both with tokio::select!.
Guard SIGTERM with #[cfg(unix)] and supply a std::future::pending() fallback so the binary still builds on Windows.
Flip readiness to “not ready” first. When shutdown begins, make your /readyz probe return 503 so the load balancer stops sending new traffic before you stop accepting connections. This closes the small window where new requests arrive at a server that is about to die. (Readiness probes are covered in health checks.)
Bound the drain with tokio::time::timeout. Pick a budget shorter than your orchestrator’s terminationGracePeriodSeconds so you exit cleanly before being force-killed.
Cancel background tasks with a CancellationToken and wait for them with a TaskTracker. Clone the token into every spawned task so a single .cancel() reaches all of them.
Tear down dependencies after the drain, not inside the shutdown future — close DB pools, flush metrics, and finish background jobs only once in-flight HTTP requests have completed.
Emit structured logs around each phase (signal received, draining, complete) so you can confirm graceful shutdown in production. Use tracing and the distributed tracing page.

Real-World Example

A production-flavored service that ties everything together: it flips readiness off on shutdown, gives the load balancer a moment to react, drains in-flight HTTP requests, then cancels a background worker and waits for it with a bounded deadline. Dependencies:

1
cargo add axum
2
cargo add tokio --features full
3
cargo add tokio-util --features rt
4
cargo add tracing
5
cargo add tracing-subscriber

1
use std::sync::atomic::{AtomicBool, Ordering};
2
use std::sync::Arc;
3
use std::time::Duration;
4

5
use axum::extract::State;
6
use axum::http::StatusCode;
7
use axum::{routing::get, Router};
8
use tokio::signal;
9
use tokio_util::sync::CancellationToken;
10
use tokio_util::task::TaskTracker;
11

12
#[derive(Clone)]
13
struct AppState {
14
    /// Flips to `false` the moment shutdown begins so the load balancer
15
    /// stops routing new traffic here while we drain.
16
    ready: Arc<AtomicBool>,
17
}
18

19
#[tokio::main]
20
async fn main() {
21
    tracing_subscriber::fmt()
22
        .with_max_level(tracing::Level::INFO)
23
        .init();
24

25
    let ready = Arc::new(AtomicBool::new(true));
26
    let state = AppState {
27
        ready: ready.clone(),
28
    };
29

30
    let shutdown = CancellationToken::new();
31
    let tracker = TaskTracker::new();
32

33
    // Background worker (e.g. a queue consumer) that drains on cancel.
34
    {
35
        let shutdown = shutdown.clone();
36
        tracker.spawn(async move {
37
            let mut tick = tokio::time::interval(Duration::from_secs(1));
38
            loop {
39
                tokio::select! {
40
                    _ = tick.tick() => tracing::info!("worker heartbeat"),
41
                    _ = shutdown.cancelled() => break,
42
                }
43
            }
44
            tracing::info!("worker drained");
45
        });
46
    }
47

48
    let app = Router::new()
49
        .route("/", get(|| async { "hello\n" }))
50
        .route("/healthz", get(live))
51
        .route("/readyz", get(ready_handler))
52
        .with_state(state);
53

54
    let listener = tokio::net::TcpListener::bind("0.0.0.0:8080").await.unwrap();
55
    tracing::info!(addr = %listener.local_addr().unwrap(), "listening");
56

57
    let shutdown_for_server = shutdown.clone();
58
    axum::serve(listener, app)
59
        .with_graceful_shutdown(async move {
60
            wait_for_signal().await;
61
            tracing::info!("shutdown signal received");
62
            // 1. Mark unready so readiness probes fail and traffic stops.
63
            ready.store(false, Ordering::SeqCst);
64
            // 2. Give the orchestrator a moment to notice before we stop
65
            //    accepting connections (avoids a brief 502 window).
66
            tokio::time::sleep(Duration::from_secs(1)).await;
67
            // 3. Signal background tasks to wind down.
68
            shutdown_for_server.cancel();
69
        })
70
        .await
71
        .unwrap();
72

73
    // The HTTP server has fully drained. Now drain background tasks,
74
    // but never wait forever: cap the drain at a deadline.
75
    tracker.close();
76
    match tokio::time::timeout(Duration::from_secs(15), tracker.wait()).await {
77
        Ok(()) => tracing::info!("graceful shutdown complete"),
78
        Err(_) => tracing::warn!("drain timed out; exiting anyway"),
79
    }
80
}
81

82
async fn live() -> StatusCode {
83
    StatusCode::OK
84
}
85

86
async fn ready_handler(State(state): State<AppState>) -> StatusCode {
87
    if state.ready.load(Ordering::SeqCst) {
88
        StatusCode::OK
89
    } else {
90
        StatusCode::SERVICE_UNAVAILABLE
91
    }
92
}
93

94
async fn wait_for_signal() {
95
    let ctrl_c = async {
96
        signal::ctrl_c().await.expect("ctrl_c handler");
97
    };
98
    #[cfg(unix)]
99
    let terminate = async {
100
        signal::unix::signal(signal::unix::SignalKind::terminate())
101
            .expect("SIGTERM handler")
102
            .recv()
103
            .await;
104
    };
105
    #[cfg(not(unix))]
106
    let terminate = std::future::pending::<()>();
107

108
    tokio::select! {
109
        _ = ctrl_c => {}
110
        _ = terminate => {}
111
    }
112
}

Running this, hitting /readyz, then sending SIGTERM and probing /readyz again during the drain shows the readiness flip in action:

1
readyz BEFORE shutdown: 200
2
readyz DURING drain:    503

And the real structured log over the full lifecycle (ANSI colors stripped):

1
2026-06-02T06:42:36.170473Z  INFO probe2: listening addr=0.0.0.0:8080
2
2026-06-02T06:42:36.171560Z  INFO probe2: worker heartbeat
3
2026-06-02T06:42:37.172777Z  INFO probe2: worker heartbeat
4
2026-06-02T06:42:37.683791Z  INFO probe2: shutdown signal received
5
2026-06-02T06:42:38.172939Z  INFO probe2: worker heartbeat
6
2026-06-02T06:42:38.685163Z  INFO probe2: worker drained
7
2026-06-02T06:42:38.685255Z  INFO probe2: graceful shutdown complete

Notice the ordering: the signal arrives, readiness flips to 503, the worker keeps heartbeating during the one-second grace window, then is cancelled and drains, and only then does shutdown complete. This is the full zero-downtime sequence.

Tip: CancellationToken::cancel() is idempotent and the token is cheap to clone(), so you can hand a clone to every background task and a single .cancel() reaches all of them. TaskTracker::wait() returns once every tracked task has finished — after you call tracker.close() to stop accepting new ones. See spawning tasks and background jobs.

Exercises

Exercise 1: Add an `SIGTERM`-aware health endpoint

Difficulty: Beginner

Objective: Reproduce the readiness flip so a load balancer stops sending traffic the moment shutdown starts.

Instructions: Starting from the first Rust example, add an AppState carrying an Arc<AtomicBool> named ready, initialized to true. Add a /readyz route that returns 200 OK when ready is true and 503 Service Unavailable otherwise. In the shutdown future, set ready to false before the drain begins. Verify with curl that /readyz returns 200 before the signal and 503 after.

Solution

1
use std::sync::atomic::{AtomicBool, Ordering};
2
use std::sync::Arc;
3

4
use axum::extract::State;
5
use axum::http::StatusCode;
6
use axum::{routing::get, Router};
7
use tokio::signal;
8

9
#[derive(Clone)]
10
struct AppState {
11
    ready: Arc<AtomicBool>,
12
}
13

14
#[tokio::main]
15
async fn main() {
16
    let ready = Arc::new(AtomicBool::new(true));
17
    let state = AppState {
18
        ready: ready.clone(),
19
    };
20

21
    let app = Router::new()
22
        .route("/", get(|| async { "ok\n" }))
23
        .route("/readyz", get(readyz))
24
        .with_state(state);
25

26
    let listener = tokio::net::TcpListener::bind("127.0.0.1:3000")
27
        .await
28
        .unwrap();
29
    println!("listening on {}", listener.local_addr().unwrap());
30

31
    axum::serve(listener, app)
32
        .with_graceful_shutdown(async move {
33
            wait_for_signal().await;
34
            // Fail readiness so traffic stops arriving, then drain.
35
            ready.store(false, Ordering::SeqCst);
36
            println!("readiness disabled, draining");
37
        })
38
        .await
39
        .unwrap();
40

41
    println!("shut down cleanly");
42
}
43

44
async fn readyz(State(state): State<AppState>) -> StatusCode {
45
    if state.ready.load(Ordering::SeqCst) {
46
        StatusCode::OK
47
    } else {
48
        StatusCode::SERVICE_UNAVAILABLE
49
    }
50
}
51

52
async fn wait_for_signal() {
53
    let ctrl_c = async {
54
        signal::ctrl_c().await.expect("ctrl_c handler");
55
    };
56
    #[cfg(unix)]
57
    let terminate = async {
58
        signal::unix::signal(signal::unix::SignalKind::terminate())
59
            .expect("SIGTERM handler")
60
            .recv()
61
            .await;
62
    };
63
    #[cfg(not(unix))]
64
    let terminate = std::future::pending::<()>();
65

66
    tokio::select! {
67
        _ = ctrl_c => {}
68
        _ = terminate => {}
69
    }
70
}

Note: the Ordering::SeqCst here is a memory-ordering parameter for the atomic, not connected to HTTP status codes.

Exercise 2: Bound the shutdown with a deadline

Difficulty: Intermediate

Objective: Ensure a hung request can never block the deploy forever.

Instructions: Spawn the axum server as a Tokio task (tokio::spawn) so you can await it separately. After the shutdown signal triggers, wrap the server’s join handle in tokio::time::timeout(Duration::from_secs(10), ...). If the drain completes in time, log “drained cleanly”; if the timeout fires, log “drain timed out; forcing exit”. This mirrors the Node setTimeout(...).unref() safety net.

Solution

1
use std::time::Duration;
2

3
use axum::{routing::get, Router};
4
use tokio::signal;
5
use tokio_util::sync::CancellationToken;
6

7
#[tokio::main]
8
async fn main() {
9
    let shutdown = CancellationToken::new();
10

11
    let app = Router::new().route("/", get(|| async { "ok\n" }));
12
    let listener = tokio::net::TcpListener::bind("127.0.0.1:3000")
13
        .await
14
        .unwrap();
15
    println!("listening on {}", listener.local_addr().unwrap());
16

17
    // Run the server on its own task so we can time the drain.
18
    let server_shutdown = shutdown.clone();
19
    let server = tokio::spawn(async move {
20
        axum::serve(listener, app)
21
            .with_graceful_shutdown(async move {
22
                server_shutdown.cancelled().await;
23
            })
24
            .await
25
            .unwrap();
26
    });
27

28
    // Wait for the OS signal, then ask the server to drain.
29
    wait_for_signal().await;
30
    println!("signal received, draining");
31
    shutdown.cancel();
32

33
    // Never wait forever for the drain to finish.
34
    match tokio::time::timeout(Duration::from_secs(10), server).await {
35
        Ok(Ok(())) => println!("drained cleanly"),
36
        Ok(Err(join_err)) => println!("server task panicked: {join_err}"),
37
        Err(_) => println!("drain timed out; forcing exit"),
38
    }
39
}
40

41
async fn wait_for_signal() {
42
    let ctrl_c = async {
43
        signal::ctrl_c().await.expect("ctrl_c handler");
44
    };
45
    #[cfg(unix)]
46
    let terminate = async {
47
        signal::unix::signal(signal::unix::SignalKind::terminate())
48
            .expect("SIGTERM handler")
49
            .recv()
50
            .await;
51
    };
52
    #[cfg(not(unix))]
53
    let terminate = std::future::pending::<()>();
54

55
    tokio::select! {
56
        _ = ctrl_c => {}
57
        _ = terminate => {}
58
    }
59
}

This requires cargo add tokio-util. The server runs as a spawned task; after the signal we cancel the token (which resolves the server’s shutdown future) and then race the join handle against a 10-second deadline.

Exercise 3: Drain a background worker on shutdown

Difficulty: Advanced

Objective: Cancel a long-running background task cooperatively and wait for it to finish before exiting.

Instructions: Spawn a background worker with TaskTracker::spawn that loops on a tokio::time::interval, doing a unit of work each tick. Inside the loop, use tokio::select! to watch a shared CancellationToken; when it is cancelled, log a message and break. After the HTTP server drains, call tracker.close() and tracker.wait() (wrapped in a tokio::time::timeout) so the process waits for the worker to finish its current unit of work. Add tokio-util with the rt feature.

Solution

1
cargo add tokio --features full
2
cargo add tokio-util --features rt

1
use std::time::Duration;
2

3
use tokio::signal;
4
use tokio_util::sync::CancellationToken;
5
use tokio_util::task::TaskTracker;
6

7
#[tokio::main]
8
async fn main() {
9
    let shutdown = CancellationToken::new();
10
    let tracker = TaskTracker::new();
11

12
    // A background worker that drains cooperatively on cancel.
13
    {
14
        let shutdown = shutdown.clone();
15
        tracker.spawn(async move {
16
            let mut tick = tokio::time::interval(Duration::from_millis(200));
17
            loop {
18
                tokio::select! {
19
                    _ = tick.tick() => println!("worker: doing a unit of work"),
20
                    _ = shutdown.cancelled() => {
21
                        println!("worker: cancellation observed, finishing up");
22
                        break;
23
                    }
24
                }
25
            }
26
            println!("worker: stopped");
27
        });
28
    }
29

30
    // Wait for the OS signal, then tell the worker to wind down.
31
    wait_for_signal().await;
32
    println!("signal received");
33
    shutdown.cancel();
34

35
    // Drain background tasks with a deadline.
36
    tracker.close();
37
    match tokio::time::timeout(Duration::from_secs(10), tracker.wait()).await {
38
        Ok(()) => println!("all background tasks drained cleanly"),
39
        Err(_) => println!("drain deadline exceeded; forcing exit"),
40
    }
41
    println!("bye");
42
}
43

44
async fn wait_for_signal() {
45
    let ctrl_c = async {
46
        signal::ctrl_c().await.expect("ctrl_c handler");
47
    };
48
    #[cfg(unix)]
49
    let terminate = async {
50
        signal::unix::signal(signal::unix::SignalKind::terminate())
51
            .expect("SIGTERM handler")
52
            .recv()
53
            .await;
54
    };
55
    #[cfg(not(unix))]
56
    let terminate = std::future::pending::<()>();
57

58
    tokio::select! {
59
        _ = ctrl_c => {}
60
        _ = terminate => {}
61
    }
62
}

Sending SIGTERM after a couple of seconds produces this real output:

1
worker: doing a unit of work
2
worker: doing a unit of work
3
worker: doing a unit of work
4
signal received
5
worker: cancellation observed, finishing up
6
worker: stopped
7
all background tasks drained cleanly
8
bye

The worker observes the cancellation, finishes cleanly, and the process waits for it before printing bye. In a real service you would combine this with the axum server from the Real-World Example so HTTP and background work both drain together.

Graceful Shutdown

Quick Overview

TypeScript/JavaScript Example

Rust Equivalent

Detailed Explanation

axum::serve and the shutdown future

Catching signals with Tokio

tokio::select! — wait for whichever happens first

Why the #[cfg(not(unix))] arm exists

Key Differences

Common Pitfalls

Awaiting a Signal directly

Forgetting with_graceful_shutdown entirely

Doing the slow cleanup before the server drains

Blocking the async runtime during shutdown

Not bounding the drain

Best Practices

Real-World Example

Further Reading

Exercises

Exercise 1: Add an SIGTERM-aware health endpoint

Exercise 2: Bound the shutdown with a deadline

Exercise 3: Drain a background worker on shutdown

`axum::serve` and the shutdown future

`tokio::select!` — wait for whichever happens first

Why the `#[cfg(not(unix))]` arm exists

Awaiting a `Signal` directly

Forgetting `with_graceful_shutdown` entirely

Exercise 1: Add an `SIGTERM`-aware health endpoint