solgitae

Posted on Apr 1

When a Memory Pool Actually Helps in Go Logging

#backend #go #performance #tutorial

When you build a high-throughput log pipeline in Go, the garbage collector quickly becomes one of your biggest bottlenecks. Every log line means new allocations: buffers, temporary structs, parsed JSON trees, and so on. At some point, you start wondering: is it time to use a memory pool?

In this post I’ll walk through a simple pattern using sync.Pool and explain when it is (and is not) a good idea for log pre-processing.

The basic pattern

For log processing, the most common thing to pool is a reusable byte buffer or struct used per log line.


var logBufferPool = sync.Pool{
    New: func() any {
        buf := make([]byte, 0, 64*1024) // 64KB buffer
        return buf
    },
}

func handleLog(raw []byte) {
    // 1. Take a buffer from the pool
    buf := logBufferPool.Get().([]byte)

    // 2. Use it for parsing / masking / rewriting
    buf = buf[:0]          // reset length, keep capacity
    buf = append(buf, raw) // do your processing on buf

    // 3. Return it to the pool
    buf = buf[:0]
    logBufferPool.Put(buf)
}

The key detail is buf = buf[:0]: this does not allocate a new array. It just resets the slice length to zero while keeping the underlying capacity. Combined with sync.Pool, this lets you reuse the same backing arrays across many log lines instead of calling make([]byte, …) thousands of times per second.

In a log pre-processor that parses, reshapes, and masks JSONL lines, this pattern can eliminate a large portion of per-line heap allocations and directly reduce GC pressure.

When a pool is a good idea

You don’t need a pool everywhere. It shines in a few specific scenarios:

High and steady throughput
If your service is processing tens of thousands of log lines per second, even small per-line allocations add up quickly and trigger frequent GC cycles.

Same shape, repeated many times
You repeatedly allocate the same “shape” of object: for example, []byte buffers of similar capacity, or small structs used during parsing.

Short‑lived scratch objects
The pooled objects are temporary scratch space inside a single request/log handling path, and you fully reset them before reuse.

You’ve profiled, and GC is the problem
Profiling shows a hot path dominated by allocations, and reducing those allocations produces measurable throughput or latency improvement.

A log pre-processing engine is almost a textbook case: each log line goes through the same transformation pipeline, uses similar-sized buffers and temporary objects, and then discards them. A pool lets you recycle that working set instead of constantly asking the runtime for new memory.

When you should avoid it

There are also clear cases where sync.Pool is not worth the cost:

Low QPS or batch tools
If your program processes a small batch of logs once in a while, the extra complexity of pooling won’t pay off.

Long‑lived objects
Pools are for scratch space, not for objects that live across requests or goroutines.

Highly variable sizes
If your buffers/objects have wildly different sizes, you either waste memory or end up implementing a complex “pool of pools”. At that point, a custom allocator or a simpler design might be better.

You haven’t measured anything
sync.Pool is not a free optimization. If the code is not allocation-bound, you’re just making it harder to read for no gain.

It’s also important to remember that sync.Pool is not a strict cache. The runtime is free to drop items from the pool on any GC cycle, so you cannot use it as a reliable long‑term free list.

Practical tips

A few practical rules that have worked well for log-style workloads:

Start simple, then optimize
First write straightforward code using bytes.Buffer or plain slices, then add a pool only around the clearly hot pieces.

Always reset before reuse
For slices, use buf = buf[:0]. For structs, add a Reset method that clears all fields. This avoids leaking data between log lines.

Return objects promptly
Use defer pool.Put(x) near the Get call when possible, to avoid forgetting to put it back in some branches.

Re‑profile after adding the pool
Confirm that the change actually reduced allocations and GC time. If the improvement is negligible, delete the pool and keep the simpler code.

Closing thoughts

Memory pools are not a silver bullet, but in a high-volume logging or log pre-processing system they can be the difference between a GC-bound pipeline and a stable, predictable one. If your log engine spends most of its time allocating the same buffers over and over, pooling those buffers is one of the lowest‑hanging fruits you can pick.

DEV Community