Sustainable image delivery with CDNs using smart resizing caching and format negotiation

Why image delivery at the CDN matters for sustainability

Images are often the largest part of a web page by bytes. When a content delivery network is used to handle image resizing and format selection at the edge, fewer bytes travel across long networks and origin servers perform less work. That combination reduces bandwidth and server compute, which in turn lowers operational cost and the environmental footprint of serving images.

Primary levers a CDN controls

Resizing at the edge to match the actual client viewport, caching the resulting variants across a wide edge footprint, and negotiating efficient image formats with the client are the three technical levers that most directly reduce waste. Implemented together they change the delivered payload without harming perceived quality.

Edge resizing versus pre generation: a practical decision framework

There are two common patterns for delivering multiple image sizes. One pattern is to pre generate a set of static variants and store them in object storage. The other pattern is to generate variants on the fly at the CDN edge when a request arrives. Both are valid. The right choice depends on traffic distribution and update patterns.

Top images and predictable sizes Prefer pre generation for high traffic assets with stable dimensions such as hero images and product thumbnails. Pre generated files maximize cache hit ratio and avoid repeated compute at the edge.
Long tail and many arbitrary sizes Use on the fly generation for user uploaded images and long tail assets where pre generating every permutation would inflate storage and build complexity.
Update frequency If an image changes frequently, on the fly generation avoids the complexity of invalidation pipelines. If changes are rare, pre generation with a clear invalidation or cache busting mechanism is simpler and more efficient.
Cost and environmental tradeoff Estimate edge compute cost and storage cost for your traffic mix. When pre generation greatly increases storage and origin egress for rarely requested sizes, on the fly generation paired with a generous edge cache is often preferable.

Cache layering and cache keys that prevent wasted work

An efficient cache strategy reduces origin hits and repeated compute. Two basic principles guide cache key design.

First, include only the request attributes that change the binary output. Typical attributes are width, quality, and output format. Avoid adding volatile headers to the cache key.

Second, use a layered cache approach. Let the CDN edge hold generated variants with a long time to live when possible. Configure the origin to store canonical source images. If a variant is missing on the edge a single edge node can generate and populate the cache for subsequent requests across the edge footprint.

Practical cache header patterns

For immutable image variants use cache headers that enable long lived caching on both the client and edge. A common pattern for stable, versioned assets is to return a cache control header such as public, max-age=31536000, immutable. For variants that may change more often set a shorter max age and rely on a surrogate key or cache purge workflow at the CDN.

Always include a Vary header when content negotiation affects the response. If the CDN or origin negotiates on the Accept header include Vary: Accept to ensure caches store separate entries for different client capabilities.

Format negotiation and how to avoid cache fragmentation

Serving modern formats such as AVIF and WebP when supported reduces payload size. Content negotiation uses the client Accept header to select the best format. Implementing negotiation at the edge yields lower latency and avoids sending full original images when the client can accept a more efficient format.

When the CDN inspects Accept and returns different binaries, the cache must be aware of the variant. If Accept is used directly in the cache key the number of cached variants may grow. Two practical techniques reduce fragmentation.

One technique is to normalize the Accept header into a small set of capability buckets before it becomes part of the cache key. For example map a complex Accept header into buckets such as avif capable, webp capable, or legacy only. That keeps variants bounded while still providing format gains.

Another technique is to let the CDN perform on the fly conversion to a single efficient format and then cache that binary independently of the original Accept header. In that case use Vary: Accept to prevent serving the converted format to clients that do not accept it. The tradeoff here is between cache size and conversion complexity.

Quality, progressive encoding and perceived savings

Reducing file size is not only about changing format. Lowering quality settings and using progressive or layered encodings can offer perceptual savings. Make quality settings configurable per image class. For example allow slightly lower quality on small thumbnails while preserving higher quality on full screen hero images.

Run blind A B tests or visual comparisons before changing defaults. Measuring perceived quality with human raters or automated perceptual metrics will prevent regressions in user experience while still capturing bandwidth improvements.

Invalidation and versioning: how to keep caches correct without excess purges

Clear versioning is the simplest way to avoid accidental staleness. Embed a content derived hash or semantic version in the image path for assets that change. That allows long lived caching without frequent purges.

When versioning is not possible, use surrogate keys or tags supported by many CDNs. Purging by tag lets you evict related variants without touching unrelated content. Avoid broad wildcard purges where possible because they cause spikes of origin traffic and unnecessary compute.

Measuring impact without guessing

To know whether CDN changes lower environmental impact, instrument and measure the right signals. The essential metrics are bytes served from edge, origin egress volume, number of edge generated variants, CPU time used for image processing, and client perceived load metrics such as largest contentful paint for image heavy pages.

Collect CDN provider metrics for cache hit ratio and edge compute invocations. Combine those with origin logs and synthetic or real user monitoring. Use the before and after comparison for at least a representative sample of pages and geographic regions to capture differences in cache behavior and client capabilities.

To translate technical savings into emissions you need a transparent and reproducible method. Combine measured bytes transferred and compute hours with published energy intensities for network and compute. Document assumptions and the data sources used so that the estimate can be reviewed or updated.

Operational patterns and governance

Introduce image delivery changes as part of feature toggles or phased rollouts. Start with a small set of high impact pages and verify cache behavior and origin load before broadening the rollout. Provide a fallback path that serves the original image in case of errors during conversion or resizing.

Maintain a small matrix of rules that map image classes to delivery policies. The matrix should state whether an asset class is pre generated or generated on the fly, default quality and format preferences, TTL expectations, and invalidation procedures. That documentation makes it easier for product teams and operations to choose the right policy for new images.

Common pitfalls and how to avoid them

One common pitfall is over fragmentation of the cache key. Including many client headers or device specific attributes will reduce cache hit ratio and increase origin load. Another pitfall is failing to include Vary when negotiation affects the output. Omitting Vary leads to serving the wrong format to some clients.

Finally watch for conversion errors that can silently degrade images. Add monitoring that checks a sample of converted images for correctness and flag conversion failures as errors rather than silent fallbacks.

Checklist to get started

Map your image inventory and classify assets by traffic, update frequency and quality needs.
Decide pre generation for the most requested sizes and on the fly for long tail variants.
Implement cache keys that include only width quality and normalized format bucket values.
Configure cache control and Vary headers appropriate to immutability and negotiation.
Instrument edge compute invocations origin egress and cache hit ratio before and after the change.

Where to go next

Test changes on a narrow subset of pages and gather quantitative and perceptual data. Share the measurement approach and assumptions with stakeholders so decisions are reproducible. Over time the combination of smaller payloads and fewer origin hits will reduce cost and the operational footprint of your image pipeline without harming user experience.

Webcarbon