~/gpu/rtx-5070-ti vs rtx-5080

RTX 5070 TivsRTX 5080

Head-to-head for local LLM inference. The honest comparison: VRAM, bandwidth, compute, and which of the 30 catalog models actually fit on each.

The specs.

$ diff specs rtx-5070-ti rtx-5080

Stat

rtx-5070-ti

rtx-5080

VRAM

16 GB

Memory bandwidth

896 GB/s

960 GB/s

+7%

FP16 compute

510 TFLOPS

565 TFLOPS

+11%

Weights budget at 8K ctx

12 GB

Model fit difference.

$ models that change with the card

Fits on both

16of 30

Only on rtx-5070-ti

Only on rtx-5080

// showing 12 of 30 models; differing fits first

Model

rtx-5070-ti

rtx-5080

fitsFP16/BF16

fitsFP16/BF16

fitsQ8_0

overQ4_K_M

overQ4_K_M

fitsQ8_0

overQ4_K_M

Qwen 2.5 Coder 32B32.5B

overQ4_K_M

overQ4_K_M

overQ4_K_M

fitsQ8_0

fitsQ3_K_M

Which one wins for…

$ ./recommend --by-workload

More VRAM headroom

Tied at 16 GB. Choose on bandwidth or price.

Faster decode (bandwidth)

RTX 5080 by +7%.

Faster prefill (compute)

RTX 5080 by +11% TFLOPS.

Catalog models that fit

Tied: 16 of 30 fit on each.

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.

RTX 5070 Ti manufacturerRTX 5070 TivsRTX 5080 manufacturerRTX 5080

The specs.

Model fit difference.

Which one wins for…

Drill into either card.

Discussion.

RTX 5070 TivsRTX 5080