~/gpu/rtx-5080 vs rtx-5090

RTX 5080vsRTX 5090

Head-to-head for local LLM inference. The honest comparison: VRAM, bandwidth, compute, and which of the 30 catalog models actually fit on each.

The specs.

$ diff specs rtx-5080 rtx-5090

Stat

rtx-5080

rtx-5090

VRAM

16 GB

32 GB

+100%

Memory bandwidth

960 GB/s

1,792 GB/s

+87%

FP16 compute

565 TFLOPS

838 TFLOPS

+48%

Weights budget at 8K ctx

12 GB

25 GB

+108%

Model fit difference.

$ models that change with the card

Fits on both

16of 30

Only on rtx-5080

Only on rtx-5090

// showing 12 of 30 models; differing fits first

Model

rtx-5080

rtx-5090

Qwen 2.5 32B32.5B

overQ4_K_M

fitsQ5_K_M

Qwen 2.5 Coder 32B32.5B

overQ4_K_M

fitsQ5_K_M

Qwen3 30B A3B30.5B

overQ4_K_M

fitsQ5_K_M

Qwen 3.6 35B A3B35B

overQ4_K_M

fitsQ5_K_M

Mixtral 8x7B46.7B

overQ4_K_M

fitsAWQ 4-BIT

Yi 34B34B

overQ4_K_M

fitsQ5_K_M

Llama 3.2 1B1.23B

fitsFP16/BF16

Llama 3.2 3B3.21B

fitsFP16/BF16

Llama 3.1 8B8B

fitsQ8_0

fitsFP16/BF16

Llama 3.3 70B70.6B

overQ4_K_M

Llama 3.1 405B405B

overQ4_K_M

Qwen 2.5 7B7B

fitsQ8_0

fitsFP16/BF16

Which one wins for…

$ ./recommend --by-workload

More VRAM headroom

RTX 5090 has 16 GB more.

Faster decode (bandwidth)

RTX 5090 by +87%.

Faster prefill (compute)

RTX 5090 by +48% TFLOPS.

Catalog models that fit

RTX 5090: 22 fit · RTX 5080: 16.

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.

RTX 5080 manufacturerRTX 5080vsRTX 5090 manufacturerRTX 5090

The specs.

Model fit difference.

Which one wins for…

Drill into either card.

Discussion.

RTX 5080vsRTX 5090