~/gpu/arc-pro-b60 vs rtx-4070

Arc Pro B60 manufacturerArc Pro B60vsRTX 4070 manufacturerRTX 4070

Head-to-head for local LLM inference. The honest comparison: VRAM, bandwidth, compute, and which of the 30 catalog models actually fit on each.

The specs.

$ diff specs arc-pro-b60 rtx-4070
Stat
arc-pro-b60
rtx-4070
Δ
VRAM
24 GB
12 GB
-50%
Memory bandwidth
456 GB/s
504 GB/s
+11%
FP16 compute
56 TFLOPS
230 TFLOPS
+311%
Weights budget at 8K ctx
18 GB
8.3 GB
-54%

Model fit difference.

$ models that change with the card
Fits on both
12of 30
Only on arc-pro-b60
9
Only on rtx-4070
0

// showing 12 of 30 models; differing fits first

Model
arc-pro-b60
rtx-4070
fitsAWQ 4-BIT
overQ4_K_M
fitsAWQ 4-BIT
overQ4_K_M
fitsQ4_K_M
overQ4_K_M
fitsQ4_K_M
overQ4_K_M
fitsQ3_K_M
overQ4_K_M
fitsQ6_K
overQ4_K_M
fitsQ5_K_M
overQ4_K_M
fitsQ5_K_M
overQ4_K_M
fitsQ3_K_M
overQ4_K_M
fitsFP16/BF16
fitsFP16/BF16
fitsFP16/BF16
fitsFP16/BF16
fitsFP16/BF16
fitsFP8/INT8

Which one wins for…

$ ./recommend --by-workload
More VRAM headroom

Arc Pro B60 has 12 GB more.

Faster decode (bandwidth)

RTX 4070 by +11%.

Faster prefill (compute)

RTX 4070 by +311% TFLOPS.

Catalog models that fit

Arc Pro B60: 21 fit · RTX 4070: 12.

Drill into either card.

$ ./vrambudget --gpu

Discussion.

$ gh discussion list

// sign in with github to leave a comment. threads live in the repo's discussions tab.