Hardware proof

H100 and B200 scale envelopes.

Measured single-GPU scale envelopes by precision tier. These capacity rows are separate from real-embedding Atlas calibration and receipt-specific proof packs.

Scale rows

Measured capacity rows

GPU / tier	max entries	query p50	R@10	query VRAM	compression
H100 fp64	200M	40.40 ms	100%	50.4 GB	11.6× vs fp64
H100 fp32	500M	76.53 ms	100%	73.0 GB	23.3× vs fp64
H100 fp16	1B	38.51 ms	96-100%	72.9 GB	46.5× vs fp64
B200 fp16	2B	60.89 ms	98%	142.0 GB	46.5× vs fp64

Full benchmark page