Hardware proof
H100 and B200 scale envelopes.
Measured single-GPU scale envelopes by precision tier. These capacity rows are separate from real-embedding Atlas calibration and receipt-specific proof packs.
Scale rows
Measured capacity rows
| GPU / tier | max entries | query p50 | R@10 | query VRAM | compression |
|---|---|---|---|---|---|
| H100 fp64 | 200M | 40.40 ms | 100% | 50.4 GB | 11.6× vs fp64 |
| H100 fp32 | 500M | 76.53 ms | 100% | 73.0 GB | 23.3× vs fp64 |
| H100 fp16 | 1B | 38.51 ms | 96-100% | 72.9 GB | 46.5× vs fp64 |
| B200 fp16 | 2B | 60.89 ms | 98% | 142.0 GB | 46.5× vs fp64 |