Comparison of computational efficiency between Fast3R and DUSt3R on a single A100 GPU. Each view has a 512×384 resolution.
# Views | Fast3R | DUSt3R | ||
---|---|---|---|---|
Time (s) | Peak GPU Mem (GiB) | Time (s) | Peak GPU Mem (GiB) | |
2 | 0.065 | 3.84 | 0.092 | 3.52 |
8 | 0.122 | 6.33 | 8.386 | 24.59 |
32 | 0.509 | 13.25 | 129.0 | 67.61 |
48 | 0.84 | 20.8 | OOM | OOM |
320 | 15.938 | 41.90 | OOM | OOM |
800 | 89.569 | 55.97 | OOM | OOM |
1000 | 137.62 | 63.01 | OOM | OOM |
1500 | 308.85 | 78.59 | OOM | OOM |
Note: "OOM" indicates Out of Memory. For DUSt3R, at 48 views the N² pairwise reconstructions consume all VRAM during global alignment.