Closed
Description
Based on the inaugural AlgoPerf competition results, we believe we can adjust the per-workload runtime budgets. Mostly to reduce the required computational resources, without significantly affecting the meaningfulness of the results.
Normalized submission runtimes across workloads
External Tuning
CRITEO 1TB | FASTMRI | RESNET | VIT | CONFORMER | DEEPSPEECH | OGBG | WMT | |
---|---|---|---|---|---|---|---|---|
Amos | inf | 0.33 | inf | 0.65 | 0.71 | 0.57 | 0.60 | 0.68 |
Baseline | 0.94 | 0.23 | inf | 0.91 | 0.90 | 0.65 | 0.42 | 0.86 |
CASPR Adaptive | NaN | 0.13 | inf | 0.58 | inf | 0.75 | 0.12 | 0.67 |
Cyclic LR | 0.67 | 0.25 | inf | 0.81 | 0.94 | 0.70 | 0.38 | 0.49 |
Generalized Adam | 0.83 | 0.18 | 0.97 | 0.84 | inf | 0.68 | 0.31 | 0.63 |
LAWA EMA | 0.69 | 0.29 | inf | 0.80 | inf | inf | 0.57 | 0.89 |
LAWA Queue | inf | 0.22 | inf | 0.66 | inf | inf | 0.25 | 0.56 |
NadamP | 0.80 | 0.22 | inf | 0.88 | 0.94 | 0.60 | 0.43 | 0.80 |
Schedule Free AdamW | 0.67 | 0.13 | inf | 0.57 | 0.92 | 0.78 | 0.29 | 0.33 |
Schedule Free Prodigy | NaN | 0.21 | inf | inf | inf | inf | 0.61 | inf |
PyTorch Distr. Shampoo | 0.65 | 0.15 | inf | 0.43 | 0.78 | 0.62 | 0.18 | 0.80 |
Self-Tuning
CRITEO 1TB | FASTMRI | RESNET | VIT | CONFORMER | DEEPSPEECH | OGBG | WMT | |
---|---|---|---|---|---|---|---|---|
AdamG | inf | inf | inf | inf | inf | inf | inf | inf |
Baseline | 0.75 | 0.22 | inf | 0.95 | 0.94 | 0.65 | 0.46 | 0.84 |
NadamW Sequential | 2.96 | 0.27 | inf | 1.58 | inf | 1.45 | 0.55 | 2.36 |
Schedule Free AdamW | 0.75 | 0.15 | inf | 0.68 | 0.97 | 0.88 | 0.32 | 0.94 |
Sinv6 | NaN | 0.49 | inf | inf | inf | 2.47 | 1.35 | 2.32 |
Sinv6 75 | NaN | 0.45 | inf | inf | inf | 2.21 | 1.50 | 1.82 |