Cashed runs on Fever
When running the experiments of different amounts of rations on fever quite a lot runs simply crashed.
There were tree kinds of crashes:
- Not in wandb listed, and stuck due to Issue 8 (solved)
- Listed in wandb as crashed, maybe just an wanb issue?
- Not in wandb listed (seems to just not be started) -> run again
To 2.
Name | State | Hostname | identifier.epoch | classifier.epoch | output_dir | scored | cls epochs in log | scores_uploaded |
---|---|---|---|---|---|---|---|---|
warm-puddle-152 | crashed | node03 | 9 | 1 | outputs/fever_0.7_27/100/21_09_04_18_14_07 | 1 | ||
breezy-elevator-156 | crashed | node03 | 9 | 1 | outputs/fever_0.5_46/100/21_09_04_18_14_07 | 1 | ||
worthy-water-151 | crashed | node04 | 9 | 1 | outputs/fever_0.7_48/100/21_09_04_18_14_07 | 1 | ||
grateful-silence-212 | crashed | node02 | 5 | outputs/fever_0.8_49/200/21_09_04_18_22_58 | - | 0 | - | |
stoic-deluge-160 | crashed | node05 | 9 | 0 | outputs/fever_0.9_50/100/21_09_04_18_58_58 | 1 | ||
misty-shape-145 | crashed | node06 | 9 | 1 | outputs/fever/100/21_09_04_18_13_48 | 1 |
- Not investigated yet Missing runs (all on fever)
dataset_meta.rationals_fraction | dataset_meta.generation_seed | Cause |
---|---|---|
0.2 | 43 | not creates at all? (no output dir) |
0.4 | 45 | not creates at all? (no output dir) |
0.6 | 47 | not creates at all? (no output dir) |
0.8 | 28 | not creates at all? (no output dir) |
and fever on seed (