Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • E expred
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 6
    • Issues 6
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Container Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Maximilian Reimer
  • expred
  • Issues
  • #10
Closed
Open
Issue created Apr 13, 2021 by Maximilian Reimer@mreimerOwner

Cashed runs on Fever

When running the experiments of different amounts of rations on fever quite a lot runs simply crashed.

There were tree kinds of crashes:

  1. Not in wandb listed, and stuck due to Issue 8 (solved)
  2. Listed in wandb as crashed, maybe just an wanb issue?
  3. Not in wandb listed (seems to just not be started) -> run again

To 2.

Name State Hostname identifier.epoch classifier.epoch output_dir scored cls epochs in log scores_uploaded
warm-puddle-152 crashed node03 9 1 outputs/fever_0.7_27/100/21_09_04_18_14_07 ✅ 1 ✅
breezy-elevator-156 crashed node03 9 1 outputs/fever_0.5_46/100/21_09_04_18_14_07 ✅ 1 ✅
worthy-water-151 crashed node04 9 1 outputs/fever_0.7_48/100/21_09_04_18_14_07 ✅ 1 ✅
grateful-silence-212 crashed node02 5 outputs/fever_0.8_49/200/21_09_04_18_22_58 - 0 -
stoic-deluge-160 crashed node05 9 0 outputs/fever_0.9_50/100/21_09_04_18_58_58 ✅ 1 ✅
misty-shape-145 crashed node06 9 1 outputs/fever/100/21_09_04_18_13_48 ✅ 1 ✅
  1. Not investigated yet Missing runs (all on fever)
dataset_meta.rationals_fraction dataset_meta.generation_seed Cause
0.2 43 not creates at all? (no output dir)
0.4 45 not creates at all? (no output dir)
0.6 47 not creates at all? (no output dir)
0.8 28 not creates at all? (no output dir)

and fever on seed (

Assignee
Assign to
Time tracking