TRAINING_JOB_CREATED | Initial state when a job is first created. Baseten has received the training configuration and persisted it to our records. | ✅ | |
TRAINING_JOB_DEPLOYING | Baseten is deploying the job, including provisioning compute resources and installing dependencies. | ✅ | |
TRAINING_JOB_RUNNING | The training code is actively executing. | ✅ | |
TRAINING_JOB_COMPLETED | The job has successfully finished execution. Any checkpoints or artifacts have been saved and uploaded. | | ✅ |
TRAINING_JOB_DEPLOY_FAILED | The job failed to deploy. This is likely due to a bad image or a resource allocation issue. | | ✅ |
TRAINING_JOB_FAILED | The job encountered an error and could not complete successfully. Check the logs for error details. | | ✅ |
TRAINING_JOB_STOPPED | The job was manually stopped by a user. | | ✅ |