# Functions
AcceptJobLog is used for arena-go-sdk.
*
* BuildTrainingJobInfo returns types.TrainingJobInfo
*/.
CompatibleJobCRD Compatible with training-operator CRD.
Get real job status WHen has pods being pending, tfJob still show in Running state, it should be Pending.
Get Event of the Job.
NewDeepSpeedJobTrainer new deepspeed job trainer.
NewETJobTrainer.
NewMPIJobTrainer.
NewPyTorchJobTrainer.
NewRayJobTrainer.
* search the training job with name and training type
*/.
# Constants
NVIDIAGPUResourceName is the extended name of the GPU resource since v1.8 this uses the device plugin mechanism.
TrainingReplicaIndexLabel training-operator replica index label.
TrainingReplicaTypeLabel training-operator replica type label.
# Structs
DeepSpeedJob Information.
DeepSpeedJobTrainer DeepSpeed Job trainer.
ET Job Information.
ET Job trainer.
MPI Job Information.
MPI Job trainer.
PyTorch Job Information.
PyTorch Job trainer.
RayJob Information.
RayJob Job trainer.
spark application wrapper.
spark job trainer.
TensorflowJob implements the TrainingJob TensorFlow Job Information.
TensorFlow Job trainer.
volcano Job wrapper.
volcano job trainer.