# Functions
No description provided by the author
RegisterDefaults adds defaulters functions to the given scheme.
Resource takes an unqualified resource and returns a Group qualified GroupResource.
No description provided by the author
No description provided by the author
No description provided by the author
# Constants
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
DefaultLauncherRestartPolicy is default RestartPolicy for Launcher Job.
DefaultRestartPolicy is default RestartPolicy for ReplicaSpec.
EnvKubeflowNamespace is ENV for kubeflow namespace specified by user.
GroupName is the group name use in this package.
GroupVersion is the version.
JobCreated means the job has been accepted by the system, but one or more of the pods/services has not been started.
JobFailed means one or more sub-resources (e.g.
JobRestarting means one or more sub-resources (e.g.
JobRunning means all sub-resources (e.g.
JobSucceeded means all sub-resources (e.g.
JobSuspended means the job has been suspended.
Kind is the kind name.
No description provided by the author
No description provided by the author
MPIReplicaTypeLauncher is the type for launcher replica.
MPIReplicaTypeWorker is the type for worker replicas.
OperatorName is the name of the operator used as value to the label common.OperatorLabelName.
# Variables
No description provided by the author
No description provided by the author
No description provided by the author
No description provided by the author
# Structs
JobCondition describes the state of the job at a certain point.
JobStatus represents the current observed state of the training Job.
No description provided by the author
No description provided by the author
No description provided by the author
ReplicaStatus represents the current observed state of the replica.
RunPolicy encapsulates various runtime policies of the distributed training job, for example how to clean up resources and how long the job can stay active.
SchedulingPolicy encapsulates various scheduling policies of the distributed training job, for example `minAvailable` for gang-scheduling.
# Type aliases
CleanPodPolicy describes how to deal with pods when the job is finished.
JobConditionType defines all kinds of types of JobStatus.
No description provided by the author
MPIReplicaType is the type for MPIReplica.