github.com/kubeflow/training-operator@v1.7.0/docs/quick-start-v1.md (about)

     1  # Testing v1
     2  
     3  TFJob is currently in v1. The quick start shows an example of TFJob.
     4  For more details please refer to [developer_guide.md](development/developer_guide.md).
     5  
     6  ## Create a TFJob
     7  
     8  Please see the [example](../examples/tensorflow/dist-mnist/README.md) to create a TFJob.
     9  
    10  ## Monitor your job
    11  
    12  To get the status of your job
    13  
    14  ```
    15  kubectl get -o yaml tfjobs $JOB
    16  ```
    17  
    18  Here is sample output for an example job
    19  
    20  ```yaml
    21  apiVersion: kubeflow.org/v1
    22  kind: TFJob
    23  metadata:
    24    creationTimestamp: 2019-03-06T09:50:49Z
    25    generation: 1
    26    name: dist-mnist-for-e2e-test
    27    namespace: kubeflow
    28    resourceVersion: "16575458"
    29    selfLink: /apis/kubeflow.org/v1/namespaces/kubeflow/tfjobs/dist-mnist-for-e2e-test
    30    uid: 526545f8-3ff5-11e9-a818-0016ac101ba4
    31  spec:
    32    cleanPodPolicy: Running
    33    tfReplicaSpecs:
    34      PS:
    35        replicas: 2
    36        restartPolicy: Never
    37        template:
    38          metadata:
    39            creationTimestamp: null
    40          spec:
    41            containers:
    42              - image: kubeflow/tf-dist-mnist-test:1.0
    43                name: tensorflow
    44                ports:
    45                  - containerPort: 2222
    46                    name: tfjob-port
    47                resources: {}
    48      Worker:
    49        replicas: 4
    50        restartPolicy: Never
    51        template:
    52          metadata:
    53            creationTimestamp: null
    54          spec:
    55            containers:
    56              - image: kubeflow/tf-dist-mnist-test:1.0
    57                name: tensorflow
    58                ports:
    59                  - containerPort: 2222
    60                    name: tfjob-port
    61                resources: {}
    62  status:
    63    conditions:
    64      - lastTransitionTime: 2019-03-06T09:50:36Z
    65        lastUpdateTime: 2019-03-06T09:50:36Z
    66        message: TFJob dist-mnist-for-e2e-test is created.
    67        reason: TFJobCreated
    68        status: "True"
    69        type: Created
    70      - lastTransitionTime: 2019-03-06T09:50:57Z
    71        lastUpdateTime: 2019-03-06T09:50:57Z
    72        message: TFJob dist-mnist-for-e2e-test is running.
    73        reason: TFJobRunning
    74        status: "True"
    75        type: Running
    76    replicaStatuses:
    77      PS:
    78        active: 2
    79      Worker:
    80        active: 4
    81    startTime: 2019-03-06T09:50:48Z
    82  ```