github.com/kubeflow/training-operator@v1.7.0/docs/quick-start-v1.md (about) 1 # Testing v1 2 3 TFJob is currently in v1. The quick start shows an example of TFJob. 4 For more details please refer to [developer_guide.md](development/developer_guide.md). 5 6 ## Create a TFJob 7 8 Please see the [example](../examples/tensorflow/dist-mnist/README.md) to create a TFJob. 9 10 ## Monitor your job 11 12 To get the status of your job 13 14 ``` 15 kubectl get -o yaml tfjobs $JOB 16 ``` 17 18 Here is sample output for an example job 19 20 ```yaml 21 apiVersion: kubeflow.org/v1 22 kind: TFJob 23 metadata: 24 creationTimestamp: 2019-03-06T09:50:49Z 25 generation: 1 26 name: dist-mnist-for-e2e-test 27 namespace: kubeflow 28 resourceVersion: "16575458" 29 selfLink: /apis/kubeflow.org/v1/namespaces/kubeflow/tfjobs/dist-mnist-for-e2e-test 30 uid: 526545f8-3ff5-11e9-a818-0016ac101ba4 31 spec: 32 cleanPodPolicy: Running 33 tfReplicaSpecs: 34 PS: 35 replicas: 2 36 restartPolicy: Never 37 template: 38 metadata: 39 creationTimestamp: null 40 spec: 41 containers: 42 - image: kubeflow/tf-dist-mnist-test:1.0 43 name: tensorflow 44 ports: 45 - containerPort: 2222 46 name: tfjob-port 47 resources: {} 48 Worker: 49 replicas: 4 50 restartPolicy: Never 51 template: 52 metadata: 53 creationTimestamp: null 54 spec: 55 containers: 56 - image: kubeflow/tf-dist-mnist-test:1.0 57 name: tensorflow 58 ports: 59 - containerPort: 2222 60 name: tfjob-port 61 resources: {} 62 status: 63 conditions: 64 - lastTransitionTime: 2019-03-06T09:50:36Z 65 lastUpdateTime: 2019-03-06T09:50:36Z 66 message: TFJob dist-mnist-for-e2e-test is created. 67 reason: TFJobCreated 68 status: "True" 69 type: Created 70 - lastTransitionTime: 2019-03-06T09:50:57Z 71 lastUpdateTime: 2019-03-06T09:50:57Z 72 message: TFJob dist-mnist-for-e2e-test is running. 73 reason: TFJobRunning 74 status: "True" 75 type: Running 76 replicaStatuses: 77 PS: 78 active: 2 79 Worker: 80 active: 4 81 startTime: 2019-03-06T09:50:48Z 82 ```