Skip to content
This repository was archived by the owner on Nov 28, 2025. It is now read-only.

Commit 6b892a1

Browse files
bhackjhseu
authored andcommitted
Use universal container (#38)
1 parent 134f622 commit 6b892a1

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

marathon/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# Running Distributed TensorFlow on Mesos/Marathon
22

33
## Prerequisite
4-
Before you start, you need to set up a Mesos cluster with Marathon installed and Docker Containerizer and Mesos-DNS enabled. It is also preferable to set up some shared storage such as HDFS in the cluster. All of these could be easily installed and configured with the help of [DC/OS](https://dcos.io/docs/1.7/administration/installing/custom/gui/). You need to remember the master target, DNS domain and HDFS namenode which are needed to bring up the TensorFlow cluster.
4+
Before you start, you need to set up a Mesos cluster with Marathon installed and Mesos-DNS enabled. It is also preferable to set up some shared storage such as HDFS in the cluster. All of these could be easily installed and configured with the help of [DC/OS](https://dcos.io/docs/1.9/administration/installing/custom/gui/). You need to remember the master target, DNS domain and HDFS namenode which are needed to bring up the TensorFlow cluster.
55

66
## Write the Training Program
77
This section covers instructions on how to write your training program and build your docker image.

marathon/template.json.jinja

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@
3737
"image": "{{ image }}",
3838
},
3939
"network": "HOST",
40-
"type": "DOCKER"
40+
"type": "MESOS"
4141
},
4242
"args": ["--worker_hosts", "{{ worker_hosts() }}", "--ps_hosts", "{{ ps_hosts() }}", "--job_name", "{{ job }}", "--task_index", "{{ i }}", "--train_dir", "{{ train_dir }}", "--sync_replica", "True", "--train_steps", "2000"],
4343
"cpus": {{ cpu }},
@@ -53,7 +53,7 @@
5353
"image": "{{ image }}",
5454
},
5555
"network": "HOST",
56-
"type": "DOCKER"
56+
"type": "MESOS"
5757
},
5858
"cmd": "tensorboard --logdir {{ train_dir }}",
5959
"cpus": {{ cpu }},

0 commit comments

Comments
 (0)