Discussion:
[jira] [Created] (MAHOUT-1767) Unable to run tests on H2O enigne in distributed mode
Dmitry Yaraev (JIRA)
2015-08-14 06:32:46 UTC
Permalink
Dmitry Yaraev created MAHOUT-1767:
-------------------------------------

Summary: Unable to run tests on H2O enigne in distributed mode
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev


When one follows the instructions located [README.md for H2O module|https://github.com/apache/mahout/tree/master/h2o], test run in the local mode. There are three steps:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one mode worker is started. And it should join to the same cloud as other worked nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Dmitry Yaraev (JIRA)
2015-08-14 06:35:45 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Yaraev updated MAHOUT-1767:
----------------------------------
Description:
When one follows the instructions located [README.md for H2O module|https://github.com/apache/mahout/tree/master/h2o], test run in the local mode. There are three steps:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.

was:
When one follows the instructions located [README.md for H2O module|https://github.com/apache/mahout/tree/master/h2o], test run in the local mode. There are three steps:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one mode worker is started. And it should join to the same cloud as other worked nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
We tried to remove generated suffix from the cloud name. After that it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Dmitry Yaraev (JIRA)
2015-08-14 06:39:46 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Yaraev updated MAHOUT-1767:
----------------------------------
Description:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.

was:
When one follows the instructions located [README.md for H2O module|https://github.com/apache/mahout/tree/master/h2o], test run in the local mode. There are three steps:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
We tried to remove generated suffix from the cloud name. After that it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Dmitry Yaraev (JIRA)
2015-08-14 06:44:45 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Yaraev updated MAHOUT-1767:
----------------------------------
Description:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.

was:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
We tried to remove generated suffix from the cloud name. After that it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Dmitry Yaraev (JIRA)
2015-08-14 06:45:46 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Yaraev updated MAHOUT-1767:
----------------------------------
Description:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or _masterURL_ in terms of the code). If you look in the code, you can find the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.

was:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or _masterURL_ in terms of the code). If you look in the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
We tried to remove generated suffix from the cloud name. After that it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Dmitry Yaraev (JIRA)
2015-08-14 06:45:46 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Yaraev updated MAHOUT-1767:
----------------------------------
Description:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or _masterURL_ in terms of the code). If you look in the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.

was:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has different cloud name (or _masterURL_ in terms of the code). If you look in to the code, you can found the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
We tried to remove generated suffix from the cloud name. After that it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Dmitry Yaraev (JIRA)
2015-08-14 06:46:45 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Yaraev updated MAHOUT-1767:
----------------------------------
Description:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or _masterURL_ in terms of the code). If you look in the code, you can find the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

After we removed the generated suffix from the cloud name, it started to work.

was:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or _masterURL_ in terms of the code). If you look in the code, you can find the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

We tried to remove generated suffix from the cloud name. After that it started to work.
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
After we removed the generated suffix from the cloud name, it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Dmitry Yaraev (JIRA)
2015-08-14 06:47:45 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Yaraev updated MAHOUT-1767:
----------------------------------
Description:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or _masterURL_ in terms of the code). If you look in the code, you can find the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

After we removed the code which appends current time to cloud name, it started to work.

was:
When one follows the instructions located in [README.md for H2O module|https://github.com/apache/mahout/blob/master/h2o/README.md] and tries to run tests in the distributed mode, tests run only in the local mode. There are three steps in the instruction:

# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}

First two steps are for executing worker nodes. The last one is for executing tests. According to the instruction, after launching tests one more worker is started. And it should join to the same cloud which other worker nodes forms. But it does joined them because it has a different cloud name (or _masterURL_ in terms of the code). If you look in the code, you can find the following:
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}

After we removed the generated suffix from the cloud name, it started to work.
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
After we removed the code which appends current time to cloud name, it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Suneel Marthi (JIRA)
2016-03-12 23:24:33 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Suneel Marthi updated MAHOUT-1767:
----------------------------------
Assignee: Andrew Palumbo
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
Assignee: Andrew Palumbo
Fix For: 0.12.0
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
After we removed the code which appends current time to cloud name, it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Suneel Marthi (JIRA)
2016-03-12 23:24:33 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Suneel Marthi updated MAHOUT-1767:
----------------------------------
Fix Version/s: 0.12.0
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
Assignee: Andrew Palumbo
Fix For: 0.12.0
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
After we removed the code which appends current time to cloud name, it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Suneel Marthi (JIRA)
2016-04-08 22:24:25 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Suneel Marthi updated MAHOUT-1767:
----------------------------------
Fix Version/s: (was: 0.12.0)
1.0.0
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
Assignee: Andrew Palumbo
Fix For: 1.0.0
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
After we removed the code which appends current time to cloud name, it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Suneel Marthi (JIRA)
2016-12-26 06:45:58 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15777717#comment-15777717 ]

Suneel Marthi commented on MAHOUT-1767:
---------------------------------------

H2O is not being supported now and we have not seen anyone even attempting it, will resolve this as 'Won't Fix'
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
Assignee: Andrew Palumbo
Fix For: 1.0.0
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
After we removed the code which appends current time to cloud name, it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
Suneel Marthi (JIRA)
2016-12-26 06:45:58 UTC
Permalink
[ https://issues.apache.org/jira/browse/MAHOUT-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Suneel Marthi resolved MAHOUT-1767.
-----------------------------------
Resolution: Won't Fix
Post by Dmitry Yaraev (JIRA)
Unable to run tests on H2O enigne in distributed mode
-----------------------------------------------------
Key: MAHOUT-1767
URL: https://issues.apache.org/jira/browse/MAHOUT-1767
Project: Mahout
Issue Type: Bug
Components: Documentation
Affects Versions: 0.11.0
Reporter: Dmitry Yaraev
Assignee: Andrew Palumbo
Fix For: 1.0.0
# {code}
host-1:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 1 formed [/W.X.Y.Z:54321]
{code}
# {code}
host-2:~/mahout$ ./bin/mahout h2o-node
...
.. INFO: Cloud of size 2 formed [/A.B.C.D:54322]
{code}
# {code}
host-N:~/mahout/h2o$ mvn test
...
.. INFO: Cloud of size 3 formed [/E.F.G.H:54323]
...
All tests passed.
...
host-N:~/mahout/h2o$
{code}
{code:title=DistributedH2OSuite.scala}
...
mahoutCtx = mahoutH2OContext("mah2out" + System.currentTimeMillis())
...
{code}
After we removed the code which appends current time to cloud name, it started to work.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Loading...