Discussion:
Problem in fuzzy clustering mahout
Prakash Poudyal
2016-04-27 01:03:46 UTC
Permalink
I was trying Fuzzy clustering in mahout. I am taking help this site

http://tuxdna.in/files/notes/mahout.html

its quite helpful but I got one error after giving fuzzy clustering
command.

***@840:/home/prakash/mahout/trunk/bin# ./mahout clusterdump -b 10 -n 10
-dt sequencefile -d
/data/lda/output-seq2sparse-normalized/dictionary.file-* -i
/data/lda/output-kmeans-cosine-clusters/clusters-4-final -o
./kmeans-cosine-dump

MAHOUT_LOCAL is set, so we don't add HADOOP_CONF_DIR to classpath.
MAHOUT_LOCAL is set, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/home/prakash/mahout/trunk/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/home/prakash/mahout/trunk/examples/target/dependency/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
16/04/27 01:49:47 INFO common.AbstractJob: Command line arguments:
{--dictionary=[/data/lda/output-seq2sparse-normalized/dictionary.file-0],
--dictionaryType=[sequencefile],
--distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure],
--endPhase=[2147483647],
--input=[/data/lda/output-kmeans-cosine-clusters/clusters-4-final],
--numWords=[10], --output=[./kmeans-cosine-dump], --outputFormat=[TEXT],
--startPhase=[0], --substring=[10], --tempDir=[temp]}
16/04/27 01:49:58 INFO clustering.ClusterDumper: Wrote 0 clusters
16/04/27 01:49:58 INFO driver.MahoutDriver: Program took 10507 ms (Minutes:
0.17511666666666667)
***@840:/home/prakash/mahout/trunk/bin# ./mahout fkmeans -i
/data/lda/output-seq2sparse-normalized/tfidf-vectors -c
/data/lda/output-fkmeans-squared-initialclusters -o
/data/lda/output-fkmeans-squared-clusters -cd 1.0 -k 20 -m 2 -ow -x 20 -dm
org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure
MAHOUT_LOCAL is set, so we don't add HADOOP_CONF_DIR to classpath.
MAHOUT_LOCAL is set, running locally
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in
[jar:file:/home/prakash/mahout/trunk/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in
[jar:file:/home/prakash/mahout/trunk/examples/target/dependency/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
16/04/27 01:50:12 INFO common.AbstractJob: Command line arguments:
{--clusters=[/data/lda/output-fkmeans-squared-initialclusters],
--convergenceDelta=[1.0],
--distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure],
--emitMostLikely=[true], --endPhase=[2147483647],
--input=[/data/lda/output-seq2sparse-normalized/tfidf-vectors], --m=[2],
--maxIter=[20], --method=[mapreduce], --numClusters=[20],
--output=[/data/lda/output-fkmeans-squared-clusters], --overwrite=null,
--startPhase=[0], --tempDir=[temp], --threshold=[0]}
16/04/27 01:50:22 WARN util.NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
16/04/27 01:50:23 INFO compress.CodecPool: Got brand-new compressor
16/04/27 01:50:23 INFO kmeans.RandomSeedGenerator: Wrote 20 Klusters to
/data/lda/output-fkmeans-squared-initialclusters/part-randomSeed
16/04/27 01:50:23 INFO compress.CodecPool: Got brand-new decompressor
Exception in thread "main" java.lang.IllegalStateException: No input
clusters found in
/data/lda/output-fkmeans-squared-initialclusters/part-randomSeed. Check
your -c argument.
at
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver.buildClusters(FuzzyKMeansDriver.java:276)
at
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver.run(FuzzyKMeansDriver.java:223)
at
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver.run(FuzzyKMeansDriver.java:115)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver.main(FuzzyKMeansDriver.java:51)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)


it would be great, if you could help
--
Regards
Prakash Poudyal
Loading...