Java based Mahout's goal is to build scalable machine learning libraries. Are there any equivalent libraries in Python ?
scikits learn is highly recommended http://scikit-learn.sourceforge.net/
sunan
Spark MLlib is recommmended. It is a scalable machine learning lib, can read data from HDFS and of course runs on top of Spark.
You can access it via PySpark (see the Programming Guide's Python examples).
来源:https://stackoverflow.com/questions/4819437/javas-mahout-equivalent-in-python