mahout常用DistanceMeasure一览表:
org.apache.mahout.common.distance.ChebyshevDistanceMeasure; org.apache.mahout.common.distance.CosineDistanceMeasure; org.apache.mahout.common.distance.EuclideanDistanceMeasure; org.apache.mahout.common.distance.MahalanobisDistanceMeasure; org.apache.mahout.common.distance.ManhattanDistanceMeasure; org.apache.mahout.common.distance.MinkowskiDistanceMeasure; org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure; org.apache.mahout.common.distance.TanimotoDistanceMeasure; org.apache.mahout.common.distance.WeightedEuclideanDistanceMeasure; org.apache.mahout.common.distance.WeightedManhattanDistanceMeasure;
比较常用的有两个欧氏距离和余弦
CosineDistanceMeasure:
EuclideanDistanceMeasure:
EuclideanDistanceMeasure和CosineDistanceMeasure区别见图:
其中有两个基于权重的距离计算方法:
org.apache.mahout.common.distance.WeightedEuclideanDistanceMeasure; org.apache.mahout.common.distance.WeightedManhattanDistanceMeasure;
使用方式如下:
WeightedManhattanDistanceMeasure cdm = new WeightedManhattanDistanceMeasure(); cdm.setWeights(new DenseVector(new double[] {1, 2, 3 })); System.out.println(cdm.distance(new DenseVector(new double[] {0.11, 510, 10 }), new DenseVector(new double[] {0.11, 510, 1 })));
设置weight即可
相关推荐
https://github.com/chubbyjiang/MapReduce
mahout_help,mahout的java api帮助文档,可以帮你更轻松掌握mahout
maven_mahout_template-mahout-0.8
MAHOUT实战 MAHOUT IN ACTION
mahoutAlgorithms源码分析 mahout代码解析
mahout-examples-0.11.1 mahout-examples-0.11.1-job mahout-h2o_2.10-0.11.1 mahout-h2o_2.10-0.11.1-dependency-reduced mahout-hdfs-0.11.1 mahout-integration-0.11.1 mahout-math-0.11.1 mahout-math-0.11.1 ...
mahout-integration-0.7mahout-integration-0.7mahout-integration-0.7mahout-integration-0.7
Mahout 官方文档,简单介绍Mahout概念及其安装和简单使用
If you are a Java developer and want to use Mahout and Machine Learning to solve Big Data analytics use-cases then this book is for you. Familiarity with shell-scripts is assumed but no prior ...
mahout-core-0.9.jar+mahout-core-0.8.jar+mahout-core-0.1.jar
Mahout0.8_API 喜欢的人就请下载吧
mahout,朴素贝叶斯分类,中文分词,mahout,朴素贝叶斯分类,中文分词,
spring-mahout-demo-----一个简单的spring-mahout结合的例子,是很好的学习开发思路的例子。
Apache Mahout 简介
Mahout支持K-Means等聚类算法,在此zip包中已经有打好jar包的资源,不需要用户再打jar包,可以直接使用。
数据下载自http://grouplens.org/datasets/movielens/的10m, 下载速度非常慢,放到此处分享
mahout 入门中文材料,是IBM文章汇总,值得一看
Mahout是一个Java的机器学习库。Mahout的完整源代码,基于maven,可以轻易导入工程中
mahout中的贝叶斯算法的拓展开发包,提供了相关接口可以供用户调用,直接即可跑出结果,相关运行方式参考blog《mahout贝叶斯算法开发思路(拓展篇)》
• 1、什么是mahout? • 2、mahout是干啥的 ? • 3、mahout是怎么干的? Apache Mahout 是 Apache Software Foundation (ASF) 开发的一个全新的开源项目,其主要目标是创建一些可伸缩的机器学习算法,供开发人员...