更新时间:2021-07-09 21:58:27
coverpage
Apache Mahout Clustering Designs
Credits
About the Author
About the Reviewers
www.PacktPub.com
Support files eBooks discount offers and more
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Chapter 1. Understanding Clustering
The clustering concept
Understanding distance measures
Understanding different clustering techniques
Algorithm support in Mahout
Clustering algorithms in Mahout
Installing Mahout
Preparing data for use with clustering techniques
Summary
Chapter 2. Understanding K-means Clustering
Learning K-means
Visualizing clusters
Chapter 3. Understanding Canopy Clustering
Running Canopy clustering on Mahout
Working with CSV files
Chapter 4. Understanding the Fuzzy K-means Algorithm Using Mahout
Learning Fuzzy K-means clustering
Chapter 5. Understanding Model-based Clustering
Learning model-based clustering
Running LDA using Mahout
Chapter 6. Understanding Streaming K-means
Learning Streaming K-means
Using Mahout for streaming K-means
Chapter 7. Spectral Clustering
Understanding spectral clustering
Mahout implementation of spectral clustering
Chapter 8. Improving Cluster Quality
Evaluating clusters
Using DistanceMeasure interface
Chapter 9. Creating a Cluster Model for Production
Preparing the dataset
Launching the Mahout job on the cluster
Performance tuning for the job
Index