- Building Machine Learning Systems with Python
- Willi Richert Luis Pedro Coelho
- 148字
- 2021-08-13 16:35:47
Tweaking the parameters
So what about all the other parameters? Can we tweak them all to get better results?
Sure. We could, of course, tweak the number of clusters or play with the vectorizer's max_features
parameter (you should try that!). Also, we could play with different cluster center initializations. There are also more exciting alternatives to KMeans itself. There are, for example, clustering approaches that also let you use different similarity measurements such as Cosine similarity, Pearson, or Jaccard. An exciting field for you to play.
But before you go there, you will have to define what you actually mean by "better". Scikit has a complete package dedicated only to this definition. The package is called sklearn.metrics
and also contains a full range of different metrics to measure clustering quality. Maybe that should be the first place to go now, right into the sources of the metrics package.
- HornetQ Messaging Developer’s Guide
- Flask Blueprints
- C語言程序設計(第3版)
- Object-Oriented JavaScript(Second Edition)
- Banana Pi Cookbook
- Python機器學習實戰
- 區塊鏈技術與應用
- Android開發三劍客:UML、模式與測試
- Frank Kane's Taming Big Data with Apache Spark and Python
- Cocos2d-x Game Development Blueprints
- Extending Unity with Editor Scripting
- Deep Learning with R Cookbook
- 計算機應用基礎(第二版)
- SQL Server實例教程(2008版)
- Splunk Essentials