- Building Machine Learning Systems with Python
- Willi Richert Luis Pedro Coelho
- 148字
- 2021-08-13 16:35:47
Tweaking the parameters
So what about all the other parameters? Can we tweak them all to get better results?
Sure. We could, of course, tweak the number of clusters or play with the vectorizer's max_features
parameter (you should try that!). Also, we could play with different cluster center initializations. There are also more exciting alternatives to KMeans itself. There are, for example, clustering approaches that also let you use different similarity measurements such as Cosine similarity, Pearson, or Jaccard. An exciting field for you to play.
But before you go there, you will have to define what you actually mean by "better". Scikit has a complete package dedicated only to this definition. The package is called sklearn.metrics
and also contains a full range of different metrics to measure clustering quality. Maybe that should be the first place to go now, right into the sources of the metrics package.
- Visual C++程序設計教程
- Photoshop智能手機APP UI設計之道
- 摩登創客:與智能手機和平板電腦共舞
- 從0到1:HTML+CSS快速上手
- 微信小程序項目開發實戰
- Learning OpenStack Networking(Neutron)
- Fast Data Processing with Spark(Second Edition)
- Spring Boot實戰
- Getting Started with Nano Server
- Data Science Algorithms in a Week
- MATLAB 2020 GUI程序設計從入門到精通
- 軟件測試技術
- MySQL數據庫教程(視頻指導版)
- R語言數據分析從入門到實戰
- Java Web入門很輕松(微課超值版)