Document clustering – a short intro
Clustering is super important in all systems that deal with any kind of information. In information retrieval systems like digital libraries and search engines they are used to group the documents into clusters. These are all documents that share similarities.
This can get really really complex very quickly, and there are loads of different clustering methods happening at all different stages to produce sufficiently exact results. Here I’m sharing with you a presentation on the topic which isn’t too involved and is quite high level. There are some maths but you can ignore them if you like, you won’t completely lose out or anything.