This is a fantastic presentation which takes you through all sorts of commonly used algorithms in text mining and for processing web data. Concepts and methods from machine learning are presented and there’s even some little stick men, so it’s gotta be good.
Who is this for? Those of you who are interested in understanding how text and language can be manipulated and abstracted by computers. This will allow you to understand far more about how machines deal with language and why we use particular techniques. If this were a sensationalist blog the title would be “Understand Google better!” or something like that


I finally got around to spending some quality time with the presentation. It’s probably the most simplified reading that I’ve seen to date that full encompasses the IR, NLP, and Data Mining concepts.
I really wish I had something like that as a primer when I first started down this path. It would have saved me days and weeks trying to understand the over complicated explanations on Wikipedia and the sort.
Thanks for sharing!
You’re welcome!
i read every single of your post sometimes i read it over and over again ..you really inspired many people here..wish you a good luck in everything you do..
Aw, thank you Gina, that’s really nice to know.
nice tutorial CJ
Thank you!
Thanks for this! Really quite useful!
lol
Hey!
I was just having giant pleasure reading your site. It was great time for me indeed. If there would be more sites with so much usefull informations like this one, then my knowledge wouldn’t be so painful to get for me. I can assume that there would be no necessery to spare so much time on searching informations. So in conclusion i just wanted to show you how i am grateful for your effort to make this site.