I’m a huge fan of Daniel Tunkelang and his blog “The noisy channel“. Recently he spoke at the Text Analytics Summit and was kind enough to share his slides with us all. Check out the other available presentations as well, they’re very good. Enabling Exploration Through Text A[...]
Archive for the ‘Information/text analysis’ Category
Enabling Exploration Through Text Analytics
Shiny new dataset
I thought I would do a little dance a give a little w00t for the new data repository that we now have access to. All researchers will be delighted to know that you can now access raw data from the Data.gov repository. I’m also pleased because we can suggest new datasets which means this has [.[...]
How does a search engine know what words mean?
Word sense disambiguation (WSD) belongs to the field of computational linguistics. It’s the research area dedicated to finding ways for machines to understand the meaning of words. More precisely, it’s about determining the word sense of a particular word in a context. This[...]
Text mine: analyse your content
You might (rightly) ask why should you should analyse your content and what can you can gain from this. Many SEO specialists have used methods such as “keyword density”, “readability”, “word frequency”, and in some cases even LSA to find similarities between web[...]



