Archive for the ‘Information/text analysis’ Category

June 8th, 2009 - 11:14 pm § in Information/text analysis

Enabling Exploration Through Text Analytics

I’m a huge fan of Daniel Tunkelang and his blog “The noisy channel“. Recently he spoke at the Text Analytics Summit and was kind enough to share his slides with us all. Check out the other available presentations as well, they’re very good. Enabling Exploration Through Text A[...]

May 23rd, 2009 - 6:26 pm § in Information/text analysis

Shiny new dataset

I thought I would do a little dance a give a little w00t for the new data repository that we now have access to. All researchers will be delighted to know that you can now access raw data from the Data.gov repository. I’m also pleased because we can suggest new datasets which means this has [.[...]

April 30th, 2009 - 6:20 pm § in Information/text analysis, Search engines

Semantic similarity revisited

I’m a big word fan, a pattern spotter, and an enthusiast of this paper: “Measuring Semantic Similarity between Words Using Web Search Engines” by Bollegala, Matsuo, Ishizuka (Univeristy Tokyo & AIST). It’s a way to meaure how closely relate words are using web search engi[...]

February 12th, 2009 - 2:30 pm § in Information/text analysis, Tools

Text mine: analyse your content

You might (rightly) ask why should you should analyse your content and what can you can gain from this.  Many SEO specialists have used methods such as “keyword density”, “readability”, “word frequency”, and in some cases even LSA to find similarities between web[...]

January 28th, 2009 - 3:28 pm § in Information/text analysis, Uncategorized

Information credibility analysis

I wanted to draw a little attention to a Japanese project called the “Information Credibility Criteria Project“.  The NICT (National Institute of Information and Communications Technology) started it in 2006.  This project is all about looking at how information sources are not all eq[...]

January 23rd, 2009 - 11:36 am § in Information/text analysis, Uncategorized

G patent: identifying similar passages in text

The patent entitled “Identifying and Linking Similar Passages in a Digital Text Corpus” was published on the 22nd of January and filed on the 20th July 2007. It’s a really interesting one, not just because it covers a topic I’m particularly interested in but because it descr[...]





© 2009-2010 Science for SEO All Rights Reserved -- Copyright notice by Blog Copyright

SEO Powered by Platinum SEO from Techblissonline

Twitter links powered by Tweet This v1.6.1, a WordPress plugin for Twitter.