<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Science for SEO &#187; Information/text analysis</title>
	<atom:link href="http://www.scienceforseo.com/category/informationtext-analysis/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.scienceforseo.com</link>
	<description>a bridge between worlds</description>
	<lastBuildDate>Sun, 08 Aug 2010 02:43:14 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Enabling Exploration Through Text Analytics</title>
		<link>http://www.scienceforseo.com/informationtext-analysis/enabling-exploration-through-text-analytics/</link>
		<comments>http://www.scienceforseo.com/informationtext-analysis/enabling-exploration-through-text-analytics/#comments</comments>
		<pubDate>Mon, 08 Jun 2009 13:14:55 +0000</pubDate>
		<dc:creator>CJ</dc:creator>
				<category><![CDATA[Information/text analysis]]></category>
		<category><![CDATA[Daniel Tunkelang]]></category>
		<category><![CDATA[text analytics summit]]></category>

		<guid isPermaLink="false">http://www.scienceforseo.com/?p=1016</guid>
		<description><![CDATA[I&#8217;m a huge fan of Daniel Tunkelang and his blog &#8220;The noisy channel&#8220;. Recently he spoke at the Text Analytics Summit and was kind enough to share his slides with us all. Check out the other available presentations as well, they&#8217;re very good. Enabling Exploration Through Text Analytics View more OpenOffice presentations from Daniel Tunkelang. [...]]]></description>
		<wfw:commentRss>http://www.scienceforseo.com/informationtext-analysis/enabling-exploration-through-text-analytics/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Shiny new dataset</title>
		<link>http://www.scienceforseo.com/informationtext-analysis/shiny-new-dataset/</link>
		<comments>http://www.scienceforseo.com/informationtext-analysis/shiny-new-dataset/#comments</comments>
		<pubDate>Sat, 23 May 2009 08:26:52 +0000</pubDate>
		<dc:creator>CJ</dc:creator>
				<category><![CDATA[Information/text analysis]]></category>
		<category><![CDATA[computing]]></category>
		<category><![CDATA[data mining;]]></category>
		<category><![CDATA[data.gov]]></category>
		<category><![CDATA[datamining]]></category>
		<category><![CDATA[dataset]]></category>
		<category><![CDATA[Executive]]></category>
		<category><![CDATA[Federal Government]]></category>

		<guid isPermaLink="false">http://www.scienceforseo.com/?p=959</guid>
		<description><![CDATA[I thought I would do a little dance a give a little w00t for the new data repository that we now have access to. All researchers will be delighted to know that you can now access raw data from the Data.gov repository. I&#8217;m also pleased because we can suggest new datasets which means this has [...]]]></description>
		<wfw:commentRss>http://www.scienceforseo.com/informationtext-analysis/shiny-new-dataset/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Semantic similarity revisited</title>
		<link>http://www.scienceforseo.com/informationtext-analysis/semantic-similarity-revisited/</link>
		<comments>http://www.scienceforseo.com/informationtext-analysis/semantic-similarity-revisited/#comments</comments>
		<pubDate>Thu, 30 Apr 2009 08:20:31 +0000</pubDate>
		<dc:creator>CJ</dc:creator>
				<category><![CDATA[Information/text analysis]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[car brand]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Measuring Semantic Similarity]]></category>
		<category><![CDATA[operating system]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[support vector machine]]></category>
		<category><![CDATA[svm]]></category>
		<category><![CDATA[Tokyo]]></category>
		<category><![CDATA[Univeristy Tokyo & AIST]]></category>
		<category><![CDATA[Web Search Engines;]]></category>

		<guid isPermaLink="false">http://www.scienceforseo.com/?p=829</guid>
		<description><![CDATA[I&#8217;m a big word fan, a pattern spotter, and an enthusiast of this paper: &#8220;Measuring Semantic Similarity between Words Using Web Search Engines&#8221; by Bollegala, Matsuo, Ishizuka (Univeristy Tokyo &#38; AIST). It&#8217;s a way to meaure how closely relate words are using web search engines. The method uses a search engine, exploiting page counts and [...]]]></description>
		<wfw:commentRss>http://www.scienceforseo.com/informationtext-analysis/semantic-similarity-revisited/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>How does a search engine know what words mean?</title>
		<link>http://www.scienceforseo.com/informationtext-analysis/how-does-a-search-engine-know-what-words-mean/</link>
		<comments>http://www.scienceforseo.com/informationtext-analysis/how-does-a-search-engine-know-what-words-mean/#comments</comments>
		<pubDate>Tue, 17 Mar 2009 00:46:18 +0000</pubDate>
		<dc:creator>CJ</dc:creator>
				<category><![CDATA[Information retrieval]]></category>
		<category><![CDATA[Information/text analysis]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[Tutorials]]></category>
		<category><![CDATA[Adam Kilgarriff]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[Chair]]></category>
		<category><![CDATA[computing]]></category>
		<category><![CDATA[dialogue systems]]></category>
		<category><![CDATA[Lesk algorithm]]></category>
		<category><![CDATA[machine learning;]]></category>
		<category><![CDATA[machine translation systems]]></category>
		<category><![CDATA[machine translation;]]></category>
		<category><![CDATA[Mark A. Greenwood]]></category>
		<category><![CDATA[Mark Sanderson]]></category>
		<category><![CDATA[Mark Stevenson]]></category>
		<category><![CDATA[Microsoft;]]></category>
		<category><![CDATA[natural language processing techniques;]]></category>
		<category><![CDATA[natural language processing;]]></category>
		<category><![CDATA[possible solutions;]]></category>
		<category><![CDATA[precise solution]]></category>
		<category><![CDATA[RDF;]]></category>
		<category><![CDATA[Roberto Navigli]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[search space]]></category>
		<category><![CDATA[semantic network;]]></category>
		<category><![CDATA[Semantic web]]></category>
		<category><![CDATA[Sheffield University;]]></category>
		<category><![CDATA[speech systems]]></category>
		<category><![CDATA[Steven Pinker]]></category>
		<category><![CDATA[Ted Pederson]]></category>
		<category><![CDATA[Texas;]]></category>
		<category><![CDATA[University of Brighton]]></category>
		<category><![CDATA[University of Glasgow]]></category>
		<category><![CDATA[University of North Texas;]]></category>
		<category><![CDATA[University of Rome]]></category>
		<category><![CDATA[web ranges]]></category>
		<category><![CDATA[well known linguist]]></category>

		<guid isPermaLink="false">http://www.scienceforseo.com/?p=605</guid>
		<description><![CDATA[    Word sense disambiguation (WSD) belongs to the field of computational linguistics.  It&#8217;s the research area dedicated to finding ways for machines to understand the meaning of words. More precisely, it&#8217;s about determining the word sense of a particular word in a context.   This is really important as without this, it&#8217;s difficult for search engines, machine [...]]]></description>
		<wfw:commentRss>http://www.scienceforseo.com/informationtext-analysis/how-does-a-search-engine-know-what-words-mean/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>Text mine: analyse your content</title>
		<link>http://www.scienceforseo.com/informationtext-analysis/text-mine-analyse-your-content/</link>
		<comments>http://www.scienceforseo.com/informationtext-analysis/text-mine-analyse-your-content/#comments</comments>
		<pubDate>Thu, 12 Feb 2009 04:30:39 +0000</pubDate>
		<dc:creator>CJ</dc:creator>
				<category><![CDATA[Information/text analysis]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[concordance software;]]></category>
		<category><![CDATA[text-mining;]]></category>

		<guid isPermaLink="false">http://www.scienceforseo.com/?p=282</guid>
		<description><![CDATA[You might (rightly) ask why should you should analyse your content and what can you can gain from this.  Many SEO specialists have used methods such as &#8220;keyword density&#8221;, &#8220;readability&#8221;, &#8220;word frequency&#8221;, and in some cases even LSA to find similarities between webpages.   These methods are not wrong but used in isolation they are [...]]]></description>
		<wfw:commentRss>http://www.scienceforseo.com/informationtext-analysis/text-mine-analyse-your-content/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Information credibility analysis</title>
		<link>http://www.scienceforseo.com/uncategorized/information-credibility-analysis/</link>
		<comments>http://www.scienceforseo.com/uncategorized/information-credibility-analysis/#comments</comments>
		<pubDate>Wed, 28 Jan 2009 05:28:00 +0000</pubDate>
		<dc:creator>CJ</dc:creator>
				<category><![CDATA[Information/text analysis]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Author]]></category>
		<category><![CDATA[cancer treatment;]]></category>
		<category><![CDATA[cancer;]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[National Institute of Information and Communications Technology;]]></category>
		<category><![CDATA[Opinion mining]]></category>
		<category><![CDATA[Reading]]></category>
		<category><![CDATA[researcher]]></category>
		<category><![CDATA[Semantic web]]></category>
		<category><![CDATA[web docs;]]></category>

		<guid isPermaLink="false">http://www.scienceforseo.com/wordpress/?p=192</guid>
		<description><![CDATA[I wanted to draw a little attention to a Japanese project called the &#8220;Information Credibility Criteria Project&#8220;.  The NICT (National Institute of Information and Communications Technology) started it in 2006.  This project is all about looking at how information sources are not all equal in that they are written by different people who&#8230;are also not [...]]]></description>
		<wfw:commentRss>http://www.scienceforseo.com/uncategorized/information-credibility-analysis/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>G patent: identifying similar passages in text</title>
		<link>http://www.scienceforseo.com/uncategorized/g-patent-identifying-similar-passages-in-text/</link>
		<comments>http://www.scienceforseo.com/uncategorized/g-patent-identifying-similar-passages-in-text/#comments</comments>
		<pubDate>Fri, 23 Jan 2009 01:36:00 +0000</pubDate>
		<dc:creator>CJ</dc:creator>
				<category><![CDATA[Information/text analysis]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Author]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[passage mining engine;]]></category>
		<category><![CDATA[ranking algorithm]]></category>
		<category><![CDATA[Search engines]]></category>
		<category><![CDATA[shingles;]]></category>
		<category><![CDATA[web browsing;]]></category>

		<guid isPermaLink="false">http://www.scienceforseo.com/wordpress/?p=188</guid>
		<description><![CDATA[The patent entitled &#8220;Identifying and Linking Similar Passages in a Digital Text Corpus&#8221; was published on the 22nd of January and filed on the 20th July 2007. It&#8217;s a really interesting one, not just because it covers a topic I&#8217;m particularly interested in but because it describes a very useful method for digital libraries in [...]]]></description>
		<wfw:commentRss>http://www.scienceforseo.com/uncategorized/g-patent-identifying-similar-passages-in-text/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 1.400 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2010-09-09 20:16:17 -->
