Sentiment analysis (also opinion retrieval/mining) is a very useful area of research as once fully functional it would enable us to determine the overall sentiment in text. We could for example determine automatically if product reviews are negative or positive, if a blog post is in agreement or d[...]
Posts Tagged ‘idf;’
Blogosphere vs Web – ranking issues
I came across a very cool paper from SIGKDD 2008 called “Blogosphere: Research Issues, Tools, and Applications” by Nitin Agarwal and Huan Liu from the University of Arizona. It’s an easy but long read, for the geek, but can also be quite happily understood by the layman. I̵[...]
Another statistical method for IR
Miles Efron published a paper entitled “An Approach to Information Retrieval Based on Statistical Model Selection” this August. He proposes to use statistical model selection for information retrieval. “The proposed approach offers two main contributions. First, we posit the [...]
Lemur toolkit
The Lemur toolkit is a natural language processing and information retrieval toolkit. Having a go on this is a nice way of seeing some IR technologies functioning first hand, rather than guessing on a major SE to observe the phenomenon. It supports all major languages, performs stemming using Porter[...]

