Researches who study webspam are limited by the lack of corpus available. There is one that gets used quite often called “WEBSPAM-UK2007“, released by Yahoo. There’s also the 2006 version. It’s really useful but as they say, it was generated to aid the researchers so it[...]

