Following are the slides of my paper presentation at COLING 2010 Workshop on The People’s Web Meets NLP: Collaboratively Constructed Semantic Resources
The paper deals with the Identification and Ranking of Topic Clusters in the blogosphere. Topic clusters represent in this paper the concept of grouping together blogs sharing a common interest i.e. topic. The algorithm takes into account both the hyperlinked social network of blogs along with the content in the blog posts. Topic-specific ranks are assigned to each blog in the cluster using a metric called “Topic Discussion Rank,” that helps in identifying the most influential blog for a specific topic. Experiments show that the presented method reaches a high level of accuracy.
This method proposed suggests a hybrid approach: first, content words of the blog posts are used to determine the relevance of a blog for a given concept, and thus the assignment to a "cluster". Second the links originating from the blog are used to rank the relevance of the blogs within the "cluster".
In the age of Web 2.0 the blogosphere has assumed a very significant role and it serves as an opinion dissemination medium; as such this research is part of a long-term project on blogosphere research and we would like to invite students/researchers who are interested in this area to collaborate with us. I recommend such people to contact me through email at email@example.com or firstname.lastname@example.org; or they can contact my colleague at email@example.com or firstname.lastname@example.org.
The full text of the paper can be downloaded from this link: paper.
Additionally a discussion on the workshop: collaboratively constructed semantic resources can be found here and your comments are also welcome.