A quick note: So far we have had about 185 Dataset downloads!!
The ICWSM-09 Data Challenge Workshop will be held May 20, 2009 in San Jose, California in conjunction with the Third AAAI International Conference on Weblogs and Social Media. Submissions for the workshop are due March 1, 2009. Please see http://www.icwsm.org/2009/
The ICWSM-09 Spinn3r blog dataset is a collection of 44 million blog posts made between August 1st and October 1st, 2008, and collected by Spinn3r.com. This dataset is freely available to researchers under a liberal data usage agreement.
Authors are invited to submit papers to a data challenge workshop to be held on the last day of ICWSM-09. This workshop will feature research papers as well as a wide-ranging discussion of data issues facing the social media research community. Good research topics might include:
* link analysis;
* social network extraction;
* clustering and topic identification;
* tracing the evolution of news;
* blog search and filtering;
* psychological, sociological, ethnographic, or personality-based studies;
* analysis of influence among bloggers;
* blog summarization and discourse analysis.
* tracing the evolution of news;
* blog search and filtering;
* psychological, sociological, ethnographic, or personality-based studies;
* analysis of influence among bloggers;
* blog summarization and discourse analysis.
You should feel free to explore any aspect of the data that you feel would be of interest to the ICWSM community. An award will be presented at ICWSM for the best paper using the dataset.
Papers may be submitted online at http://www.easychair.org/ conferences/?conf=icwsm09dcw.
Submissions may be up to 8 pages in length, must be in PDF format, and must follow the ICWSM formatting guidelines.
Best regards,
Ian Soboroff (NIST)
Akshay Java (Live Labs, Microsoft)
ICWSM-09 Data Chairs
Recent Comments