In the 4th ICWSM 2023 data challenge, we invite papers that model and contribute to understanding the temporal and social dynamics of the social tasks in the provided datasets, or identify other important related dimensions to study the temporal effect on those datasets.
We welcome submissions on various topics that address the temporal shift of data. The data challenge includes three tracks, 1) Time-aware models and social trends, 2 ) Temporal dataset, and 3) Non- archival track.
Track 1: Time-Aware Models and social trends ( archival)
Task1 : Leveraging the evolution in time for specific task.
We encourage submissions that characterize the incorporation of temporal data in different social-based tasks, both at data and model level. Tasks may include, but are not limited to, modeling temporal characteristics of hate speech, using different versions of edited Wikipedia articles, enhanced tone detection, the effectiveness of temporal data in detecting the veracity of rumors, and stance detection dynamics. Submitted papers may focus on evaluating models' performance when considering the time variable, studying the evolution of specific phenomena, examining the distribution shift between the training data and live update data, or focusing on specific concept shifts.
Task2: Studying trends and social change.
We especially welcome contributions that examine the transitions in community or content based on social data (i.e., temporal networks or content) to analyze some social phenomena. For example, studying context or static embedding to analyze content across time and communities where the main goal could be to study changes in descriptions of genders and ethnic groups, representation of people using contextualized semantics, analyzing factors that cause the formation and persistence of trends, the dynamics of sentiment and topics, or the manifestation of different social phenomena across communities.
For this data challenge, we ask authors to use datasets of their choice, selected from those released as ICWSM Dataset Papers between 2009 and 2022. You can find a list of these papers/ datasets categorized by topic in the table below. The work should be in compliance with The FAIR Data Principles, as submitted work can use one of these datasets as baseline and collect the temporal data using open resources tools such as archive.org, or any other archival repositories.
Participants are also welcome to use their own or other open datasets, such as pushshift Reddit (2005-2019), Wikipedia historical archive, and Diachronic Language Models from Twitter (2019-2022). There are other possible alternatives to construct a temporal dataset for your topic of interest using one of the available features provided by various platforms' APIs. In general, the datasets need to comply with the platform API regulations and follow the general principle that guarantees the transparency of datasets, such as datasheets for datasets.
Access to the set of proposed datasets for this data challenge from [here].
Track 2: Temporal Dataset (archival)
Participants are also welcome to submit their own temporal dataset and will be part of the full proceedings. The work should be in compliance with The FAIR Data Principles,
Datasets and metadata must be published using a dataset sharing service (e.g. Zenodo , datorium , dataverse , or any other dataset sharing services that index your dataset and metadata and increase the re-findability of the data) that provides a DOI for the dataset, which should be included in the dataset paper submission. Ethical considerations must be discussed and datasets need to comply with the platform API regulations and follow the general principle that guarantees the transparency of datasets, such as datasheets for datasets.
Track 3: Non-Archival
The non-archival track seeks recently accepted/published work as well as work-in-progress. It does not need to be anonymized and will not go through the review process. The submission should clearly indicate the original venue and will be accepted. Non-archival submissions won't be included in the proceeding, rather it going to be given a (presentation or poster slot) based on the organizers' evaluation and the alignment of the work with the data challenge's main theme.
In case the work has been published/or is undersubmission, then submit the pdf file version of the work (i.e. no need to resubmit in the data challenge format). Follow the submission website and make sure you select a non-archival track to upload your pdf file.
The data challenge is open to everyone.
Submission should be made via EasyChair and must follow the formatting guidelines for ICWSM-2023, all archival submissions will follow we will follow double-blind review process.
All submissions must be anonymous and conform to AAAI standards for double-blind review. Both short papers (4 pages including references) and posters (2 pages including references) that adhere to the 2-column AAAI format will be considered for review. All papers must be submitted as high-resolution PDF files, formatted in AAAI two-column, camera-ready style, for US Letter (8.5" x 11") paper, using Type 1 or TrueType fonts (available templates: AAAI 2023 Author Kit on Overleaf or AAAI 2023 Author Kit.zip [Word | LaTeX])
Submission Website: https://easychair.org/conferences/?conf=icwsm-dc2023