Corpora and Data

Corpus of Job Offers in LIS

Description: 1,020 job offers collected from a Spanish employment agency Website (documentacion.com.es) for the period between 2006 and 2008.

Topic: Library and Information Science job offers

Date: Retrieved 31 March, 2008

Source: http://www.documentacion.com.es (the site ceased to work on 2008)

Languages: Spanish and Catalan

[Thank documentacion.com.es webmasters, Javier Leiva and María José Sola, for authorizing its publication for research purposes]

Download (compiled by J.Morato, J.A.Moreiro & S.Sanchez-Cuadrado)

EIREX Information Retrieval Education through EXperimentation

The Information Retrieval Education through EXperimentation (EIREX) project consists of a series of experiments designed to foster new Information Retrieval (IR) education methodologies and resources, with the specific goal of teaching undergraduate IR courses from an experimental perspective. In short, the EIREX project studies how to adapt the techniques and methodologies used in large-scale research and industrial IR evaluation settings to the limited resources and scope of an undergraduate course. These experiments started in 2010, in the context of the IR course taught at the University Carlos III of Madrid.

The EIREX series have the following goals:

    • To help students get a view of the Information Retrieval process as they would find it in a real-world scenario, either industrial or academic.

    • To make students realize the importance of laboratory experiments in Computer Science, and have them initiated in their execution and analysis.

    • To create a public repository of resources to teach Information Retrieval courses.

    • To seek the collaboration and active participation of other Universities in this endeavor.

Justin Timberlake That Girl download

Crazy Love Cosmo Nuts downloads

The Black Dog Bass Mantra download

More information on the motivation behind the EIREX experiments is available in the paper J. Urbano, M. Marrero, D. Martí­n and J. Morato, “Bringing Undergraduate Students Closer to a Real-World Information Retrieval Setting: Methodology and Resources“, ACM SIGCSE Annual Conference on Innovation and Technology in Computer Science Education, 2011.

International Conferences

EIREX Overview Reports

This website contains information on every EIREX edition, with all data available along with reports detailing their development and results.

EIREX 2010 Track: Computing

Description: The first Information Retrieval Education through Experimentation track, EIREX 2010, was run at the University Carlos III of Madrid. Twenty topics were created, which resulted in a total of 9,769 documents after crawling. Twelve systems were used to form the pools of documents to judge for each topic, and two people independently made judgments for most topics. A total of 1,976 documents resulted for the biased collection, and 24 systems by 8 student groups were evaluated with it.

Access to the EIREX datasets is password protected. You may submit a request to receive access.

Refer to the data reference page for a description of the data and the format they follow.

Topic: Computing

Date: during the 2010 Spring Semester

Source:

Languages: English

Download

EIREX 2011 Track: Crowdsourcing

Description: The second Information Retrieval Education through EXperimentation track, EIREX 2011, was run at the University Carlos III of Madrid during the 2011 Spring Semester. The theme chosen for the test collection was Crowdsourcing. Twenty-three topics were created, which resulted in a total of 13,245 documents after crawling. Twelve systems were used to form the pools of documents to judge for each topic. A total of 2,088 documents resulted for the biased collection, and 15 systems by 5 student groups were evaluated with it.

Access to the EIREX datasets is password protected. You may submit a request to receive access.

Refer to the data reference page for a description of the data and the format they follow.

Topic: Crowdsourcing

Date: during the 2011 Spring Semester

Source:

Languages: English

License: http://creativecommons.org/licenses/by/3.0/

Download

EIREX 2012 Track: Social Media

Description: The third Information Retrieval Education through EXperimentation track, EIREX 2012, was run at the University Carlos III of Madrid during the 2012 Spring Semester. The theme chosen for the test collection was Social Media. Thirty-five topics were created, which resulted in a total of 22,402 documents after crawling. Twelve systems were used to form the pools of documents to judge for each topic. A total of 3,738 documents resulted for the biased collection, and 56 systems by 19 student groupswere evaluated with it.

Access to the EIREX datasets is password protected. You may submit a request to receive access.

Refer to the data reference page for a description of the data and the format they follow.

Topic: Social Media

Date: during the 2012 Spring Semester

Source:

Languages: English

License: http://creativecommons.org/licenses/by/3.0/

Download