Voz Project

Voz is a system that explores techniques for automatic extraction of narrative information from text. Voz combines off-the-shelf NLP tools, common sense knowledge databases and domain knowledge to extract a symbolic representation of a text and compute features related to narrative information.

Architecture

Voz implements an NLP pipeline reusing several components from open source, readily available NLP toolkits and knowledge bases. Voz is implemented in Java and Python (preview deployment, old version, unstable). Voz relies on several open source NLP toolkits (Parser Services) made available via a webservice (preview deployment, limited old version) available for download as a turnkey solution for Google App Engine (Webapp2).

Publications

For additional information or if you use any component from the system, please cite either of the papers below.

J. Valls-Vargas, J. Zhu, S. Ontañón (2015). Narrative Hermeneutic Circle: Improving Character Role Identification from Natural Language Text via Feedback Loops. IJCAI 2015. [PDF]

@inproceedings{valls2015ijcai,
Author = {Valls-Vargas, Josep and Onta{\~n}{\'o}n, Santiago and Zhu, Jichen},

Title = {Narrative Hermeneutic Circle: Improving Character Role Identification from Natural Language Text via Feedback Loops},

Year = {2015},

Booktitle = {Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence},

Pages = {2517--2523},
}

J. Valls-Vargas, J. Zhu, S. Ontañón (2014). Toward Automatic Role Identification in Unannotated Folk Tales. AIIDE 2014. [PDF]

@inproceedings{valls2014aiide,
Author = {Valls-Vargas, Josep and Zhu, Jichen and Onta{\~n}{\'o}n, Santiago},

Booktitle = {Proceedings of the Tenth Artificial Intelligence and Interactive Digital Entertainment Conference},
Title = {Toward Automatic Role Identification in Unannotated Folk Tales},
Year = {2014},
}

J. Valls-Vargas, S. Ontañón, J. Zhu (2014). Toward Automatic Character Identification in Unannotated Narrative Text. INT 7 at ELO 2014. [PDF]

@inproceedings{valls2014int,
Author = {Valls-Vargas, Josep and Onta{\~n}{\'o}n, Santiago and Zhu, Jichen},
Booktitle = {Proceedings of the Seventh Workshop in Intelligent Narrative Technologies},
Title = {Toward Automatic Character Identification in Unannotated Narrative Text},
Year = {2014},
}

Downloads

The system is currently under active development. Any updates will be posted in this page.

Voz
- ~~Online demo available:~~ ~~http://gameailab.appspot.com/~~ ~~(older version)~~
- ~~Source code:~~ ~~https://bitbucket.org/josepvalls/voz~~ ~~(older version)~~
- ~~Source code:~~ ~~https://bitbucket.org/josepvalls/voz2~~
- Source code: https://github.com/josepvalls/voz2b
Weka package implementing the continuous (or generalized) Jaccard distance [ZIP]
- How to install? In the package manager, select unofficial [PNG]
- How to use? Select it in an algorithm that uses a distance measure, i.e., IBk [PNG]
Parser Services: Webservice for Stanford Parser, Stanford CoreNLP, Apache OpenNLP and Berkeley Parser
- ~~Online demo available:~~ ~~http://129.25.12.216:8888~~
- ~~Source code:~~ ~~https://bitbucket.org/josepvalls/parserservices~~
- Source code: https://github.com/josepvalls/parserservices

Data

The following packages contain the datasets used in our publications.

Dataset used in our paper at INT 2014.
Dataset used in our paper at AIIDE 2014.
Dataset used in our paper at IJCAI 2015 (dataset available on request josep@valls.name).
Dataset used for our experiments in our user study (dataset available on request josep@valls.name).

Please note the dataset currently does not contain the full text of the stories.

Other

This is the link to our user study. We collected data on October 31st 2017. Responses after this date may not be considered. ~~http://129.25.12.216/~~

Page updated

Google Sites

Report abuse