PersonaBank

If you use this data in your research, please refer to and cite: Lukin, Stephanie M., Bowden, Kevin, Barackman, Casey, Walker, Marilyn A. A Corpus of Personal Narratives and Their Story Intention Graphs. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC), Portorož, Slovenia, 2016.

Overview:  We present a new corpus, PersonaBank, consisting of 108 personal stories from weblogs that have been annotated with their Story Intention Graphs, a deep representation of the fabula of a story. We describe the topics of the stories and the basis of the Story Intention Graph representation, as well as the process of annotating the stories to produce the Story Intention Graphs and the challenges of adapting the tool to this new personal narrative domain. We also discuss how the corpus can be used in applications that retell the story using different styles of tellings, co-tellings, or as a content planner.

The Data: The download includes

  • 108 blog stories and their corresponding .VGL and Scheherazade realization files
  • Tutorial for blog annotation in Scheherazade
  • Spreadsheet describing the stories, their topics, polarity, and an excerpt of the story
  • Spreadsheet describing the topic distribution
  • Readme file

Related Works:

Works that use this corpus:

Download: Fill out the following form to download the PersonaBank Corpus

Contact: Please direct questions to Stephanie Lukin: slukin [at] soe [dot] ucsc [dot] edu

Version 1.0 May 17, 2016. Website last updated May 26, 2016.

 
1 Start 2 Complete
User Information