Finding Good Conversations Online: The Yahoo News Annotated Comments Corpus

Publication
Apr 3, 2017
Abstract

This work presents a dataset and annotation scheme for the new task of  identifying "good" conversations that occur online, which we call ERICs: Engaging, Respectful, and/or Informative Conversations.  We develop a taxonomy to reflect features of entire threads and individual comments which we believe contribute to identifying ERICs;
code a novel dataset of Yahoo News comment threads (2.4k threads and 10k comments) and 1k threads from the Internet Argument Corpus; and analyze the features characteristic of ERICs.
This is one of the largest annotated corpora of online human dialogues, with the most detailed set of annotations. It will be valuable for identifying ERICs and other aspects of argumentation, dialogue, and discourse.

 

 

  • Linguistic Annotation Workshop (LAW 2017)
  • Conference/Workshop Paper

BibTeX