A Corpus of Sentence-level Revisions in Academic Writing: A Step towards Understanding Statement Strength in Communication

Chenhao Tan, Lillian Lee
In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL'2014) (short paper)

The strength with which a statement is made can have a significant impact on the audience. For example, international relations can be strained by how the media in one country describes an event in another; and papers can be rejected because they overstate or understate their findings. It is thus important to understand the effects of statement strength. A first step is to be able to distinguish between strong and weak statements. However, even this problem is understudied, partly due to a lack of data. Since strength is inherently relative, {\em revisions} of texts that make claims are a natural source of data on strength differences. In this paper, we introduce a corpus of sentence-level revisions from academic writing. We also describe insights gained from our annotation efforts for this task.

[Data(README)][PDF][Slides]

@inproceedings{tan+lee:14,
     author = {Chenhao Tan and Lillian Lee},
     title = {A Corpus of Sentence-level Revisions in Academic Writing: A Step towards Understanding Statement Strength in Communication},
     year = {2014},
     booktitle = {Proceedings of ACL (short paper)}
}

This work was supported in part by NSF grant IIS-0910664 and a Google Research Grant. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation or other sponsors.