UW ACL data v1.0 (released April 2017) Distributed together with: Friendships, Rivalries, and Trysts: Characterizing Relations between Ideas in Texts Chenhao Tan, Dallas Card, Noah A. Smith In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL'2017) The paper, data, and associated materials can be found at: http://chenhaot.com/pages/idea-relations.html If you use this data, please cite: @inproceedings{tan+card+smith:17, author = {Chenhao Tan and Dallas Card and Noah A. Smith}, title = {Friendships, Rivalries, and Trysts: Characterizing Relations between Ideas in Texts}, year = {2017}, booktitle = {Proceedings of ACL} } and @inproceedings{Radev&al.09a, author = {Radev, Dragomir R. and Muthukrishnan, Pradeep and Qazvinian, Vahed}, title = {The {ACL} Anthology Network Corpus}, year = {2009}, booktitle = {Proceedings, ACL Workshop on Natural Language Processing and Information Retrieval for Digital Libraries} } The size of the compressed file is 92M. It includes 2 files in addition to this README: acl.jsonlist.gz This file contains the papers from the following conferences/journals: ACL, NAACL, EMNLP and TACL from 1979 to 2014. They were extracted from ACL Anthology Networks (http://clair.eecs.umich.edu/aan/index.php). acl_processed.jsonlist.gz This file is the same as acl.jsonlist.gz except that the 'text' field for each document has been tokenized and lemmatized. This file can be used to replicate the experiment in our paper. Please email any questions to: chenhao@chenhaot.com