RecipeRef Corpus for Modeling Anaphora Resolution from the Procedural Text of Recipes

Published:

Biaoyan Fang, Christian Druckenbrodt, Saber A. Akhondi, Camilo Thorne, Timothy Baldwin, Karin Verspoor (2022) RecipeRef Corpus for Modeling Anaphora Resolution from the Procedural Text of Recipes. Mendeley Data

@misc{reciperefdataset,
    author = {Fang, Biaoyan and Druckenbrodt, Christian and Akhondi, Saber A.\ and Thorne, Camilo and Baldwin, Timothy and Verspoor, Karin},
    title = {RecipeRef Corpus for Modeling Anaphora Resolution from the Procedural Text of Recipes,
    url = {https://doi.org/10.17632/rcyskfvdv7},
    howpublished = {Mendeley Data}}
}

Abstract

Current research on anaphora resolution mostly focuses on declarative text, e.g. journals, newswires, and dialogues. Procedural text, however, as one important type of text, contains rich anaphora phenomena but has only received limited attention. For procedural text comprehension, it is critical to resolving anaphora phenomena - resolution of anaphora in these texts is required to determine what action applies to which entity. To enrich the anaphora research in the procedural text, we investigate the anaphora phenomena in the procedural text of recipes. More specifically, we detailedly investigate the similarities and differences between recipes and chemical patents. We propose a generalized annotation guideline for the procedural text of recipes, adopting from chemical patents with domain adaptation. Based on the proposed guideline, we further create the RecipeRef corpus, in the hope that it will facilitate the research on anaphora resolution in procedural text.