The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents

Published:

Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zenan Zhai, Zubair Afzal, Trevor Cohn, Timothy Baldwin and Karin Verspoor (2022) The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents. In Proceedings of the 44rd European Conference on Information Retrieval (ECIR 2022), Stavanger, Norway.


@inproceedings{10.1007/978-3-030-99739-7_50,
author = {Li, Yuan and Fang, Biaoyan and He, Jiayuan and Yoshikawa, Hiyori and Akhondi, Saber A. and Druckenbrodt, Christian and Thorne, Camilo and Zhai, Zenan and Afzal, Zubair and Cohn, Trevor and Baldwin, Timothy and Verspoor, Karin},
title = {The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents},
year = {2022},
isbn = {978-3-030-99738-0},
publisher = {Springer-Verlag},
address = {Berlin, Heidelberg},
url = {https://doi.org/10.1007/978-3-030-99739-7_50},
doi = {10.1007/978-3-030-99739-7_50},
abstract = {The discovery of new chemical compounds is a key driver of the chemistry and pharmaceutical industries, and many other industrial sectors. Patents serve as a critical source of information about new chemical compounds. The ChEMU (Cheminformatics Elsevier Melbourne Universities) lab addresses information extraction over chemical patents and aims to advance the state of the art on this topic. ChEMU lab 2022, as part of the 13th Conference and Labs of the Evaluation Forum (CLEF-2022), will be the third ChEMU lab. The ChEMU 2020 lab provided two information extraction tasks, named entity recognition and event extraction. The ChEMU 2021 lab introduced two more tasks, chemical reaction reference resolution and anaphora resolution. For ChEMU 2022, we plan to re-run all the four tasks with a new task on semantic classification for tables as the fifth one. In this paper, we introduce ChEMU 2022, including its motivation, goals, tasks, resources, and evaluation framework.},
booktitle = {Advances in Information Retrieval: 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II},
pages = {400–407},
numpages = {8},
keywords = {Reaction reference resolution, Anaphora resolution, Table classification, Chemical patents, Named entity recognition, Text mining, Event extraction},
location = {Stavanger, Norway}
}


Abstract

The discovery of new chemical compounds is a key driver of the chemistry and pharmaceutical industries, and many other industrial sectors. Patents serve as a critical source of information about new chemical compounds. The ChEMU (Cheminformatics Elsevier Melbourne Universities) lab addresses information extraction over chemical patents and aims to advance the state of the art on this topic. ChEMU lab 2022, as part of the 13th Conference and Labs of the Evaluation Forum (CLEF-2022), will be the third ChEMU lab. The ChEMU 2020 lab provided two information extraction tasks, named entity recognition and event extraction. The ChEMU 2021 lab introduced two more tasks, chemical reaction reference resolution and anaphora resolution. For ChEMU 2022, we plan to re-run all the four tasks with a new task on semantic classification for tables as the fifth one. In this paper, we introduce ChEMU 2022, including its motivation, goals, tasks, resources, and evaluation framework.