New paper and resources to support anatomical entity recognition at literature scale
2013-10-28
We are pleased to announce that a new paper has been published, presenting a number of new tools and resources supporting the recognition of anatomical entities:
- AnatomyTagger, a new machine learning-based system for anatomical entity mention recognition that has been applied to annotate to automatically annotate the entire Open Access scientific domain literature.
- AnatEM: a corpus of 1200 documents manually annotated for 13,700 anatomical entity mentions
- Results of tagging all of the 600,000 PMC OA full-text documents, identifying 48M anatomical entity mentions
Paper
Sampo Pyysalo and Sophia Ananiadou (2013). Anatomical Entity Mention Recognition at Literature Scale. Bioinformatics.
Abstract
Motivation
Anatomical entities ranging from sub-cellular structures to organ systems are central to biomedical science, and mentions of these entities are essential to understanding the scientific literature. Despite extensive efforts to automatically analyse various aspects of biomedical text, there have been only few studies focusing on anatomical entities, and no dedicated methods for learning to automatically recognize anatomical entity mentions in free-form text have been introduced.
Results
We present AnatomyTagger, a machine learning-based system for anatomical entity mention recognition. The system incorporates a broad array of approaches proposed to benefit tagging, including the use of UMLS- and OBO-based lexical resources, word representations induced from unlabelled text, statistical truecasing, and non-local features. We train and evaluate the system on a newly introduced corpus that substantially extends on previously available resources, and apply the resulting tagger to automatically annotate the entire Open Access scientific domain literature. The resulting analyses have been applied to extend services provided by the Europe PMC literature database.
Availability
All tools and resources introduced in this work are available from http://nactem.ac.uk/anatomytagger
Previous item | Next item |
Back to news summary page |
Featured News
- ELLIS Workshop on Misinformation Detection - 16th June 2025
- 1st Workshop on Misinformation Detection in the Era of LLMs (MisD)- 23rd June 2025
- Prof. Sophia Ananiadou accepted as an ELLIS fellow
- Invited talk at the 15th Marbach Castle Drug-Drug Interaction Workshop
- BioNLP 2025 and Shared Tasks accepted for co-location at ACL 2025
- Prof. Junichi Tsujii honoured as Person of Cultural Merit in Japan
- Participation in panel at Cyber Greece 2024 Conference, Athens
- New Named Entity Corpus for Occupational Substance Exposure Assessment
Other News & Events
- CL4Health @ NAACL 2025 - Extended submission deadline - 04/02/2025
- Shared Task on Financial Misinformation Detection at FinNLP-FNP-LLMFinLegal
- FinNLP-FNP-LLMFinLegal @ COLING-2025 - Call for papers
- Keynote talk at Manchester Law and Technology Conference
- Keynote talk at ACM Summer School on Data Science, Athens