Anatomical entity mention recognition
This is the home page for AnatomyTagger, an open-source entity mention tagger for anatomical entities, the AnatEM anatomical entity mention corpus, and related open data resources presented in
- Sampo Pyysalo and Sophia Ananiadou (2013). Anatomical Entity Mention Recognition at Literature Scale. Bioinformatics.
if you use any of the tools and resources available from this page, please cite this paper.
Overview
The following tools and resources are provided:
- AnatEM corpus: 1212 documents manually annotated for anatomical entity mentions
- AnatomyTagger: Python implementation of anatomical entity mention tagger
AnatEM corpus

The extended Anatomical Entity Mention corpus (AnatEM) consists of 1212 documents (approx. 250,000 words) manually annotated to identify over 13,000 mentions of anatomical entities. Each annotation is assigned one of 12 granularity-based types such as Cellular component, Tissue and Organ, defined with reference to the Common Anatomy Reference Ontology. The corpus builds in part on two previously introduced resources, AnEM and MLEE. The corpus annotations were created using the brat annotation tool.
Download
-
AnatEM corpus version 1.0.2: AnatEM-1.0.2.tar.gz (8.6MB) (CC BY-SA license)>
The corpus distribution contains the annotations in the standoff format used by the brat tool, the column-based CoNLL format, and the NERsuite format.
AnatomyTagger
AnatomyTagger is a Python-based entity mention tagger implemented using the NERsuite named entity recognition toolkit. The tagger is provided with various lexical and corpus resources for anatomical entity tagging and is simple to train and to apply.
Download
-
AnatomyTagger version 0.9.0: anatomytagger-0.9.0.tar.gz (87MB) (MIT license)
References
These tools and resources were introduced in
- Sampo Pyysalo and Sophia Ananiadou. (2013). Anatomical Entity Mention Recognition at Literature Scale. Bioinformatics.
Please cite this paper if you use any of the tools and resources available from this page.
Featured News
- Invited talk at BioASQ 2023
- Prof. Ananiadou appointed as Senior Area Chair for ACL 2023 and IJCNLP-AACL 2023
- New Knowledge Transfer Partnership with 10BE5
- Panellist at Digital Trust and Society Forum 2023
- Chinese Government AwardAward for PhD student Tianlin Zhang
- Advances in Data Science and AI Conference 2023
- Keynote talk at EMBL-EBI industry club Machine Learning for Text Mining
- Talk at Open Data Science Conference (ODSC)
- BioLaySumm 2023 - Shared Task @ BioNLP 2023
- Prof. Ananiadou gives talk as distinguished speaker in the Women in AI speaker series
- Junichi Tsujii awarded Order of the Sacred Treasure, Gold Rays with Neck Ribbon
Other News & Events
- Keynote Talk at the Festival of AI
- Recent funding successes for Prof. Sophia Ananiadou
- New article on using neural architectures to aggregate sequence labels from multiple annnotators
- New article on improving biomedical extractive summarisation using domain knowledge
- New article on automated detection and analysis of depression and stress in social media data