New homepage for the GENIA project and biomedical annotated corpora
2011-12-22
We are pleased to announce a new website for the GENIA project: http://www.nactem.ac.uk/genia/.
The GENIA project has been running since 1998, and the new website contains information about the following:
- The GENIA corpus - the primary resource created by the GENIA project. The corpus is intended to support the development and evaluation of information extraction and text mining systems for the domain of molecular biology. It consists of 1,999 MEDLINE abstracts, which have been annotated with various levels of linguistic and semantic information, i.e. parts-of-speech, syntax, terms, events, relations and coreference. The corpus can be downloaded from the website.
- Shared tasks - The GENIA project initiated the BioNLP Shared Task series and has organised a number of tasks in 3 different shared task events, i.e. the BioNLP/JNLPBA Shared Task 2004, and the BioNLP Shared Tasks of 2009 and 2011.
- Other GENIA project corpora - A number of additional corpora have been annotated using extensions of the GENIA/BioNLP Shared Task event representation. These consist of event corpora of protein post-translational modifications (PTM), Type IV secretion systems, DNA methylation, mTOR pathways and "Exhaustive PTM".
- Efforts that are related to the GENIA project. These include the meta-knowledge corpus - an extension of the GENIA event corpus which adds annotation about how events are to be interpreted according to their textual context.
Information about tools developed to perform automatic annotation, through training on the GENIA corpus, will be added to the site shortly.
Previous item | Next item |
Back to news summary page |
Featured News
- Call for papers: CL4Health @ NAACL 2025
- BioNLP 2025 and Shared Tasks accepted for co-location at ACL 2025
- Prof. Junichi Tsujii honoured as Person of Cultural Merit in Japan
- Participation in panel at Cyber Greece 2024 Conference, Athens
- Shared Task on Financial Misinformation Detection at FinNLP-FNP-LLMFinLegal
- New Named Entity Corpus for Occupational Substance Exposure Assessment
- FinNLP-FNP-LLMFinLegal @ COLING-2025 - Call for papers
Other News & Events
- Keynote talk at Manchester Law and Technology Conference
- Keynote talk at ACM Summer School on Data Science, Athens
- Invited talk at the 8th Annual Women in Data Science Event at the American University of Beirut
- Invited talk at the 2nd Symposium on NLP for Social Good (NSG), University of Liverpool
- Invited talk at Annual Meeting of the Danish Society of Occupational and Environmental Medicine