MultiCardioNER Shared Task Homepage
The MultiCardioNER Track is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group and promoted by Spanish and European projects such as DataTools4Heart, AI4HF, BARITONE and AI4ProfHealth.
What is MultiCardioNER?
MultiCardioNER is a shared task and set of resources focused on the multilingual adaptation of clinical NER systems to the cardiology domain.
For more information about the MultiCardioNER task, check the Task Info tab, which includes the Motivation, Subtasks, Schedule, Registration and Submission pages.
To learn more about the MultiCardioNER corpus and how it was annotated, check the Data tab, including the Corpus Description, Annotation Guidelines and Download pages.
MultiCardioNER will be held as part of the BioASQ Workshop in the CLEF 2024 conference. For more information about them, check the Workshop tab.
MultiCardioNER is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group (formerly Text Mining Unit).
Important information
- Training and development sets for Tracks 1 and 2 is now out! https://zenodo.org/records/10948355
- The DrugTEMIST Annotation Guidelines are now live! https://zenodo.org/records/11065433
- The test sets for both Tracks are now out! https://zenodo.org/doi/10.5281/zenodo.10948354 Importantly, the test data is mixed with a background set, so participants must create predictions for all documents, although they will only be evaluated on the documents belonging to the test set. After the evaluation period is over, we will release the test set and its Gold Standard annotations separately, as well as provide more details about the background set documents.
Related resources
At the NLP for Biomedical Information Analysis group (formerly Text Mining Unit), one of our missions is the open publication of datasets to train and benchmark biomedical information extraction, normalization and indexing systems. For that reason, we have released multiple datasets as part of shared tasks over the years. If you are interested in MultiCardioNER, you might want to take a look at some of our resources and competitions about:
- Clinical content extraction: DisTEMIST (diseases), MedProcNER/ProcTEMIST (clinical procedures), SympTEMIST (signs and findings), CANTEMIST (tumour morphology), CodiEsp (coding to ICD), PharmaCoNER (chemicals and proteins), LivingNER (species and humans)
- Sociodemographic content extraction: MEDDOPLACE (locations and more) MEDDOCAN (sensitive data), MEDDOPROF (occupations)
- Information extraction in social media: SocialDisNER (diseases), ProfNER (occupations)
- Linguistic aspects: BARR1 and BARR2 (abbreviation resolution)
- Machine Translation: ClinSpEn (EN<->ES clinical content translation)
Schedule
Event | Date (Midnight CEST) |
---|---|
MultiCardioNER Train+Dev Set Release | April 9th, 2024 |
DrugTEMIST Annotation Guidelines Release | April 25th, 2024 |
MultiCardioNER Test Set Texts Release | May 2nd, 2024 |
Participant Test Predictions Deadline | May 15th, 2024 |
Participant Evaluation Result Release | May 19th, 2024 |
Submission of Participant Papers Deadline | May 31st, 2024 |
Notification of Acceptance of Participant Papers | June 24th, 2024 |
Submission of Camera-ready Participant Papers Deadline | July 8th, 2024 |
BioASQ @ CLEF2024 | September 9th-12th, 2024 |