MultiClinAI Shared Task Homepage
The MultiClinAI Track is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group and promoted by European projects such as DataTools4Heart and AI4HF.
What is MultiClinAI?
MultiClinAI is a shared task focused on the creation of comparable multilingual corpora via annotation projection, as well as the multilingual extraction of clinical concepts.
For more information about the MultiClinAI task, check the Task Info tab, which includes the Motivation, Subtasks, Schedule and Registration, as well as the Evaluation & Submission tab.
To learn more about the MultiClinAI corpora and how they were annotated, check the Data tab.
MultiClinAI will be held as part of the #SMM4H-HeaRD Workshop in the ACL 2026 conference. For more information about them, check the Workshop tab.
MultiClinAI is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group (formerly Text Mining Unit).
Registration
https://forms.gle/oE9gfaNxFw2f6gyX6
Schedule
| Event | Date (Midnight CET) |
|---|---|
| MultiClinNER subtask training set release | February 6, 2026 |
| MultiClinCorpus subtask training set release | February 6, 2026 |
| MultiClinNER test set release (only texts) | March 18, 2026 |
| MultiClinNER test set prediction submissions (CodaBench) | March 25, 2026 |
| MultiClinCorpus test set release (only texts) | March 27, 2026 |
| MultiClinCorpus test set prediction submissions (CodaBench) | April 9, 2026 |
| Result / evaluation returned to teams | April 14, 2026 |
| Participant proceedings due | April 24, 2026 |
| Notification of acceptance and participant proceedings reviews | May 15, 2026 |
| Camera-ready papers due | May 25, 2026 |
| ACL Proceedings due (hard deadline) | June 1, 2026 |
| Workshop | July 2–3, 2026 |
Related resources
At the NLP for Biomedical Information Analysis group (formerly Text Mining Unit), one of our missions is the open publication of datasets to train and benchmark biomedical information extraction, normalization and indexing systems. For that reason, we have released multiple datasets as part of shared tasks over the years. If you are interested in MultiClinAI, you might want to take a look at some of our resources and competitions about:
- Clinical content extraction: DisTEMIST (diseases), MedProcNER/ProcTEMIST (clinical procedures), SympTEMIST (signs and findings), CANTEMIST (tumour morphology), CodiEsp (coding to ICD), PharmaCoNER (chemicals and proteins), LivingNER (species and humans), MultiCardioNER (diseases and medications, includes the DrugTEMIST corpus as well as cardiology-specific data)
- Socio-demographic / Social Determinants of Health content extraction: MEDDOPLACE (locations and more) MEDDOCAN (sensitive data), MEDDOPROF (occupations), ToxHabits (extraction of substance use-related content)
- Information extraction in social media: SocialDisNER (diseases), ProfNER (occupations)
- Linguistic aspects: BARR1 and BARR2 (abbreviation resolution)
- Machine Translation: ClinSpEn (EN<->ES clinical content translation)
- Summarization: MultiClinSUM (multilingual summarization of clinical content)
Contact
- Salvador Lima-López, Barcelona Supercomputing Center (BSC), Spain: salvador.limalopez@gmail.com
- Fernando Gallego-Donoso, Barcelona Supercomputing Center (BSC), Spain: fgallegodonoso@gmail.com
Google Group: TBA