SympTEMIST Shared Task Homepage
The SympTEMIST Track is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group and promoted by Spanish and European projects such as DataTools4Heart, AI4HF, BARITONE and AI4ProfHealth.
What is SympTEMIST?
SympTEMIST stands for SYMPtoms, signs and findings TExt MIning Shared Task . It is a shared task and set of resources focused on the detection and normalization of symptoms, signs and findings in medical documents in Spanish.
For more information about the SympTEMIST task, check the Task Info tab, which includes the Motivation, Subtasks, Schedule, Registration and Submission pages.
To learn more about the SympTEMIST corpus and how it was annotated, check the Data tab, including the Corpus Description, Annotation Guidelines (with example annotations and screenshots) and Download pages.
SympTEMIST will be held as part of BioCreative 2023. For more information about it, check the Workshop tab.
SympTEMIST is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group (formerly Text Mining Unit).
Important information
- The corpus’ training set, multilingual silver standard and gazetteer, as well as the test set for all subtasks are now available on Zenodo.
- Submission information now available on the Submission page.
Related resources
At the NLP for Biomedical Information Analysis group (formerly Text Mining Unit), one of our missions is the open publication of datasets to train and benchmark biomedical information extraction, normalization and indexing systems. For that reason, we have released multiple datasets as part of shared tasks over the years. If you are interested in SympTEMIST, you might want to take a look at some of our resources and competitions about:
- Clinical content extraction: DisTEMIST (diseases), MedProcNER/ProcTEMIST (clinical procedures), CANTEMIST (tumour morphology), CodiEsp (coding to ICD), PharmaCoNER (chemicals and proteins)
- Sociodemographic content extraction: MEDDOPLACE (locations and clinical departments) MEDDOCAN (sensitive data), MEDDOPROF (occupations)
- Information extraction in social media: SocialDisNER (diseases), ProfNER (occupations)
- Linguistic aspects: BARR1 and BARR2 (abbreviation resolution)
- Machine Translation: ClinSpEn (EN<->ES clinical content translation)
Schedule
Event | Date (Midnight CEST) |
---|---|
SympTEMIST Annotation Guidelines Release | August 8th 2023 |
SympTEMIST Train Set Subtask 1 (NER) Release | August 8th, 2023 |
SympTEMIST Train Set Subtask 2 + 3 (Linking + Multilingual) Release | September 12th, 2023 |
SympTEMIST Gazetteer Release | September 12th, 2023 |
SympTEMIST Subtask 1 (NER) Test Set Release | September 30th 2023 |
Subtask 1 (NER) Test Predictions Deadline | October 6th 2023 |
Subtask 1 (NER) Evaluation Results Release | October 10th 2023 |
SympTEMIST Subtask 2 + 3 (Linking + Multilingual) Test Set Release | October 6th 2023 |
Subtask 2 + 3 (Linking + Multilingual) Test Predictions Deadline | October 13th 2023 |
Subtask 2 + 3 (Linking + Multilingual) Evaluation Release | October 17th 2023 |
Submission of Participant Papers Deadline | October 22nd 2023 |
Notification of Acceptance Participant Papers | October 30th 2023 |
Submission of Camera-ready Participant Papers Deadline | November 3rd 2023 |
BioCreative VIII @ AMIA 2023 | November 11-15 2023 |