ToxHabits Shared Task Homepage
The ToxHabits Track is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group and promoted by Spanish and European projects such as DataTools4Heart, AI4HF and BARITONE.
To participate please see Registration page. If you are already registered, see the Data page to get the data for the task.
What is ToxHabits?
ToxHabits is a shared task and set of resources focused on the detection and characterization of substance use and abuse in clinical texts.
For more information about the ToxHabits task, check the Task Info tab, which includes the Motivation, Subtasks, Schedule, Registration and Submission pages.
To learn more about the ToxHabits corpus and how it was annotated, check the Data tab.
ToxHabits will be held as part of the BioCreative Workshop in the IJCAI 2025 conference. For more information about them, check the Workshop tab.
ToxHabits is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group (formerly Text Mining Unit).
Related resources
At the NLP for Biomedical Information Analysis group (formerly Text Mining Unit), one of our missions is the open publication of datasets to train and benchmark biomedical information extraction, normalization and indexing systems. For that reason, we have released multiple datasets as part of shared tasks over the years. If you are interested in ToxHabits, you might want to take a look at some of our resources and competitions about:
- Clinical content extraction: DisTEMIST (diseases), MedProcNER/ProcTEMIST (clinical procedures), SympTEMIST (signs and findings), CANTEMIST (tumour morphology), CodiEsp (coding to ICD), PharmaCoNER (chemicals and proteins), LivingNER (species and humans), MultiCardioNER (diseases and medications, includes the DrugTEMIST corpus as well as cardiology-specific data)
- Sociodemographic content extraction: MEDDOPLACE (locations and more) MEDDOCAN (sensitive data), MEDDOPROF (occupations)
- Information extraction in social media: SocialDisNER (diseases), ProfNER (occupations)
- Linguistic aspects: BARR1 and BARR2 (abbreviation resolution)
- Machine Translation: ClinSpEn (EN<->ES clinical content translation)
- Clinical Text Summarization: MultiClinSum (new this year!)
Schedule
Event | Date (Midnight CEST) |
---|---|
ToxHabits registration | May 26th |
ToxHabits Train set release (see Data) | April 30th |
Annotation Guidelines release | TBA |
ToxHabits Test set release | May 26th |
ToxHabits Test Prediction submission | June 1st (Anywhere on Earth) |
ToxHabits results release | June 5th |
IJCAI’25 Workshop Team Invitation | June 6th |
BioCreative IX proceedings paper submission | June 16th |
BioCreative IX proceedings paper reviews | July 1st |
IJCAI 2025 | August 16th |
If you have any questions please contact the main organizers:
- Gabriel Vayá Abad (gvaya.bsc [at] gmail [dot] com)
- Wesam Alnabki (wesam.alnabki.bsc [at] gmail [dot] com)
- Martin Krallinger (krallinger.martin [at] gmail [dot] com)