Submission
The evaluation of the competition will be done against a manually annotated data set purposely created for this task.
Submission format
Both subtasks, ToxNER (Subtask 1) and ToxUse (Subtask 2), share the same submission format.
Participating teams must generate a TSV file for each subtask containing the annotations detected in the test documents, following the column structure below:
- File name (filename): Name of the file from which the entity mention has been extracted
- Mention label (label): The category of the entity.
- For ToxNER (Subtask 1), the entities could be: Tobacco, Alcohol, Drug, Cannabis
- For ToxUse (Subtask 2) the entities could be: Amount, Type, Frequency, Duration, Method, History
- Starting character offset (off0): Character index where the detected mention starts.
- Ending character offset (off1): Character index where the detected mention ends.
- Mention span (span): Mention extracted from the text (i.e., the text of the entity)
Example (submission.tsv):

Submission Procedure
The submissions must be sent via email to < wesam.alnabki.bsc [at] gmail.com >.
All submissions must follow the instructions below:
- Submission Limit:
- Each team may submit up to 5 runs per subtask (ToxNER and/or ToxUse).
- Submission Format:
- Submit one single ZIP file containing your runs with the following structure:
- A subdirectory for each subtask you are participating in (e.g.,
ToxNER/
,ToxUse/
). - A README.txt file in the root directory including the following:
- Team name (same as registered)
- Team contact person (name and email)
- Concise description of each run
- Any extra resources used (e.g., dictionaries, external data, pre-trained models)
- A subdirectory for each subtask you are participating in (e.g.,
- Submit one single ZIP file containing your runs with the following structure:
- Naming Runs:
- If you submit multiple runs for a subtask, name the TSV files clearly and sequentially:
- Example:
1-baseline.tsv
,2-bert-model.tsv
, etc.
- Example:
- If you submit multiple runs for a subtask, name the TSV files clearly and sequentially:
- ZIP File Naming Convention:
- Your ZIP file must include:
- Team name (as used in registration)
- Subtask(s) covered
- Submission date as a timestamp (YYYYMMDD)
- Example:
toxhabits_teamname_toxner_20240528.zip
toxhabits_teamname_toxuse_20240528.zip
- Your ZIP file must include:
- TSV File Requirements:
- Submissions must be in tab-separated files (.tsv) with the following:
- One row per detected mention
- The required columns:
filename
,label
,off0
,off1
,span
- A header row with column names
- One file per system/run
- Submissions must be in tab-separated files (.tsv) with the following: