The evaluation of the competition will be done against a manually annotated data set purposely created for this task.
Submission format
Sub-track 1 DisTEMIST-Entities
Participating teams will have to generate a tsv file with the annotations detected for each of the test documents according to the following column structure:
- filename: Name of the file from which the disease mention has been extracted.
- mention label: In this case it will always be ENFERMEDAD
- starting character offset: Character number where the detected mention starts.
- ending character offset: Character number where the detected mention ends.
- mention span: Mention extracted from text
filename label off0 off1 span
es-S0004-06142007000600016-2 ENFERMEDAD 330 374 lesión nodular sólida en cabeza de epidídimo
es-S0004-06142007000600016-2 ENFERMEDAD 436 467 tumor adenomatoide de epidídimo
es-S0004-06142007000600016-5 ENFERMEDAD 1136 1167 tumor adenomatoide de epidídimo
Sub-track 2 Distemist-Linking
Participating teams will have to generate a tsv file with the annotations detected for each of the test documents according to the following column structure:
- filename: Name of the file from which the disease mention has been extracted.
- mention label: It will always be ENFERMEDAD
- starting character offset: Character number where the detected mention starts.
- ending character offset: Character number where the detected mention ends.
- mention span: Mention extracted from text
- code: List of Snomed-CT concept codes linked to the mention. If there is more than one code associated with a mention, they will be concatenated by the symbol “+”
filename label off0 off1 span code
es-S0004-06142007000600016-2 ENFERMEDAD 330 374 lagunaas coriorretinianas 302893000
es-S0004-06142007000600016-2 ENFERMEDAD 436 467 anomalías de la migración neuroral 253146009
es-S0004-06142007000600016-5 ENFERMEDAD 1136 1167 afectación macular 312999006
To evaluate your systems AFTER June 2022, do it in Codalab: https://codalab.lisn.upsaclay.fr/competitions/7162
The following sections are kept here for historical reasons. They are not relevant AFTER June 2022
Evaluation dates (OLD: shared task)
- 10 May 09:00 GMT – 15 May 09:00 GMT
Submission instructions (OLD: shared task)
Download here the submission tutorial
Download here a submission ZIP example.
To submit your results, (1) register at BioASQ website and (2), follow the following instructions:
A. 5 submissions per sub-track will be allowed.
B. You must submit ONE SINGLE ZIP file with the following structure:
- One subdirectory per subtask in which you are participating.
- In addition, in the parent directory, you must add a README.txt file with your contact details and a really short explanation of your system.
- If you have more than one system, you can include their predictions and we will evaluate them (up to 5 prediction runs).
- For each prediction run:
- You must include the tab-separated file with your predictions.
- One single file with all the predictions.
- With a .tsv file extension.
- If you have more than one system, include one tab-separated file for each system.
- If you have more than one system, name the tab-separated files with numbers and a more or less recognizable name. For example, 1-systemDL.tsv and 2-systemlookup.tsv
C. With this ZIP file, go to the participants area of BioASQ (http://participants-area.bioasq.org/Tasks/distemist/) and submit there your file.