Evaluation Script

Additional Datasets

  • Spanish abstracts with ICD10 codes. Download it from Zenodo.
    It is a collection of abstracts in Spanish with ICD10 codes.
    It can be used to improve system training.
    For more information, see the Datasets page or Zenodo webpage.
  • Translated to Spanish corpus with ICD10 codes. Download it from Zenodo.
    It is a collection of abstracts in Spanish (machine-translated from English) with ICD10 codes.
    It can be used to improve system training.
  • English, Spanish and Portuguese clinical case reports (639 for each language) annotated with medical diagnostic codes from the ICD10-CM. Download from GitHub.
    It is a collection of abstracts in 3 languages with ICD10 codes.
    It can be used to improve system training.

Linguistic Resources

  • CUTEXT. See it on GitHub.
    Medical term extraction tool.
    It can be used to extract relevant medical terms from clinical cases.
  • SPACCC POS Tagger. See it on Zenodo.
    Part Of Speech Tagger for Spanish medical domain corpus.
    It can be used as a component of your system.
  • NegEx-MES. See it on Zenodo.
    A system for negation detection in Spanish clinical texts based on NegEx algorithm.
    It can be used as a component of your system.
  • AbreMES-X. See it on Zenodo.
    Software used to generate the Spanish Medical Abbreviation DataBase.
  • AbreMES-DB. See it on Zenodo.
    Spanish Medical Abbreviation DataBase.
    It can be used to fine-tune your system.
  • MeSpEn Glossaries. See it on Zenodo.
    Repository of bilingual medical glossaries made by professional translators.
    It can be used to fine-tune your system.

Terminological Resources

  • Valid codes for the task. Download it here.
  • Spanish Ministry of Health official CIE10 browser. See the documentation page for the 2018 version.
    ICD10 terminology in Spanish is named CIE10. It is managed by the Ministry of Health.
  • Diseases and symptoms lexicon with MESH and ICD10 codes. List of disease and symptoms terms extracted from Spanish clinical texts. They are mapped to MESH and from MESH to ICD10.
    It can be used as gazetteers or dictionaries.
    Download it here.
  • More soon published.

Established Coding Systems

Other Relevant Systems

  • Live demo of a NER for drug/chemical/gene in Spanish clinical texts, here.
  • NER for drug/chemical/gene with BERT on Spanish clinical texts, here.
  • Alternative NER for drug/chemical/gene on Spanish clinical documents, here.