Programme

Proceedings: LREC 2018 – MultilingualBIO

14:00 – 14:10 – Welcome and presentation
Maite Melero

14:10 – 14:40 – Introductory talk
Aurélie Névéol: A scoping review of the literature on clinical Natural Language Processing in languages other than English.  [Slides]

Abstract: Natural language processing applied to clinical text or aimed at a clinical outcome has been thriving in recent years. This presentation offers a broad overview of clinical Natural Language Processing (NLP) for languages other than English. It aims to provide insights to NLP researchers faced with establishing clinical text processing in a language other than English, as well as clinical informatics researchers and practitioners looking for resources in their languages in order to apply NLP techniques and tools to clinical practice or investigation. Recent studies are summarized to offer insights and outline opportunities in this area. Studies are classified into three groups: (i) studies describing the development of new NLP systems or components de novo, (ii) studies describing the adaptation of NLP architectures developed for English to another language, and (iii) studies focusing on a particular clinical application. The advantages and drawbacks of each method are discussed in light of the application context. Finally, the presentation will highlight major challenges and opportunities that will affect the impact of NLP on clinical practice and public health studies in a context that encompasses English as well as other languages.

Session 1
14:40 – 15:00
Salvador Medina and Jordi Turmo: Building a Spanish/Catalan Health Records Corpus with Very Sparse Protected Information Labelled.  [PDF] [Slides]

15:00 – 15:20
Lingyi Tang, Jun Xu, Xinyue Hu, Qiang Wei and Hua Xu: Building a Biomedical Chinese-English Parallel Corpus from MEDLINE. [PDF] [Slides]

15:20 – 15:40
Marta Villegas, Ander Intxaurrondo, Aitor Gonzalez-Agirre, Montserrat Marimon and Martin Krallinger: The MeSpEN Resource for English-Spanish Medical Machine Translation and Terminologies: Census of Parallel Corpora, Glossaries and Term Translations. [PDF] [Slides]

15:40 – 16:00
Roland Roller, Madeleine Kittner, Dirk Weissenborn and Ulf Leser: Cross-lingual Candidate Search for Biomedical Concept Normalization. [PDF] [Slides]

16:00 – 16:30 Coffee break

Session 2
16:30 – 16:50
Marta R. Costa-jussà, Noe Casas and Maite Melero: English-Catalan Neural Machine Translation in the Biomedical Domain through the cascade approach. [PDF] [Slides]

16:50 – 17:10
Olatz Perez-de-Viñaspre, Maite Oronoz and Natalia Elvira: KabiTermICD: Nested Term Based Translation of the ICD-10-CM into a Minor Language (Basque). [PDF] [Slides]

17:10– 17:25
Martin Krallinger: Presentation of the 2nd Edition of the Biomedical Abbreviation Recognition and Resolution task (BARR2) at the forecoming IberEval 2018.  [Slides]

17:25 – 18:00 – Discussion and Closing