{"id":3932,"date":"2019-09-19T10:23:41","date_gmt":"2019-09-19T10:23:41","guid":{"rendered":"http:\/\/temu.bsc.es\/meddocan\/?p=3932"},"modified":"2021-02-28T20:11:45","modified_gmt":"2021-02-28T20:11:45","slug":"annotation-guidelines","status":"publish","type":"post","link":"https:\/\/temu.bsc.es\/smm4h-spanish\/?p=3932","title":{"rendered":"Annotation Guidelines"},"content":{"rendered":"\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\"><p><a rel=\"noreferrer noopener\" href=\"https:\/\/doi.org\/10.5281\/zenodo.4309356\" target=\"_blank\">Training and validation (annotated), test and background (unannotated) datasets<\/a><\/p><p><a rel=\"noreferrer noopener\" href=\"https:\/\/doi.org\/10.5281\/zenodo.4306016\" target=\"_blank\">Guidelines<\/a><\/p><\/blockquote>\n\n\n\n<p>The SMM4H-Spanish corpus was manually annotated by linguist experts following the SMM4H-Spanish guidelines.  These guidelines contain rules for  annotating professions, employment statuses and work-related activities in health-related tweets in Spanish. Additionally, they also include some considerations regarding the codification of the annotations to the ESCO and SNOMED-CT taxonomies.<\/p>\n\n\n\n<p>Guidelines were created de novo in three phases: <\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>First, a <em>zero version<\/em> of the guidelines was developed after annotating a initial batch of ~200 tweets and outlining the main problems and difficulties of the data.<\/li><li>Second, a <em>stable version<\/em> of guidelines was reached while annotating sample sets of the ProfNER corpus iteratively until quality control was satisfactory.<\/li><li>Third, guidelines are iteratively <em>refined<\/em> as manual annotation continues. <\/li><\/ol>\n\n\n\n<p>The annotation guidelines are available in <a href=\"https:\/\/doi.org\/10.5281\/zenodo.4306016\" target=\"_blank\" rel=\"noreferrer noopener\">Spanish here<\/a> and in <a href=\"https:\/\/doi.org\/10.5281\/zenodo.4479740\" target=\"_blank\" rel=\"noreferrer noopener\">English here<\/a>.<\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Training and validation (annotated), test and background (unannotated) datasets Guidelines The SMM4H-Spanish corpus was manually annotated by linguist experts following the SMM4H-Spanish guidelines. These guidelines contain rules for annotating professions, employment statuses and work-related activities in health-related tweets in Spanish. Additionally, they also include some considerations regarding the codification of the annotations to the ESCO [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[],"class_list":["post-3932","post","type-post","status-publish","format-standard","hentry","category-data"],"_links":{"self":[{"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=\/wp\/v2\/posts\/3932","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=3932"}],"version-history":[{"count":11,"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=\/wp\/v2\/posts\/3932\/revisions"}],"predecessor-version":[{"id":4546,"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=\/wp\/v2\/posts\/3932\/revisions\/4546"}],"wp:attachment":[{"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=3932"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=3932"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/temu.bsc.es\/smm4h-spanish\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=3932"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}