{"id":3966,"date":"2019-09-19T12:27:20","date_gmt":"2019-09-19T12:27:20","guid":{"rendered":"http:\/\/temu.bsc.es\/meddocan\/?p=3966"},"modified":"2019-09-19T12:27:20","modified_gmt":"2019-09-19T12:27:20","slug":"examples","status":"publish","type":"post","link":"https:\/\/temu.bsc.es\/meddocan\/index.php\/examples\/","title":{"rendered":"Examples"},"content":{"rendered":"\n<p>In this section, we show examples of how the evaluation will be carried out to make it clearer. For the following examples, we will consider that this set of MEDDOCAN tags is our Gold Standard (GS):<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1024\" height=\"146\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-tags-1024x146.png\" alt=\"\" class=\"wp-image-3281\" srcset=\"https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-tags-1024x146.png 1024w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-tags-300x43.png 300w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-tags-768x110.png 768w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-tags.png 1066w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption>Example of tags in a Gold Standard file<br><\/figcaption><\/figure>\n\n\n\n<p>This GS file is in <em>i2b2<\/em> format. We have a <em>DATE<\/em> entity, a <em>CONTACT<\/em> entity, four <em>LOCATION<\/em> entities, and a <em>NAME<\/em> entity. Each entity tag is composed of the ID, the <em>START<\/em> and <em>END<\/em> offsets, the text snippet between the previous offsets, the entity <em>TYPE<\/em> and a <em>COMMENT<\/em>.<\/p>\n\n\n\n<p><strong>NOTE:<\/strong> The <em>ID<\/em>, <em>TEXT<\/em>, and <em>COMMENT<\/em> fields are not used for any of the evaluation metrics.  The number in the <em>ID<\/em> field and the <em>COMMENT<\/em> field are arbitrary, and the evaluation of the <em>TEXT<\/em> field is implicit in the offset evaluation, as the text is the same for the GS and the systems.<\/p>\n\n\n\n<h2><strong>Sub-track 1: NER offset<\/strong><\/h2>\n\n\n\n<p>In this sub-track we want to match exactly the <strong>beginning<\/strong> and <strong>end<\/strong> locations of each PHI entity tag, as well as detecting correctly the annotation <strong>type<\/strong>. The following system annotations will be accepted by the evaluation script even if the numbers on the ID fields are different, and additional comments are included.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1016\" height=\"102\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddoca-ex2.png\" alt=\"\" class=\"wp-image-3672\" srcset=\"https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddoca-ex2.png 1016w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddoca-ex2-300x30.png 300w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddoca-ex2-768x77.png 768w\" sizes=\"(max-width: 1016px) 100vw, 1016px\" \/><figcaption>Example of tags in a system submission file<\/figcaption><\/figure>\n\n\n\n<p>For this example the scores obtained by this system are the following:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"223\" height=\"56\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex2.png\" alt=\"\" class=\"wp-image-3288\"\/><\/figure>\n\n\n\n<p><strong><em>Precisi\u00f3n<\/em><\/strong> has been computed dividing true positives (4) by the sum of true positives and false positives (0), scoring<em> <strong>4\/(4+0) = 1.0<\/strong><\/em>. <strong><em>Recall<\/em><\/strong> has been computed dividing true positives by the sum of true positives and false negative (3: Tags with IDs T1, T5, and T7 in the GS), scoring <em><strong>4\/(4+3) = 0.5714<\/strong><\/em>. Finally, <strong><em>F1<\/em><\/strong> is computed using precision and recall, scoring <em><strong>2*((1*0.5714)\/(1+0.5714))= 2*(0.5714\/1.5714) = 0.7273<\/strong><\/em>.<\/p>\n\n\n\n<p><strong>NOTE:<\/strong> This is just an example. We are aware that achieving a precision score of 1.0 is quite a difficult task.<\/p>\n\n\n\n<h2><strong>Sub-track 2: Sensitive span detection<\/strong><\/h2>\n\n\n\n<p>For this second sub-track the goal is&nbsp;only to identify sensitive data. Thus, we will consider a span-based evaluation, regardless of the actual type of entity or the correct offset identification of multi-token sensitive phrase mentions.<\/p>\n\n\n\n<h4><strong>Strict span evaluation<\/strong><\/h4>\n\n\n\n<p>In the strict evaluation&nbsp;metric, the evaluation script will accept as correct annotations in the submissions that <strong>match exactly the start and end offset<\/strong> of the annotations in the GS file. For instance, it will accept as correct these tags:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1016\" height=\"153\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex3-1.png\" alt=\"\" class=\"wp-image-3674\" srcset=\"https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex3-1.png 1016w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex3-1-300x45.png 300w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex3-1-768x116.png 768w\" sizes=\"(max-width: 1016px) 100vw, 1016px\" \/><figcaption>Example of tags in a system submission file<\/figcaption><\/figure>\n\n\n\n<p>But also the following ones, even if the <strong>entity tag and type are not specified<\/strong> (the system has submitted <em>PHI<\/em> and <em>OTHER<\/em> as <em>ENTITY<\/em> and <em>TYPE<\/em>, respectively):<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1023\" height=\"152\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex4.png\" alt=\"\" class=\"wp-image-3675\" srcset=\"https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex4.png 1023w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex4-300x45.png 300w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/03\/meddocan-ex4-768x114.png 768w\" sizes=\"(max-width: 1023px) 100vw, 1023px\" \/><figcaption>Example of a system submission file with unspecified tags<\/figcaption><\/figure>\n\n\n\n<h4><strong>Merged spans evaluation<\/strong><\/h4>\n\n\n\n<p>For this sub-track we will also additionally compute <strong>another evaluation <\/strong>where we will <strong>merge the spans of PHI connected by non-alphanumerical characters<\/strong>. For instance, consider <strong>this tag in a GS file<\/strong>:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"1024\" height=\"52\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex5-1024x52.png\" alt=\"\" class=\"wp-image-3311\" srcset=\"https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex5-1024x52.png 1024w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex5-300x15.png 300w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex5-768x39.png 768w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex5.png 1036w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption>Example of merged span evaluation: GS tag<\/figcaption><\/figure>\n\n\n\n<p>For the previous tag, in the merged spans evaluation, the script will accept as correct the following annotation (this example is also accepted by the strict span evaluation metric):<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"829\" height=\"51\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex6.png\" alt=\"\" class=\"wp-image-3312\" srcset=\"https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex6.png 829w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex6-300x18.png 300w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex6-768x47.png 768w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex6-825x51.png 825w\" sizes=\"(max-width: 829px) 100vw, 829px\" \/><figcaption>Example of merged span evaluation: system annotation with an unspecified tag<br><br><\/figcaption><\/figure>\n\n\n\n<p>But also the following one, because the scripts merges the spans of the annotation if there are ony non-alphanumerical characters between them:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" width=\"743\" height=\"74\" src=\"http:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex7.png\" alt=\"\" class=\"wp-image-3313\" srcset=\"https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex7.png 743w, https:\/\/temu.bsc.es\/meddocan\/wp-content\/uploads\/2019\/02\/meddocan-ex7-300x30.png 300w\" sizes=\"(max-width: 743px) 100vw, 743px\" \/><figcaption>Example of merged span evaluation: system annotations with unspecified tags matching the GS annotation using merging of spans<br><\/figcaption><\/figure>\n\n\n\n<p>This merging process is carried out both in the GS file and in the system submission file. Therefore, the result is symmetric even if the larger span is given by the system (&#8220;Navarro Cu\u00e9llar&#8221; and &#8220;Ignacio&#8221; spans in the GS file and &#8220;Navarro Cu\u00e9llar, Ignacio&#8221; in the system file).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In this section, we show examples of how the evaluation will be carried out to make it clearer. For the following examples, we will consider that this set of MEDDOCAN tags is our Gold Standard (GS): This GS file is in i2b2 format. We have a DATE entity, a CONTACT entity, four LOCATION entities, and [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[5],"tags":[],"_links":{"self":[{"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/posts\/3966"}],"collection":[{"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/comments?post=3966"}],"version-history":[{"count":1,"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/posts\/3966\/revisions"}],"predecessor-version":[{"id":3967,"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/posts\/3966\/revisions\/3967"}],"wp:attachment":[{"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/media?parent=3966"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/categories?post=3966"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/temu.bsc.es\/meddocan\/index.php\/wp-json\/wp\/v2\/tags?post=3966"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}