The PharmaCoNER corpus has been randomly sampled into three
Sample set
The sample set is composed of 15 clinical cases extracted from the training set. This sample set is also included in the evaluation script (see Resources). Download the sample set from here.
Train set
The train set is composed of 500 clinical cases. Download the train set from here.
Development set
The Development set is composed of 250 clinical cases. Download the development set from here.
Background set
The background set is composed of 2,751 clinical cases. It is distributed in plain text format. Download the background set from here.
Test set with Gold Standard annotations
The Test set is with Gold Standard annotations is composed of 250 clinical cases. Download the Test set with Gold Standard annotations from here.