These files contain the medical articles of MedlinePlus in both English and Spanish. These articles are classified in the 4 sections present in MedlinePlus: - Drugs and supplements (druginfo). - Drugs (meds). - Herbs and Supplements (natural). - Medical Encyclopedia (ency). - Articles (articles). - Patient instructions (patientinstructions). - Health Topics (health_topics). - Laboratory test information (labtests). We can find 2 files for each article in both languages: - TXT file with article's clean raw text. - XML file with the article's content in TEI format. ---------------------------- TXT files Clean raw text files are structured the following way: T: title of the section P: paragraph of the article t: title of the subsection -: element of a list ---------------------------- TEI files XML files are structured the following way: - each "
" node is a section or subsection of the article (check variable "type") - "" nodes store the title of the section or subsection - "

" nodes store paragraphs - "" nodes store elements of lists