by text mining unit at barcelona supercomputing center
The input should be a plain text file with one short text per line.