This course studies data quality problems and solutions in the context of text and web mining, which is the exploration of vast amounts of digitized text for use in knowledge discovery or more particularly drug discovery in the biomedical field.