Data Cleaning and Information Extraction