Multi-Modal Information Extraction from Text, Semi-Structured, and Tabular Data on the Web