An approach to extracting topic-guided views from the sources of a data lake