Text this: Multimodal learning using heterogeneous data /