Text this: Pretrain Vision and Large Language Models in Python