Running 231 FineVision: Open Data is All You Need π 231 A new open-source dataset for training VLMs