Intermediate checkpoints for HF model

by przvl - opened May 11, 2024

Discussion

przvl

May 11, 2024

•

edited May 11, 2024

Thanks to the whole team for the great work on the OLMo models!

On the model card you state:

We are releasing many checkpoints for these models, for every 1000 training steps.
These have not yet been converted into Hugging Face Transformers format, but are available in allenai/OLMo-7B.

Are you still converting the checkpoints to HF format? Would be really helpful for easily comparing different checkpoints with transformers (also for the 1B model).

ahmedsqrd

May 22, 2024

+1 would like to follow up on this as I would like to use the HF format models!

shanearora

May 23, 2024

As of today, we have released almost all the checkpoints of the newer allenai/OLMo-1.7-7B-hf model. The original 1B model will probably be next.

If you have any particular intermediate checkpoints you are interested in using, then one option is to convert these to HF format yourself (it takes maybe 5-10 mins per checkpoint). The instructions are in Checkpoints.md. The idea is to find the official checkpoint you want in https://github.com/allenai/OLMo/blob/main/checkpoints/official and then use convert_olmo_to_hf_new.py to convert it to HF format.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment