olmo-eval: An evaluation workbench for the model development loop | Endigest