Change the repository type filter
All
Repositories list
2 repositories
langtest
PublicDeliver safe & effective language modelshelm
PublicHolistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.