Pacific AI

All

2 repositories

langtest
Public
Deliver safe & effective language models
nlp artificial-intelligence benchmarks benchmark-framework model-assessment ai-safety mlops responsible-ai ml-safety trustworthy-ai
Python
•
Apache License 2.0
•50•545•4•0•Updated Oct 25, 2025Oct 25, 2025
helm
Public
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.
Python
•
Apache License 2.0
•340•0•0•0•Updated Sep 24, 2025Sep 24, 2025