feat(第12章): 添加 Universal LLMJudge 和 Universal Win Rate 评估示例#99
Open
WHQAQ11 wants to merge 1 commit intodatawhalechina:mainfrom
Open
feat(第12章): 添加 Universal LLMJudge 和 Universal Win Rate 评估示例#99WHQAQ11 wants to merge 1 commit intodatawhalechina:mainfrom
WHQAQ11 wants to merge 1 commit intodatawhalechina:mainfrom
Conversation
新增文件: - 10_Universal_llm_judge.py: 展示使用 UniversalLLMJudgeEvaluator 进行代码质量评估的案例,包含自定义代码模板 - 11_Universal_win_rate.py: 展示使用 UniversalWinRateEvaluator 对比生成数学题和参考题质量的案例 文档更新: - 更新第12章文档:添加 Universal LLMJudge 和 Win Rate 模块的完整使用指南,包括两层级 API 设计、内置模板详解、自定义维度创建、字段映射最佳实践等内容 这些改进展示了通用模块字段映射和自定义评估维度的用法。
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
新增文件:
文档更新:
这些改进展示了通用模块字段映射和自定义评估维度的用法。