Skip to content

Conversation

@gabrielmbmb
Copy link
Contributor

Description

This PR adds the new RewardModelScore step which uses transformers to load a reward model to assign and score to an instruction-response or a conversation.

@gabrielmbmb gabrielmbmb added the enhancement New feature or request label Jul 29, 2024
@gabrielmbmb gabrielmbmb added this to the 1.3.0 milestone Jul 29, 2024
@gabrielmbmb gabrielmbmb self-assigned this Jul 29, 2024
@github-actions
Copy link

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-840/

@codspeed-hq
Copy link

codspeed-hq bot commented Jul 29, 2024

CodSpeed Performance Report

Merging #840 will not alter performance

Comparing reward-model-step (62ec8a5) with develop (974b45e)

Summary

✅ 1 untouched benchmarks

@gabrielmbmb gabrielmbmb merged commit 20bd1e3 into develop Jul 30, 2024
@gabrielmbmb gabrielmbmb deleted the reward-model-step branch July 30, 2024 07:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants