Add logits_processor option to generate_step function by nathanrchn · Pull Request #983 · ml-explore/mlx-examples

nathanrchn · 2024-09-10T20:42:38Z

This update introduces token masking capabilities to the generate_step function via a new logits_processor parameter. This enhancement supports constrained decoding scenarios that require token masking prior to sampling.

Updates include:

New logits_processor parameter in generate_step function
Token masking logic implemented within _step function
Updated docstring for generate_step to describe logits_processor
Created mx.array of all tokens including prompt

Usage example:

def logits_processor(input_ids: mx.array, logits: mx.array) -> mx.array:
    return grammar_processor(input_ids, logits)

Here, grammar_processor could represent a custom constrained decoding approach.

awni · 2024-09-27T18:45:18Z

Sorry for the delayed review. This is cool! And I think we can include it but I want to clarify something first. We have the logit_bias argument already which can do some of what the logit_processor does but in a less flexible way. Is that insufficient for your use cases? If so.. could you explain?

In addition, I don't think we need to support both arguments as it's a bit messy and redundant. Perhaps we can remove the logit_bias and keep the more flexible logit_processor if needed.

nathanrchn · 2024-09-28T13:13:12Z

I can indeed use the logit_bias argument but as my project primarily uses the transformers library, it would be nice to have the same logit_processor. Additionally, I would need to rewrite the stream_generate or generate method because the mask changes with each forward pass. Moreover, when masking tokens for constrained decoding, the majority of tokens are typically masked. Creating the logit_bias dictionary might slow down the generation process slightly, as it needs to cover thousands of tokens.

If you agree, I can remove the logit_bias from the arguments.

awni · 2024-09-28T13:15:17Z

Sounds good, thanks for clarifying. Let's remove logit_bias in favor of logit_processor then. Thanks!

awni · 2024-09-28T14:01:31Z

llms/mlx_lm/utils.py

        else:
            y, logprobs = sample(logits)

+        tokens_ids = mx.concat([tokens_ids, y], axis=0)


Is this a bug? Shouldn't it be tokens?

Yes of course.

…sformers library

awni

Thanks for the addition!

awni · 2024-09-28T16:57:22Z

I added logit_bias back because it's part of the OpenAI API spec and I don't want to break compatability with that.

As a follow up, a nice thing to do would be to refactor out logit_bias and repition_penalty out of generate_step since they can both use the logit_processor. That would simplify generate_step quite nicely.

nathanrchn · 2024-09-28T21:12:55Z

Do you mean adding logit_bias and repition_penalty as argument for the generation and stream_generate and create logits_processor method to handle the biases and the repetition penalty?

If so, it might be beneficial to modify the logits_processor type from a single function to a list of functions to apply sequentially.

awni · 2024-09-28T21:28:31Z

I hadn't thought it through too carefully but yes what your describing is more or less what I had in mind.

If so, it might be beneficial to modify the logits_processor type from a single function to a list of functions to apply sequentially.

Yea .. that may be cleaner.

nathanrchn mentioned this pull request Sep 10, 2024

Add types and MLX compatibility for cli generation epfl-dlab/transformers-CFG#93

Merged

awni reviewed Sep 28, 2024

View reviewed changes

nathanrchn and others added 6 commits September 28, 2024 08:21

Add logits_processor option for the generation as in huggingface tran…

67a9325

…sformers library

concatenation correction

92ba4b2

Rename the tokens variable for clarity

3d27dbf

remove the logit_bias argument from generate_step method

917cdb4

fix the variable name

50e4665

nits + test

824f7fd

awni force-pushed the logits_processor branch from 8de0a15 to 824f7fd Compare September 28, 2024 15:29

test

c8216ca

awni approved these changes Sep 28, 2024

View reviewed changes

add back logit bias + test

83aaf0c

awni merged commit ace2bb5 into ml-explore:main Sep 28, 2024

nathanrchn mentioned this pull request Sep 29, 2024

repetiton_penalty and logits_bias just using logits_processors #1004

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add logits_processor option to generate_step function#983

Add logits_processor option to generate_step function#983
awni merged 8 commits intoml-explore:mainfrom
nathanrchn:logits_processor

nathanrchn commented Sep 10, 2024

Uh oh!

awni commented Sep 27, 2024

Uh oh!

nathanrchn commented Sep 28, 2024

Uh oh!

awni commented Sep 28, 2024

Uh oh!

awni Sep 28, 2024

Uh oh!

nathanrchn Sep 28, 2024

Uh oh!

awni left a comment

Uh oh!

awni commented Sep 28, 2024

Uh oh!

nathanrchn commented Sep 28, 2024

Uh oh!

awni commented Sep 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nathanrchn commented Sep 10, 2024

Uh oh!

awni commented Sep 27, 2024

Uh oh!

nathanrchn commented Sep 28, 2024

Uh oh!

awni commented Sep 28, 2024

Uh oh!

awni Sep 28, 2024

Choose a reason for hiding this comment

Uh oh!

nathanrchn Sep 28, 2024

Choose a reason for hiding this comment

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

awni commented Sep 28, 2024

Uh oh!

nathanrchn commented Sep 28, 2024

Uh oh!

awni commented Sep 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants