Add constrained decoding #480

magicheng0816 · 2025-12-03T07:41:14Z

No description provided.

linkerzhang · 2025-12-03T12:29:22Z

xllm/core/framework/sampling/constrained_decoding.h

+
+  // Input generated_token_list: [sequence_num][generated_token_ids]
+  // Output: mask tensor[sequence_num,vocab_size]
+  virtual torch::Tensor generate_mask(


There'll be a data copy when calling this function, right? if it's heavy, it should be avoided.

There is no heavy data copying. First, an initialized tensor is generated on the device side based on the vocab size. Then, a valid token index set is dynamically obtained from the already generated tokens on the host side, copied to the device side, and the initialized tensor is modified in-place to form a mask.

xllm/core/framework/sampling/constrained_decoding.h

magicheng0816 requested review from DragonFive, liujinguang0125, liutongxuan, walsonyang and yq33victor December 3, 2025 07:41

magicheng0816 added 2 commits December 3, 2025 16:28

feat: add constrained decoding for generative recommendation.

0e9219d

feat: fix log style,etc.

e0d7b6a

magicheng0816 force-pushed the add_constrained_decoding branch from cd1fdce to e0d7b6a Compare December 3, 2025 08:29

linkerzhang reviewed Dec 3, 2025

View reviewed changes

feat: add comments, xllm header.

3f28900

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add constrained decoding #480

Add constrained decoding #480

magicheng0816 commented Dec 3, 2025

Uh oh!

linkerzhang Dec 3, 2025

Uh oh!

magicheng0816 Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add constrained decoding #480

Are you sure you want to change the base?

Add constrained decoding #480

Conversation

magicheng0816 commented Dec 3, 2025

Uh oh!

linkerzhang Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

magicheng0816 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants