Use templates as parameters #588

seanmor5 · 2024-07-23T15:50:28Z

Rather than requiring shape/type we can just use templates. The benefit is that this work supports using any Nx.Container as a model parameter. For right now we still support the old style under the param function, but this defers to the new parameter which requires templates.

There are certain places where using a container/composite parameter in place of a regular parameter makes sense. For example, if we want to initialize quantized models then we can create a quantized tensor container which represents a quantized parameter that can be converted back to a regular parameter.

In the future, I plan to unify the currently separated shape calculation and initialization process for a parameter. Realistically each parameter should just take input templates, do the shape calculation, and then initialize directly without any intermediate steps

seanmor5 added 2 commits July 23, 2024 11:38

Use templates as parameters

7dec0a1

Uncomment deps

18c77ef

josevalim approved these changes Jul 23, 2024

View reviewed changes

seanmor5 merged commit c4d33e5 into main Jul 24, 2024

seanmor5 deleted the template-params branch July 24, 2024 11:36

polvalente mentioned this pull request Sep 24, 2024

fix: loss scale assertions #597

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use templates as parameters #588

Use templates as parameters #588

Uh oh!

seanmor5 commented Jul 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use templates as parameters #588

Use templates as parameters #588

Uh oh!

Conversation

seanmor5 commented Jul 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants