-
Notifications
You must be signed in to change notification settings - Fork 145
[Bugfix] [Config] Retune ptpc fmoe deepseek-r1 for MI308 #1418
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
valarLip
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
I fixed a problem of tuning with the CK solutions of fmoe in #1405. And you can retune the shapes with this procedure: 1. Clean the untuned_fmoe.csv and add the shapes you want to tune 2. Run AITER_REBUILD=1 python3 hsa/gfx942/fmoe_2stages/tune.py --all at root of the repository. It will update the shapes tuned in tuned_fmoe.csv |
Ok. Let me try and get back to you. |
|
@yzhou103 I don't reproduce this problem, have you updated your code base (ck submodule) and cleaned the build ? |
Motivation
This also addressed issue #1417
Since the commit a7f63e3 is only about adding preshuffle for mxfp4,
so for gfx942 that does not support fp4, the
bpreshuffleargument has been set toFalse.The fused moe has been retuned as the original configuration has accuracy issue. We are getting lm_eval score of
0.Please let me know if the kernel usage or tuning procedure is not correct as the generated tuning file only have 1 kernel entry.
Technical Details
Retuning procedure that we have executed:
Clean the
untuned_fmoe.csvand thetuned_fmoe.csvAdd the following entries into
untuned_fmoe.csv.AITER_REBUILD=1 python3 hsa/gfx942/fmoe_2stages/tune.pyat root of the repository.Test Plan
Run E2E lmeval test for ptpc deepseek-r1
Test Result
Submission Checklist