Mixture of Experts model #210
deepsaia
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Thanks @karpathy for the amazing repo.
Translating the inspiration into a working prototype now.
Built a mixture of experts model with somewhat modular architecture:
https://github.com/deepsaia/moe_llama
Beta Was this translation helpful? Give feedback.
All reactions