Skip to content

Refactor DeepSeek model by extracting basic functions#4611

Closed
fzyzcjy wants to merge 0 commit intosgl-project:mainfrom
fzyzcjy:feat/deepseek_refactor_atomic
Closed

Refactor DeepSeek model by extracting basic functions#4611
fzyzcjy wants to merge 0 commit intosgl-project:mainfrom
fzyzcjy:feat/deepseek_refactor_atomic

Conversation

@fzyzcjy
Copy link
Copy Markdown
Collaborator

@fzyzcjy fzyzcjy commented Mar 20, 2025

Motivation

In order to support #4068, this PR extracts some small functions from deep seek model and deep ep dispatcher.

The code diff needs to subtract change in #4610, i.e. please view fzyzcjy/sglang@feat/deepseek_async...feat/deepseek_refactor_atomic

Modifications

Checklist

@fzyzcjy fzyzcjy marked this pull request as ready for review March 20, 2025 13:13
@fzyzcjy
Copy link
Copy Markdown
Collaborator Author

fzyzcjy commented Apr 1, 2025

Ping me when this PR is to be merged - currently I only resolve conflicts in #4068, and will port the resolve code back here when pinged.

@fzyzcjy fzyzcjy closed this Apr 4, 2025
@fzyzcjy fzyzcjy force-pushed the feat/deepseek_refactor_atomic branch from f4a83ae to 8e10fec Compare April 4, 2025 03:00
@fzyzcjy
Copy link
Copy Markdown
Collaborator Author

fzyzcjy commented Apr 4, 2025

will reopen by extracting new code (temporarily reset to main branch)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant