Skip to content

Commit a7ffb59

Browse files
committed
added abstraction for target model backend
1 parent a05d76e commit a7ffb59

33 files changed

+1948
-2095
lines changed

examples/run_llama3_eagle3_online.sh

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,15 +8,22 @@ NUM_GPUS=${1:-8}
88
torchrun \
99
--standalone \
1010
--nproc_per_node $NUM_GPUS \
11-
$ROOT_DIR/scripts/train_eagle3_online.py \
11+
$ROOT_DIR/scripts/train_eagle3.py \
1212
--target-model-path meta-llama/Meta-Llama-3.1-8B-Instruct \
1313
--draft-model-config $ROOT_DIR/configs/llama3-8B-eagle3.json \
14-
--train-data-path $ROOT_DIR/cache/dataset/sharegpt.jsonl \
14+
--train-data-path $ROOT_DIR/cache/dataset/sharegpt_train.jsonl \
1515
--output-dir $ROOT_DIR/outputs/llama3-8b-eagle3 \
1616
--num-epochs 2 \
1717
--batch-size 2 \
1818
--learning-rate 1e-4 \
1919
--max-length 2048 \
2020
--chat-template llama3 \
2121
--cache-dir $ROOT_DIR/cache \
22-
--attention-backend flex_attention
22+
--attention-backend flex_attention \
23+
--target-model-backend sglang \
24+
--report-to wandb \
25+
--log-steps 10
26+
27+
28+
29+

requirements.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ pre-commit
22
torch==2.8.0
33
torchaudio==2.8.0
44
torchvision==0.23.0
5-
transformers==4.55.2
5+
transformers==4.57.1
66
qwen-vl-utils==0.0.11
77
datasets
88
setuptools
@@ -12,5 +12,5 @@ psutil
1212
numpy
1313
accelerate
1414
pydantic
15-
sglang[all]==0.5.1
15+
sglang==0.5.4
1616
openai-harmony

0 commit comments

Comments
 (0)