Skip to content

Commit d7cdd68

Browse files
committed
fix ruff check of cicd
fix ruff check of cicd rename fix tp/pp input_ids transfer fix fix unset path update golden values modify pre-commit for ci debug fix format format
1 parent db1acfa commit d7cdd68

File tree

15 files changed

+151
-14
lines changed

15 files changed

+151
-14
lines changed

examples/qwen3/conf/train/engram.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ model:
5959

6060
# engram args =================
6161
use_engram: true
62-
engram_tokenizer_name_or_path: tokenizers/Qwen/Qwen2.5-7B-Instruct
62+
engram_tokenizer_name_or_path: xxx
6363
engram_vocab_size: [759680, 759680]
6464
max_ngram_size: 3
6565
n_embed_per_ngram: 512
@@ -101,12 +101,12 @@ model:
101101
data:
102102
reset_position_ids: True
103103
reset_attention_mask: True
104-
data_path: /share/project/lizhiyu/hetero_data/HQ_wo_fim/Nemotron-CC-high-actual-actual-high_text_document
104+
data_path: xxx
105105
split: 1
106106
no_mmap_bin_files: true
107107
tokenizer:
108108
legacy_tokenizer: true
109109
tokenizer_type: Qwen2TokenizerFS
110-
tokenizer_path: tokenizers/Qwen/Qwen2.5-7B-Instruct
110+
tokenizer_path: xxx
111111
vocab_size: 151936
112112
make_vocab_size_divisible_by: 64

examples/qwen3/conf/train_engram.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ experiment:
77
seed: 42
88
save_steps: 50
99
load: None
10-
exp_dir: /share/project/lixianduo/scale_gems_cx/${experiment.exp_name}
10+
exp_dir: xxx
1111
ckpt_format: torch
1212
task:
1313
type: train
@@ -17,10 +17,10 @@ experiment:
1717
per_node_task: false
1818
no_shared_fs: false
1919
rdzv_backend: static
20-
hostfile: /share/project/lixianduo/scale_gems_cx/host_single
21-
ssh_port: 7878
20+
hostfile: xxx
21+
ssh_port: xxx
2222
cmds:
23-
before_start: ulimit -n 1048576 && source /root/miniconda3/bin/activate /share/project/lixianduo/envs/flagscale-train-copy
23+
before_start: ulimit -n 1048576 && source /root/miniconda3/bin/activate flagscale-train
2424
envs:
2525
LOGLEVEL: "INFO"
2626
CUDA_VISIBLE_DEVICES: "0,1,2,3,4,5,6,7"
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.

flagscale/models/deepseek/engram_transformer_layer.py renamed to flagscale/models/megatron/engram/engram_transformer_layer.py

File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.

0 commit comments

Comments
 (0)