You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
CNN accepts the word sequence of the embedding table, then process the data by convolution and pooling, and finally outputs a semantic vector.
@@ -240,12 +240,12 @@ The example of this format is as follows.
240
240
241
241
## Training
242
242
243
-
We use `python train.py -y 0 --model_arch 0` with the data in `./data/classification` to train a DSSM model for classification. The paremeters to execute the script `train.py` can be found by execution `python infer.py --help`. Some important parameters are:
243
+
We use `python train.py -y 0 --model_arch 0 --class_num 2` with the data in `./data/classification` to train a DSSM model for classification. The paremeters to execute the script `train.py` can be found by execution `python infer.py --help`. Some important parameters are:
244
244
245
245
-`train_data_path` Training data path
246
246
-`test_data_path` Test data path, optional
247
247
-`source_dic_path` Source dictionary path
248
-
-`target_dic_path`目Target dictionary path
248
+
-`target_dic_path`Target dictionary path
249
249
-`model_type` The type of loss function of the model: classification 0, sort 1, regression 2
250
250
-`model_arch` Model structure: FC 0,CNN 1, RNN 2
251
251
-`dnn_dims` The dimension of each layer of the model is set, the default is `256,128,64,32`,with 4 layers.
0 commit comments