-
Notifications
You must be signed in to change notification settings - Fork 40
dcn on Criteo #335
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
dcn on Criteo #335
Changes from 28 commits
Commits
Show all changes
56 commits
Select commit
Hold shift + click to select a range
07f6ed3
add dcn files.
jiangyzy 8882312
add README.md
jiangyzy e953d85
update readme.md, requirements.txt, train.sh. pretrained models cover…
jiangyzy d241dbf
deleted files
jiangyzy e2d76dd
deleted files
jiangyzy a632cb5
auto format by CI
oneflow-ci-bot 62b98d0
deleted .gitignore
jiangyzy 543a1bb
deleted .gitignore
jiangyzy 6e3a757
updated files
jiangyzy 68e8e49
modified nn.init.zeros_ and nn.init.xavier_normal_ in crossnet.
jiangyzy 06dfe97
fix change form /scripts/swin_dataloader_compare_speed_with_pytorch.py
jiangyzy c672dc4
add processing frappe from csv to parqurt format files: tools/frap…
jiangyzy 03c5763
modified frappe download link in README.md
jiangyzy d9dfd92
delete tools dir
jiangyzy 249b794
add tools dir
jiangyzy b87dbab
update dcn_graph_train_eval files
jiangyzy f14901b
Merge branch 'main' of https://github.com/Oneflow-Inc/models into dcn…
jiangyzy bc1d6d6
update fuxi dcn graph train and eval files , new dataset make tool ba…
jiangyzy 472645f
modified train.sh table_size_array
jiangyzy 91a88a4
fix some erroe in fuxi_data_util when save csv
jiangyzy 900cab4
Merge remote-tracking branch 'origin/dcn_fuxi_train_eval' into main
jiangyzy 09f6501
Criteo dcn related files
jiangyzy bb364fb
modified README.md
jiangyzy 6d477b8
modified dcn_train_eval.py some arguments name
jiangyzy 3c9201b
create graph when lr_decay
jiangyzy e987402
deleted fm_persistent
jiangyzy 2e50a5b
update dcn_train_eval.py
jiangyzy bd77d0f
formated file by
jiangyzy 329f789
new tool dir , and modified dcn_train_eval.py/sh fake path
jiangyzy a589398
add feature_map_json argment
jiangyzy 572d969
delete unnecessary and useless code
jiangyzy 24fe34d
add cast in make_criteo_parquet.py, modified dcn_train_eval.py
jiangyzy d29f5ee
delete useless
jiangyzy 6297862
add throughput
jiangyzy e14f763
Merge branch 'main' of https://github.com/Oneflow-Inc/models into cri…
jiangyzy b2dc7d6
add valid test samples arg
jiangyzy 9bae48b
fix batch_size and train_batch_size mismatched problem
jiangyzy 1960bd7
delete uesless print code
jiangyzy ed02f35
add a blank line in the bottom of dataset_config.yaml
jiangyzy 8bfb01b
add requirements.txt, update README.md
jiangyzy 5ea1f41
move loss=loss.numpy() to improve efficiency
jiangyzy 22a0477
delete fuxi code in dcn_train_eval.py, add scala related files, upda…
jiangyzy dd87ca6
update README
jiangyzy 062680b
remove RecommenderSystems/dcn/tools/make_criteo_parquet.py and Recom…
jiangyzy 3adeb3b
simplified DNN module, modified test eval process and related READEM…
jiangyzy cc9b8c4
add Crossnet fuxi quote, modified directory description in Readme an…
jiangyzy 1cef2f6
name auc loglogg in eval process as val_auc val_logloss, add pandas …
jiangyzy 314831f
simplified train.sh and related README contents
jiangyzy 2bb71e4
simplified L2,3,4 in train.sh
jiangyzy b2574fb
set size_factor default=3
jiangyzy fffafce
add dcn structure image
jiangyzy 60f8505
update Crossnet implementation in README
jiangyzy b1f9403
update Crossnet implementation in README
jiangyzy 104127f
update Crossnet implementation in README
jiangyzy 8eac6b3
update Crossnet implementation in README
jiangyzy ee21320
update README
jiangyzy File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,21 @@ | ||
| # Deep&Cross | ||
| [Deep & Cross Network](https://dl.acm.org/doi/10.1145/3124749.3124754) (DCN) can not only keep the advantages of DNN model, but also learn specific bounded feature crossover more effectively. In particular, DCN can explicitly learn cross features for each layer without the need for manual feature engineering, and the increased algorithm complexity is almost negligible compared with DNN model. | ||
|
|
||
|
|
||
| ## Directory description | ||
| ``` | ||
| . | ||
| |-- tools | ||
| |-- criteo.py # fuxi data preprofile | ||
| |-- csv_2_h5.py # fuxi file used to transform data from csv to h5 | ||
| |-- dataset_config.yaml # dataset config file | ||
| |-- fuxi_features.py # fuxi file | ||
| |-- | ||
| |--criteo1t_parquet.py # Read Criteo1T data and export it as parquet data format | ||
| |-- dlrm_train_eval.py # OneFlow DLRM training and evaluation scripts with OneEmbedding module | ||
| |-- requirements.txt # python package configuration file | ||
| └── README.md # Documentation | ||
| ``` | ||
|
|
||
|
|
||
|
|
||
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.