Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
298 commits
Select commit Hold shift + click to select a range
c79de72
[NPU] Add group norm (#35937)
Ray2020BD Sep 29, 2021
2b8fd70
fix bug of top_k npu op (#36175)
pangyoki Sep 29, 2021
83578cf
[npu] add box coder (#36171)
Ray2020BD Sep 29, 2021
79bd5f9
add slot record dataset (#36200)
yaoxuefeng6 Sep 29, 2021
a9ea41c
Spinlock (#36030)
liutiexing Sep 29, 2021
1f93582
Add functional autograd API:hessian (#36108)
levi131 Sep 29, 2021
3eb5071
fix cusparse compile problem, test=develop (#36199)
Liu-xiandong Sep 29, 2021
69eed34
add optest for adamw (#36148)
zhaoyinglia Sep 29, 2021
21b93c3
Add basic support for CUDA Graph (#36190)
sneaxiy Sep 29, 2021
8af939f
fix the undefined variable bug in dist_transformer file (#36211)
youth123 Sep 30, 2021
5e0f199
Fix raw optim (#36176)
youth123 Sep 30, 2021
a66b9fb
[NPU] modify transpose2 and index_select_grad kernels for model xlnet…
Aganlengzi Sep 30, 2021
56b04bc
add test_hessian time out (#36234)
levi131 Sep 30, 2021
c12176e
fix yolo (#36240)
b3602sss Sep 30, 2021
0a3dbe8
add slotrecord datafeed (#36099)
yaoxuefeng6 Sep 30, 2021
2cee0ea
added Piotr to authors.md and updated Intel-related paddle authors im…
jakpiase Oct 4, 2021
dc4d571
Added concat BF16/FP32 BWD OneDNN kernel (#35889)
jakpiase Oct 5, 2021
e928834
[OneDNN] Conv op refactor. (#36252)
arogowie-intel Oct 7, 2021
730dcaf
fix bugs in HybridParallelClipGrad of hybrid_parallel_optimizer (#36237)
haohongxiang Oct 7, 2021
9814f89
fix cast cuda implementation (#36266)
sneaxiy Oct 8, 2021
1bd9cfe
Added oneDNN BF16 relu (#36265)
arlesniak Oct 8, 2021
a29ff4c
add python interface of sub_graph (#36120)
huangxu96 Oct 8, 2021
7cb19f5
[NPU] BatchNorm support layout of NCL and NLC, test=develop (#35668)
qili93 Oct 8, 2021
ca16e8f
add fs list_files_info (#36224)
yaoxuefeng6 Oct 8, 2021
f9591bb
Support CUDA Graph on ParallelExecutor (#36250)
sneaxiy Oct 8, 2021
57e8cbe
Fix for oneDNN conv op (#36284)
jakpiase Oct 8, 2021
d8887af
fix hasattr(paddle.fluid.ir.PassDesc.OP, '__name__') error (#36229)
Avin0323 Oct 9, 2021
2fd8dee
C++ support register pass via PassDesc (#36095)
Avin0323 Oct 9, 2021
623df42
support ClipGradByGlobalNorm in sharding (#36012)
zhaoyinglia Oct 9, 2021
c8a0101
update fft api path (#36219)
zhiboniu Oct 9, 2021
62e4115
fill_diagonal op fix border cross caused by offset (#36212)
zhiboniu Oct 9, 2021
21dc7f4
Add new API 'tensordot' (#36273)
From00 Oct 9, 2021
cb620ca
Add const for OpDesc::id() and VarDesc::id() (#36298)
sneaxiy Oct 9, 2021
9111927
Enhance OpTest for bfloat16. (#36079)
Xreki Oct 9, 2021
7e6c0ce
Implement Fused BN + Add + Relu with cudnnFusedOps API. (#35955)
ZzSean Oct 9, 2021
9b987b3
Add skip case for conv2d convert test (#36301)
JZZ-NOTE Oct 11, 2021
5690666
Add use_cinn Flag and RunFromCinn in PE (#36107)
zhhsplendid Oct 11, 2021
34bd18f
add skip case in trt converter ut (#36287)
baoachun Oct 11, 2021
2b7b752
add mish trt plugin (#34123)
wangxinxin08 Oct 11, 2021
ea76457
fix the hidden method in paddle.distributed.utils file (#36210)
youth123 Oct 11, 2021
2bf82e7
fix fft axis (#36321)
Oct 11, 2021
642aaa2
use unified external error message for cufft api (#36114)
cxxly Oct 11, 2021
64d08c0
fix bug of upload third party to bos (#36311)
betterpig Oct 11, 2021
1106132
[NPU] fix softmax_with_cross_entropy in dygraph, test=develop (#36297)
qili93 Oct 11, 2021
83541fd
[NPU] fix set_value, test=develop (#36272)
qili93 Oct 11, 2021
7850f7c
[NPU] fix matmul_v2 and utils.run_check, test=develop (#36164)
qili93 Oct 11, 2021
71cb3ff
enhance yolobox trt plugin (#34128)
wangxinxin08 Oct 11, 2021
414c252
Fix, test=document_fix (#36336)
tianshuo78520a Oct 11, 2021
7a724dd
fix multi-node (#36329)
yaoxuefeng6 Oct 11, 2021
c38b048
add reshard module (#35779)
Caozhou1995 Oct 11, 2021
00245cf
[Paddle-ASP] Revise 4d tensor sparsity mask pattern for conv2d sparsi…
zlsh80826 Oct 11, 2021
1026052
fix_dp_grad_merge_with_grad_clip_by_global_norm (#36334)
FeixLiu Oct 11, 2021
fc5415d
change exit code of pip install dependencies to 5 (#36016)
betterpig Oct 11, 2021
eaeeb88
fix bug of clear third_party cache every 10 days (#36332)
betterpig Oct 11, 2021
830debc
Add functor_primitives.h for kernel primtive api (#36203)
AnnaTrainingG Oct 11, 2021
a679fcb
Add more tests and fix bugs for cudnn_norm_conv_test and cudnn_bn_and…
ZzSean Oct 11, 2021
1439387
added missing bf16 ops (#36291)
jakpiase Oct 11, 2021
85b7723
Add nn.functional.sparse_attention and some test cases, test=develop …
Liu-xiandong Oct 11, 2021
7b45a46
Add FLAGS_allreduce_record_one_event to remove event waiting number (…
sneaxiy Oct 11, 2021
339cb19
fix for matmul_v2 6D x 2D (#36342)
jakpiase Oct 11, 2021
e5b4dd7
[heterps] add fuse_allreduce (#35131)
danleifeng Oct 11, 2021
6d353aa
refine auto_growth allocator (#35732)
zhiqiu Oct 11, 2021
2a75b44
Fix stop_gradient in RunProgramOp (#36339)
Aurelius84 Oct 12, 2021
0594d2a
Revert "refine case when thread_num = 1 (#36201)" (#36347)
sneaxiy Oct 12, 2021
ec148ca
fft: modify sample code result (#36325)
lijiaqi0612 Oct 12, 2021
d247cf1
fix bugs in mp_layers、pp_layers and HybridParallelClipGrad (#36144)
haohongxiang Oct 12, 2021
e275e42
Add pool2d test convert (#36338)
JZZ-NOTE Oct 12, 2021
8cc7146
[NPU] add int64 kernel for slice, test=develop (#36328)
qili93 Oct 12, 2021
1d660eb
Fix the bug when axis is specified and weight is provided
HydrogenSulfate Oct 11, 2021
8c2fbc3
Update loss.py
HydrogenSulfate Oct 11, 2021
53dc014
Update loss.py
HydrogenSulfate Oct 11, 2021
3675f25
Update loss.py
HydrogenSulfate Oct 11, 2021
6cd41ce
Update loss.py
HydrogenSulfate Oct 11, 2021
a4246b9
Update test_cross_entropy_loss.py
HydrogenSulfate Oct 11, 2021
59841e6
Update test_cross_entropy_loss.py
HydrogenSulfate Oct 11, 2021
f77083b
Update loss.py
HydrogenSulfate Oct 11, 2021
b3f6eed
refine LarsOptimizer (#36351)
sneaxiy Oct 12, 2021
09778f4
[NPU] fix elementwise_mul to support broadcast, test=develop (#36258)
qili93 Oct 12, 2021
1e1aa19
[Autograd.functional] VJP and JVP (#36020)
tongxin Oct 12, 2021
021add6
remove not needed log (#36348)
jczaja Oct 12, 2021
40cfe7b
delete remove_static_file() function in error.py (#36153)
0x45f Oct 12, 2021
6920afe
fix windows bug that python virtual env can't find python executable …
zhwesky2010 Oct 12, 2021
5f1eb83
[NPU] concat supports dtype int64 for model deepfm (#36327)
Aganlengzi Oct 12, 2021
fba355f
change the paddle.mm to matmul_v2 (#35770)
wawltor Oct 12, 2021
3e2dec5
Change the input param of fusion op interface from pointer to tensor …
ZzSean Oct 12, 2021
033a73c
Revert "refine LarsOptimizer (#36351)" (#36369)
sneaxiy Oct 12, 2021
3c2bdaa
unify usage of tuple and list (#36368)
levi131 Oct 13, 2021
90457d8
Set NIGHTLY tag for 'tensordot' UT (#36354)
From00 Oct 13, 2021
caa2003
[PaddlePaddle Hackathon] add AlexNet (#36058)
fuqianya Oct 13, 2021
d7858c9
[PaddleInference] Pass: add int8 flag for op (#36042)
Wangzheee Oct 13, 2021
2c44ee7
[New Feature] Support triple grad in Paddle (#36187)
JiabinYang Oct 13, 2021
e051bba
Remove RunFromCinn in PE because We Will Call CinnRunner in Compute o…
zhhsplendid Oct 13, 2021
59e425c
[Amp] refine code of amp level (#36362)
zhiqiu Oct 13, 2021
bf748f2
Implemented LRU based cache clearing (#36290)
jczaja Oct 13, 2021
192e08c
pool fix (#36388)
b3602sss Oct 13, 2021
817f9ef
fix pp comm init bug (#36377)
Caozhou1995 Oct 13, 2021
85bb1a8
support auto parallel data shard (#36055)
GuoxiaWang Oct 13, 2021
3a869cc
Add fp16 for clip_by_norm & clip_by_global_norm (#36198)
zhangbo9674 Oct 13, 2021
9a9953d
[AMP] add attr is_distributed for layer.to (#36221)
zhangbo9674 Oct 13, 2021
2441847
Verify the correctness of graph rewrited by GeneratePass (#36116)
Avin0323 Oct 13, 2021
0c31579
Merge lars op (#35476)
JamesLim-sy Oct 13, 2021
d7064f0
[PaddlePaddle hackathon] + ADD CELU (#36088)
JunnYu Oct 13, 2021
8fd1b6a
fix BatchNorm for fp16 (#36376)
GuoxiaWang Oct 13, 2021
7f5128f
clean inference logs when config.DisableGlogInfo is triggered (#36356)
Oct 14, 2021
b857d75
Sparsity support (#36413)
JZZ-NOTE Oct 14, 2021
03d8304
[hybrid enhance] add flag to control the avg position for grad merge …
FeixLiu Oct 14, 2021
fb68ea6
Add static memory analysis module (#36408)
sneaxiy Oct 14, 2021
cb5bf58
fix import bug for assign (#36406)
levi131 Oct 14, 2021
693b1aa
reduce some unittest's parallel number to avoding timeout failure (#3…
betterpig Oct 14, 2021
8ffcc7c
[HybridParallel]Rebuild code for pipeline (#36396)
ForFishes Oct 14, 2021
eb722e3
refine lars (#36409)
sneaxiy Oct 14, 2021
f4eda86
Merge momentum ops/kernels (#36380)
sneaxiy Oct 14, 2021
5d18967
Revert "Implemented LRU based cache clearing (#36290)" (#36426)
lidanqing-vv Oct 14, 2021
bed4fb2
[NPU] Add density_prior_box (#36361)
Ray2020BD Oct 14, 2021
12e6dbb
Add the complete code and related files of resnet_unit_op (#36366)
ZzSean Oct 14, 2021
3e6d9db
inference support bert when exists matmul_v2 (#36424)
jiweibo Oct 14, 2021
63fd7d6
refine merge lars (#36428)
sneaxiy Oct 14, 2021
3cf5764
enable 3rd order test case (#36427)
levi131 Oct 14, 2021
8256f6f
fix lars (#36431)
sneaxiy Oct 14, 2021
66c58fa
optimize-offload support adamw op type (#36432)
firestonelib Oct 14, 2021
6ccc2a4
add sparse_embedding doc (#36283)
Yanxing-Shi Oct 14, 2021
8566cc9
close some check on CI-OP-Benchmark, test=develop (#36442)
Avin0323 Oct 15, 2021
4dda18a
fix momentum ops (#36452)
sneaxiy Oct 15, 2021
808be65
[New Feature] Support tanh triple grad (#36225)
JiabinYang Oct 15, 2021
b3f02c5
Add BuildCinnPass (#36345)
thisjiang Oct 15, 2021
f45e6cf
dynamic load mkl as a fft backend when it is avaialble and requested …
Oct 15, 2021
37257d6
fix no_grad context error in train mode when using save/load (#36434)
0x45f Oct 15, 2021
277c9a5
add resnext (#36070)
SigureMo Oct 15, 2021
2de0b58
feat: Add TRT support for 3D(batch_norm_op and elementwise_add_op) (#…
Oct 15, 2021
12882b2
Add ResNetUnit Python API (#35426)
ZzSean Oct 15, 2021
e703a2e
fix opt-offload save bug (#36433)
firestonelib Oct 15, 2021
adb8049
Remove wrong __restrict__ of CUDA LarsMomentumOpKernel (#36460)
sneaxiy Oct 15, 2021
0452f27
fix the initializer of resnet unit op (#36483)
ZzSean Oct 16, 2021
314cc49
Revert "fix the initializer of resnet unit op (#36483)" (#36487)
sneaxiy Oct 17, 2021
4e036fa
refine rescale_grad (#36490)
sneaxiy Oct 17, 2021
e496d1e
modify ut of cond (#36475)
haohongxiang Oct 18, 2021
79dbbcc
[autograd.functional] Fix a bug on handling v=None in vjp and jvp (#3…
tongxin Oct 18, 2021
d3c9394
Fix conv2d op_teller error (#36474)
JZZ-NOTE Oct 18, 2021
d19a9b3
[XPU AMP] 1. xpu support gradient acc 2. xpu support create tensor in…
taixiurong Oct 18, 2021
623e36b
add IPluginV2Layer: AddPluginV2Ext (#36493)
Wangzheee Oct 18, 2021
051544b
quant support matmul_v2 (#36469)
ceci3 Oct 18, 2021
3845aff
Add operators for async read & async write (#36333)
DesmonDay Oct 18, 2021
8757fc5
[NPU] fix dtype for arg_max, test=develop (#36457)
qili93 Oct 18, 2021
cbd15f7
[NPU] add kernels for elementwise_add gather_nd tile, test=develop (#…
qili93 Oct 18, 2021
b7f7664
Add quant axis (#36467)
xiaoxiaohehe001 Oct 18, 2021
4c0ad77
Lml/vhp (#36146)
levi131 Oct 18, 2021
bdac9ff
Added softplus FP32 FWD OneDNN kernel (#36382)
jakpiase Oct 18, 2021
10f0a0f
[HybridParallel]Support fp16 in dygraph hybrid parallel (#36420)
haohongxiang Oct 18, 2021
305b99a
Add pow2_decay_with_linear_warmup op (#36421)
sneaxiy Oct 19, 2021
a7830a2
bug fix for DeserializeSelectedRows. test=develop (#36520)
zmxdream Oct 19, 2021
77f4597
fix out of range for area interp, test=develop (#36466)
tink2123 Oct 19, 2021
1d5746b
add rocm support for fft api (#36415)
cxxly Oct 19, 2021
a573a7e
Add auto parallel cost model and unittests (#36363)
YipZLF Oct 19, 2021
34d785c
[paddle.linalg.qr] Add the Qr Operator (#35742)
aoyulong Oct 19, 2021
7edcc4f
catch the generatorfunction and intercept it. (#35369)
2742195759 Oct 19, 2021
d89a759
fix replicate pad when input size is 0 (#36510)
littletomatodonkey Oct 19, 2021
8cc8e41
[hybrid] static model parallel dropout support deterministic RandomSe…
wangxicoding Oct 19, 2021
7b67f39
add nearest_interp_v2 trt plugin (#34126)
wangxinxin08 Oct 19, 2021
6cdc5a4
Optimize the subgraph generated by BuildCinnPass (#36503)
thisjiang Oct 19, 2021
be6a833
Inference add type check in copy_from_cpu (#36429)
jiweibo Oct 19, 2021
9e49447
[heterps]edit shrink and unseenday logit for pslib (#36194)
danleifeng Oct 19, 2021
49d7bd3
[NPU] update inference cmake, test=develop (#36505)
qili93 Oct 19, 2021
f261246
fix op_flops not define. test=develop (#36489)
heavengate Oct 19, 2021
999242e
[NPU] Add iou_similarity op (#36412)
Ray2020BD Oct 19, 2021
51c97d9
Support elementwise_add triple grad Kernel (#36508)
veyron95 Oct 19, 2021
fe01ba6
remove no_value using var.name (#36513)
0x45f Oct 20, 2021
127488b
Add kQueueSync.synchronize_run_ logic (#36546)
Aurelius84 Oct 20, 2021
797bd40
[Auto Parallel] Generalization for Partition and Completion (#35735)
JZ-LIANG Oct 20, 2021
c285c71
[FIX] Extend time for test_activation_nn_grad to avoid its timeout is…
JiabinYang Oct 20, 2021
4bd1977
fix (#36557)
b3602sss Oct 20, 2021
6524fa8
Add CINN Compile Option (#36292)
zhhsplendid Oct 20, 2021
8ca5206
fix SerializeSelectedRows (#36543)
zmxdream Oct 20, 2021
06bd348
update for trt convert ut. (#36549)
jiweibo Oct 20, 2021
7325c9f
add unittest (#36371)
jiweibo Oct 20, 2021
605e7f0
fix pow2 decay (#36559)
sneaxiy Oct 20, 2021
873ee4e
adapt to cann5.0.3_alpha3. (#36106)
wuhuachaocoding Oct 20, 2021
3f2d6a3
Add FasterTokenizer Operator (#34491)
Steffy-zxf Oct 20, 2021
fc5db55
fix fc fuse proble (#36568)
jiweibo Oct 20, 2021
6a572a1
[NPU] Add kldiv_loss_op for npu (#36494)
ronny1996 Oct 20, 2021
17b4dd7
Fix global gather and global scatter operators (#36517)
youth123 Oct 20, 2021
6a3941e
fix bugs of ClipGradByGlobalNorm in HybridParallel (#36555)
haohongxiang Oct 20, 2021
ded3e70
[heterps]fix heterps pipeline training (#36512)
danleifeng Oct 20, 2021
e82c3a5
Support No DataTransform From GetKernelTypeForVar (#36571)
Aurelius84 Oct 21, 2021
1d38a01
refine comments for GradScaler state_dict (#36522)
zhangbo9674 Oct 21, 2021
f698577
Fixed unit test for auto parallel cost model (#36574)
YipZLF Oct 21, 2021
7253398
Fix flame graph (#36578)
liutiexing Oct 21, 2021
d64f7b3
add ctr table depends (#36465)
zhaocaibei123 Oct 21, 2021
5eb640c
Graph engine4 (#36587)
seemingwang Oct 21, 2021
921c091
Fix a bug in ReadData, ReadDataBc and ReadDataReduce when NX != 1 (#3…
AnnaTrainingG Oct 21, 2021
b6e7f8e
User specified backend (#35745)
2742195759 Oct 21, 2021
7bf2aa3
add fill_any_like/flatten ops to train ssd on kunlun (#36550)
tangzhiyi11 Oct 21, 2021
66f4b29
fix hdfs download_dir (#36590)
danleifeng Oct 21, 2021
6072aec
Add viterbi decode (#35778)
joey12300 Oct 21, 2021
0ca2807
[NPU] Add sync_batch_norm and sync_batch_norm_grad NPU Kernel (#36320)
windstamp Oct 21, 2021
856cb9c
Added matmul_v2+transpose+reshape fuse pass (#36481)
jakpiase Oct 21, 2021
7eab0fa
add swish_op for npu (#36579)
ronny1996 Oct 21, 2021
ed478a3
[NPU] Add p_norm_grad (#36497)
Ray2020BD Oct 21, 2021
ff06df6
[PaddlePaddle Hackathon] add InceptionV3 (#36064)
SigureMo Oct 22, 2021
f46311b
support lite xpu choose device id (#36610)
jiweibo Oct 22, 2021
6580ad1
【Bug Fixes】Elementwise_add triple grad, fixed an input uninitialized …
veyron95 Oct 22, 2021
08248db
[hapi] support dygraph amp O2 (#36441)
zhiqiu Oct 22, 2021
d490621
Fused attention op forward (#35905)
limin2021 Oct 22, 2021
1962d3a
add fp16 kernel for clip_op (#36577)
zhangbo9674 Oct 22, 2021
5e88084
correct slice serialize data (#36588)
b3602sss Oct 22, 2021
ab73288
New Paddle-CINN Compile PR (#36584)
zhhsplendid Oct 23, 2021
3cb6f65
Add transformer of paddle desc and cinn desc (#36100)
thisjiang Oct 23, 2021
425db7c
add file exists check (#36628)
jiweibo Oct 23, 2021
f6d8252
fix interpolate mkldnn op error (#36623)
baoachun Oct 23, 2021
99e396f
disable padding if dynamic shape (#36648)
b3602sss Oct 23, 2021
bbd4bd7
add cinn graph symbolization (#36417)
thisjiang Oct 23, 2021
e2173b6
Add the macro `-DPADDLE_WITH_CINN`. (#36660)
wzzju Oct 23, 2021
087c3ab
fix pool2d convert case (#36667)
JZZ-NOTE Oct 25, 2021
b18cbfb
add op: fused_feedforward(forward) (#35843)
Oct 25, 2021
eff3ee5
Fix grid sampler while input size is [1] (#36183)
wanghaoshuang Oct 25, 2021
cdb9bfa
[new-exec] Add events waiter (#36480)
liutiexing Oct 25, 2021
50778ad
add some ops to train ssd on kunlun (#36407)
tangzhiyi11 Oct 25, 2021
59d8b8c
[HybridParallel]fix bug of check_inf in fleet_base.py (#36651)
haohongxiang Oct 25, 2021
4c46037
Create CinnCompiler class for compiling subgraphs found by build_cinn…
wzzju Oct 25, 2021
e16fe48
CI build PR and dev whl (#36532)
tianshuo78520a Oct 25, 2021
39f1912
Add bincount op (#36317)
smallv0221 Oct 25, 2021
2dd0a46
add op: fused_feedforward(backward) (#35611)
Oct 25, 2021
19b02d9
[NPU] modifications for model ernie-1.0 (#36642)
Aganlengzi Oct 25, 2021
cea1ba8
add ctr accessor (#36601)
zhaocaibei123 Oct 25, 2021
229bae8
Pool3d 2.0 (#36545)
Oct 26, 2021
db633af
Fix conv2d convert case (#36699)
JZZ-NOTE Oct 26, 2021
eca78a9
Support various length support for SelectedRows in GLOO::AllGather (#…
2742195759 Oct 26, 2021
290ded7
Optimize FasterTokenizer (#36701)
joey12300 Oct 26, 2021
93c591e
[Paddle-Inference]Add MatmulV2ToMatmul convert Pass, fix (matmul_v2, …
Wangzheee Oct 26, 2021
21bece3
enable flags_benchmark for dygraph (#36686)
zhiqiu Oct 26, 2021
81e0c1b
move fft and signal files, move signal APIs (#36540)
Oct 26, 2021
28bab07
Fix the null ptr bug in build_cinn_pass. (#36698)
wzzju Oct 26, 2021
43dcf23
fix wrong trt dim when input dim is 2 (#36614)
baoachun Oct 26, 2021
3523bbe
[NPU] fix argsort op, test=develop (#36576)
qili93 Oct 26, 2021
9aeca2f
Move fused_attention and fused_feedforward functional api path to inc…
limin2021 Oct 26, 2021
eb9ef88
Modify paddle.static.nn.cond doc (#36694)
zhhsplendid Oct 26, 2021
fe6dbdd
[new-exec] Add cancel for thread pool (#36688)
liutiexing Oct 26, 2021
87fbbd3
[new-exec] cache exception in child thread (#36692)
zhiqiu Oct 26, 2021
236ed94
Add roi_align grad (#36724)
Ray2020BD Oct 26, 2021
7b1e30f
roll_op: support Tensor as input for shifts (#36727)
Oct 26, 2021
5119428
Add fused attention op backward and python layer. (#36498)
limin2021 Oct 26, 2021
63f1e6b
Remove additional warnning in layer.to (#36700)
JiabinYang Oct 26, 2021
63f3ae0
show paddle traceback after last user code traceback (#36741)
0x45f Oct 27, 2021
542ba21
Fix inverse in fake quant (#36762)
wanghaoshuang Oct 27, 2021
9f9ed3a
add paddle.linalg.eigvalsh API (#35615)
huangjun12 Oct 27, 2021
8c1c72a
enable trt test check and fix trt ut error(3/3) (#36581)
jiweibo Oct 27, 2021
6838a18
add fp16 unittests for kl2 (#36583)
taixiurong Oct 27, 2021
34b6860
fix fftshift/ifftshift on static mode (#36748)
Oct 27, 2021
d5245a3
add matmul_v2 to v1 CPU pass and fix matmul dim error (#36731)
baoachun Oct 27, 2021
e625315
bugfix: only check backend when mode == Collecive (#36758)
2742195759 Oct 27, 2021
9f3613f
Fused transformer encoder layer and fused feedforward layer (#36604)
Oct 27, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
14 changes: 8 additions & 6 deletions AUTHORS.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
| abhinavarora | Abhinav Arora |
| andreazanetti | Andrea Zanetti |
| arlesniak | Artur Lesniak |
| arogowie-intel | Adam Osewski |
| [arogowie-intel](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg) | Adam Osewski |
| backyes | Yan-Fei Wang |
| baiyfbupt | Yi-Fan Bai |
| beckett1124 | Bin Qi |
Expand All @@ -25,8 +25,8 @@
| hedaoyuan | Dao-Yuan He |
| helinwang | He-Lin Wang |
| jacquesqiao | Long-Fei Qiao |
| jakpiase | Jakub Piasecki |
| [jczaja](https://raw.githubusercontent.com/jczaja/Paddle/paddle-poland-team/doc/images/paddle_poland_team.jpg) | Jacek Czaja |
| [jakpiase](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg) | Jakub Piasecki |
| [jczaja](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg) | Jacek Czaja |
| JiayiFeng | Jia-Yi Feng |
| kbinias | Krzysztof Binias |
| kexinzhao | Ke-Xin Zhao |
Expand All @@ -47,28 +47,30 @@
| pakchoi | Chuan-Jiang Song |
| panyx0718 | Xin Pan |
| pengli09 | Peng Li |
| pmajchrzak |Piotr Majchrzak |
| [piotrekobiIntel](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg) | Piotr Paturej |
| [pmajchrzak](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg) | Piotr Majchrzak |
| pkuyym | Ya-Ming Yang |
| pzelazko-intel | Pawel Zelazko |
| [pawelpiotrowicz](https://raw.githubusercontent.com/jczaja/Paddle/paddle-poland-team/doc/images/paddle_poland_team.jpg) | Pawel Piotrowicz |
| QiJune | Jun Qi |
| qingqing01 | Qing-Qing Dang |
| reyoung | Yang Yu |
| [Sand3r-](https://raw.githubusercontent.com/jczaja/Paddle/paddle-poland-team/doc/images/paddle_poland_team.jpg)| Michal Gallus |
| [sfraczek](https://raw.githubusercontent.com/jczaja/Paddle/paddle-poland-team/doc/images/paddle_poland_team.jpg)| Sylwester Fraczek |
| [sfraczek](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg)| Sylwester Fraczek |
| sneaxiy | Jin-Le Zeng |
| Superjom | Chun-Wei Yan |
| tensor-tang | Jian Tang |
| tianbingsz | Tian-Bing Xu |
| tpatejko | Tomasz Patejko |
| [tsocha](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg) | Tomasz Socha |
| typhoonzero | Yi Wu |
| velconia | Qi-Yang Min |
| wanghaoshuang | Hao-Shuang Wang |
| wangyang59 | Yang Wang |
| wangzhen-nlp | Zhen Wang |
| wen-bo-yang | Wen-Bo Yang |
| wojtuss | Wojciech Uss |
| wozna | Joanna Wozna |
| [wozna](https://raw.githubusercontent.com/jakpiase/Paddle/new_paddle_intel_authors/img/img.jpg)| Joanna Wozna |
| wwhu | Wei-Wei Hu |
| xinghai-sun | Xing-Hai Sun |
| Xreki | Yi-Qun Liu |
Expand Down
1 change: 1 addition & 0 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -214,6 +214,7 @@ option(PY_VERSION "Compile PaddlePaddle with python3 support" ${PY_VER
option(WITH_DGC "Use DGC(Deep Gradient Compression) or not" ${WITH_DISTRIBUTE})
option(SANITIZER_TYPE "Choose the type of sanitizer, options are: Address, Leak, Memory, Thread, Undefined" OFF)
option(WITH_LITE "Compile Paddle Fluid with Lite Engine" OFF)
option(WITH_CINN "Compile PaddlePaddle with CINN" OFF)
option(WITH_NCCL "Compile PaddlePaddle with NCCL support" ON)
option(WITH_RCCL "Compile PaddlePaddle with RCCL support" ON)
option(WITH_XPU_BKCL "Compile PaddlePaddle with BAIDU KUNLUN XPU BKCL" OFF)
Expand Down
34 changes: 34 additions & 0 deletions cmake/external/ascend.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -85,5 +85,39 @@ if(WITH_ASCEND_CL)
ADD_LIBRARY(acl_op_compiler SHARED IMPORTED GLOBAL)
SET_PROPERTY(TARGET acl_op_compiler PROPERTY IMPORTED_LOCATION ${acl_op_compiler_lib})
add_custom_target(extern_ascend_cl DEPENDS ascendcl acl_op_compiler)
endif()

if (WITH_ASCEND_CL)
macro(find_ascend_toolkit_version ascend_toolkit_version_info)
file(READ ${ascend_toolkit_version_info} ASCEND_TOOLKIT_VERSION_CONTENTS)
string(REGEX MATCH "version=([0-9]+\.[0-9]+\.[0-9]+\.[a-z]*[0-9]*)" ASCEND_TOOLKIT_VERSION "${ASCEND_TOOLKIT_VERSION_CONTENTS}")
string(REGEX REPLACE "version=([0-9]+\.[0-9]+\.[0-9]+\.[a-z]*[0-9]*)" "\\1" ASCEND_TOOLKIT_VERSION "${ASCEND_TOOLKIT_VERSION}")
string(REGEX REPLACE "[a-z|\.]" "" CANN_VERSION ${ASCEND_TOOLKIT_VERSION})
add_definitions("-DCANN_VERSION_CODE=${CANN_VERSION}")
if(NOT ASCEND_TOOLKIT_VERSION)
set(ASCEND_TOOLKIT_VERSION "???")
else()
message(STATUS "Current Ascend Toolkit version is ${ASCEND_TOOLKIT_VERSION}")
endif()
endmacro()

macro(find_ascend_driver_version ascend_driver_version_info)
file(READ ${ascend_driver_version_info} ASCEND_DRIVER_VERSION_CONTENTS)
string(REGEX MATCH "Version=([0-9]+\.[0-9]+\.[0-9]+)" ASCEND_DRIVER_VERSION "${ASCEND_DRIVER_VERSION_CONTENTS}")
string(REGEX REPLACE "Version=([0-9]+\.[0-9]+\.[0-9]+)" "\\1" ASCEND_DRIVER_VERSION "${ASCEND_DRIVER_VERSION}")
if(NOT ASCEND_DRIVER_VERSION)
set(ASCEND_DRIVER_VERSION "???")
else()
message(STATUS "Current Ascend Driver version is ${ASCEND_DRIVER_VERSION}")
endif()
endmacro()

if (WITH_ARM)
set(ASCEND_TOOLKIT_DIR ${ASCEND_DIR}/ascend-toolkit/latest/arm64-linux)
else()
set(ASCEND_TOOLKIT_DIR ${ASCEND_DIR}/ascend-toolkit/latest/x86_64-linux)
endif()

find_ascend_toolkit_version(${ASCEND_TOOLKIT_DIR}/ascend_toolkit_install.info)
find_ascend_driver_version(${ASCEND_DIR}/driver/version.info)
endif()
82 changes: 82 additions & 0 deletions cmake/external/cinn.cmake
Original file line number Diff line number Diff line change
@@ -0,0 +1,82 @@
# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

if (NOT WITH_CINN)
return()
endif()

# TODO(zhhsplendid): CINN has lots of warnings during early development.
# They will be treated as errors under paddle. We set no-error now and we will
# clean the code in the future.
add_definitions(-w)

######################################
# Build CINN from Git External Project
######################################
include(ExternalProject)
set(CINN_SOURCE_DIR ${THIRD_PARTY_PATH}/CINN)
# TODO(zhhsplendid): Modify git tag after we have release tag
set(CINN_GIT_TAG e422c01b7875301996a2baf67a14ba61b0e6192a)
set(CINN_OPTIONAL_ARGS -DPY_VERSION=${PY_VERSION} -DWITH_CUDA=${WITH_GPU} -DWITH_CUDNN=${WITH_GPU} -DPUBLISH_LIBS=ON -DWITH_TESTING=ON)
set(CINN_BUILD_COMMAND $(MAKE) cinnapi -j)
ExternalProject_Add(
external_cinn
${EXTERNAL_PROJECT_LOG_ARGS}
GIT_REPOSITORY "${GIT_URL}/PaddlePaddle/CINN.git"
GIT_TAG ${CINN_GIT_TAG}
PREFIX ${CINN_SOURCE_DIR}
BUILD_COMMAND ${CINN_BUILD_COMMAND}
INSTALL_COMMAND ""
CMAKE_ARGS ${CINN_OPTIONAL_ARGS})



ExternalProject_Get_property(external_cinn BINARY_DIR)
ExternalProject_Get_property(external_cinn SOURCE_DIR)
set(CINN_BINARY_DIR ${BINARY_DIR})
set(CINN_SOURCE_DIR ${SOURCE_DIR})

message(STATUS "CINN BINARY_DIR: ${CINN_BINARY_DIR}")
message(STATUS "CINN SOURCE_DIR: ${CINN_SOURCE_DIR}")


######################################
# Add CINN's dependencies header files
######################################

# Add absl
set(ABSL_INCLUDE_DIR "${CINN_BINARY_DIR}/dist/third_party/absl/include")
include_directories(${ABSL_INCLUDE_DIR})

# Add isl
set(ISL_INCLUDE_DIR "${CINN_BINARY_DIR}/dist/third_party/isl/include")
include_directories(${ISL_INCLUDE_DIR})

# Add LLVM
set(LLVM_INCLUDE_DIR "${CINN_BINARY_DIR}/dist/third_party/llvm/include")
include_directories(${LLVM_INCLUDE_DIR})

######################################################
# Put external_cinn and dependencies together as a lib
######################################################

set(CINN_LIB_NAME "libcinnapi.so")
set(CINN_LIB_LOCATION "${CINN_BINARY_DIR}/dist/cinn/lib")
set(CINN_INCLUDE_DIR "${CINN_BINARY_DIR}/dist/cinn/include")

add_library(cinn SHARED IMPORTED GLOBAL)
set_target_properties(cinn PROPERTIES IMPORTED_LOCATION "${CINN_LIB_LOCATION}/${CINN_LIB_NAME}")
include_directories(${CINN_INCLUDE_DIR})
add_dependencies(cinn external_cinn)

2 changes: 1 addition & 1 deletion cmake/external/dlpack.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ set(DLPACK_PREFIX_DIR ${THIRD_PARTY_PATH}/dlpack)
set(DLPACK_SOURCE_DIR ${THIRD_PARTY_PATH}/dlpack/src/extern_dlpack)

set(DLPACK_REPOSITORY ${GIT_URL}/dmlc/dlpack.git)
set(DLPACK_TAG v0.2)
set(DLPACK_TAG v0.4)

cache_third_party(extern_dlpack
REPOSITORY ${DLPACK_REPOSITORY}
Expand Down
2 changes: 1 addition & 1 deletion cmake/external/lite.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -134,7 +134,7 @@ if (NOT LITE_SOURCE_DIR OR NOT LITE_BINARY_DIR)
GIT_TAG ${LITE_GIT_TAG}
PREFIX ${LITE_SOURCES_DIR}
UPDATE_COMMAND ""
PATCH_COMMAND sed -i "s?NNadapter_bridges_path = os.path.abspath('..')+\"\/lite\/kernels\/nnadapter\/bridges\/paddle_use_bridges.h\"?NNadapter_bridges_path = os.path.abspath(\'..\')+\"\/extern_lite\/lite\/kernels\/nnadapter\/bridges\/paddle_use_bridges.h\"?" ${LITE_SOURCES_DIR}/src/extern_lite//lite/tools/cmake_tools/record_supported_kernel_op.py && sed -i "/general::ssa::ConvertToSSA(cpp_prog)$<SEMICOLON>/d" ${LITE_SOURCES_DIR}/src/extern_lite/lite/model_parser/model_parser.cc
PATCH_COMMAND sed -i "s?NNadapter_bridges_path = os.path.abspath('..')+\"\/lite\/kernels\/nnadapter\/bridges\/paddle_use_bridges.h\"?NNadapter_bridges_path = os.path.abspath(\'..\')+\"\/extern_lite\/lite\/kernels\/nnadapter\/bridges\/paddle_use_bridges.h\"?" ${LITE_SOURCES_DIR}/src/extern_lite//lite/tools/cmake_tools/record_supported_kernel_op.py
BUILD_COMMAND ${LITE_BUILD_COMMAND}
INSTALL_COMMAND ""
CMAKE_ARGS -DCMAKE_CXX_COMPILER=${CMAKE_CXX_COMPILER}
Expand Down
51 changes: 51 additions & 0 deletions cmake/external/utf8proc.cmake
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
# Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

INCLUDE(ExternalProject)

SET(UTF8PROC_PREFIX_DIR ${THIRD_PARTY_PATH}/utf8proc)
SET(UTF8PROC_INSTALL_DIR ${THIRD_PARTY_PATH}/install/utf8proc)
# As we add extra features for utf8proc, we use the non-official repo
SET(UTF8PROC_REPOSITORY ${GIT_URL}/JuliaStrings/utf8proc.git)
SET(UTF8PROC_TAG v2.6.1)

IF(WIN32)
SET(UTF8PROC_LIBRARIES "${UTF8PROC_INSTALL_DIR}/lib/utf8proc_static.lib")
add_definitions(-DUTF8PROC_STATIC)
ELSE(WIN32)
SET(UTF8PROC_LIBRARIES "${UTF8PROC_INSTALL_DIR}/lib/libutf8proc.a")
ENDIF(WIN32)

INCLUDE_DIRECTORIES(${UTF8PROC_INSTALL_DIR}/include)

ExternalProject_Add(
extern_utf8proc
${EXTERNAL_PROJECT_LOG_ARGS}
${SHALLOW_CLONE}
GIT_REPOSITORY ${UTF8PROC_REPOSITORY}
GIT_TAG ${UTF8PROC_TAG}
PREFIX ${UTF8PROC_PREFIX_DIR}
UPDATE_COMMAND ""
CMAKE_ARGS -DCMAKE_C_FLAGS=${CMAKE_C_FLAGS}
-DBUILD_SHARED=ON
-DBUILD_STATIC=ON
-DCMAKE_CXX_FLAGS=${CMAKE_CXX_FLAGS}
-DCMAKE_INSTALL_PREFIX:PATH=${UTF8PROC_INSTALL_DIR}
-DCMAKE_BUILD_TYPE:STRING=${CMAKE_BUILD_TYPE}
BUILD_BYPRODUCTS ${UTF8PROC_LIBRARIES}
)

ADD_LIBRARY(utf8proc STATIC IMPORTED GLOBAL)
SET_PROPERTY(TARGET utf8proc PROPERTY IMPORTED_LOCATION ${UTF8PROC_LIBRARIES})
ADD_DEPENDENCIES(utf8proc extern_utf8proc)
2 changes: 1 addition & 1 deletion cmake/external/xpu.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ ELSE ()
ENDIF()

SET(XPU_BASE_URL_WITHOUT_DATE "https://baidu-kunlun-product.cdn.bcebos.com/KL-SDK/klsdk-dev")
SET(XPU_BASE_URL "${XPU_BASE_URL_WITHOUT_DATE}/20210921")
SET(XPU_BASE_URL "${XPU_BASE_URL_WITHOUT_DATE}/20211020")
SET(XPU_XRE_URL "${XPU_BASE_URL}/${XPU_XRE_DIR_NAME}.tar.gz" CACHE STRING "" FORCE)
SET(XPU_XDNN_URL "${XPU_BASE_URL}/${XPU_XDNN_DIR_NAME}.tar.gz" CACHE STRING "" FORCE)
SET(XPU_XCCL_URL "${XPU_BASE_URL_WITHOUT_DATE}/20210623/${XPU_XCCL_DIR_NAME}.tar.gz" CACHE STRING "" FORCE)
Expand Down
14 changes: 13 additions & 1 deletion cmake/inference_lib.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -124,6 +124,11 @@ function(copy_part_of_thrid_party TARGET DST)
SRCS ${GLOG_INCLUDE_DIR} ${GLOG_LIBRARIES}
DSTS ${dst_dir} ${dst_dir}/lib)

set(dst_dir "${DST}/third_party/install/utf8proc")
copy(${TARGET}
SRCS ${UTF8PROC_INSTALL_DIR}/include ${UTF8PROC_LIBRARIES}
DSTS ${dst_dir} ${dst_dir}/lib)

if (WITH_CRYPTO)
set(dst_dir "${DST}/third_party/install/cryptopp")
copy(${TARGET}
Expand Down Expand Up @@ -353,7 +358,9 @@ function(version version_file)
"WITH_MKL: ${WITH_MKL}\n"
"WITH_MKLDNN: ${WITH_MKLDNN}\n"
"WITH_GPU: ${WITH_GPU}\n"
"WITH_ROCM: ${WITH_ROCM}\n")
"WITH_ROCM: ${WITH_ROCM}\n"
"WITH_ASCEND_CL: ${WITH_ASCEND_CL}\n"
"WITH_ASCEND_CXX11: ${WITH_ASCEND_CXX11}\n")
if(WITH_GPU)
file(APPEND ${version_file}
"CUDA version: ${CUDA_VERSION}\n"
Expand All @@ -364,6 +371,11 @@ function(version version_file)
"HIP version: ${HIP_VERSION}\n"
"MIOpen version: v${MIOPEN_MAJOR_VERSION}.${MIOPEN_MINOR_VERSION}\n")
endif()
if(WITH_ASCEND_CL)
file(APPEND ${version_file}
"Ascend Toolkit version: ${ASCEND_TOOLKIT_VERSION}\n"
"Ascend Driver version: ${ASCEND_DRIVER_VERSION}\n")
endif()
file(APPEND ${version_file} "CXX compiler version: ${CMAKE_CXX_COMPILER_VERSION}\n")
if(TENSORRT_FOUND)
file(APPEND ${version_file}
Expand Down
2 changes: 0 additions & 2 deletions cmake/miopen.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,6 @@ find_path(MIOPEN_INCLUDE_DIR "miopen/miopen.h"
NO_DEFAULT_PATH
)

get_filename_component(__libpath_hist ${CUDA_CUDART_LIBRARY} PATH)

find_library(MIOPEN_LIBRARY NAMES "libMIOpen.so"
PATHS ${MIOPEN_ROOT} ${MIOPEN_ROOT}/lib ${MIOPEN_ROOT}/lib64 ${__libpath_hist}
$ENV{MIOPEN_ROOT} $ENV{MIOPEN_ROOT}/lib $ENV{MIOPEN_ROOT}/lib64
Expand Down
9 changes: 6 additions & 3 deletions cmake/operators.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -185,6 +185,8 @@ function(op_library TARGET)
list(REMOVE_ITEM hip_srcs "cholesky_op.cu")
list(REMOVE_ITEM hip_srcs "matrix_rank_op.cu")
list(REMOVE_ITEM hip_srcs "svd_op.cu")
list(REMOVE_ITEM hip_srcs "eigvalsh_op.cu")
list(REMOVE_ITEM hip_srcs "qr_op.cu")
list(REMOVE_ITEM hip_srcs "eigh_op.cu")
list(REMOVE_ITEM hip_srcs "multinomial_op.cu")
list(REMOVE_ITEM hip_srcs "decode_jpeg_op.cu")
Expand Down Expand Up @@ -214,9 +216,10 @@ function(op_library TARGET)
foreach(manual_pybind_op "compare_all_op" "compare_op" "logical_op" "bitwise_op" "nccl_op"
"tensor_array_read_write_op" "tensorrt_engine_op" "conv_fusion_op"
"fusion_transpose_flatten_concat_op" "fusion_conv_inception_op"
"sync_batch_norm_op" "dgc_op" "fused_fc_elementwise_layernorm_op"
"sync_batch_norm_op" "sparse_attention_op" "dgc_op" "fused_fc_elementwise_layernorm_op"
"skip_layernorm_op" "multihead_matmul_op" "fusion_group_op" "fused_bn_activation_op" "fused_embedding_eltwise_layernorm_op" "fusion_gru_op" "fusion_lstm_op"
"fused_bn_add_activation_op")
"fused_bn_add_activation_op" "fused_attention_op" "resnet_unit_op" "fused_feedforward_op")

if ("${TARGET}" STREQUAL "${manual_pybind_op}")
set(pybind_flag 1)
endif()
Expand Down Expand Up @@ -297,7 +300,7 @@ function(op_library TARGET)
file(APPEND ${pybind_file} "USE_OP_DEVICE_KERNEL(${TARGET}, CUDNN);\n")
endif()

if (WITH_XPU AND ${xpu_cc_srcs_len} GREATER 0)
if (WITH_XPU AND ${pybind_flag} EQUAL 0 AND ${xpu_cc_srcs_len} GREATER 0)
file(APPEND ${pybind_file} "USE_OP_DEVICE_KERNEL(${TARGET}, XPU);\n")
endif()

Expand Down
14 changes: 12 additions & 2 deletions cmake/third_party.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -210,6 +210,10 @@ include(external/threadpool)# download threadpool
include(external/dlpack) # download dlpack
include(external/xxhash) # download, build, install xxhash
include(external/warpctc) # download, build, install warpctc
include(external/utf8proc) # download, build, install utf8proc

list(APPEND third_party_deps extern_eigen3 extern_gflags extern_glog extern_boost extern_xxhash)
list(APPEND third_party_deps extern_zlib extern_dlpack extern_warpctc extern_threadpool extern_utf8proc)
include(external/lapack) # download, build, install lapack

list(APPEND third_party_deps extern_eigen3 extern_gflags extern_glog extern_boost extern_xxhash)
Expand Down Expand Up @@ -251,8 +255,8 @@ if(WITH_GPU)
include(external/cub) # download cub
list(APPEND third_party_deps extern_cub)
endif()
set(URL "https://paddlepaddledeps.bj.bcebos.com/externalErrorMsg.tar.gz" CACHE STRING "" FORCE)
file_download_and_uncompress(${URL} "externalError" MD5 c0749523ebb536eb7382487d645d9cd4) # download file externalErrorMsg.tar.gz
set(URL "https://paddlepaddledeps.bj.bcebos.com/externalErrorMsg_20210928.tar.gz" CACHE STRING "" FORCE)
file_download_and_uncompress(${URL} "externalError" MD5 a712a49384e77ca216ad866712f7cafa) # download file externalErrorMsg.tar.gz
if(WITH_TESTING)
# copy externalErrorMsg.pb, just for unittest can get error message correctly.
set(SRC_DIR ${THIRD_PARTY_PATH}/externalError/data)
Expand Down Expand Up @@ -356,6 +360,12 @@ if (WITH_LITE)
include(external/lite)
endif (WITH_LITE)

if (WITH_CINN)
message(STATUS "Compile Paddle with CINN.")
include(external/cinn)
add_definitions(-DPADDLE_WITH_CINN)
endif (WITH_CINN)

if (WITH_CRYPTO)
include(external/cryptopp) # download, build, install cryptopp
list(APPEND third_party_deps extern_cryptopp)
Expand Down
Binary file removed log
Binary file not shown.
Loading