Avoid `Rational` in activation function gradients #399

mcabbott · 2022-03-05T21:09:43Z

This avoids using rational numbers in some activation function gradients, as that causes problems on GPU.

Closes #398, closes #400.

terasakisatoshi · 2022-03-06T02:53:52Z

Could you fix the definition of deriv_hardsish? It contains 1//2 which causes an error on my GPU machine.

https://github.com/mcabbott/NNlib.jl/blob/b8b328d9c0fd2ab215a2298b5cfd82191cd6e606/src/activations.jl#L406-L408

For example here is my suggestion.

deriv_hardswish(x) = ifelse(x < -3, oftf(x, 0), ifelse(x > 3, oftf(x, 1), x / 3 + oftf(x, 1 / 2)))

mcabbott · 2022-03-06T03:03:40Z

Maybe I got all of them this time...

mcabbott · 2022-03-06T03:06:12Z

test/runtests.jl

-end
+@testset verbose=true "NNlib.jl" begin
+    if CUDA.functional()
+        if get(ENV, "NNLIB_TEST_CUDA", "false") == "true"


BTW all this mess is because I thought it was sometimes failing first on another test, and hence not showing the problem. So I added an overall testset. But it turns out this doesn't matter, because NNlibCUDA already has such a testset, and that's where both problems were.

Anyway, so it's unrelated, but perhaps a good idea. I also pulled the CUDA tests first. Since these aren't always run, it's nice to find out immediately whether they are going to be run at all.

But easy to revert if someone thinks it ought not to be in this PR.

mcabbott · 2022-03-06T03:21:16Z

GPU test failures in softmax are probably to be looked at in FluxML/NNlibCUDA.jl#44. Which sadly is in a separate repository.

Nightly test failure is still #396

mcabbott added 3 commits March 5, 2022 16:08

avoid rational numbers

2a25865

move CUDA tests first, add overall testset

1a2c66e

NNLIB_TEST_CUDA: true for v1

b8b328d

two more rationals

fe0a032

mcabbott commented Mar 6, 2022

View reviewed changes

mcabbott mentioned this pull request Mar 7, 2022

update softmax tests to match NNlib 0.8.3 FluxML/NNlibCUDA.jl#44

Merged

darsnack mentioned this pull request Mar 7, 2022

NNlib 0.8.3 broke NNlibCUDA #400

Closed

mcabbott requested a review from darsnack March 7, 2022 15:32

mcabbott changed the title ~~Fix #398~~ Avoid Rational in activation function gradients Mar 7, 2022

darsnack approved these changes Mar 7, 2022

View reviewed changes

mcabbott merged commit 886b34c into FluxML:master Mar 7, 2022

mcabbott deleted the activate8 branch March 7, 2022 16:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Avoid `Rational` in activation function gradients #399

Avoid `Rational` in activation function gradients #399

Uh oh!

mcabbott commented Mar 5, 2022 •

edited

Loading

Uh oh!

terasakisatoshi commented Mar 6, 2022 •

edited

Loading

Uh oh!

mcabbott commented Mar 6, 2022

Uh oh!

mcabbott Mar 6, 2022

Uh oh!

mcabbott Mar 7, 2022

Uh oh!

mcabbott commented Mar 6, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Avoid Rational in activation function gradients #399

Avoid Rational in activation function gradients #399

Uh oh!

Conversation

mcabbott commented Mar 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

terasakisatoshi commented Mar 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcabbott commented Mar 6, 2022

Uh oh!

mcabbott Mar 6, 2022

Choose a reason for hiding this comment

Uh oh!

mcabbott Mar 7, 2022

Choose a reason for hiding this comment

Uh oh!

mcabbott commented Mar 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Avoid `Rational` in activation function gradients #399

Avoid `Rational` in activation function gradients #399

mcabbott commented Mar 5, 2022 •

edited

Loading

terasakisatoshi commented Mar 6, 2022 •

edited

Loading

mcabbott commented Mar 6, 2022 •

edited

Loading