feat: activation intrinsics for neural networks #860

jalvesz · 2024-08-13T19:54:33Z

Add activation functions for neural networks. Relates to #858

Todo:

specs & autodoc
tests
examples

Previous Art
SciPy Special functions collection: https://docs.scipy.org/doc/scipy/reference/special.html
torch nn: https://pytorch.org/docs/stable/nn.html#non-linear-activations-other
neural-fortran : https://github.com/modern-fortran/neural-fortran/blob/main/src/nf/nf_activation.f90

cc: @Beliavsky @milancurcic @epagone @jvdp1 @perazz

…functions

jalvesz · 2024-08-23T13:41:02Z

Before moving forward, any opinions on

putting these functions within the "specialfunctions" category?
naming of the derivative/gradient version with <name>_grad?
any other remark?

…activations

perazz

Thank you for this implementation @jalvesz, I have left my comments. I am no machine learning expert, so I focused on the code rather than on the actual math of each function.

include/common.fypp

src/stdlib_specialfunctions.fypp

test/specialfunctions/test_specialfunctions_activations.fypp

src/stdlib_specialfunctions.fypp

Co-authored-by: Federico Perini <[email protected]>

jalvesz · 2025-04-07T20:53:39Z

Thanks @perazz for your review! regarding tanh and erf, I wonder if I should actually remove reference to the intrinsic names and simply leave a reference to the fast approximation as that is what makes sense for NNs. Also, should these functions stay here or moved to the intrinsics module?

Regarding

I'm fine with the f prefix (or fast_ would also be OK). I believe that should be standardized somewhere, for example in the library style guide?

I agree with you and also wonder which one would be preferable, I don't have strong oppinions on that.

perazz · 2025-04-09T06:37:41Z

I think that for now, it makes sense to only have these functions as part of the activation functions submodule (it's natural that they are approximated versions in this context). For other functions overloading intrinsics, there is no single rule:

gamma, log_gamma directly overwrite the intrinsics name
stdlib_sum*, stdlib_dot_product* intrinsics have the stdlib_ prefix.

So, perhaps adding another rule (f* prefix) is not so desirable? Maybe, they should be named stdlib_tanh and stdlib_erf in line with the other overloaded intrinsics. Just a thought.

jalvesz · 2025-04-14T19:49:27Z

@perazz so for the fast functions I propose here to use the naming fast_<> as I felt that a stdlib_<> feels like stdlib's reference implementation, but here we are proposing degraded accuracy for the sake of activation functions. I've added the documentation for those and also extended the tests for different accuracies but leaving the tolerance fixed to a low accuracy value.

Beliavsky · 2025-05-01T21:34:40Z

A rigorous test would be to print the activation function and derivative values for a range of arguments, write them to a file, and compare them with a reference implementation in Python. How feasible is that?

I have looked at the code and checked a few of the activation functions and derivatives. They look correct. There are some functions with an optional dim argument that is not referenced, for example

pure module function softmax_r1_sp( x , dim ) result( y )
    real(sp), intent(in) :: x(:)
    real(sp) :: y(size(x))
    integer, intent(in), optional :: dim

    y = exp(x - maxval(x))
    y = y / sum(y)
end function

Can unused dim arguments be removed from functions without too much manual work?

jalvesz · 2025-05-02T21:04:13Z

Thanks for the review @Beliavsky. I've reverted the use of dim for the rank-1 cases.

Regarding reading the results from files, one way could be using .npy files which stdlib can read. That would take quite some time though. I would suggest to do it as a separate PR for improving the testing, the current testing uses hard-coded reference values for the soft-family which are not elemental but require special treatment rank-wise.

jalvesz · 2025-05-16T08:56:14Z

I'll merge this one that has been hanging for several months. I'll add an issue to enhance the tests with an .npy data-base later on.

jalvesz added 6 commits August 13, 2024 21:42

start working on activations module

2ff7029

softmax for ranks from 1 to 4

7d1c6ad

move activations to specialfunctions, add specs

c1303e7

fix float constant definition

f22756a

fix float constant definition

b1a4180

fix float constant definition

90b8de3

jalvesz mentioned this pull request Aug 17, 2024

build system issue #863

Closed

jalvesz and others added 8 commits August 19, 2024 13:32

Merge branch 'fortran-lang:master' into activations

b7c8c81

update src CMakeLists

1b3bf4f

add tests for activations

f4ad250

add tests for sigmoid and gelu

9d7eb7c

missing module procedure

5727921

missing interface and change of kind definition for elemental module …

2ed7626

…functions

add SiLU activation

f1acf1e

Merge branch 'fortran-lang:master' into activations

230bea9

jalvesz and others added 8 commits September 15, 2024 12:24

Merge branch 'fortran-lang:master' into activations

dd7125d

Merge branch 'fortran-lang:master' into activations

b137b36

Merge branch 'fortran-lang:master' into activations

bc2bf5a

add any rank support for softmax and logsoftmax

5c47bf0

Merge branch 'fortran-lang:master' into activations

8f0cd69

Merge branch 'activations' of https://github.com/jalvesz/stdlib into …

1a2245a

…activations

homogenize arguments

5d0419e

add selu activation

21851d0

jalvesz closed this Dec 22, 2024

jalvesz force-pushed the activations branch from 1a2245a to 35e7146 Compare December 22, 2024 21:45

jalvesz added 2 commits December 22, 2024 22:56

Merge branch 'activations' of https://github.com/jalvesz/stdlib into …

ef6e3e6

…activations

Add SELU documentation

1914e78

jalvesz reopened this Dec 23, 2024

jalvesz added the topic: mathematics linear algebra, sparse matrices, special functions, FFT, random numbers, statistics, ... label Dec 23, 2024

jalvesz added 3 commits January 29, 2025 14:36

Merge branch 'fortran-lang:master' into activations

20ecd43

Merge branch 'fortran-lang:master' into activations

d5cfa36

Merge branch 'fortran-lang:master' into activations

1c3fbda

jalvesz mentioned this pull request Feb 21, 2025

intrinsics module with alternative implementations #915

Merged

2 tasks

jalvesz added 2 commits February 28, 2025 18:15

Merge branch 'fortran-lang:master' into activations

259360f

Merge branch 'fortran-lang:master' into activations

3519b43

jalvesz mentioned this pull request Mar 29, 2025

Neural network activation functions #858

Closed

perazz approved these changes Apr 3, 2025

View reviewed changes

perazz reviewed Apr 3, 2025

View reviewed changes

src/stdlib_specialfunctions.fypp Outdated Show resolved Hide resolved

jalvesz and others added 3 commits April 4, 2025 09:35

Update src/stdlib_specialfunctions.fypp

0afc655

Co-authored-by: Federico Perini <[email protected]>

lowercase procedure names

66e662c

single shape macro

ea74c87

Merge branch 'fortran-lang:master' into activations

9dacc48

refactor tanh, add docs, tests on all real precisions

5d41402

jalvesz and others added 3 commits April 15, 2025 12:33

Merge branch 'fortran-lang:master' into activations

d6ffa11

use stdlib_sum

3090f53

Merge branch 'fortran-lang:master' into activations

497600d

jalvesz linked an issue Apr 28, 2025 that may be closed by this pull request

Neural network activation functions #858

Closed

remove unused dim variable

c62110d

Merge branch 'fortran-lang:master' into activations

b1d64b9

jalvesz changed the title ~~activation intrinsics for neural networks~~ feat: activation intrinsics for neural networks May 10, 2025

jalvesz merged commit 09bccec into fortran-lang:master May 16, 2025
16 checks passed

jalvesz deleted the activations branch May 16, 2025 08:58

jalvesz mentioned this pull request May 16, 2025

Enhance activations testing #992

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: activation intrinsics for neural networks #860

feat: activation intrinsics for neural networks #860

Uh oh!

jalvesz commented Aug 13, 2024 •

edited

Loading

Uh oh!

jalvesz commented Aug 23, 2024

Uh oh!

perazz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jalvesz commented Apr 7, 2025

Uh oh!

perazz commented Apr 9, 2025

Uh oh!

jalvesz commented Apr 14, 2025

Uh oh!

Beliavsky commented May 1, 2025

Uh oh!

jalvesz commented May 2, 2025 •

edited

Loading

Uh oh!

jalvesz commented May 16, 2025

Uh oh!

Uh oh!

Uh oh!

feat: activation intrinsics for neural networks #860

feat: activation intrinsics for neural networks #860

Uh oh!

Conversation

jalvesz commented Aug 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jalvesz commented Aug 23, 2024

Uh oh!

perazz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jalvesz commented Apr 7, 2025

Uh oh!

perazz commented Apr 9, 2025

Uh oh!

jalvesz commented Apr 14, 2025

Uh oh!

Beliavsky commented May 1, 2025

Uh oh!

jalvesz commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jalvesz commented May 16, 2025

Uh oh!

Uh oh!

Uh oh!

jalvesz commented Aug 13, 2024 •

edited

Loading

jalvesz commented May 2, 2025 •

edited

Loading