Made blur faster by using gaussian decomposition property. #2315

jrruijli · 2020-12-23T14:40:52Z

Description

Image blurring is now linear in kernel size instead of quadratic by making use of decomposition of Gaussian.
For large kernels this results in especially large speed-ups.

Type of change

More efficient implementation of gaussian_filter2d.
(Note that mean_filter2d could also be implemented with two 1d filters).

Checklist:

I've properly formatted my code according to the guidelines

How Has This Been Tested?

Using standard testing. The original (unmodified) test still works:

python3 -m pytest tensorflow_addons/image/tests/filters_test.py
======================================= test session starts ========================================
platform linux -- Python 3.8.6, pytest-5.4.3, py-1.10.0, pluggy-0.13.1
rootdir: ~/home/jrru/git_repositories/addons_mine, inifile: pytest.ini
plugins: extra-durations-0.1.3, xdist-1.34.0, forked-1.3.0, typeguard-2.10.0
collected 116 items

tensorflow_addons/image/tests/filters_test.py .............................................. [ 39%]
...................................................................... [100%]

==================================== sum of all tests durations ====================================
11.08s
======================================= 116 passed in 11.19s =======================================

…rflow#1767) * Use the mark for the gpu tests. * Skip if gpu and tf.float16.

* added test for normalize_data_format in keras_utils * change after review

* add sample_weight support to FScore metrics, generate test data with https://colab.research.google.com/drive/1ymB5iOj9YCeBQ-g-8eMGpQhiHr7QUBM4

* Remove `sequential_update` from AverageWrapper In TF2.0, sequential_update is redundant. This allows the removal of `tf.control_dependencies` from average_wrapper and its subclasses: moving_average and stochastic_weight_averaging. * Revert "Remove `sequential_update` from AverageWrapper" This reverts commit 7cf4201. * Remove `tf.control_dependencies` from AverageWrapper Add deprecation warning for `sequential_update`. * Set type of sequential_update to Optional[bool] `sequential_update` is no longer part of the optimizer's configuration. Loading an older configuration throws the DeprecationWarning. * black format

* README fixes and consistency improvements

* Added echo state network (ESN) recurrent cell

* enable half and double for resampler * register GPU kernels * specialize half and double kernels

* remove internal modules * clean up * refact * remove six * remove losses * refact get_config

…ll (tensorflow#1861) Some RNN cells such as GRUCell and SimpleRNNCell do not return the same state structure in get_initial_state and in the call method.

* Incorporate low-rank techniques into DCN. It supports a low-rank kernel W = U * V, where U \in R^{last_dim x projection_dim} and V \in R^{projection_dim x last_dim}; Introduces a flag diag_scale that increases the diagonal of kernel W by diag_scale.

tensorflow#1872) * Fix AttentionWrapper type annotation for multiple attention mechanisms * Revise attention_layer_size type

* Make cutout compatible with keras layer

* Support unknown rank image

* Fix sparse_image_warp unknown batch size * More tests

google-cla · 2020-12-23T14:40:56Z

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.

bot-of-gabrieldemarmiesse · 2020-12-23T14:41:29Z

@ghosalsattam @Mainak431

You are owners of some files modified in this pull request.
Would you kindly review the changes whenever you have the time to?
Thank you very much.

jrruijli · 2020-12-23T14:48:45Z

@googlebot I fixed it.

google-cla · 2020-12-23T14:48:52Z

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.

bhack · 2020-12-23T14:49:39Z

Check #1450 and more general tensorflow/tensorflow#39050

googlebot · 2020-12-23T14:53:37Z

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.

jrruijli · 2020-12-23T14:56:43Z

@googlebot I fixed it.

ghosalsattam · 2020-12-23T15:02:00Z

I guess running 2D in Convolution in-stead of 2 1D convolutions takes lesser time in GPU due to parallel computations. So, earlier I left it as 2D convolutions only.

jrruijli · 2020-12-23T15:23:29Z

@googlebot I fixed it.

jrruijli · 2020-12-23T15:28:43Z

"I guess running 2D in Convolution in-stead of 2 1D convolutions takes lesser time in GPU due to parallel computations. So, earlier I left it as 2D convolutions only."

The main problem is that the 2D convolution is quadratic in the kernel size. Earlier, I used a Gaussian blur with a sigma of 3, for which a proper kernel is 17x17 (around 3 sigma). The difference is doing 1 convolution with 298 values or two convolutions with 17 values. Parallelism will not help much here.

This reverts commit e3fa95c.

bhack · 2020-12-23T15:36:32Z

Please check both refence tickets that I've mentioned in the previous comment.

jrruijli · 2020-12-23T16:44:25Z

Did I close this? I was having trouble with git...

Anyway, according to some local testing, for small kernels (<3) the separable version is about 20% slower than the 2D version. Note that blurring generally requires a bit larger kernels (you want a kernel of 7 for a sigma of 1). For larger kernels (> 15), the separable version is about twice as fast.

On CPU, for small kernels the differences are negligible. For a kernel of size 10 the separable version is almost 5x faster.

On TPUs I don't know what will happen.

So let me know if this worthwhile or you prefer waiting until the depthwise convolution gets updated to support separable kernels.

gabrieldemarmiesse and others added 30 commits May 12, 2020 13:00

Bump the tensorflow version to stable. (tensorflow#1792)

5923a8e

Use the mark for the gpu tests in conditional_gradient_test.py (tenso…

5dfa050

…rflow#1767) * Use the mark for the gpu tests. * Skip if gpu and tf.float16.

added test for normalize_data_format in keras_utils (tensorflow#1818)

7a8a19a

* added test for normalize_data_format in keras_utils * change after review

Clarify flow definition for dense_image_warp (tensorflow#1817)

9c784de

Expose sharpness (tensorflow#1827)

ad57f58

Fix condition tracing in scale_channel (tensorflow#1830)

154bf1a

add sample_weight support to FScore metrics (tensorflow#1816)

c30284b

* add sample_weight support to FScore metrics, generate test data with https://colab.research.google.com/drive/1ymB5iOj9YCeBQ-g-8eMGpQhiHr7QUBM4

* Bump version to 0.11 (tensorflow#1834)

068d9bd

* Update compatibility matricies (tensorflow#1835)

3243475

fix bug (tensorflow#1838)

c0c9202

Add python3.8 to package classifiers (tensorflow#1837)

a554bae

speedup sharpness (tensorflow#1836)

df08971

fix ill-formatted doc (tensorflow#1847)

c73ca7d

Remove squadrick from subpackage CODEOWNERS (tensorflow#1848)

d783bfb

translate x and y in only one call (tensorflow#1842)

e7cf143

README fixes and consistency improvements (tensorflow#1844)

ddfa675

* README fixes and consistency improvements

Decorate _update_multi_class_model (tensorflow#1849)

37b3d4f

Increase default pip timeout to improve CI reliability (tensorflow#1857)

2cf1556

Added echo state network (ESN) recurrent cell (tensorflow#1811)

8110dde

* Added echo state network (ESN) recurrent cell

Enable half and double for resampler GPU ops (tensorflow#1852)

8743d60

* enable half and double for resampler * register GPU kernels * specialize half and double kernels

use AsDeviceMemory from StreamExecutorUtil (tensorflow#1851)

494a729

remove internal modules (tensorflow#1812)

f9b0104

* remove internal modules * clean up * refact * remove six * remove losses * refact get_config

Ensure cell state structure is unchanged on first AttentionWrapper ca…

5ed6795

…ll (tensorflow#1861) Some RNN cells such as GRUCell and SimpleRNNCell do not return the same state structure in get_initial_state and in the call method.

* Remove optional float total steps (tensorflow#1871)

ddf253d

Fix AttentionWrapper type annotation for multiple attention mechanisms (

01a06ab

tensorflow#1872) * Fix AttentionWrapper type annotation for multiple attention mechanisms * Revise attention_layer_size type

fix tensorflow#1824 (tensorflow#1840)

12ebd5d

fix docstring in focal loss (tensorflow#1878)

f8d94ae

Remove manual type checks and fix some annotations (tensorflow#1876)

a8a8a33

seanpmorgan and others added 6 commits December 16, 2020 22:29

* Move enable_runfiles flag to configure.py (tensorflow#2301)

e4a9755

Fix filters type hint (tensorflow#2303)

27c2874

Make cutout compatible with keras layer (tensorflow#2302)

eaa13bb

* Make cutout compatible with keras layer

Support unknown rank image (tensorflow#2300)

92942e3

* Support unknown rank image

Update ABI compatibility (tensorflow#2306)

a9cad16

Fix sparse_image_warp partially unknown shape (tensorflow#2308)

1492384

* Fix sparse_image_warp unknown batch size * More tests

boring-cyborg bot added the image label Dec 23, 2020

google-cla bot added the cla: no label Dec 23, 2020

jrruijli added 2 commits December 23, 2020 15:55

Made blur faster by using gaussian decomposition property.

9afe291

Merge branch 'master' of https://github.com/jrruijli/addons

1d33557

jrruijli added 4 commits December 23, 2020 16:04

Merge branch 'master' of https://github.com/jrruijli/addons

6e3976c

Made blur faster by using gaussian decomposition property.

76efcf4

Merge branch 'master' of https://github.com/jrruijli/addons

ac021dc

Made blur faster by using gaussian decomposition property.

18f64aa

jrruijli added 2 commits December 23, 2020 16:31

Revert "Made blur faster by using gaussian decomposition property."

cf36cd9

This reverts commit e3fa95c.

Merge branch 'master' of https://github.com/jrruijli/addons

a3bdbe7

jrruijli closed this Dec 23, 2020

jrruijli force-pushed the master branch from d9aa973 to ac021dc Compare December 23, 2020 15:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Made blur faster by using gaussian decomposition property. #2315

Made blur faster by using gaussian decomposition property. #2315

jrruijli commented Dec 23, 2020

google-cla bot commented Dec 23, 2020

bot-of-gabrieldemarmiesse commented Dec 23, 2020

jrruijli commented Dec 23, 2020

google-cla bot commented Dec 23, 2020

bhack commented Dec 23, 2020

googlebot commented Dec 23, 2020

jrruijli commented Dec 23, 2020

ghosalsattam commented Dec 23, 2020

jrruijli commented Dec 23, 2020

jrruijli commented Dec 23, 2020

bhack commented Dec 23, 2020

jrruijli commented Dec 23, 2020

Made blur faster by using gaussian decomposition property. #2315

Made blur faster by using gaussian decomposition property. #2315

Conversation

jrruijli commented Dec 23, 2020

Description

Type of change

Checklist:

How Has This Been Tested?

google-cla bot commented Dec 23, 2020

bot-of-gabrieldemarmiesse commented Dec 23, 2020

jrruijli commented Dec 23, 2020

google-cla bot commented Dec 23, 2020

bhack commented Dec 23, 2020

googlebot commented Dec 23, 2020

jrruijli commented Dec 23, 2020

ghosalsattam commented Dec 23, 2020

jrruijli commented Dec 23, 2020

jrruijli commented Dec 23, 2020

bhack commented Dec 23, 2020

jrruijli commented Dec 23, 2020