-
Notifications
You must be signed in to change notification settings - Fork 10
Distributed standalone driver #1102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
jcanton
wants to merge
556
commits into
main
Choose a base branch
from
distributed_standalone_driver
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+646
−199
Draft
Changes from all commits
Commits
Show all changes
556 commits
Select commit
Hold shift + click to select a range
c8bfbc6
Fix import for ToZeroBasedIndexTransformation
msimberg 86da004
Remove unused import
msimberg 814814f
Rename single/multi-node to single/multi-rank
msimberg f7dae6e
Add missing word
msimberg 7abfc3d
Remove todos
msimberg 3973bfb
Remove todo
msimberg 85b091d
Remove unused logger
msimberg 8e85390
Small refactor in halo.py
msimberg 9e3f0fc
Revert horizontal_size/horizontal_config rename
msimberg 2bd04a7
Fix typo
msimberg 2bf0d2a
Small renaming related to index offset/transformation
msimberg e72803b
Small simplification
msimberg aa54fea
Replace prints with logging in distributed tests
msimberg 1c1247a
Update model/common/tests/common/grid/mpi_tests/test_parallel_grid_ma…
msimberg 5248993
Update model/common/tests/common/decomposition/unit_tests/test_halo.py
msimberg c6ef577
Update model/common/tests/common/grid/mpi_tests/test_parallel_grid_ma…
msimberg cea742c
Fix typo
msimberg e7428e6
Fix typo
msimberg 9d7e88a
Remove model/common/tests/common/grid/fixtures.py
msimberg fde16aa
Fix distributed metrics test
msimberg 0f06c6b
Revert horizontal_size/config rename
msimberg fcf7821
Fix some inconsistent renames
msimberg 6f8fcd4
Update model/common/src/icon4py/model/common/decomposition/halo.py
msimberg 66abfe1
Update model/common/src/icon4py/model/common/decomposition/halo.py
msimberg fec9a29
Clean up assignment
msimberg de988bc
Prefix private member functions with _
msimberg eae8bde
Remove MPI import for type checking
msimberg 1288824
Minor cleanup
msimberg 1fcee72
Use field_type_aliases in compute_cell_2_vertex_interpolation.py
msimberg 0c51fa2
Merge remote-tracking branch 'origin/main' into distributed-tests-dac…
msimberg 3d9f018
Simplify skipping mpi tests
msimberg d41ea85
Remove is not None
msimberg cccbe97
Simplify return
msimberg 2b6be6d
Revert "Remove is not None"
msimberg b5115c2
Minor consistency renaming
msimberg 1370754
Small cleanup in grid refinement
msimberg 822549e
Add distributed property to GridConfig
msimberg d01fd13
CI for standalone_driver
nfarabullini f3c69ff
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 4bf1e15
edits to yamla nd toml files
nfarabullini de83aeb
further edits and fixes
nfarabullini cf19a0f
Small reordering of fields
msimberg 54ba6b5
Add more fields to be tested in test_parallel_grid_manager.py
msimberg bac1faf
Change computational domain for some geometry fields
msimberg 789a7ff
Sort geometry fields
msimberg b40ff92
Add more interpolation fields to parallel grid manager test
msimberg 67e054b
Test more metrics fields
msimberg f864ec6
Remove unused fixture
msimberg 0fbfb62
Fix figure reference
msimberg 2bc143e
Merge remote-tracking branch 'origin/main' into halo_construction
msimberg 4fb0936
Remove deleted field from test
msimberg 203fcf9
inital conditions edits and tests
nfarabullini ca8d67c
Merge branch 'main' into ci_for_standalone_driver
nfarabullini dfdcfcb
further edits to remove serialized constants
nfarabullini bdde616
Merge branch 'ci_for_standalone_driver' of https://github.com/C2SM/ic…
nfarabullini 7234c6b
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 5e5a6ef
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 7290b81
Update model/common/src/icon4py/model/common/decomposition/definition…
msimberg 135172d
Update model/common/src/icon4py/model/common/decomposition/definition…
msimberg 171d23f
Update model/common/src/icon4py/model/common/decomposition/halo.py
msimberg 7fa119a
Update model/common/src/icon4py/model/common/decomposition/definition…
msimberg 5b6ddf8
Merge remote-tracking branch 'origin/main' into distributed-tests-dac…
msimberg 18ebc10
Merge remote-tracking branch 'origin/main' into halo_construction
msimberg f47e5f3
Work on failing test_parallel_grid_geometry.py fields
msimberg 20e4a8b
Comment one more field out for now
msimberg 46014f6
further edits
nfarabullini 1061e37
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 02a9650
Add assertions for halo construction consistency
msimberg bb9ebf1
lowered tolerance in standalone driver test and ran pre-commit
nfarabullini 5b8a448
Merge branch 'main' into ci_for_standalone_driver
nfarabullini cd8ef1c
update from upstream, edits to types, and ran pre-commit
nfarabullini 63a2520
Change implementation of array_ns_from_array
msimberg 3eea114
some edits
nfarabullini c1217d5
small edit
nfarabullini 4bf3733
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 23d7c83
Fix inverse dual edge length and c_lin_e fields
msimberg cc30a62
small fix
nfarabullini 6b1bb5f
small edit to fixture
nfarabullini 930e167
commented out parametrization
nfarabullini e86051a
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 22338bf
removed w_1
nfarabullini fc982cb
small test edits
nfarabullini 5424978
Merge branch 'ci_for_standalone_driver' of https://github.com/C2SM/ic…
nfarabullini 3fc3dbf
edited one tolerance
nfarabullini 55836c3
further fixes for w_1
nfarabullini 69c8ae6
Merge branch 'main' into ci_for_standalone_driver
nfarabullini cfabcd3
Update diffusion.py
nfarabullini 6cbcbad
merge with upstream
nfarabullini 5724bc6
edits to icon4py test
nfarabullini 961f4cd
small type hint edit
nfarabullini 3db70c3
further edits
nfarabullini 68e85a4
standalone driver parametrization edits
nfarabullini f0e14d2
Merge branch 'main' into ci_for_standalone_driver
nfarabullini fa07307
adjusted standalone tolerances
nfarabullini f415370
Merge branch 'main' into distributed-tests-dace-gpu
nfarabullini ad485d0
"Fix" more parallel tests
msimberg d2335c1
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 09e99d2
ran pre-commit
nfarabullini 88ffb2f
Merge branch 'ci_for_standalone_driver' of https://github.com/C2SM/ic…
nfarabullini abcc29c
grid description fixture and tiny grid
jcanton 8c83811
Add tests to check consistency of owner masks and domain bounds
msimberg 20aed6e
change config of metric factory for JW
OngChia 895b5b4
Fix vertex/edge decomposition ownership masks
msimberg 88b83ba
Update compute bounds in factories now that ownership bug is fixed
msimberg 44e3708
Add tests to check that halo levels are sorted
msimberg 82466c1
Clean up test_halo.py and fix a few parameters
msimberg 38be5e5
Add missing is_sorted helper function
msimberg 7cac885
Remove todo
msimberg eda26ef
Clean up and enable more tests in test_parallel_grid_manager.py
msimberg a397e0a
Add specialized mask_prog_halo_c test
msimberg d0acad4
Merge remote-tracking branch 'origin/halo_construction' into halo_con…
msimberg d949e44
Merge remote-tracking branch 'origin/main' into halo_construction
msimberg d2d8faf
Update array_ns_from_array after merge
msimberg 95a676f
Update tested fields in test_parallel_grid_manager.py after merge
msimberg 47d0e63
Merge branch 'main' into distributed-tests-dace-gpu
nfarabullini ce21e8f
modified np strict references with broader array_ns
nfarabullini 214fbe9
Simplify test_halo.py
msimberg 0d33ee4
Remove empty pytest file
msimberg 1f632ab
Disable r01b01 grid for testing for now
msimberg a62e6e3
Fix formatting
msimberg c367428
Ignore commented out code
msimberg 1975443
Small cleanup refactoring in halo construction
msimberg 19abd25
Update decomposition info with_dimension call to set_dimension
msimberg 4bf0618
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 0d10792
Explicitly convert np.bool_ to bool
msimberg cecea11
adjusted tolerances
nfarabullini 3013e78
Merge branch 'ci_for_standalone_driver' of https://github.com/C2SM/ic…
nfarabullini 74938f3
Merge branch 'main' into distributed-tests-dace-gpu
nfarabullini f6fd95f
Remove unnecessary test
msimberg 03fb537
replaced a few more np with array_ns
nfarabullini 878db70
Update interpolation_fields.py
nfarabullini c449030
ran pre-commit
nfarabullini 9460369
removed additional but unused return val
nfarabullini d96956a
edit for scipy func to flexible gpu/cpu func
nfarabullini c343334
Constrain xfail a bit more
msimberg c3606ae
Update interpolation_fields.py
nfarabullini 890166a
replaced gt4py field operator within regular function to regular pyth…
nfarabullini 81375ca
ran pre-commit
nfarabullini da60b19
Add domain sizes
msimberg 5fb0b0e
small fix to bounds
nfarabullini 6362e62
small fix to tuple
nfarabullini 726fd50
fix to index
nfarabullini 356f889
introduced array_ns instead of strict numpy computation in init_w
nfarabullini fd0c7f0
Remove rbf interpolation todo
msimberg b8fa94e
Merge remote-tracking branch 'origin/main' into halo_construction
msimberg b47aa03
edit from asnumpy() to ndarray
nfarabullini 1869d03
additional edit from asnumpy() to ndarray
nfarabullini 80f1fa9
Fix second halo level cell computation
msimberg e9fc3fd
Merge remote-tracking branch 'origin/main' into halo_construction
msimberg 7b3e18a
potential fix
nfarabullini 5305e2f
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 182a843
Increase distributed CI pipeline timelimits
msimberg 79d65f2
Allow None for halo_levels
msimberg 5a6cc8e
Add more tests for test_compute_domain_bounds_for_global_grid
msimberg 8efe35c
Enable grid data download in distributed pipeline
msimberg be614a0
potential fix
nfarabullini 5fc92d8
Merge branch 'ci_for_standalone_driver' of https://github.com/C2SM/ic…
nfarabullini b6314b6
more edits
nfarabullini 3b95f9b
more edits
nfarabullini b1386d2
removed serialized exit point from internal code
nfarabullini 50ebb90
small edit to equation to see if this works with gpu backend
nfarabullini 6642aa2
new for loop implementaed instead of array_ns.sum
nfarabullini 26fd7db
edits to array_ns and ran pre-commit
nfarabullini 53a2585
edits for offset declaration potential issue
nfarabullini d6d9f30
small edit to change args position
nfarabullini 56f9cef
Add xfails for wrong halo, interior, local indices with torus grid
msimberg c6cf66b
Check INTERIOR indices with icon4py domain decomposition
msimberg 301c041
Update ci/distributed.yml
msimberg d77f5d9
Less verbose xfail
msimberg 37b196b
Download r01b01 grid in ci
msimberg 39703b3
Enable download again
msimberg c958a6e
small fix to index
nfarabullini 6df6af6
Disable download again
msimberg 08cc652
small edits to offset provider spec
nfarabullini 115aec8
Explicitly set ICON4PY_TEST_DATA_PATH in github workflows
msimberg f204644
More explicit path
msimberg 0279054
added cpu only marker
nfarabullini dc339dc
Enable grid download for grid download step
msimberg fd6034a
ran pre-commit
nfarabullini de2f7a5
edited ci time
nfarabullini 7a8118a
removed backend customization
nfarabullini 83de5f4
ran pre-commit
nfarabullini fc0a445
Remove todos
msimberg accedb7
Merge branch 'halo_construction' into distributed_standalone_driver
jcanton 125e26f
remove these two
jcanton 2e52f15
adjusted ci time and removed cp only constraint
nfarabullini 3c7e718
increased ci time
nfarabullini 6e900f9
Merge branch 'main' into ci_for_standalone_driver
nfarabullini 8b5a614
increased ci time
nfarabullini 2a71be6
Merge branch 'main' into distributed_standalone_driver
jcanton b36637f
Merge branch 'ci_for_standalone_driver' of https://github.com/C2SM/ic…
nfarabullini 7b097e2
Merge branch 'ci_for_standalone_driver' into distributed_standalone_d…
jcanton 2b9c823
Merge branch 'ci_for_standalone_driver' into distributed_standalone_d…
jcanton 9f9765f
Merge branch 'distributed-tests-dace-gpu' into distributed_standalone…
jcanton cae3aba
these changes don't belong here but why not
jcanton b3cab13
code refactoring
nfarabullini 71ccba5
further code refactoring
nfarabullini 0ed42b7
Merge remote-tracking branch 'origin/main' into distributed-tests-dac…
msimberg fbd9775
wip
jcanton bf47258
move check_local_global_fields to parallel_helpers
jcanton e154538
small fix
nfarabullini e9fd355
fix type warning
jcanton 77e0fc4
parallel standalone driver?
jcanton 2158b61
small fix
nfarabullini 9ab2b09
small fix
nfarabullini f17d6ab
ran pre-commit
nfarabullini 82ebba2
wip2
jcanton dacc75d
Merge branch 'ci_for_standalone_driver' into distributed_standalone_d…
jcanton 51eab0c
Merge branch 'distributed-tests-dace-gpu' into distributed_standalone…
jcanton a25e6f7
gt4py hack
jcanton 3ee4e96
comment init_w
jcanton acd2217
comment tracer adv
jcanton 79e6a41
update test
jcanton b564899
more verbose testing
jcanton 9e2dbd7
no default exchange
jcanton 3a5f51d
no single rank defaults
jcanton 437c6c7
exchange to IC
jcanton 1874a54
no need for dim prints
jcanton fd9f493
Add halo exchanges for jw initial condition
msimberg 888c1ae
Remove unnecessary functools.partial
msimberg 054b7ba
Add parallel jablonowski williamson initial condition test
msimberg f25e3e5
Slight cleanup
msimberg 720e48f
add SingleNodeExchange warnings
jcanton 87a23c5
remove unnecessary array_ns, we have the backend and xp from that
jcanton 86e600b
fix typer
jcanton a38072b
update warning message
jcanton 4f24d9b
forgot to remove from here
jcanton d1e5076
cleanup and wip on distributed serdata test
jcanton bab3d58
Add back assertions to local/global field checks
msimberg 6a65619
add wip multi-rank ser_data test
jcanton 552c118
wip2
jcanton f295368
I need this back
jcanton b4b20cb
add UV to IC test
jcanton 45eb7f2
Fix halo exchanges for diagnostic state
msimberg a57d8f4
pick from #1012
jcanton 5c15dd6
Merge branch 'main' into distributed_standalone_driver
jcanton db1dc99
restore some from main
jcanton 758b96b
Try to restore exchanges to distributed standalone driver
msimberg 5b126f7
Add missing exchange
msimberg 3e1bb55
Comment out part of initial conditions for now
msimberg 76a426e
Allow passing atol to check_local_global_fields
msimberg 5107aa4
Use ghex release
msimberg e440a53
cupy/numpy compat
msimberg cfe8147
put back SingleNodeExchange defaults
jcanton 88e97da
back to main for diffusion.py
jcanton 81e3a1f
a couple more cleanups
jcanton f62de4a
put this back too
jcanton a4921e9
Merge branch 'main' into distributed_standalone_driver
jcanton 29ea0b3
update uv.lock
jcanton 9fc348b
small edit to fix init_w issue and ran pre-commit
nfarabullini File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TODO: remove before merging? or disable somehow? talk with @msimberg