File tree Expand file tree Collapse file tree 1 file changed +24
-1
lines changed Expand file tree Collapse file tree 1 file changed +24
-1
lines changed Original file line number Diff line number Diff line change @@ -7,9 +7,32 @@ This file is used to list changes made in each version of the AWS ParallelCluste
7
7
-----
8
8
9
9
** ENHANCEMENTS**
10
-
10
+ - Add support for EFA on Centos 7, Amazon Linux and Ubuntu 1604
11
11
- Add support for Ubuntu in China region ` cn-northwest-1 `
12
12
13
+ ** CHANGES**
14
+ - SGE: changed following parameters in global configuration
15
+ - ` max_unheard 00:03:00 ` : allows a faster reaction in case of faulty nodes
16
+ - ` reschedule_unknown 00:00:30 ` : enables rescheduling of jobs running on failing nodes
17
+ - ` qmaster_params ENABLE_FORCED_QDEL_IF_UNKNOWN ` : forces job deletion on unresponsive nodes
18
+ - ` qmaster_params ENABLE_RESCHEDULE_KILL ` : forces rescheduling or killing of jobs running on failing nodes
19
+ - Slurm: decrease SlurmdTimeout to 120 seconds to speed up replacement of faulty nodes
20
+ - Always use full master FQDN when mounting NFS on compute nodes. This solves some issues occurring with some networking
21
+ setups and custom DNS configurations
22
+ - Set soft and hard ulimit on open files to 10000 for all supported OSs
23
+ - Pin python ` supervisor ` version to 3.4.0
24
+ - Remove unused ` compute_instance_type ` from jobwatcher.cfg
25
+ - Removed unused ` max_queue_size ` from sqswatcher.cfg
26
+ - Remove double quoting of the post_install args
27
+
28
+ ** BUG FIXES**
29
+ - Fix issue that was preventing Torque from being used on Centos 7
30
+ - Start node daemons at the end of instance initialization. The time spent for post-install script and node
31
+ initialization is not counted as part of node idletime anymore.
32
+ - Fix issue which was causing an additional and invalid EBS mount point to be added in case of multiple EBS
33
+ - Install Slurm libpmpi/libpmpi2 that is distributed in a separate package since Slurm 17
34
+
35
+
13
36
2.3.1
14
37
-----
15
38
You can’t perform that action at this time.
0 commit comments