File tree Expand file tree Collapse file tree 1 file changed +25
-0
lines changed Expand file tree Collapse file tree 1 file changed +25
-0
lines changed Original file line number Diff line number Diff line change @@ -3,6 +3,31 @@ aws-parallelcluster-node CHANGELOG
3
3
4
4
This file is used to list changes made in each version of the aws-parallelcluster-node package.
5
5
6
+ 2.4.1
7
+ -----
8
+
9
+ ** ENHANCEMENTS**
10
+ - Torque:
11
+ - process nodes added to or removed from the cluster in batches in order to speed up cluster scaling.
12
+ - scale up only if required slots/nodes can be satisfied
13
+ - scale down if pending jobs have unsatisfiable CPU/nodes requirements
14
+ - add support for jobs in hold/suspended state (this includes job dependencies)
15
+ - automatically terminate and replace faulty or unresponsive compute nodes
16
+ - add retries in case of failures when adding or removing nodes
17
+ - add support for ncpus reservation and multi nodes resource allocation (e.g. -l nodes=2: ppn =3+3: ppn =6)
18
+
19
+ ** CHANGES**
20
+ - Drop support for Python 2. Node daemons now support Python >= 3.5.
21
+ - Torque: trigger a scheduling cycle every 1 minute when there are pending jobs in the queue. This is done in order
22
+ to speed up jobs scheduling with a dynamic cluster size.
23
+
24
+ ** BUG FIXES**
25
+ - Restore logic that was automatically adding compute nodes identity to known_hosts file.
26
+ - Slurm: fix issue that was causing the daemons to fail when the cluster is stopped and an empty compute nodes file
27
+ is imported in Slurm config.
28
+ - Torque: fix command to disable hosts in the scheduler before termination.
29
+
30
+
6
31
2.4.0
7
32
-----
8
33
You can’t perform that action at this time.
0 commit comments