Skip to content

Commit 65df975

Browse files
committed
YARN-1265. Fair Scheduler chokes on unhealthy node reconnect (Sandy Ryza)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/trunk@1531146 13f79535-47bb-0310-9956-ffa450edef68
1 parent f22ee33 commit 65df975

File tree

2 files changed

+6
-0
lines changed
  • hadoop-yarn-project
    • hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair

2 files changed

+6
-0
lines changed

hadoop-yarn-project/CHANGES.txt

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -98,6 +98,8 @@ Release 2.2.1 - UNRELEASED
9898
YARN-879. Fixed tests w.r.t o.a.h.y.server.resourcemanager.Application.
9999
(Junping Du via devaraj)
100100

101+
YARN-1265. Fair Scheduler chokes on unhealthy node reconnect (Sandy Ryza)
102+
101103
Release 2.2.0 - 2013-10-13
102104

103105
INCOMPATIBLE CHANGES

hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/FairScheduler.java

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -788,6 +788,10 @@ private synchronized void addNode(RMNode node) {
788788

789789
private synchronized void removeNode(RMNode rmNode) {
790790
FSSchedulerNode node = nodes.get(rmNode.getNodeID());
791+
// This can occur when an UNHEALTHY node reconnects
792+
if (node == null) {
793+
return;
794+
}
791795
Resources.subtractFrom(clusterCapacity, rmNode.getTotalCapability());
792796
updateRootQueueMetrics();
793797

0 commit comments

Comments
 (0)