Skip to content

Commit 93d08b8

Browse files
authored
change cross entropy to entropy, fix typo
In the loss, we are adding entropy instead of cross entropy. fixed a typo as well.
1 parent 3eb470c commit 93d08b8

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

PolicyGradient/a3c/estimators.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
def build_shared_network(X, add_summaries=False):
55
"""
66
Builds a 3-layer network conv -> conv -> fc as described
7-
in the A3C paper. This network is shared by bother the policy and value net.
7+
in the A3C paper. This network is shared by both the policy and value net.
88
99
Args:
1010
X: Inputs
@@ -75,7 +75,7 @@ def __init__(self, num_outputs, reuse=False, trainable=True):
7575
"probs": self.probs
7676
}
7777

78-
# We add cross-entropy to the loss to encourage exploration
78+
# We add entropy to the loss to encourage exploration
7979
self.cross_entropy = -tf.reduce_sum(self.probs * tf.log(self.probs), 1, name="cross_entropy")
8080
self.cross_entropy_mean = tf.reduce_mean(self.cross_entropy, name="cross_entropy_mean")
8181

0 commit comments

Comments
 (0)