We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 838990b commit cfeef6dCopy full SHA for cfeef6d
transformer_lm/main.py
@@ -189,7 +189,7 @@ def step(inputs, targets):
189
"--num_iters", type=int, default=100000, help="Iterations to train for."
190
)
191
parser.add_argument(
192
- "--learning_rate", type=float, default=3e-4, help="SGD learning rate."
+ "--learning_rate", type=float, default=3e-4, help="AdamW learning rate."
193
194
195
"--weight_decay", type=float, default=1e-5, help="Set the weight decay"
0 commit comments