Main LSTM Weight Initialization

Hi,

I was looking at the weight initialization and it looks like you use xavier_uniform_ for the main LSTM input-to-hidden-weights 

https://github.com/jihunchoi/hyperlstm/blob/7f50825b95c58035cfee5d8d024339522328d35d/models/hyperlstm.py#L203

While in the paper they define that both the input-to-hidden-weights and the hidden-to-hidden-weights should use Orthogonal initialization. On page 20 Section A.2.3

> Orthogonal initialization is applied to the Wh and Wx

Although I am not sure if the tensorflow implementation follows the paper or not, could you elaborate on why you decided to use xavier uniform or was that just a copy of the tensorflow implementation of the model, or possibly an error in the code?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Main LSTM Weight Initialization #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Main LSTM Weight Initialization #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions