- Reinforcement Learning with TensorFlow
- Sayon Dutta
- 370字
- 2025-02-23 17:35:22
Long Short Term Memory Networks
RNNs practically fail to handle long term dependencies. As the gap between the output data point in the output sequence and the input data point in the input sequence increases, RNNs fail in connecting the information between the two. This usually happens in text-based tasks such as machine translation, audio to text, and many more where the length of sequences are long.
Long Short Term Memory Networks, also knows as LSTMs (introduced by Hochreiter and Schmidhuber), are capable of handling these long-term dependencies. Take a look at the image given here:
data:image/s3,"s3://crabby-images/c86f8/c86f81367a25c126c845158bcfeb26e920054e5a" alt=""
The key feature of LSTM is the cell state . This helps the information to flow unchanged. We will start with the forget gate layer,
which takes the concatenation of of last hidden state,
and
as the input and trains a neural network that results a number between 0 and 1 for each number in the last cell state
, where 1 means to keep the value and 0 means to forget the value. Thus, this layer is to identify what information to forget from the past and results what information to retain.
data:image/s3,"s3://crabby-images/5ea22/5ea227ea6a06700e3648708d39338bd2445d51a2" alt=""
Next we come to the input gate layer and tanh layer
whose task is to identify what new information to add in to one received from the past to update our information, that is, the cell state. The tanh layer creates vectors of new values, while the input gate layer identifies which of those values to use for the information update. Combining this new information with information retained by using the the forget gate layer,
,to update our information, that is, cell state
:
data:image/s3,"s3://crabby-images/4fd93/4fd93319e8b9b5293478d865cc921e4f44fa2ab5" alt=""
data:image/s3,"s3://crabby-images/a5283/a528323ba3785d6fcf40049e9d866cef36d344bf" alt=""
data:image/s3,"s3://crabby-images/23d8f/23d8fa667d162dd6530806758fa949104a8837a4" alt=""
Thus, the new cell state is:
data:image/s3,"s3://crabby-images/4a69c/4a69ca86fdabf0fea19a446f0bbf122095f1d9ba" alt=""
Finally, a neural network is trained at the output gate layer, , returning which values of cell state
to output as the hidden state,
:
data:image/s3,"s3://crabby-images/74228/7422820b3b8f3f72647aa353629394c7e32c4328" alt=""
data:image/s3,"s3://crabby-images/e235b/e235b367df5d0770c3ea440ef003923fca984e4a" alt=""
Thus, an LSTM Cell incorporates the last cell state , last hidden state
and current time step input
, and outputs the updated cell state
and the current hidden state
.