Long Short-Term Memory

From Psyc 40 Wiki
Revision as of 23:39, 21 October 2022 by User (talk | contribs)
Jump to: navigation, search

By Alphonso Bradham

```Note: This page is currently a work in progress```

Long Short-Term Memory (LSTM) refers to a type of recurrent neural network architecture useful for performing classification and regression tasks in long sequence or time-series data. LSTMs were developed to counter the [vanishing gradient problem], and the key features of an LSTM network are the inclusion of LSTM "cell state" vectors that allow them to keep track of long range relationships in data that other models would "forget".

Motivation

LSTMs were developed to