LitArk » Books » Computer

LazyProgrammer - Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python)

Here you can read online LazyProgrammer - Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python) full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2016, publisher: LazyProgrammer, genre: Computer. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Book:
Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python)
Author:
LazyProgrammer
Publisher:
LazyProgrammer
Genre:
Books / Computer
Year:
2016
Rating:
3 / 5
Favourites:
Add to favourites
Your mark:
- 60
- 1
- 2
- 3
- 4
- 5

Description
Author's other books
Similar books

Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python): summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python)" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

LazyProgrammer: author's other books

Who wrote Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python)? Find out the surname, the name of the author of the book and a list of all author's works by series.

Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python) — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python)" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Deep Learning: Recurrent Neural Networks in Python LSTM, GRU, and more RNN machine learning architectures in Python and Theano By: The LazyProgrammer ( https://lazyprogrammer.me )

Introduction

Recurrent Neural Networks are all about learning sequences. You may have already learned about Markov models for sequence modeling, which make use of the Markov assumption, p{ x(t) | x(t-1), ..., x(1) } = p{ x(t) | x(t-1) }. In other words, the current value depends only on the last value. While easy to train, one can imagine this may not be very realistic. Ex. the previous word in a sentence is and, whats the next word? Whereas Markov Models are limited by the Markov assumption, Recurrent Neural Networks are not.

As a result, they are more expressive, and more powerful than anything weve seen on tasks that we havent made progress on in decades. In the first section of the book we are going to add time to our neural networks. Ill introduce you to the Simple Recurrent Unit, also known as the Elman unit. The Simple Recurrent Unit will help you understand the basics of recurrent neural networks - the types of tasks they can be used for, how to construct the objective functions for these tasks, and backpropagation through time. We are going to revisit a classical neural network problem - the XOR problem, but were going to extend it so that it becomes the parity problem - youll see that regular feedforward neural networks will have trouble solving this problem but recurrent networks will work because the key is to treat the input as a sequence. In the next section of the course, we are going to revisit one of the most popular applications of recurrent neural networks - language modeling, which plays a large role in natural language processing or NLP.

Another popular application of neural networks for language is word vectors or word embeddings. The most common technique for this is called Word2Vec, but Ill show you how recurrent neural networks can also be used for creating word vectors. In the section after, well look at the very popular LSTM, or long short-term memory unit, and the more modern and efficient GRU, or gated recurrent unit, which has been proven to yield comparable performance. Well apply these to some more practical problems, such as learning a language model from Wikipedia data and visualizing the word embeddings we get as a result. One tip for getting through this book: Understand the mechanics first, worry about the meaning later. When we talk about LSTMs were going to discuss its ability to remember and forget things - keep in mind these are just convenient names by way of analogy.

Were not actually building something thats remembering or forgetting - they are just mathematical formulas. So worry about the math, and let the meaning come naturally to you. What you especially dont want to do is the opposite - try to understand the meaning without understanding the mechanics. When you do that, the result is usually a sensationalist media article or a pop-science book - this book is the opposite of that. We want to understand on a technical level whats happening - explaining things in layman terms or thinking of real-life analogies is icing on the cake only if you understand the technicalities. All the code for this class is hosted on Github, and you can get it from https://github.com/lazyprogrammer/machine_learning_examples in the folder rnn_class.

Git is a version control system that allows me to push updates and keep a history of all the changes. You should always do a git pull to make sure you have the latest version.

Formatting

I know that the e-book format can be quite limited on many platforms. If you find the formatting in this book lacking, particularly for the code or diagrams, please shoot me an email at along with a proof-of-purchase, and I will send you the original ePub from which this book was created.

Chapter 1: The Simple Recurrent Unit

In this chapter we are going to talk about the Simple Recurrent Unit, also known as the Elman Unit.

Sequence Representation

Before we go into the recurrent unit itself - lets talk about sequences.

Sequence Representation

Before we go into the recurrent unit itself - lets talk about sequences.

Why? Because these will look slightly different from what were used to. Recall that our input data X is usually represented with an NxD matrix. Thats N samples and D features. But thats when we werent working with sequences. Well lets suppose we did have a sequence. How many dimensions would that require? Call the length of the sequence T.

If the observation is a D-dimensional vector, and we have T of them, then one sequence of observations will be a TxD matrix. If we have N training samples, that will be an NxTxD matrix - a 3-dimensional object. Sometimes, our sequences are not of equal length. This can happen when were talking about sentences, which are obviously of arbitrary length, or perhaps sounds and music, or someones credit history. How can we handle this? Instead of a 3-D matrix, well have a length-N list, where each element is a 2-D observation sequence of size T(n)xD. Since Python lists can contain any object as an element, this is ok.

In general, we can represent our sequence as: x(1), x(2), , x(t), , x(T-1), x(T)

The Recurrent Unit

Back to our simple recurrent unit. The way it works is this. Take a simple feedforward neural network with 1 hidden layer. The input is a D-dimensional vector, x(t). o-----o-----o x(t) h(t) y(t) What we want to do is create a feedback connection from the hidden layer to itself. _ / \ Wh \ / o-----o-----o Wi Wo Notice that the feedback loop implies theres a delay of 1 time unit. _ / \ Wh \ / o-----o-----o Wi Wo Notice that the feedback loop implies theres a delay of 1 time unit.

So the input into h(t) is now not just x(t), but h(t-1) also. In math, we can represent h(t) as (assuming row vectors): h(t) = f(x(t)W i + h(t-1)W h + b h ) What is the size of W h ? Just like the other layers, we connect everything-to-everything. Let M = number of dimensions of h(t). So if there are M hidden units, the first hidden unit connects back to all M hidden units, the second hidden unit connects back to all M hidden units, and so on. So in total there will be M hidden-to-hidden weights. Another way to think of this is that h(t) and h(t-1) must be the same size, therefore, W h must be of size MxM for the equation to be valid.

Notice that the f function can be any one of the usual hidden layer nonlinearities - usually sigmoid, tanh, or relu. Its a hyperparameter, just like with other types of neural networks. Question for the reader to think about: Why does the RNN not make the Markov assumption? Due to the recursive formulation, h(t) needs to have an initial state, h(0). That is, assuming each sequence starts at the index 1. Sometimes, researchers just set this to 0, and other times, they treat it as a parameter that we can use backpropagation on. Since Theano automatically differentiates things for us, well treat it as an updeatable parameter.

Note that you may add multiple recurrent layers to your recurrent neural network, just like with feedforward neural networks. The number of recurrent layers is a hyperparameter just like how the number of hidden layers is for a regular feedforward neural network. The question of how many is the best depends on whats best for your specific problem. We can do this layering for all types of the recurrent units well see - so thats the Elman, the GRU, and the LSTM. _ _ / \ / \ \ / \ / o-----o-----o-----o x(t) h1(t) h2(t) y(t) And thats all there is to it! Just by adding that one recurrent connection, weve already created a recurrent neural network.

Prediction and Relation to Markov Models

In this section we are going to look more closely at what a recurrent neural network can predict and talk about how, under certain circumstances, we can relate it to what we know about Markov models.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Similar books «Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python)»

Look at similar books to Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python). We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.

Ahmed Ph. Abbasi

Python Machine Learning: Machine Learning Algorithms for Beginners--Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch

Learning in Python: Study Data Science and Machine Learning including Modern Neural Networks produced in Python, Theano, and TensorFlow

KUMAR

Learning in Python: Learn Data Science and Machine Learning with Modern Neural Networks composed in Python, Theano, and TensorFlow

Sebastian Raschka

Python Machine Learning

Moore Richard

Python Machine Learning: The Ultimate Guide for Beginners to Machine Learning with Python, Programming and Deep Learning, Artificial Intelligence, Neural Networks, and Data Science

Yuxi (Hayden) Liu

Hands-On Deep Learning Architectures with Python: Create deep neural networks to solve computational problems using TensorFlow and Keras

Valentino Zocca

Python Deep Learning

Seth Weidman

Deep Learning from Scratch: Building with Python from First Principles

Dipanjan Sarkar

Hands-On Transfer Learning with Python Implement Advanced Deep Learning and Neural Network Models Using TensorFlow and Keras

LazyProgrammer

Unsupervised Deep Learning in Python: Master Data Science and Machine Learning with Modern Neural Networks written in Python and Theano

Unknown

Convolutional Neural Networks in Python: Master Data Science and Machine Learning with Modern Deep Learning in Python, Theano, and TensorFlow (Machine Learning in Python)

LazyProgrammer

Deep Learning in Python Prerequisites

Reviews about «Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python)»

Discussion, reviews of the book Deep Learning: Recurrent Neural Networks in Python: LSTM, GRU, and more RNN machine learning architectures in Python and Theano (Machine Learning in Python) and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.