Слайд 2
Pachshenko
Galina Nikolaevna
Associate Professor of Information System Department,
Candidate
data:image/s3,"s3://crabby-images/d0580/d05806ce10b9955eac2b9b74bdb9fbd11d37f3d4" alt="Pachshenko Galina Nikolaevna Associate Professor of Information System Department, Candidate of Technical Science"
of Technical Science
Слайд 4Topics
Single-layer neural networks
Multi-layer neural networks
Single perceptron
Multi-layer perceptron
Hebbian Learning Rule
Back propagation
Delta-rule
Weight adjustment
Cost Function
Сlassification
data:image/s3,"s3://crabby-images/850ae/850ae045361cac66a951437e6d9f4453a64f40ee" alt="Topics Single-layer neural networks Multi-layer neural networks Single perceptron Multi-layer perceptron Hebbian"
(Independent Work)
Слайд 5 Single-layer neural networks
data:image/s3,"s3://crabby-images/60e17/60e17b567f178665f1b2505fef01e5b51d9cddd7" alt="Single-layer neural networks"
Слайд 7Single perceptron
The perceptron computes a single output from multiple real-valued inputs by forming a linear combination
data:image/s3,"s3://crabby-images/04e94/04e947ac19f9a9bf816164934b6cee8cd54df23c" alt="Single perceptron The perceptron computes a single output from multiple real-valued inputs"
according to its input weights and then possibly putting the output through activation function.
Слайд 8Single perceptron.
Mathematically this can be written as
data:image/s3,"s3://crabby-images/7d1b9/7d1b9cd1b917b1a53de1280d7fc8f476f2a186dc" alt="Single perceptron. Mathematically this can be written as"
Слайд 10Task 1:
Write a program that finds output of a single perceptron.
Note:
Use bias.
data:image/s3,"s3://crabby-images/2ca8d/2ca8df4d8cc2e68d8ab34e0069d28a60d7d0db09" alt="Task 1: Write a program that finds output of a single perceptron."
The bias shifts the decision boundary away from the origin and does not depend on any input value.
Слайд 11Multilayer perceptron
A multilayer perceptron (MLP) is a class of feedforward artificial neural network.
data:image/s3,"s3://crabby-images/1e9f5/1e9f540523a7578b119b58ec13de1f7713e4730c" alt="Multilayer perceptron A multilayer perceptron (MLP) is a class of feedforward artificial neural network."
Слайд 13Structure
• nodes that are no target of any connection are called input
data:image/s3,"s3://crabby-images/249c2/249c22fdffba5ddb498e6ace82c0948f9e5efdc1" alt="Structure • nodes that are no target of any connection are called input neurons."
neurons.
Слайд 14• nodes that are no source of any connection are called output
data:image/s3,"s3://crabby-images/40076/400763455f2695e4b151e4811ae450e18f75b26a" alt="• nodes that are no source of any connection are called output"
neurons.
A MLP can have more than one output neuron.
The number of output neurons depends on the way the target values (desired values) of the training patterns are described.
Слайд 15• all nodes that are neither input neurons nor output neurons are
data:image/s3,"s3://crabby-images/b8f96/b8f96c9bac9cce919db56c6969d0307a333f39a0" alt="• all nodes that are neither input neurons nor output neurons are"
called hidden neurons.
• all neurons can be organized in layers, with the set of input layers being the first layer.
Слайд 16The original Rosenblatt's perceptron used a Heaviside step function as the activation
data:image/s3,"s3://crabby-images/d9821/d982139384a6d226d79e269549f7ee509b560e5a" alt="The original Rosenblatt's perceptron used a Heaviside step function as the activation function."
function.
Слайд 17Nowadays, in multilayer networks, the activation function is often chosen to be
data:image/s3,"s3://crabby-images/8a267/8a267add0000c1c892c180b128b6eeea8acecfa7" alt="Nowadays, in multilayer networks, the activation function is often chosen to be the sigmoid function"
the sigmoid function
Слайд 20These functions are used because they are mathematically convenient.
data:image/s3,"s3://crabby-images/dc028/dc0288055f5ce5869cccce702dce6d74f8996306" alt="These functions are used because they are mathematically convenient."
Слайд 21An MLP consists of at least three layers of nodes.
Except for the
data:image/s3,"s3://crabby-images/fc677/fc67707d44b33903ef28dab8efcf26e5f267fa7f" alt="An MLP consists of at least three layers of nodes. Except for"
input nodes, each node is a neuron that uses a nonlinear activation function.
Слайд 22MLP utilizes a supervised learning technique called backpropagation for training.
data:image/s3,"s3://crabby-images/3db8a/3db8a2462a39b7a37079e3ed728796c4f94f7444" alt="MLP utilizes a supervised learning technique called backpropagation for training."
Слайд 23Hebbian Learning Rule
Delta rule
Backpropagation algorithm
data:image/s3,"s3://crabby-images/45ab9/45ab9ba47b32043cbf2b1267415753c0d626caab" alt="Hebbian Learning Rule Delta rule Backpropagation algorithm"
Слайд 24
Hebbian Learning Rule
(Hebb's rule)
The Hebbian Learning Rule (1949)
is a learning rule that
data:image/s3,"s3://crabby-images/26cf7/26cf78a88793c948206b8e981737d6259724348e" alt="Hebbian Learning Rule (Hebb's rule) The Hebbian Learning Rule (1949) is a"
specifies how much the weight of the connection between two units should be increased or decreased in proportion to the product of their activation.
Слайд 25Hebbian Learning Rule
(Hebb's rule)
data:image/s3,"s3://crabby-images/b300f/b300fe090f4d239051c1f1086896f2f7b01e1fec" alt="Hebbian Learning Rule (Hebb's rule)"
Слайд 28The backpropagation algorithm was originally introduced in the 1970s, but its importance
data:image/s3,"s3://crabby-images/963bf/963bfe208acc80af6908dc67445885f7dbfb27db" alt="The backpropagation algorithm was originally introduced in the 1970s, but its importance"
wasn't fully appreciated until a famous 1986 paper by David Rumelhart, Geoffrey Hinton, and Ronald Williams.
Слайд 29That paper describes several neural networks where backpropagation works far faster than
data:image/s3,"s3://crabby-images/1e9f2/1e9f22a7f347125562dfbe7cc22f0a1c4f10f544" alt="That paper describes several neural networks where backpropagation works far faster than"
earlier approaches to learning, making it possible to use neural nets to solve problems which had previously been insoluble.
Слайд 30Supervised Backpropagation – The mechanism of backward error transmission (delta learning rule)
data:image/s3,"s3://crabby-images/41768/4176877b442ce526fd2b482d90a2b027a1925724" alt="Supervised Backpropagation – The mechanism of backward error transmission (delta learning rule)"
is used to modify the weights of the internal (hidden) and output layers
Слайд 31Back propagation
The back propagation learning algorithm uses the delta-rule.
What this does
data:image/s3,"s3://crabby-images/cc6f6/cc6f6d2bcde847eacb613581dacbae498a1aa9f7" alt="Back propagation The back propagation learning algorithm uses the delta-rule. What this"
is that it computes the deltas, (local gradients) of each neuron starting from the output neurons and going backwards until it reaches the input layer.
Слайд 32The delta rule is derived by attempting to minimize the error in
data:image/s3,"s3://crabby-images/5fd28/5fd2851435ab9305de08b66165a749125db04e38" alt="The delta rule is derived by attempting to minimize the error in"
the output of the neural network through gradient descent.
Слайд 33To compute the deltas of the output neurons though we first have
data:image/s3,"s3://crabby-images/4cd54/4cd547812f5a45b2c24d2d54ccd4d31ad7c44a6b" alt="To compute the deltas of the output neurons though we first have"
to get the error of each output neuron.
Слайд 34That’s pretty simple, since the multi-layer perceptron is a supervised training network
data:image/s3,"s3://crabby-images/c5637/c5637bef9040bf24f5c0528e3d51119f3dcdaadb" alt="That’s pretty simple, since the multi-layer perceptron is a supervised training network"
so the error is the difference between the network’s output and the desired output.
ej(n) = dj(n) – oj(n)
where e(n) is the error vector, d(n) is the desired output vector and o(n) is the actual output vector.
Слайд 35Now to compute the deltas:
deltaj(L)(n) = ej(L)(n) * f'(uj(L)(n)) ,
for neuron j
data:image/s3,"s3://crabby-images/d7231/d7231ad96d720f60950554f8dcec5306194d9311" alt="Now to compute the deltas: deltaj(L)(n) = ej(L)(n) * f'(uj(L)(n)) , for"
in the output layer L
where f'(uj(L)(n)) is the derivative of the value of the jth neuron of layer L
Слайд 37Weight adjustment
Having calculated the deltas for all the neurons we are now
data:image/s3,"s3://crabby-images/c18aa/c18aa186b17d199d8e257f83dc4e820729af00fb" alt="Weight adjustment Having calculated the deltas for all the neurons we are"
ready for the third and final pass of the network, this time to adjust the weights according to the generalized delta rule:
Слайд 40Note: For sigmoid activation function
Derivative of the function:
S'(x) = S(x)*(1 -
data:image/s3,"s3://crabby-images/3d499/3d499e8db05f1882fd00fa678ca532c5dc890571" alt="Note: For sigmoid activation function Derivative of the function: S'(x) = S(x)*(1 - S(x))"
S(x))
Слайд 42
Cost Function
We need a function that will minimize the parameters over our
data:image/s3,"s3://crabby-images/11e91/11e91f74d478401a38a1532444860d57ae90f628" alt="Cost Function We need a function that will minimize the parameters over"
dataset. One common function that is often used is mean squared error
Слайд 43Squared Error: which we can minimize using gradient descent
A cost function is
data:image/s3,"s3://crabby-images/6f33d/6f33d6318fb208d142136c716e33dce9d2a63aca" alt="Squared Error: which we can minimize using gradient descent A cost function"
something you want to minimize. For example, your cost function might be the sum of squared errors over your training set. Gradient descent is a method for finding the minimum of a function of multiple variables. So you can use gradient descent to minimize your cost function.
Слайд 44Back-propagation is a gradient descent over the entire networks weight vectors.
In practice,
data:image/s3,"s3://crabby-images/8d0e2/8d0e2c00bd9a3eeb0d62c48d6ac6e2660adadfc0" alt="Back-propagation is a gradient descent over the entire networks weight vectors. In"
it often works well and can run multiple times. It minimizes error over all training samples.
Слайд 45Task 2:
Write a program that can update weights of neural network using
data:image/s3,"s3://crabby-images/96ba2/96ba2b77c112cef085e4c417b4ffd2f86c18a286" alt="Task 2: Write a program that can update weights of neural network using backpropagation."
backpropagation.