Feedforward neural networks
Recapituation of supervised learning systems studied thus far
Neural network with one hidden layer
The choice of non-linearity
-
Today, the most commonly used non-linearity is ReLU (Rectified Linear Unit) or some of its variants.
-
Other useful non-linearities include sigmoid and the hyperbolic tangent (tanh).
-
Advantages of ReLU over other non-linearities include:
- the fact that it does not “saturate”
- its computation does not involve an exponentiation and instead is a “thresholding” operator
- it results in the network learning a sparse representation
Feedforward neural networks
-
In the previous sections we talked about a neural network with one hidden layer.
-
However, we can have more hidden layers. In fact, you can have as many as necessary (how do you know how many are necessary?).
-
The resulting compuational architecture is known as a feedforward neural network.
Homework 7b (read below for details)
- Read one of the following papers:
- Next class you will have 5 minutes to (in this order):
- Show us the figure in the paper with the model architecture.
- Tell us the name of the model.
- Tell us what type of model it is.
- In two or three sentences, tell us what the model does.
- In two or three sentences, explain what the inputs and outputs of the model are.
- Walk us through the model architecure (i.e. what does each block, arrow, color, etc. in the figure represent?).
- Create a subreddit post with the information that you will share in class.
- IMPORTANT: in your post/presentation you may only use the figure in the paper with the model architecture.
Homework 7c (read below for details)
- Next class we will have a guest speaker
- Julia Wilkins (PhD student at NYU) will talk to us about her research.
- She will presentat her paper: VocalSet
- Read Julia’s paper in detail, but focus on the part that covers the model architecture and results.
- On our course subreddit, there is a thread to discuss Julia’s paper.
- You must post two questions you have after reading the paper.
- Do not repeat questions that have already been asked ny others in the thread.
- The sooner you post, the less likely it will be that somebody else already asked your question.
- You must also answer at least one of the questions asked by your peers.
All three due Mar 22th at 11:59PM (Eastern Standard Time)
© Iran R. Roman 2022