L2 regularization problem #4

heibanke · 2016-06-15T10:04:25Z

hello, andersbll:

Thanks for your code. it is very useful for me.
i read your code and want to ask a question.

Line68 in layers.py:
self.dW = np.dot(self.last_input.T, output_grad)/n - self.weight_decay*self.W
In L2 regularization, i think this program need modify into
self.dW = np.dot(self.last_input.T, output_grad)/n + self.weight_decay*self.W
Would you tell me what you think to use "- self.weight_decay*self.W"?

B.R
heibanke

The text was updated successfully, but these errors were encountered:

heibanke · 2016-06-16T09:08:19Z

another problem:

helpers.py:

def tanh_d(x):
    e = np.exp(2*x)
    return (e-1)/(e+1)

should modify into following code:

def tanh_d(x):
    e = tanh(x)
    return 1-e**2

B.R
heibanke

983 · 2016-11-20T17:30:34Z

I was wondering about the minus-sign, too.

Also I am confused about the division by n, although it probably doesn't matter since it only changes the learning rate.

def tanh_d(x):
    e = np.exp(2*x)
    return (e-1)/(e+1)

seems to be the same as tanh, so I think you are right.

One of the reasons why tanh and sigmoid are used as an activation function is that the derivative can be computed from the forward propagation pass without evaluating an expensive function again, but that is not done here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

L2 regularization problem #4

L2 regularization problem #4

heibanke commented Jun 15, 2016 •

edited

Loading

heibanke commented Jun 16, 2016 •

edited

Loading

983 commented Nov 20, 2016

L2 regularization problem #4

L2 regularization problem #4

Comments

heibanke commented Jun 15, 2016 • edited Loading

heibanke commented Jun 16, 2016 • edited Loading

983 commented Nov 20, 2016

heibanke commented Jun 15, 2016 •

edited

Loading

heibanke commented Jun 16, 2016 •

edited

Loading