Implementation quick question #17

macrocredit · 2023-07-17T01:34:55Z

Hi - Great work!

I have one question:

loss, jvp = fc.jvp(f, (tuple(params),), (v_params,))

Do you know why jvp is a scalar? I would have thought that this is a matrix. Also, is there a reason why we are calling tuple(params) instead of params?

Thank you.

FITZET · 2023-10-26T07:33:29Z

i also found that the jvp is a scalar, and i'm not sure how the jvp is calculated. i want a formula to show the computing flow layer by layer, do you know how to get the formula?

inikishev · 2024-12-31T20:34:20Z

Hi - Great work!

I have one question:

loss, jvp = fc.jvp(f, (tuple(params),), (v_params,))

Do you know why jvp is a scalar? I would have thought that this is a matrix. Also, is there a reason why we are calling tuple(params) instead of params?

Thank you.

its dot product of jacobian with a random vector, which is the same as directional derivative in the direction of that vector. Params are a tuple because I believe jvp in pytorch requires a tuple

macrocredit changed the title ~~Quick implementation question~~ Implementation quick question Jul 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation quick question #17

Implementation quick question #17

macrocredit commented Jul 17, 2023

FITZET commented Oct 26, 2023

inikishev commented Dec 31, 2024 •

edited

Loading

Implementation quick question #17

Implementation quick question #17

Comments

macrocredit commented Jul 17, 2023

FITZET commented Oct 26, 2023

inikishev commented Dec 31, 2024 • edited Loading

inikishev commented Dec 31, 2024 •

edited

Loading