Skip to content

Add mlp_encoder_model (MLPs are used to process input data)#20

Open
lgray wants to merge 3 commits intomainfrom
mlp-encoder-model
Open

Add mlp_encoder_model (MLPs are used to process input data)#20
lgray wants to merge 3 commits intomainfrom
mlp-encoder-model

Conversation

@lgray
Copy link
Contributor

@lgray lgray commented Jun 11, 2025

This model should probably only be used with "slim" 3 output variable training (regress x, y, beta) and "diagonal" 8 variable output training (x,y,alpha,beta + errors).

@lgray
Copy link
Contributor Author

lgray commented Jun 11, 2025

@GiuseppeDiGuglielmo FYI

@lgray
Copy link
Contributor Author

lgray commented Jun 11, 2025

@davidgjiang @ArghyaDas112358 please have a look here.

For this model specifically we'll need two different loss configurations:

  • "diagonal": this is a mixture density network where the covariance matrix is only the diagonal, so just the diagonal of the cholesky decomposition too
  • "slim": this is a standard MSE regression to the x, y, cotbeta so no need to initialize a gaussian or anything

I am not confident that this model can handle the full covariance training.

Please feel free to add to this PR with your edits!

@lgray
Copy link
Contributor Author

lgray commented Jun 11, 2025

@GiuseppeDiGuglielmo Please give a try to synthesis to see how it goes for 2 timeslices!

@lgray
Copy link
Contributor Author

lgray commented Jun 20, 2025

@davidgjiang @ArghyaDas112358 I had a typo and forgot to actually quantize the MLP encoder. Please update and retrain!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant