Trainer: fixing the gaussian_multinomial_diffusion.py file #58

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

lotif wants to merge 2 commits into main from marcelo/refactor-gaussian

Collaborator

lotif commented Oct 8, 2025

PR Type

Fix

Short Description

Clickup Ticket(s): https://app.clickup.com/t/868fuke6e

General improvements in the midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py file:

Removing Ruff and mypy ignores
Refactoring the __init__ function
Adding docstrings
Fixing parameter and variable names
Removing unused parameters
Removing unused methods

Tests Added

Only minor adjustments, the functionality does not change.

lotif added 2 commits

October 8, 2025 17:55


          Fixing the gaussian multinomial diffusion module

ae09be9


          Small docstring adjustment

1175bc6

lotif requested review from ElahehBassak, amrit110, bzamanlooy, emersodb, fatemetkl, masi-sh and sarakodeiri

October 8, 2025 22:01

emersodb reviewed

View reviewed changes

Collaborator

emersodb left a comment

I'm going to have to trust you a bit on the documentation fidelity in a lot of places, since I don't know the code nearly as well as you do 🙂. There were a few places where I wanted to clarify my understanding of the documentation. So definitely tell me if I'm off base anywhere.

Some other fairly minor comments throughout.

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                      self,

                      batch_size: int,

                      device: torch.device,

                      method: Literal["uniform", "importance"] = "uniform",

Collaborator

emersodb Oct 9, 2025

Maybe we can just make this a local enum here?

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                  DIRECT = "direct"

              class ConditioningFunction(Protocol):

Collaborator

emersodb Oct 9, 2025

More for my own learning, but do you mind explaining why it might be advantageous for this to be a class inheriting from Protocol rather than a Callable? It's an easier typing annotation and also adds some documentation, which is nice. Perhaps that's enough justification, but also though perhaps there was something deeper?

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                                   prevent singularities.

                  Args:

                      num_diffusion_timesteps: The number of betas to produce.

Collaborator

emersodb Oct 9, 2025

This documentation is technically true, but also doesn't really capture what the variable actually represents? (I know you didn't write it 😂)

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                                produces the cumulative product of (1-beta) up to that

                                part of the diffusion process.

                      max_beta: The maximum beta to use; use values lower than 1 to

                                prevent singularities.

Collaborator

emersodb Oct 9, 2025

Give the documentation here, should we assert that max_beta is, in fact, lower than 1...to prevent singularities... 😂

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                      """

                      if device is None:

                          device = torch.device("cpu")

Collaborator

emersodb Oct 9, 2025

The super call below is a bit of an old-school style isn't it?

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                      Args:

                          log_x_start: The log probability of the initial input.

                          log_x_t: The log probability of the features.

Collaborator

emersodb Oct 9, 2025

Similar comments about the log prob here.

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                      Args:

                          model_out: The model output.

                          log_x: The log probability of the features.

Collaborator

emersodb Oct 9, 2025

Same comment here about probabilities. Also are we sure we're predicting probabilities of the model output?

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                      categorical_features = features[:, self.num_numerical_features :]

                      numerical_features_ts = numerical_features

                      log_categrocial_features_ts = categorical_features

Collaborator

emersodb Oct 9, 2025

This and the line above are a bit of weird indirection and tying values togehter. We keep numerical_features and categorical_features around only to check their shapes. Rather than creating new tensors here (without copying), why not just use the original tensors?

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                      b = num_samples

                      z_norm = torch.randn((b, self.num_numerical_features), device=self.device)

                      batch_size = num_samples

Collaborator

emersodb Oct 9, 2025

Why not just call num_samples argument batch size here? It doesn't look like num_samples is used elsewhere anyway.

src/midst_toolkit/models/clavaddpm/gaussian_multinomial_diffusion.py

    
                      b = num_samples

                      z_norm = torch.randn((b, self.num_numerical_features), device=self.device)

                      batch_size = num_samples

Collaborator

emersodb Oct 9, 2025

Why not just call num_samples argument batch size here? It doesn't look like num_samples is used elsewhere anyway.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

emersodb emersodb left review comments

amrit110 Awaiting requested review from amrit110

fatemetkl Awaiting requested review from fatemetkl

sarakodeiri Awaiting requested review from sarakodeiri

masi-sh Awaiting requested review from masi-sh

bzamanlooy Awaiting requested review from bzamanlooy

ElahehBassak Awaiting requested review from ElahehBassak

At least 1 approving review is required to merge this pull request.

Labels

None yet