MaskControl: Spatio-Temporal Control for Masked Motion Synthesis (ICCV 2025 -- Oral)

[Previous Name] ControlMM: Controllable Masked Motion Generation

[Project Page] [Paper]

If you find our code or paper helpful, please consider starring our repository and citing:

@article{pinyoanuntapong2024controlmm,
  title={ControlMM: Controllable Masked Motion Generation},
  author={Pinyoanuntapong, Ekkasit and Saleem, Muhammad Usama and Karunratanakul, Korrawe and Wang, Pu and Xue, Hongfei and Chen, Chen and Guo, Chuan and Cao, Junli and Ren, Jian and Tulyakov, Sergey},
  journal={arXiv preprint arXiv:2410.10780},
  year={2024}
}

✅ TODO List

🧪 Evaluation

Joint Control (GMD, OmniControl, and MMM Evaluation)
ProMoGen Evaluation
STMC Evaluation

🎯 Generation

Joint Control
Obstacle Avoidance
Body Part Timeline Control

🏋️ Training

Retrain MoMask with Cross Entropy for All Positions
Add Logits Regularizer

📍 Getting Started

Our code built on top of MoMask. If you encounter any issues, please refer to the MoMask repository for setup and troubleshooting instructions.

1. Conda Environment

conda env create -f environment.yml
conda activate ControlMM
pip install git+https://github.com/openai/CLIP.git

Alternative: Pip Installation

pip install -r requirements.txt

2. Models and Dependencies

Download Pre-trained Models

bash prepare/download_models.sh

Download Evaluation Models and Gloves

For evaluation only.

bash prepare/download_evaluator.sh
bash prepare/download_glove.sh

You have two options here:

Skip getting data, if you just want to generate motions using own descriptions.
Get full data, if you want to re-train and evaluate the model.

(a). Full data (text + motion)

HumanML3D - Follow the instruction in HumanML3D, then copy the result dataset to our repository:

cp -r ../HumanML3D/HumanML3D ./dataset/HumanML3D

📖 Evaluation on joint control:

▶️ Pelvis Only (GMD Evaluation)

python eval_t2m_trans_res.py \
    --res_name tres_nlayer8_ld384_ff1024_rvq6ns_cdp0.2_sw \
    --dataset_name t2m \
    --ctrl_name 'z2024-08-23-01-27-51_CtrlNet_randCond1-196_l1.1XEnt.9TTT__fixRandCond' \
    --gpu_id 0 \
    --ext 0_each100Last600CtrnNet \
    --control trajectory \
    --density -1 \
    --each_iter 100 \
    --last_iter 600 \
    --ctrl_net T

▶️ All Joints (OminControl and MMM Evaluation)

python eval_t2m_trans_res.py \
    --res_name tres_nlayer8_ld384_ff1024_rvq6ns_cdp0.2_sw \
    --dataset_name t2m \
    --ctrl_name 'z2024-08-27-21-07-55_CtrlNet_randCond1-196_l1.5XEnt.5TTT__cross' \
    --gpu_id 4 \
    --ext 0_each100_last600_ctrlNetT \
    --control cross \
    --density -1 \
    --each_iter 100 \
    --last_iter 600 \
    --ctrl_net T

🎮 Control Joints

The following joints can be controlled:

[pelvis, left_foot, right_foot, head, left_wrist, right_wrist]

🚀 Arguments

Argument	Description
`--res_name`	Name of the residual transformer
`--ctrl_name`	Name of the control transformer (VQ and Masked Transformer are also saved in this)
`--gpu_id`	GPU ID to use
`--ext`	Log name used for saving results, stored in: `checkpoints/t2m/{ctrl_name}/eval/{ext}`
`--control`	Type of random joint control: • `trajectory` – pelvis only • `random` – uniform random joints • `cross` – random combinations, see section [A.11 CROSS COMBINATION] • Any single joint: `pelvis`, `l_foot`, `r_foot`, `head`, `left_wrist`, `right_wrist`, `lower` • `all` – all joints
`--density`	Number of control frames: • `1`, `2`, `5` – exact number of control frames • `49` – 25% of ground truth length • `196` – 100% of ground truth length (If GT length < 196, 49/196 are converted proportionally)
`--each_iter`	Number of logits optimization iterations at each unmask step
`--last_iter`	Number of logits optimization iterations at the last unmask step
`--ctrl_net`	Enable ControlNet with Logits Regularizer: `T` or `F`

🎯 Generation

🚀 Joints Control

python -m generation.control_joint --path_name ./output/control1 --iter_each 100 --iter_last 600

Argument	Type	Default	Description
`--path_name`	str	`./output/test`	Output directory to save the optimization results.
`--iter_each`	int	`100`	Number of logits optimization steps at each unmasking step.
`--iter_last`	int	`600`	Number of logits optimization steps at the final unmasking step.
`--show`	flag	`False`	If set, automatically opens the result HTML visualization after execution.

Acknowlegements

We sincerely thank the open-sourcing of these works where our code is based on: MoMask, OmniControl, GMD, MMM, TLControl, STMC, ProgMoGen, TEMOS and BAMM

License

This code is distributed under an LICENSE-CC-BY-NC-ND-4.0.

Note that our code depends on other libraries, including SMPL, SMPL-X, PyTorch3D, and uses datasets which each have their own respective licenses that must also be followed.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
common		common
data		data
data_loaders/humanml/utils		data_loaders/humanml/utils
dataset		dataset
example_data		example_data
exit		exit
generation		generation
models		models
motion_loaders		motion_loaders
options		options
prepare		prepare
utils		utils
visualization		visualization
visualize		visualize
.gitignore		.gitignore
README.md		README.md
edit_t2m.py		edit_t2m.py
environment.yml		environment.yml
eval_t2m_trans_res.py		eval_t2m_trans_res.py
eval_t2m_vq.py		eval_t2m_vq.py
gen_t2m.py		gen_t2m.py
render.py		render.py
requirements.txt		requirements.txt
train_ctrlnet.py		train_ctrlnet.py
train_res_transformer.py		train_res_transformer.py
train_t2m_transformer.py		train_t2m_transformer.py
train_vq.py		train_vq.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MaskControl: Spatio-Temporal Control for Masked Motion Synthesis (ICCV 2025 -- Oral)

[Previous Name] ControlMM: Controllable Masked Motion Generation

[Project Page] [Paper]

✅ TODO List

🧪 Evaluation

🎯 Generation

🏋️ Training

📍 Getting Started

1. Conda Environment

Alternative: Pip Installation

2. Models and Dependencies

Download Pre-trained Models

Download Evaluation Models and Gloves

📖 Evaluation on joint control:

▶️ Pelvis Only (GMD Evaluation)

▶️ All Joints (OminControl and MMM Evaluation)

🎮 Control Joints

🚀 Arguments

🎯 Generation

🚀 Joints Control

Acknowlegements

License

About

Uh oh!

Releases

Packages

Languages

exitudio/MaskControl

Folders and files

Latest commit

History

Repository files navigation

MaskControl: Spatio-Temporal Control for Masked Motion Synthesis (ICCV 2025 -- Oral)

[Previous Name] ControlMM: Controllable Masked Motion Generation

[Project Page] [Paper]

✅ TODO List

🧪 Evaluation

🎯 Generation

🏋️ Training

📍 Getting Started

1. Conda Environment

Alternative: Pip Installation

2. Models and Dependencies

Download Pre-trained Models

Download Evaluation Models and Gloves

📖 Evaluation on joint control:

▶️ Pelvis Only (GMD Evaluation)

▶️ All Joints (OminControl and MMM Evaluation)

🎮 Control Joints

🚀 Arguments

🎯 Generation

🚀 Joints Control

Acknowlegements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages