[2025-10-18]: The code and models have been released! We have improved the training resolution of the models and achieved better results compared to the original paper. Both old weights (for ...
The overall workflow of the task-oriented data synthesis framework (TODSynth) consists of three stages: (a) Training stage using an MM-DiT generative model conditioned on text and mask. (b) Sampling ...