Skip to content

Add Hunyuan 3D 2.1 Support #8714

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 17 commits into
base: master
Choose a base branch
from

Conversation

yousef-rafat
Copy link

I have created the reimplementation of the VAE. The main contributions are in rewriting/avoiding the use of unnecessary libraries and modularizing the code by removing code redundancy and better structuring of the code.

The implemented VAE is faster by approximately 5 seconds in the encoding process and 2 seconds in the decoding process.

The fps.py file contains the implementation of Farthest Point Sampling, avoiding using torch_cluster. The preprocess.py contains the Volume Decoder, Surface Extractor, and Trimesh exporting. The point_attention.py contains the code for the encoder. The transformer.py contains the code for the Transformer model that is used in the geo decoder of the VAE and the Self-Attention in the encoder. The vae.py contains the VAE code along with a test example.

fixed not returning y in the forward method of MoEBlock
@yousef-rafat yousef-rafat changed the title Implementation of VAE for Hunyuan 3D 2.1 Add Hunyuan 3D 2.1 Support Jul 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant