Speech2Mesh is an end-to-end pipeline that transforms spoken language into 3D printable mesh models—no CAD tools or technical design skills needed. It leverages the power of generative AI and 3D reconstruction to make 3D modeling accessible and intuitive.
speech2mesh.ipynb– Main Jupyter notebook demonstrating the complete pipeline.report/DeepRobProjectReport.pdf- Report PDF file with literature review.
- Converts speech into text using OpenAI’s Whisper.
- Generates multi-view images using Stable Diffusion.
- Reconstructs meshes using InstantMesh.
- Optional comparison with Point-E + Point2Mesh.
git clone [https://github.com/yourusername/speech2mesh.git](https://github.com/trushant05/speech2mesh.git)
cd speech2meshpython -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activatepip install jupyterlabjupyter lab speech2mesh.ipynb- Trushant Adeshara – @trushantadeshara
- Pannaga Sudarshan
- Kajal Awasthi
- Saket Pradhan
This project is open source and available under the MIT License.
- Whisper by OpenAI
- Stable Diffusion by StabilityAI
- InstantMesh
- Point-E, Point2Mesh