Skip to content

trushant05/speech2mesh

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Speech2Mesh

Speech2Mesh is an end-to-end pipeline that transforms spoken language into 3D printable mesh models—no CAD tools or technical design skills needed. It leverages the power of generative AI and 3D reconstruction to make 3D modeling accessible and intuitive.


📦 Project Structure

  • speech2mesh.ipynb – Main Jupyter notebook demonstrating the complete pipeline.
  • report/DeepRobProjectReport.pdf - Report PDF file with literature review.

🛠️ Features

  • Converts speech into text using OpenAI’s Whisper.
  • Generates multi-view images using Stable Diffusion.
  • Reconstructs meshes using InstantMesh.
  • Optional comparison with Point-E + Point2Mesh.

🚀 Getting Started

1. Clone the Repository

git clone [https://github.com/yourusername/speech2mesh.git](https://github.com/trushant05/speech2mesh.git)
cd speech2mesh

2. Create a Virtual Environment (optional but recommended)

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

3. Install JupyterLab

pip install jupyterlab

4. Launch the Notebook

jupyter lab speech2mesh.ipynb

✍️ Authors

  • Trushant Adeshara – @trushantadeshara
  • Pannaga Sudarshan
  • Kajal Awasthi
  • Saket Pradhan

📜 License

This project is open source and available under the MIT License.


📫 Acknowledgements

  • Whisper by OpenAI
  • Stable Diffusion by StabilityAI
  • InstantMesh
  • Point-E, Point2Mesh

About

Generate 3D printable mesh using speech prompts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors