Structured Light "Slit" Scanner using ESP32-CAM with IoT capabilites.

3D Reconstruction of Objects on Client Application.

Machine Setup (Physical System)

Project Navigation:

CORE Software Features:

ESP-32 Data Acquisition & TCP Communications to Client C++ Application

Decoding & Pre-Processing of Image Datasets

🖥 1. | Transforming 2D Points into Cylindrical Coordinates for subsequent translation into 3D Cartesian Space

🖥 2. | 3D Reconstruction (In Cartesian Space) using Lazer's Planar Equation & Camera Extrincts (Translation Vector)

Client Interface (CLI & OpenGL; Point Cloud Rendering)

Camera Calibration (Intrinsics & Extrinsics) & Lazer Plane Estimation with Python

Mechanical & Electrical Docs:

⚙️ Physical Design (CAD Models, Mechanical Specification)

🔌 Hardware Setup (Circuit Diagram & Setup Elaborations)

Showcase!

Datasets that worked fairly well:

Meditating Monk

Dog In Cup

Butter Holder

Datasets with lots of Missing Information & Noise :

Wooden Elephant

Wooden Frog

Headless Luck

ESP-32 Data Acquisition & TCP Communications to Client C++ Application

As you can see in the Machine Setup / Physical System, there is a Rotating Plate on which an Object is placed on, there is also a line lazer fixed at 15 Degrees, and the ESP-32 Camera Board aligned with the center of the Rotating Plate. (However the ESP-32 is higher than the Plate)

For every Rotation, 3 Data Points need to be captured and sent to the C++ Application in Order, View the Table Below.

Data Point	Example Data
1.) Image with Line Lazer Projection
2.) Image without Line Lazer Projection
3.) Acknowledgement	Current Rotation of Object (Angle in Degrees, as Float Value)

The ESP-32 join's my WiFi network upon setup. In the loop function, the images & acknowledgements are sent in a particular sequence via TCP, as I need to know which image is which, and what acknowledgement they correspond to. TCP is especially good for lossless data transfer over the network.

The C++ Application, on the other end, runs a TCP Server which the ESP writes data to upon every loop iteration.

ESP-32 WiFi Connection via WiFiClient & WiFiMulti libraries.

Further Elaborations about Networking System | ESP32 to C++ Client | (Diagrams & More Info.)

Below is a Basic Diagram of how the TCP Server works on the C++ Client, essentially the Receiving Thread is actually the TCP Server, which get's sent messages containing Images & Acknowledgements from the ESP-32 Cam. This thread then pushes those buffers onto a queue for the Processing Thread to deal with them. This way, the actual server suffers no overhead for processing, and can most-likely always be "unblocked" during an incoming message.

In this Diagram, the actual sending of information is abstracted from the main loop, however the main loop ensures success of these functions, as it tries them until success, and re-tries upon failure. (False return)

State Diagram of the `CaptureImage` & `sendAcknowledgement` Functions

Decoding & Pre-Processing of Image Datasets

After network communications between the ESP-32 CAM and Client Application have completed, the client-side application has received a directory full of images with associated meta-data information in the title. (Binary Type for Pair-Ordering & Angle in Degrees)

In struct.h

struct LazerSlice {
    cv::Mat off_img;
    cv::Mat on_img;
    cv::Mat processed_matrix;

    std::vector<glm::vec3> list_3d_points;

    float angle;
};

The first step is to parse/decode these images into an ordered structure, with decoded angle in degrees, off_img set to the 1_... Image, and on_img set to 0_... Image. This pair of images is casted to cv::Mat and is used in pre-processing, which returns the processed_matrix that is the matrix we'll be reference in the reconstruction algorithms later on.

NOTE: Whilst using the second Reconstruction method, you shouldn't use the Perspective Transformation, as the Camera Extrincts are known and all interpolation of 2D points are done by the Lazer's Planar Equation.

By the end of Decoding & Preprocessing, we should have a list of structs: std::vector<LazerSlice> slices = { ... } where each pair represents 1 list entry. (At this point, list_3d_points is still un-assigned)

Transforming 2D Points into Cylindrical Coordinates for subsequent translation into 3D Cartesian Space

After the Data Acquisition & Pre-processing Phase, We've got a set of LazerSlices, where the processed_matrix parameter is essentially a Binary Mask , where the activated points is the projected Line Lazer at the specific angle.

More Pre-processing happens to the Binary Masks, such as extracting 1 X,Y Activated Point per Y Layer on the image, and an Undistortion Transformation to rectify Camera Distortion, since the ESP slighly barrels I find. (Transformation done with cv::undistortPoints, with Camera Matrix & Distortion Coefficients found with Python Script)

Now I've got clean 2D Lazer Line Projection on the Object at all captured angles (Discrete Values, from 0 to 360 deg). from here I choose an Origin Point (On the X-axis) to define the center for all images. Defined as X-Midpoint

The algorithm takes a 2D image and conceptualizes it as a cross-sectional slice at a specific laser angle, denoted as Θ Θ. This slice is represented in a cylindrical coordinate system, centered around an origin defined by the midpoint of the X-axis in the image. The X and Z coordinates of this slice are then transformed into Cartesian coordinates by dividing them by tan(Θ). Subsequently, these Cartesian coordinates are normalized to fit within the OpenGL coordinate range.

Finally, the list of 3D points extrapolated from the slice undergoes a rotational transformation (Vector (x,y,z) multiplied by (3 x 3) Rotation Matrix with the slice's angle as theta LazerSlice.angle.

Observation: Going from Cylindrical to Cartesian, then doing a transformation for rotation of 3D points preserves the relative spatial relationships within the 3D points captured on each slice!

3.) 3D Reconstruction (In Cartesian Space) using Lazer's Planar Equation & Camera Extrincts (Translation Vector)

C++ Client Interface

Camera Calibration (Intrinsics & Extrinsics) & Lazer Plane Estimation with Python

Ressources & Inspiration for this project:

Main Inspiration:

hackster.io ; 3D Scanning with Raspberry Pi and MATLAB

Very Useful Ressources:

ece.cornell.edu, ECE 5725: Embedded OS Project ; Laser 3D Scanner

Brown University ; The Laser Slit 3D Scanner

Less Useful Ressources:

The Complete Guide to 3D Scanners using Laser Triangulation

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
.vscode		.vscode
data		data
data_camera_calibration		data_camera_calibration
data_usable		data_usable
physical_design/parts		physical_design/parts
software		software
static		static
xyz_outputs		xyz_outputs
README.md		README.md
hardware.md		hardware.md
theory_research.md		theory_research.md

jasonmzx/3D-Slit-Scanner_ESP32

Folders and files

Latest commit

History

Repository files navigation

Structured Light "Slit" Scanner using ESP32-CAM with IoT capabilites.

Project Navigation:

CORE Software Features:

Mechanical & Electrical Docs:

Showcase!

Datasets that worked fairly well:

Datasets with lots of Missing Information & Noise :

ESP-32 Data Acquisition & TCP Communications to Client C++ Application

State Diagram of the CaptureImage & sendAcknowledgement Functions

Decoding & Pre-Processing of Image Datasets

Transforming 2D Points into Cylindrical Coordinates for subsequent translation into 3D Cartesian Space

3.) 3D Reconstruction (In Cartesian Space) using Lazer's Planar Equation & Camera Extrincts (Translation Vector)

C++ Client Interface

Camera Calibration (Intrinsics & Extrinsics) & Lazer Plane Estimation with Python

Ressources & Inspiration for this project:

About

Topics

Resources

Stars

Watchers

Forks

Languages

State Diagram of the `CaptureImage` & `sendAcknowledgement` Functions