Lantern — Phone Scan → 3D

Point a phone at a real object, get a clean, correctly-scaled, CAD-ready .glb mesh — reconstructed on-device, with the depth network running on the Snapdragon Hexagon NPU via ExecuTorch.

Built for the Qualcomm × Meta ExecuTorch Hackathon (Jun 27–28, 2026).

How it works

Faithful reconstruction (not generative): TSDF fusion of ARCore-aligned monocular depth maps.

RGB + ARCore pose + raw depth + intrinsics
   │
   ├─► Depth-Anything-3 Small ......... dense RELATIVE depth (affine-invariant)
   │     (DA-V2 Small = fallback)        NPU-optimized for SM8750 (~43 ms/frame)
   ├─► affine scale/shift solver ...... fit metric ≈ s·pred + t  vs ARCore sparse
   │                                     metric depth → real meters (solve s AND t)
   ├─► TSDF fusion (Open3D) ........... average many views → one clean surface
   ├─► marching cubes ................. raw .glb mesh
   └─► import_and_clean.py (Blender) .. watertight + normals + m→mm → CAD-ready .glb

Two depth sources by design: the monocular depth net gives a dense, smooth depth in unknown units; ARCore gives sparse-but-metric depth. The affine solver marries them so the mesh is both dense and correctly sized. See roadmap.md for the full execution plan, decision log, and risk register.

Depth net: Depth-Anything-3 Small (Apache-2.0, 24.7 M params, 518×518) is the host default and the on-device target — Qualcomm AI Hub already exports it NPU-optimized for the S25 SoC (SM8750) at ~43 ms/frame float on the Hexagon NPU. DA-V2 Small stays as the documented fallback. Generate disparities with depth_anything_v3.py; build the ExecuTorch .pte with export_da3_executorch.py (pip install -r requirements-da3.txt first). See LIVE_MESH_PLAN.md §5.0a for the research log.

pip install -r requirements-da3.txt
python3 depth_anything_v3.py --frames <dataset>/frames --output <dataset>/disparities
python3 export_da3_executorch.py --backend qnn --soc SM8750 -o da3_small_sm8750.pte

Target device

Galaxy S25 / S25+ / S25 Ultra → Snapdragon 8 Elite = SM8750 (Adreno 830 + Hexagon NPU).

⚠️ The S25 FE is Exynos, not Snapdragon — the QNN lane (QnnPartitioner/QnnQuantizer/SM8750) does not target it. On an FE unit, use ExecuTorch's Samsung ENN backend, or fall back to a CPU/GPU (XNNPACK) .pte.

Repo contents

File	What
`roadmap.md`	Full execution roadmap — DAG, 6 phases, 5 decisions, risk register, module cards
`depth_anything_v3.py`	DA3-Small host inference → pipeline disparities (drop-in for `depth_model.py`)
`export_da3_executorch.py`	DA3-Small → ExecuTorch `.pte` (XNNPACK CPU / QNN `SM8750` NPU)
`depth_model.py`	DA-V2 Small host inference (fallback depth net)
`requirements-da3.txt`	Heavy DA3/torch/ExecuTorch extras (kept out of core `requirements.txt`)
`import_and_clean.py`	Host-side mesh cleanup: raw TSDF mesh → watertight, scaled, CAD-ready `.glb`
`test_harness.sh`	Offline smoke test (generates a sphere fixture, runs the script, validates)
`orientation_test.sh`	Distinct-dims box test — verifies the pipeline is orientation-preserving

Mesh cleanup — `import_and_clean.py`

Takes the reconstruction's output mesh (.glb, also .obj/.ply/.stl) and produces a clean mesh: import → join parts → voxel-remesh to watertight → consistent normals → scale m→mm → export. Writes both a .glb (for viewers/web) and an .stl beside it — CAD tools (Fusion 360, FreeCAD) can't read .glb, so the .stl is the actual CAD handoff. Returns a non-zero exit on failure so the pipeline can gate on it.

Verify CAD-importability with cad_check.py (imports the STL through the OpenCASCADE kernel — FreeCAD's kernel — and reports imports-as-mesh-body and solid-convertible; needs cadquery-ocp).

blender --background --python import_and_clean.py -- input.glb output.glb

# options:
#   --scale  FLOAT   uniform scale (default 1000 = m→mm for CAD)
#   --voxel  FLOAT   voxel remesh size in meters (default 0.005); smaller = finer
#   --no-remesh      keep raw topology (normals still fixed)
#   --rotate-x DEG   optional frame correction (default 0 — usually unneeded)
#   --rotate-z DEG

Orientation note: Blender's glTF importer and exporter apply inverse +Y-up↔+Z-up conversions, so import → clean → export is orientation-preserving with no manual rotation (verified by orientation_test.sh). Use --rotate-x/--rotate-z only if a mesh comes in tipped.

Contract (input from the float pipeline): meters (1.0 = 1 m, ARCore world), glTF +Y-up. The script logs imported dims (pre-transform, m) every run — the fastest check that units are right (a coffee mug ≈ 0.10).

Run the tests

./test_harness.sh        # end-to-end smoke test
./orientation_test.sh    # axis-swap / orientation check

Both are offline (they generate their own fixtures) and exit non-zero on failure. Verified on Blender 5.1.

Status

Host-side (~80% of de-risking runs before the phone is unboxed): float pipeline + quantization in progress; mesh cleanup + validation complete and tested. Phone integration is the final phase, not the first.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
app		app
arcore		arcore
da3_outputs		da3_outputs
demo		demo
docs		docs
gradle		gradle
models		models
pitch		pitch
pulled_models		pulled_models
real_session/arcore		real_session/arcore
scripts		scripts
tests		tests
.gitignore		.gitignore
AGENT.md		AGENT.md
ARCHITECTURE.md		ARCHITECTURE.md
CURRENTLY_WORKING.md		CURRENTLY_WORKING.md
FEATURES.md		FEATURES.md
LIVE_MESH_PLAN.md		LIVE_MESH_PLAN.md
MESH_ENHANCEMENT_PLAN.md		MESH_ENHANCEMENT_PLAN.md
OBJECT_TRACKING_PLAN.md		OBJECT_TRACKING_PLAN.md
QNN_RUNTIME_PLAN.md		QNN_RUNTIME_PLAN.md
QNN_SETUP.md		QNN_SETUP.md
README.md		README.md
REPO_STATUS.md		REPO_STATUS.md
RESPONSIBILITIES.md		RESPONSIBILITIES.md
SCANNING_UX_PROMPT.md		SCANNING_UX_PROMPT.md
STORYTELLING.md		STORYTELLING.md
VERIFY.md		VERIFY.md
analyze_session.py		analyze_session.py
android_session_to_pipeline.py		android_session_to_pipeline.py
box_complete.py		box_complete.py
build.gradle.kts		build.gradle.kts
cad_check.py		cad_check.py
charuco_pose.py		charuco_pose.py
convert_session.py		convert_session.py
cuboid_fit.py		cuboid_fit.py
da3_small_depth.pte		da3_small_depth.pte
depth_anything_v3.py		depth_anything_v3.py
depth_model.py		depth_model.py
depthanything_v2_small_qnn.pte		depthanything_v2_small_qnn.pte
export_da3_executorch.py		export_da3_executorch.py
export_da3_mv_executorch.py		export_da3_mv_executorch.py
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
ground_truth.py		ground_truth.py
import_and_clean.py		import_and_clean.py
inhand_tracker.py		inhand_tracker.py
multiview_da3_to_mesh.py		multiview_da3_to_mesh.py
object_frame.py		object_frame.py
offline_mesh.py		offline_mesh.py
orientation_test.sh		orientation_test.sh
partition_report.txt		partition_report.txt
pipeline_float.py		pipeline_float.py
profile_depth.py		profile_depth.py
pull_session.sh		pull_session.sh
requirements-da3.txt		requirements-da3.txt
requirements.txt		requirements.txt
roadmap.md		roadmap.md
scale_solver.py		scale_solver.py
settings.gradle.kts		settings.gradle.kts
test_depth.py		test_depth.py
test_harness.sh		test_harness.sh
tsdf_fuse.py		tsdf_fuse.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lantern — Phone Scan → 3D

How it works

Target device

Repo contents

Mesh cleanup — `import_and_clean.py`

Run the tests

Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lantern — Phone Scan → 3D

How it works

Target device

Repo contents

Mesh cleanup — import_and_clean.py

Run the tests

Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Mesh cleanup — `import_and_clean.py`

Packages