Integrate QUADTRIX C Engine & PyTorch Weight Export Pipeline by Eamon2009 · Pull Request #3 · Eamon2009/Quadtrix.cpp

Eamon2009 · 2026-04-29T12:56:23Z

Description

This PR establishes the core infrastructure for the QUADTRIX Engine. It introduces a hybrid workflow where models are trained/handled in PyTorch and then exported to a custom binary format for high-performance, dependency-free inference in Pure C.

Key Changes

Build System & Configuration

CMakePresets.json: Added standardized build profiles for MinGW (Debug/Release).
Torch Toggle: Introduced QUADTRIX_ENABLE_TORCH as a build-time flag. This allows developers to toggle LibTorch dependencies on or off, ensuring the C engine remains portable even on systems without PyTorch installed.
.gitignore: Configured to exclude heavy binaries (libtorch/), build artifacts, and Python virtual environments to keep the repository lean.

C Inference Engine (src/main.c or engine.c)

Binary Weight Loader: Implemented load_model() to parse .bin files using raw pointer offsets and fread, mapping data directly to C structs.
Transformer Architecture: Built out the structural support for Multi-head Attention, LayerNorm, and Feedforward blocks.
Inference Loop: Added the main generation loop with a rolling context buffer (memmove) and basic token sampling.

Training & Export Pipeline (src/export_weights.py)

Hardware Agnostic Training: Updated the Python logic to support device = torch.device("cuda" if torch.cuda.is_available() else "cpu").
Serialization: Added a conversion script to flatten PyTorch .pt tensors into the specific binary sequence expected by the C engine's memory map.

Testing & Validation

src/torch_example.cpp: Created a lightweight smoke test to verify LibTorch linking and basic tensor math independently of the main engine.
(#2 )

Created a minimal C++ file to verify the LibTorch installation. Implemented a basic tensor operation to confirm successful linking.

Ignored build directories and executable files. Excluded Python virtual environments (.venv). Prevented tracking of LibTorch binaries and model files (.pt, .bin).

Implemented logic to automatically detect and utilize CUDA if available. Added CPU fallback to ensure the script runs on all hardware configurations.

Added load_model to parse Transformer weights from binary files. Implemented the main generation loop with token sampling.

Implemented a script to extract weights from .pt checkpoints. Formats and flattens tensors into a raw binary .bin file compatible with the C engine.

# Description This PR synchronizes the model interaction logic across both the Python backend utilities and the web frontend. It establishes a consistent way to interface with the model weights and the C++ engine. ## Python Backend (inference.py) - Goal: Refactor the standalone inference script to support modern weight loading. - Weight Mapping: Updated to load and map .pt files directly using the refactored architecture. - Chat Mode: Implemented a robust interactive loop for rapid model testing and verification. ## Frontend Layer (frontend/src/api) - Goal: Establish the bridge between the UI and the Quadtrix engine. - Service Definition: Created the base API client to handle requests to the C++ backend. - Dual-Path Logic: Added handlers for both Training control and Inference/Chat endpoints. - Stream Support: Prepared the API layer to handle "generation" data chunks for real-time UI updates. ## other PR merge #7 #6 #5 #4 #3

Eamon2009 added 7 commits April 29, 2026 17:40

feat:configure CMake to locate and enable PyTorch

45029fb

feat: add basic Torch smoke test in src/torch_example.cpp

34c52c2

Created a minimal C++ file to verify the LibTorch installation. Implemented a basic tensor operation to confirm successful linking.

chore: add .gitignore to exclude build artifacts and dependencies

a484b61

Ignored build directories and executable files. Excluded Python virtual environments (.venv). Prevented tracking of LibTorch binaries and model files (.pt, .bin).

feat: add device-agnostic training support in main.py

968b3d6

Implemented logic to automatically detect and utilize CUDA if available. Added CPU fallback to ensure the script runs on all hardware configurations.

feat: implement model loading and main inference loop in C

d3d4d4a

Added load_model to parse Transformer weights from binary files. Implemented the main generation loop with token sampling.

feat: add weight converter from PyTorch to binary format

581554b

Implemented a script to extract weights from .pt checkpoints. Formats and flattens tensors into a raw binary .bin file compatible with the C engine.

Delete CUDA directory avoiding merge conflicts

59f3576

Eamon2009 requested a review from codeaddict-119 April 29, 2026 12:56

Eamon2009 assigned Eamon2009 and codeaddict-119 Apr 29, 2026

Eamon2009 added documentation Improvements or additions to documentation enhancement New feature or request labels Apr 29, 2026

codeaddict-119 approved these changes Apr 29, 2026

View reviewed changes

codeaddict-119 merged commit 85a5433 into exp Apr 29, 2026

Eamon2009 mentioned this pull request May 1, 2026

Integrate Python Inference Refactor and Frontend API Layer #8

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate QUADTRIX C Engine & PyTorch Weight Export Pipeline#3

Integrate QUADTRIX C Engine & PyTorch Weight Export Pipeline#3
codeaddict-119 merged 7 commits into
expfrom
master

Eamon2009 commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Eamon2009 commented Apr 29, 2026

Description

Key Changes

Build System & Configuration

C Inference Engine (src/main.c or engine.c)

Training & Export Pipeline (src/export_weights.py)

Testing & Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants