Deepbox

The TypeScript Toolkit for AI & Numerical Computing

Deepbox is a comprehensive, type-safe TypeScript library that unifies numerical computing, tabular data workflows, and machine learning into a single modular package. Zero runtime dependencies. 4 009 tests. Production-ready.

Inspired by NumPy, Pandas, PyTorch, scikit-learn, and Matplotlib.

Requirements

Node.js >= 24.13.0

Installation

npm install deepbox

Quick Start

import { tensor, add, parameter } from "deepbox/ndarray";
import { DataFrame } from "deepbox/dataframe";
import { LinearRegression } from "deepbox/ml";

// Tensor operations with broadcasting
const a = tensor([
  [1, 2],
  [3, 4],
]);
const b = tensor([
  [5, 6],
  [7, 8],
]);
const c = add(a, b); // tensor([[6, 8], [10, 12]])

// Automatic differentiation
const x = parameter([2, 3]);
const y = x.mul(x).sum();
y.backward();
// x.grad -> tensor([4, 6])

// DataFrame operations
const df = new DataFrame({
  name: ["Alice", "Bob", "Charlie"],
  age: [25, 30, 35],
  score: [85, 90, 78],
});

// Machine learning
const model = new LinearRegression();
model.fit(XTrain, yTrain);
const predictions = model.predict(XTest);

Prefer per-module imports for tree-shaking, or use namespaces from the root:

import * as ds from "deepbox";
const t = ds.ndarray.tensor([1, 2, 3]);

Modules

Module	What it provides	Inspired by
`deepbox/core`	Types, errors, validation, dtype helpers, configuration	-
`deepbox/ndarray`	N-D tensors with autograd, broadcasting, 90+ ops, sparse matrices	NumPy, PyTorch
`deepbox/linalg`	SVD, QR, LU, Cholesky, eigenvalue decomposition, solvers, norms	NumPy/SciPy
`deepbox/dataframe`	DataFrame + Series with 50+ operations, CSV I/O	Pandas
`deepbox/stats`	Descriptive stats, correlations, hypothesis tests (t-test, ANOVA, chi-square, etc.)	SciPy/statsmodels
`deepbox/metrics`	40+ ML metrics (classification, regression, clustering)	scikit-learn
`deepbox/preprocess`	Scalers, encoders, normalizers, cross-validation splits	scikit-learn
`deepbox/ml`	Classical ML (Linear, Ridge, Lasso, Logistic, Trees, SVM, KNN, Naive Bayes, Ensembles)	scikit-learn
`deepbox/nn`	Neural networks (Linear, Conv, RNN/LSTM/GRU, Attention, Normalization, Losses)	PyTorch
`deepbox/optim`	Optimizers (SGD, Adam, AdamW, RMSprop, etc.) + LR schedulers	PyTorch
`deepbox/random`	Distributions (uniform, normal, binomial, gamma, beta, etc.) + sampling	NumPy.random
`deepbox/datasets`	Built-in datasets (Iris, Digits, Breast Cancer, etc.) + synthetic generators	scikit-learn/PyTorch
`deepbox/plot`	SVG/PNG plotting (scatter, line, bar, hist, heatmap, contour, ML plots)	Matplotlib

Features

N-Dimensional Arrays

90+ operations: arithmetic, trigonometric, logical, reductions, sorting, manipulation
Automatic differentiation: GradTensor with reverse-mode backpropagation
Broadcasting: NumPy-compatible semantics
Sparse matrices: CSR format with arithmetic and matrix operations
Multiple dtypes: float32, float64, int32, int64, uint8, bool, string
Activation functions: ReLU, Sigmoid, Softmax, GELU, Mish, Swish, ELU, LeakyReLU

DataFrames & Series

Pandas-like API: filtering, grouping, joining, merging, pivoting, sorting
CSV I/O: read and write CSV files
Descriptive statistics: describe(), value counts, correlation matrices

Linear Algebra

Decompositions: SVD, QR, LU, Cholesky, Eigenvalue (eig, eigh, eigvals, eigvalsh)
Solvers: solve(), lstsq(), solveTriangular()
Properties: det(), trace(), matrixRank(), cond(), slogdet()
Norms: norm() (L1, L2, Frobenius, nuclear, inf)
Inverse: inv(), pinv()

Statistics

Descriptive: mean, median, mode, variance, std, skewness, kurtosis, quantile, percentile
Correlations: Pearson, Spearman, Kendall tau
Hypothesis tests: t-tests (1-sample, independent, paired), ANOVA, chi-square, Shapiro-Wilk, Mann-Whitney U, Kruskal-Wallis, Friedman, Anderson-Darling, KS test
Variance tests: Levene, Bartlett

Machine Learning

Linear models: LinearRegression, Ridge, Lasso, LogisticRegression
Tree-based: DecisionTreeClassifier/Regressor, RandomForestClassifier/Regressor
Ensemble: GradientBoostingClassifier/Regressor
SVM: LinearSVC, LinearSVR
Neighbors: KNeighborsClassifier, KNeighborsRegressor
Naive Bayes: GaussianNB
Clustering: KMeans, DBSCAN
Dimensionality reduction: PCA, t-SNE

Neural Networks

Layers: Linear, Conv1d, Conv2d, MaxPool2d, AvgPool2d
Recurrent: RNN, LSTM, GRU
Attention: MultiheadAttention, TransformerEncoderLayer
Normalization: BatchNorm1d, LayerNorm
Regularization: Dropout
Activations: ReLU, Sigmoid, Tanh, GELU, Mish, Swish, Softmax, LogSoftmax, ELU, LeakyReLU, Softplus
Losses: mseLoss, maeLoss, crossEntropyLoss, binaryCrossEntropyLoss, binaryCrossEntropyWithLogitsLoss, huberLoss, rmseLoss
Containers: Sequential

Optimization

Optimizers: SGD (with momentum), Adam, AdamW, Nadam, RMSprop, Adagrad, AdaDelta
LR Schedulers: StepLR, MultiStepLR, ExponentialLR, CosineAnnealingLR, LinearLR, OneCycleLR, ReduceLROnPlateau, WarmupLR

Preprocessing

Scalers: StandardScaler, MinMaxScaler, RobustScaler, MaxAbsScaler, Normalizer, PowerTransformer, QuantileTransformer
Encoders: LabelEncoder, OneHotEncoder, OrdinalEncoder, LabelBinarizer, MultiLabelBinarizer
Splitting: trainTestSplit, KFold, StratifiedKFold, GroupKFold, LeaveOneOut, LeavePOut

Visualization

Plot types: scatter, line, bar, histogram, heatmap, contour, box plot, violin plot, pie chart
ML plots: confusion matrix, ROC curve, precision-recall curve, learning curves, validation curves, decision boundaries
Output: SVG (browser + Node.js), PNG (Node.js only)

Examples

Automatic Differentiation

import { parameter } from "deepbox/ndarray";

const x = parameter([
  [1, 2],
  [3, 4],
]);
const w = parameter([[0.5], [0.5]]);
const y = x.matmul(w).sum();
y.backward();
// x.grad -> gradients w.r.t. x
// w.grad -> gradients w.r.t. w

Neural Network Training

import { Sequential, Linear, ReLU, Dropout, mseLoss } from "deepbox/nn";
import { Adam } from "deepbox/optim";

const model = new Sequential(
  new Linear(10, 64),
  new ReLU(),
  new Dropout(0.2),
  new Linear(64, 32),
  new ReLU(),
  new Linear(32, 1),
);

const optimizer = new Adam(model.parameters(), { lr: 0.001 });

for (let epoch = 0; epoch < 100; epoch++) {
  const output = model.forward(xTrain);
  const loss = mseLoss(output, yTrain);
  optimizer.zeroGrad();
  loss.backward();
  optimizer.step();
}

ML Pipeline

import { trainTestSplit, StandardScaler } from "deepbox/preprocess";
import { RandomForestClassifier } from "deepbox/ml";
import { accuracy, f1Score } from "deepbox/metrics";

const [XTrain, XTest, yTrain, yTest] = trainTestSplit(X, y, {
  testSize: 0.2,
  randomState: 42,
});

const scaler = new StandardScaler();
scaler.fit(XTrain);
const XTrainScaled = scaler.transform(XTrain);
const XTestScaled = scaler.transform(XTest);

const model = new RandomForestClassifier({ nEstimators: 100, maxDepth: 10 });
model.fit(XTrainScaled, yTrain);

const yPred = model.predict(XTestScaled);
console.log("Accuracy:", accuracy(yTest, yPred));
console.log("F1 Score:", f1Score(yTest, yPred));

Classical ML Models

import {
  DecisionTreeClassifier,
  GradientBoostingClassifier,
  KNeighborsClassifier,
  LinearSVC,
} from "deepbox/ml";

const tree = new DecisionTreeClassifier({ maxDepth: 5 });
tree.fit(XTrain, yTrain);

const gb = new GradientBoostingClassifier({
  nEstimators: 100,
  learningRate: 0.1,
});
gb.fit(XTrain, yTrain);

const knn = new KNeighborsClassifier({ nNeighbors: 5 });
knn.fit(XTrain, yTrain);

const svm = new LinearSVC({ C: 1.0 });
svm.fit(XTrain, yTrain);

DataFrame Operations

import { DataFrame } from "deepbox/dataframe";

const df = new DataFrame({
  name: ["Alice", "Bob", "Charlie", "David"],
  age: [25, 30, 35, 28],
  salary: [50000, 60000, 75000, 55000],
  department: ["IT", "HR", "IT", "HR"],
});

const itDept = df.filter((row) => row.department === "IT");
const avgSalary = df.groupBy("department").agg({ salary: "mean" });
const sorted = df.sort("salary", { ascending: false });

Plotting

import { scatter, plot, hist, heatmap, saveFig } from "deepbox/plot";
import { tensor } from "deepbox/ndarray";

scatter(tensor([1, 2, 3, 4, 5]), tensor([2, 4, 5, 4, 6]), { color: "#1f77b4" });
plot(tensor([1, 2, 3, 4, 5]), tensor([2, 4, 5, 4, 6]), { color: "#ff7f0e" });
hist(tensor([1, 2, 2, 3, 3, 3, 4, 4, 5]), { bins: 5 });
heatmap(
  tensor([
    [1, 2, 3],
    [4, 5, 6],
    [7, 8, 9],
  ]),
);
saveFig("output.svg");

Performance

Deepbox is pure TypeScript — no native addons, no WebAssembly, no C bindings. Every operation runs on V8’s JIT compiler with TypedArray backing. Despite competing against Python libraries that use hand-tuned C and Fortran backends (BLAS, LAPACK, ATen), Deepbox delivers competitive or superior performance in several areas.

542 head-to-head benchmarks across 10 categories, tested on the same machine with identical data sizes and iteration counts:

Category	Deepbox Wins	Python Package Wins	Competing Against
DataFrames	24	31	Pandas (C / Cython)
Datasets	11	30	scikit-learn
Linear Algebra	0	54	NumPy + SciPy (LAPACK)
Metrics	45	18	scikit-learn (C / Cython)
ML Training	16	32	scikit-learn (C / Cython)
NDArray Ops	6	88	NumPy (C / BLAS)
Plotting	43	0	Matplotlib (C / Agg)
Preprocessing	20	23	scikit-learn (C / Cython)
Random	0	44	NumPy (C)
Statistics	31	26	SciPy (C / Fortran)
Total	196	346

Where Deepbox shines

bar (200 bars) — 16488.6x faster (Plotting)
KNeighborsClassifier fit (200x5) — 48.4x faster (ML Training)
transpose (500x500) — 43.9x faster (NDArray Ops)
fbetaScore (β=0.5) (1K) — 34.0x faster (Metrics)
describe (100x5) — 29.4x faster (DataFrames)
chisquare (10 bins) — 18.5x faster (Statistics)
loadLinnerud (20x3) — 15.3x faster (Datasets)
PowerTransformer fit (500x10) — 6.3x faster (Preprocessing)

Context

Python’s numerical libraries delegate heavy lifting to compiled C/Fortran code (OpenBLAS, MKL, LAPACK). Deepbox implements everything in TypeScript, relying on V8’s TurboFan JIT and Float64Array for performance. The gap is largest for BLAS-bound operations (matmul, decompositions) and smallest for memory-layout operations (transpose, reshape, indexing) where Deepbox’s lazy-view architecture has an advantage.

Run npm run bench:all to reproduce. Full results in benchmarks/RESULTS.md.

Development

See CONTRIBUTING.md for the full development workflow.

npm install         # Install dependencies
npm run build       # Build the package
npm test            # Run 4 009 tests
npm run typecheck   # Type checking
npm run lint        # Lint with Biome
npm run format      # Format with Biome
npm run all         # Run all checks

License

MIT License — see LICENSE for details.

Built by Jehaad Aljohani

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github		.github
benchmarks		benchmarks
docs		docs
src		src
test		test
.editorconfig		.editorconfig
.gitignore		.gitignore
.nvmrc		.nvmrc
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LLMs.txt		LLMs.txt
README.md		README.md
SECURITY.md		SECURITY.md
biome.json		biome.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.test.json		tsconfig.test.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deepbox

The TypeScript Toolkit for AI & Numerical Computing

Requirements

Installation

Quick Start

Modules

Features

N-Dimensional Arrays

DataFrames & Series

Linear Algebra

Statistics

Machine Learning

Neural Networks

Optimization

Preprocessing

Visualization

Examples

Automatic Differentiation

Neural Network Training

ML Pipeline

Classical ML Models

DataFrame Operations

Plotting

Performance

Where Deepbox shines

Context

Development

License

About

Uh oh!

Releases

Packages

Languages

License

jehaad1/Deepbox

Folders and files

Latest commit

History

Repository files navigation

Deepbox

The TypeScript Toolkit for AI & Numerical Computing

Requirements

Installation

Quick Start

Modules

Features

N-Dimensional Arrays

DataFrames & Series

Linear Algebra

Statistics

Machine Learning

Neural Networks

Optimization

Preprocessing

Visualization

Examples

Automatic Differentiation

Neural Network Training

ML Pipeline

Classical ML Models

DataFrame Operations

Plotting

Performance

Where Deepbox shines

Context

Development

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages