Skip to content
View byoung77's full-sized avatar

Block or report byoung77

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
byoung77/README.md

Brent Young, Ph.D.

Quantitative Modeling | Data Science | Machine Learning Systems

I am an applied mathematician and data scientist with 10+ years of experience designing quantitative models, building machine learning systems, and working with distributed computing frameworks. My work focuses on connecting mathematical theory with practical implementation, especially in machine learning, probabilistic modeling, and scalable data systems.

Example output from fractal visualization tool (C/GTK project below)


πŸ” Current Focus

  • Approximate Nearest Neighbor (ANN) algorithms
  • Retrieval-Augmented Generation (RAG) systems
  • Bayesian modeling and change point detection
  • Distributed computing (Hadoop, Spark)
  • From-scratch implementations of machine learning algorithms

πŸš€ Featured Projects

πŸ”— Approximate Nearest Neighbor (ANN) System

Layered graph ANN structure inspired by HNSW, emphasizing clarity, experimentation, and tunable performance.

  • Cosine and Euclidean similarity
  • Batch construction and incremental insertion
  • Graph connectivity via MST-based reconnection
  • Empirical evaluation (recall, inflation, timing)

πŸ‘‰ https://github.com/byoung77/Approximate-Nearest-Neighbors-Project

Build Times vs. Dataset Size for ANN System


🧠 Neural Network (NumPy, From Scratch)

Configurable feedforward neural network implemented from first principles.

  • Implemented generic feedforward architecture with user-defined layers
  • Coded backpropagation and gradient updates manually
  • Designed modular training loop with support for classification and regression
  • Trained model exposed as a reusable callable class

πŸ‘‰ https://github.com/byoung77/Neural-Net-Implementation

Neural Net Training Loss for Two Moons Dataset


πŸ“š Doctor Who Oracle (RAG System)

End-to-end retrieval-augmented generation system with vector search and reranking.

  • FAISS-based vector search
  • Cross-encoder reranking
  • Citation-aware responses
  • Desktop GUI for interactive querying

πŸ‘‰ https://github.com/byoung77/Doctor-Who-Oracle

Dr. Who Oracle Interface


πŸ’‘ Lights Out

An interactive Python implementation of the classic Lights Out puzzle, extended with multiple algebraic state spaces and nontrivial topological grids.

πŸ‘‰ https://github.com/byoung77/lights-out


πŸ’Ύ Committee Appeals Database

Built a Python/MySQL/LaTeX system to replace fragmented committee records stored across multiple documents, enabling searchable history and automated generation of professional PDF reports.

πŸ‘‰ https://github.com/byoung77/committee-appeals-db


πŸŒ€ Fractal Explorer (C / GTK)

Interactive fractal visualization tool built in C.

  • Mandelbrot and Julia sets
  • Real-time zoom and navigation
  • Custom function exploration

πŸ‘‰ https://github.com/byoung77/GUI-Fractal-Project


πŸ“ˆ HDP-HMM with Topological Emissions (Julia)

Bayesian nonparametric model for change point detection.

  • Hierarchical Dirichlet Process Hidden Markov Model
  • Integration of topological data analysis features
  • Full inference pipeline in Julia

πŸ‘‰ https://github.com/byoung77/hdp-hmm-te


πŸ› οΈ Technical Skills

Programming: Python, Julia, Go, C
Machine Learning: Bayesian modeling, HMMs, clustering, topological data analysis, neural networks
Data & Distributed Systems: Hadoop, Spark, MapReduce
Tools: Git, Linux, LaTeX


πŸ§ͺ Selected Work

  • Built and deployed distributed analytics experiments on an 8-node Hadoop/Spark cluster
  • Designed Bayesian nonparametric change-point detection model (HDP-HMM)
  • Implemented machine learning systems from first principles
  • Developed retrieval-augmented generation (RAG) system with GUI interface

πŸŽ“ Background

  • Ph.D., Mathematics β€” Rutgers University
  • M.S., Data Science β€” Fordham University (2025)
  • Associate Professor of Mathematics, Wilkes University (2015–Present)

My academic work includes mathematical modeling, probability, machine learning, and computational methods.


🎯 Approach

I focus on:

  • Building systems from first principles to understand core mechanics
  • Bridging theory and implementation
  • Evaluating trade-offs through experimentation

πŸ“« Contact


πŸ“„ License

All projects are released under the MIT License unless otherwise noted.

Popular repositories Loading

  1. hdp-hmm-te hdp-hmm-te Public

    Disentangled Sticky Hierarchical Dirichlet Process Hidden Markov Model with Topological Emissions

    Julia

  2. GUI-Fractal-Project GUI-Fractal-Project Public

    Interactive fractal explorer written in C using GTK that renders Julia sets and other fractals with user-defined complex functions.

    C

  3. Doctor-Who-Oracle Doctor-Who-Oracle Public

    A retrieval-augmented chatbot for answering Doctor Who trivia using Wikipedia and FAISS.

    Python

  4. Neural-Net-Implementation Neural-Net-Implementation Public

    From-scratch neural network implementation in NumPy supporting regression, classification, and training diagnostics.

    Python

  5. Approximate-Nearest-Neighbors-Project Approximate-Nearest-Neighbors-Project Public

    Graph-based approximate nearest neighbor search with custom indexing, dynamic insertion, and tunable search accuracy.

    Python

  6. byoung77 byoung77 Public

    Profile README showcasing work in machine learning, applied mathematics, and software systems.