Skip to content
View thiswillbeyourgithub's full-sized avatar

Block or report thiswillbeyourgithub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Hi there πŸ‘‹

Self-taught developer, currently a medical student, hopefully psychiatry resident soon!

Interests: everything AI & Neuroscience🧠, including in-betweens and even-remotely-related. This obviously includes computational psychiatry.

Approach & Personality: investing my time by coding from the grounds up, hate closed source systems and forcing others to reinvent the same stack, love collaboration, love distributed and resilient systems, love interoperability, love repairing stuff.

To get in touch: one of my email is "2hf immediately followed by w1bgt at mailer.me", or invite me to a private repository to exchange via other means.

I also have a "professional" github account.

Github Projects

To browser my projects, you can also go to my stars page and look at lists: they contain all my public projects (and only my projects) but are thematically organized. Both those "starred lists" and the list below should exhaustively list my public projects. A few of them are pinned to my profile page.

All code is almost always only done by me.

Until somewhere 2025 no AI was used, all code with my actual fingers. Since 2025 I'm using more and more LLMs.

I have some other private repositories that I don't publish here.

The list below was compiled on July 27th 2025.

Click for summaries of my github projects

Medicine & Computer Science

  • wdoc: Multi-modal document querying and summarization system integrating diverse content types with advanced RAG capabilities and LLM-powered semantic processing for comprehensive information extraction.
    • OmniQA: Document indexing tool for seamless querying of any file format using large language models and vector search technology. Was superseeded by wdoc.
  • KnQuant: Advanced text processing library that converts unstructured text into searchable knowledge triplets using multi-modal embeddings and parallel processing architecture.
  • QuestEA: Research project merging numerical survey responses with semantic question embeddings to enhance patient data analysis and uncover hidden patterns in medical questionnaires.
  • Beta-Variational-Autoencoder: Deep learning model implementing beta-variational autoencoder architecture for dimensionality reduction and data compression using the scikit-learn API.
  • TaguchiGridSearchConverter: Statistical optimization tool leveraging Taguchi orthogonal arrays to reduce hyperparameter search space while maintaining effective parameter coverage in machine learning experiments.
  • save_to_zotero: Advanced command-line tool for automated webpage archival to PDF with metadata extraction and seamless integration into Zotero's reference management system across multiple devices.
  • med-pharmacokinetic-simulator: Pharmacokinetic simulation tool that models and visualizes medication blood concentration over time to optimize dosing schedules.
  • ADHD_european_drug_map: Data visualization tool for tracking and mapping the availability of ADHD medications across European countries using official EMA data sources.

Anki

Anki is an open source flashcard/spaced repetition memorization system

  • Voice2Anki: Universal language learning tool combining speech recognition and AI to automatically generate and manage personalized flashcards from voice recordings with adaptive formatting and intelligent memory features.
  • AnkiAiUtils): Advanced toolkit that enhances Anki flashcards through automated AI-powered generation of explanations, mnemonics, illustrations and reformulations, with support for personalized memory systems and semantic similarity matching.
  • AnnA_anki_neuronal_Appendix: Machine learning system that optimizes Anki flashcard review order by analyzing semantic similarities to prevent reviewing related cards together, improving retention while reducing daily workload.
  • py_ankiconnect: Library that simplifies interaction with Anki flashcard software through its AnkiConnect API, offering both command-line and Python interfaces.
  • AnkiAutoMindmap: Tool for automated generation of mindmaps and mermaid diagrams based on Anki flashcard content and tag hierarchies.
  • i3_seach_anki_collection: Integration tool enabling quick Anki flashcard searches directly from the i3 window manager interface.
  • HapaxPredator: Text analysis tool for identifying and displaying word frequencies in Anki flashcards to detect potential misspellings and facilitate natural language processing tasks.
  • IndexableAnki: Tool that converts Anki flashcards into searchable text files for desktop search engine integration.
  • anki_PrioriTag: Tool for automatically prioritizing Anki flashcard reviews by analyzing tag-based failure patterns and generating targeted filtered decks for optimal learning efficiency.
  • anki_autobury_added_today: Automation utility that identifies and buries newly added Anki cards to prevent them from appearing in the same study session.
  • Anki Semantic Search: Semantic search engine utilizing fastText's multilingual word vectors to intelligently search through Anki flashcard collections based on meaning rather than exact matches.
  • pdf2anki: Tool for batch-converting PDF documents into Anki flashcards with searchable text and page images, enabling multi-keyword search across PDF content within Anki's interface.
  • clozolkor: Advanced spaced repetition flashcard template enabling sequential reveal of cloze deletions with customizable hints, shortcuts, and cross-platform compatibility.

Karakeep

Karakeep is an open source read it later app

  • karakeep_python_api: Python interface and command-line tool providing comprehensive access to Karakeep's bookmarking API with automated testing, data validation, and community-contributed scripts.
  • Karanki: Bidirectional synchronization tool for automating flashcard creation from highlights with color-coded retention levels and intelligent state tracking capabilities.
  • freshrss_to_karakeep: Integration tool connecting FreshRSS RSS reader with Karakeep bookmarking service for automated content curation and seamless article transfer between platforms.

Logseq

Logseq is an open source PKM (Personal Knowledge Management) app

  • LogseqMarkdownParser: Parser and utility library supporting seamless reading, manipulation and export of Logseq markdown files with properties and blocks.
  • wallabag_to_logseq_and_omnivore: Python script for seamless migration of articles and highlights from Wallabag to Logseq and Omnivore reading platforms, ensuring continuity of reading history and annotations.
  • LogseqPDFImporter: Tool for importing and converting PDF annotations from various PDF readers into Logseq's native format, preserving highlights, colors, and area annotations.
  • MdXLogseqTODOSync: Synchronization tool for maintaining TODO items across different markdown file formats with support for pattern filtering, custom delimiters, and nested bullet point preservation.

Open-WebUI

Open-WebUI is a self hosted AI platform

  • openwebui_custom_pipes_filters: Comprehensive suite of custom filters, tools, and pipes designed to enhance OpenWebUI's functionality with message tracking, cost monitoring, and extended chat capabilities.

Smartwatch

Mainly for wasp-os on the pinetime

  • sleep_tracker_pinetime: Privacy-oriented sleep tracking and smart alarm system utilizing accelerometer and heart rate data to optimize wake-up times based on sleep cycle analysis and gradual wake-up patterns.
  • pomodoro_wasp_os: Customizable Pomodoro timer application for PineTime smartwatches featuring multiple presets, vibration patterns, and persistent settings.

API

I made my own "reference" libraries to make my other projects more interoperable

  • freshrss_python_api: Python wrapper and API client for the FreshRSS Fever API, enabling programmatic management of RSS feeds, items, and categories with robust error handling and type safety.
  • caldav_tasks_api: Python library and CLI tool for advanced CalDAV task management with features like task synchronization, ELO-based prioritization, and comprehensive task manipulation capabilities.
  • karakeep_python_api: See above
  • py_ankiconnect: See above

Productivity

Tools I use, used or made

  • mini_LiTOY: Minimalist implementation of the ELO-based task prioritization algorithm, enabling automated ranking of to-do lists through interactive comparisons and score calculations.
    • LiTOY: Smart prioritization tool utilizing pairwise comparisons and ELO ratings to rank personal goals based on importance and time investment.
  • BrownieCutter: Template generator for Python projects that automates boilerplate code creation and project structure setup with customizable configurations.
  • zsh-ai: Terminal companion that leverages AI language models to suggest and explain shell commands based on natural language queries with fuzzy-finder integration.
  • HAL: Email summarization and labeling tool that processes daily incoming messages, leveraging LLMs to generate concise summaries and assign categorization labels while maintaining robust security measures and cost controls.
  • github_discussion_parser: Parser tool that extracts and formats GitHub repository discussions into XML-like Markdown files optimized for Large Language Model processing.
  • systemd_unit_maker: Command-line automation tool for generating and managing systemd service and timer units with customizable templates and interactive configuration options.
  • SAIC (SimpleAICommits): Command line tool leveraging OpenAI models to automatically generate semantic commit messages based on git diffs and commit history.
  • Quick_Whisper_Typer: Speech recognition tool for hands-free text input, voice commands processing, and AI conversation, featuring low-latency transcription and multilingual support.
  • simple_voice_chat: Conversational interface integrating various speech-to-text, language model, and text-to-speech services for real-time voice interactions with AI assistants.

"Rot" tools

Tools leveraging deterministic time-based codes

  • wormrot.sh: Secure file transfer automation system using time-synchronized codes and a shared secret for seamless transmission between trusted devices.
  • fowlrot.sh: Time-based code generation system for secure peer-to-peer connections using synchronized, deterministic codes derived from pre-shared secrets.
  • knockd_rotator: Security enhancement to port knocking via time-based sequence rotation and synchronization between client and server, leveraging shared secrets and deterministic generation to prevent replay attacks.

Ntfy

ntfy.sh makes it easy to send and receive notifications, I use it a lot for monitoring

  • Daily_Fact_Ntfy: Notification system that generates and delivers AI-powered interesting facts about user-specified topics through ntfy.sh at randomized intervals.
  • Ntfy_CSV_Reminders: Probability-based notification system for managing recurring tasks with randomized timing to prevent notification fatigue.
  • ntfy_systemd: Monitoring system that tracks systemd service failures and sends instant notifications through ntfy with detailed status reports.
  • ntfy_syncthing_conflict_checker: Monitoring tool for detecting and reporting file conflicts in Syncthing shared folders with support for local and remote notifications.
  • ntfy_fail2ban: Security monitoring tool that integrates Fail2Ban with ntfy.sh to deliver real-time notifications about potential intrusion attempts and IP blocks.
  • weather_notifier: Weather notification system that monitors rainfall forecasts and temperature variations while delivering mobile alerts through ntfy.sh integration.
  • allocine_checker: Movie monitoring tool that automatically checks theater listings for specific films and sends notifications when they become available.

Misc Tools

  • git_scripts_keeper: Automated version control utility for tracking and committing changes across multiple Git repositories through periodic monitoring and auto-generated commit messages.
  • OCR_with_format: OCR tool using pytesseract to maintain original text formatting and spacing while performing optical character recognition on images.
  • HumanReadableSeed: Bidirectional conversion tool transforming cryptographic tokens into human-readable word sequences while ensuring data integrity through automatic verification checks.
  • PersistDict: Thread-safe persistent dictionary implementation with automatic expiration, metadata tracking, and customizable serialization using LMDB as backend storage.
  • whisper_audio_splitter: Speech-to-text based audio splitting tool that automatically segments audio files at detected keywords while supporting multiple STT backends and parallel processing capabilities.
  • corpus_matcher: Optimized substring matching library utilizing dual-algorithm approach with parallel processing and configurable parameters for efficient text search in large corpora.
  • speech.sh: Text-to-speech interface utilizing OpenAI's API with caching capabilities and system-wide integration options for Linux environments.
  • OpenrouterModelFilter: Command-line utility for filtering and sorting OpenRouter API models based on customizable criteria such as pricing, context length, and model characteristics.
  • iptables_rate_limit_modifier: Network management tool that automatically adjusts rate-limiting thresholds in iptables rules to prevent false positives while maintaining security against malicious traffic.
  • Load_Average_Balancer.sh: System resource management tool for delaying CPU-intensive tasks until optimal load conditions are met, integrating with backup solutions like restic.
  • PDF_batch_decryptor: Command-line tool for automated decryption of multiple password-protected PDF files using qpdf and pdfinfo utilities.
  • Spotify_tts: Music player companion that announces currently playing tracks using either basic or AI-powered text-to-speech synthesis.
  • ufw_auto_ssh_whitelist: Automated firewall management system for SSH connections with dynamic whitelisting and rule cleanup capabilities.
  • ufw_block_analyzer: Real-time monitoring and analysis system for UFW firewall logs with Docker network context enrichment and structured TOML output.
  • btrfs_cow_disabler: Command-line utility for disabling Copy-on-Write functionality on Btrfs filesystems through atomic file operations and checksum verification.
  • docker_volume_backup: Robust data backup solution for Docker containers with automated volume management, container state handling, and detailed logging capabilities.
  • ShellArgParser: Command-line utility for converting Python-style arguments into shell environment variables, simplifying argument parsing in shell scripts.
  • IndexableNewsboat: Command line tool that exports RSS entries from Newsboat into text files for seamless integration with desktop search engines.
  • MediaDurationRecursiveChecker: Media duration analysis tool that recursively scans directories to calculate total duration and size of media files while providing real-time processing estimates.
  • MediaMetadataExtractor: Metadata extraction tool for media files that provides comprehensive technical information in Excel and JSON formats, supporting various video and audio formats with a user-friendly GUI interface.
  • MediaSizeOrHashMatcher: Media file comparison tool utilizing size-based matching and parallel hash computation for efficient duplicate detection across directories.
  • llm_agent: Plugin system enabling intelligent task automation through multiple search engines, autonomous web browsing, file manipulation, and shell access with built-in safety controls and persistent memory capabilities.
  • fancontrol_autohealing_config: Configuration parser and updater that maintains fan control settings across system reboots by automatically detecting and adjusting hardware monitoring device numbers.
  • prompt_GPT3: Command-line interface for OpenAI's GPT-3 featuring conversation logging, vim mode, and specialized modes for flashcard creation and translation tasks.
  • pdfannots: Text extraction tool for PDF documents that processes annotations, highlights, and comments into formatted Markdown, JSON, or EDN outputs for academic paper reviews and document analysis.

Others

  • FUTOmeter: Library suite for privacy-preserving usage tracking and context-aware donation prompts in FOSS applications, enabling sustainable funding through data-driven user engagement metrics.

Bounty Track Record

Being dedicated to open source, I try to fund what I can using bounty incentives.

Click here to see my bounty track record
  1. Porting my SleepTk app from a micropython-based OS to a C-based device

  2. Adding highlights on the mobile webapp for Karakeep

  3. The other bounties are currently in progress

Pinned Loading

  1. wdoc Public

    Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc

    Python 472 37

  2. QuestEA Public

    QuestionnaireEmbeddingsAnalysis - innovative approach to extracting richer information from clinical survey

    Python 2

  3. Voice2Anki Public

    A powerful tool that converts voice recordings into high-quality Anki flashcards using AI-powered transcription and LLM processing, featuring Few-Shot Learning to adapt to your personal style and s…

    Python 20

  4. AnkiAIUtils Public

    AI-powered tools to enhance Anki flashcards with explanations, mnemonics, illustrations, and adaptive learning for medical school and beyond

    Python 769 26

  5. AnnA_Anki_neuronal_Appendix Public

    Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity

    Python 65 1

  6. SleepTk_pinetime_sleep_tracker Public

    SleepTk: sleep tracker with smart alarm for the pinetime on wasp-os

    Python 67 3