DOM-aware tokenization for Hugging Face language models
-
Updated
Jun 1, 2024 - HTML
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
DOM-aware tokenization for Hugging Face language models
An overview of the possibilities offered by artificial intelligence (AI) to serve as a technical basis for a digital product offering: from understanding, personalization, design of machine learning models and its deployment through an API built with FastAPI into the Cloud
AI POCS: ML, NLP, LLM, Vision, Classification, clustering, GenAI, Transformers, PyTorch, Keras, All things AI POCS.
Prompts, notebooks, and tools for generative pre-trained transformers.
List of resources for mineral exploration and machine learning, generally with useful code and examples.
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
SaprotHub: Making Protein Modeling Accessible to All Biologists
The goal of this project is to develop a machine learning model that can classify movie reviews as positive or negative based on the sentiment expressed in the text.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The Jieba Chinese Word Segmentation Implemented in Rust
Sentilyze aims to analyze sentiment in text from social media, news, and websites. Real-time analysis, granular classification, customizable settings.
Documents classification using KNN Algorithm a graph based approach along with scrapped data
Implementation of various transformer architecture models, application, and fine-tuning code.
Minimalist web-searching app with an AI assistant that runs directly from your browser. Uses Web-LLM, Ratchet-ML, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Created by Alan Turing