index

A blog indexing service for static-site generation.

Features

Authenticated reindex API that atomically rebuilds indexed posts.
Full-text search API returning title, url, snippet, and score metadata.
Chinese-aware tokenization for query term extraction (2-gram tokenizer in app layer).
MySQL full-text index with ngram parser for Chinese retrieval support.
TOML-based configuration as the only runtime configuration source.

Tech Stack

Go 1.25+
MySQL 9
HTTP: github.com/gin-gonic/gin
Logging: github.com/sirupsen/logrus
Config parser: github.com/pelletier/go-toml/v2

Project Layout

main.go: runnable API entrypoint.
data: data models and validation.
service: business services and orchestration.
adapter/http: Gin router, handlers, auth middleware.
adapter/storage/mysql: MySQL repository implementation.
adapter/index: tokenizer and rune-safe snippet builder.
config: TOML config loading and validation.
app: dependency wiring and server bootstrap.
migrations: SQL schema migrations.

Configuration

Example config: config.example.toml

Create runtime config from the example file:

cp config.example.toml config.toml

Runtime config is file-only. Keep real secrets in your local config.toml and never commit them.

Migration

Apply migrations/001_init_posts.sql to your MySQL database before running the service.

Run

go run .

The default startup config path is config.toml. If the file is missing or incomplete, startup fails.

API

Health

GET /healthz
GET /readyz

Index (Authenticated)

POST /v1/index

Headers:

Authorization: Bearer <token>

Body:

{
  "posts": [
    {
      "title": "Example",
      "url": "https://example.com/p/1",
      "content": "Post content...",
      "published_at": 1710832800
    }
  ]
}

Index semantics:

Each call uploads the full current post set.
The service reindexes all rows in one transaction.
Post IDs are generated by the database automatically.

Search

GET /v1/search?q=keyword&page=1&page_size=10

published_at in requests and search responses uses Unix timestamp seconds.

Response item fields:

title
url
snippet
score (optional)
matched_terms (optional)

Tests

go test ./...

go test -race ./...

Notes on Chinese Search

Database layer uses MySQL full-text index with WITH PARSER ngram.
Application layer uses deterministic 2-gram tokenization for query term extraction and matched-term/snippet generation.
Snippet logic is rune-safe to avoid breaking Unicode text boundaries.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
adapter		adapter
app		app
config		config
data		data
migrations		migrations
service		service
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
USAGE.md		USAGE.md
config.example.toml		config.example.toml
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

index

Features

Tech Stack

Project Layout

Configuration

Migration

Run

API

Health

Index (Authenticated)

Search

Tests

Notes on Chinese Search

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

index

Features

Tech Stack

Project Layout

Configuration

Migration

Run

API

Health

Index (Authenticated)

Search

Tests

Notes on Chinese Search

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages