str

Unicode-aware string utilities for Gleam

Production-ready Gleam library providing Unicode-aware string operations with a focus on grapheme-cluster correctness, pragmatic ASCII transliteration, and URL-friendly slug generation.

✨ Features

Category	Highlights
🎯 Grapheme-Aware	All operations correctly handle Unicode grapheme clusters (emoji, ZWJ sequences, combining marks)
🔤 Case Conversions	`snake_case`, `camelCase`, `kebab-case`, `PascalCase`, `Title Case`, `capitalize`
🔗 Slug Generation	Configurable `slugify` with token limits, custom separators, and Unicode preservation
🔍 Search & Replace	`index_of`, `last_index_of`, `replace_first`, `replace_last`, `contains_any/all`
✅ Validation	`is_uppercase`, `is_lowercase`, `is_title_case`, `is_ascii`, `is_hex`, `is_numeric`, `is_alpha`
🛡️ Escaping	`escape_html`, `unescape_html`, `escape_regex`
📏 Similarity	Levenshtein `distance`, percentage `similarity`, `hamming_distance`
🧩 Splitting	`splitn`, `partition`, `rpartition`, `chunk`, `lines`, `words`
📐 Padding	`pad_left`, `pad_right`, `center`, `fill`
🚀 Zero Dependencies	Pure Gleam implementation with no OTP requirement

📦 Installation

gleam add str

🚀 Quick Start

import str/core
import str/extra

pub fn main() {
  // 🎯 Grapheme-safe truncation preserves emoji
  let text = "Hello 👩‍👩‍👧‍👦 World"
  core.truncate(text, 10, "...")
  // → "Hello 👩‍👩‍👧‍👦..."

  // 🔗 ASCII transliteration and slugification
  extra.slugify("Crème Brûlée — Recipe 2025!")
  // → "creme-brulee-recipe-2025"

  // 🔤 Case conversions
  extra.to_camel_case("hello world")   // → "helloWorld"
  extra.to_snake_case("Hello World")   // → "hello_world"
  core.capitalize("hELLO wORLD")       // → "Hello world"

  // 🔍 Grapheme-aware search
  core.index_of("👨‍👩‍👧‍👦 family test", "family")
  // → Ok(2) - counts grapheme clusters, not bytes!

  // 📏 String similarity
  core.similarity("hello", "hallo")
  // → 0.8 (80% similar)
  
  // 🛡️ HTML escaping
  core.escape_html("<script>alert('xss')</script>")
  // → "&lt;script&gt;alert(&#39;xss&#39;)&lt;/script&gt;"
}

📚 API Reference

🔤 Case & Capitalization

Function	Example	Result
`capitalize(text)`	`"hELLO wORLD"`	`"Hello world"`
`swapcase(text)`	`"Hello World"`	`"hELLO wORLD"`
`is_uppercase(text)`	`"HELLO123"`	`True`
`is_lowercase(text)`	`"hello_world"`	`True`
`is_title_case(text)`	`"Hello World"`	`True`

✂️ Grapheme Extraction

Function	Example	Result
`take(text, n)`	`take("👨‍👩‍👧‍👦abc", 2)`	`"👨‍👩‍👧‍👦a"`
`drop(text, n)`	`drop("hello", 2)`	`"llo"`
`take_right(text, n)`	`take_right("hello", 3)`	`"llo"`
`drop_right(text, n)`	`drop_right("hello", 2)`	`"hel"`
`at(text, index)`	`at("hello", 1)`	`Ok("e")`
`chunk(text, size)`	`chunk("abcdef", 2)`	`["ab", "cd", "ef"]`

🔍 Search & Replace

Function	Example	Result
`index_of(text, needle)`	`"hello world", "world"`	`Ok(6)`
`last_index_of(text, needle)`	`"hello hello", "hello"`	`Ok(6)`
`contains_any(text, needles)`	`"hello", ["x", "e", "z"]`	`True`
`contains_all(text, needles)`	`"hello", ["h", "e"]`	`True`
`replace_first(text, old, new)`	`"aaa", "a", "b"`	`"baa"`
`replace_last(text, old, new)`	`"aaa", "a", "b"`	`"aab"`

🧩 Splitting & Partitioning

Function	Example	Result
`partition(text, sep)`	`"a-b-c", "-"`	`#("a", "-", "b-c")`
`rpartition(text, sep)`	`"a-b-c", "-"`	`#("a-b", "-", "c")`
`splitn(text, sep, n)`	`"a-b-c-d", "-", 2`	`["a", "b-c-d"]`
`words(text)`	`"hello world"`	`["hello", "world"]`
`lines(text)`	`"a\nb\nc"`	`["a", "b", "c"]`

📐 Padding & Filling

Function	Example	Result
`pad_left(text, width, pad)`	`"42", 5, "0"`	`"00042"`
`pad_right(text, width, pad)`	`"hi", 5, "*"`	`"hi***"`
`center(text, width, pad)`	`"hi", 6, "-"`	`"--hi--"`
`fill(text, width, pad, pos)`	`"x", 5, "-", "both"`	`"--x--"`

✅ Validation

Function	Description
`is_numeric(text)`	Digits only (0-9)
`is_alpha(text)`	Letters only (a-z, A-Z)
`is_alphanumeric(text)`	Letters and digits
`is_ascii(text)`	ASCII only (0x00-0x7F)
`is_printable(text)`	Printable ASCII (0x20-0x7E)
`is_hex(text)`	Hexadecimal (0-9, a-f, A-F)
`is_blank(text)`	Whitespace only
`is_title_case(text)`	Title Case format

🔗 Prefix & Suffix

Function	Example	Result
`remove_prefix(text, prefix)`	`"hello world", "hello "`	`"world"`
`remove_suffix(text, suffix)`	`"file.txt", ".txt"`	`"file"`
`ensure_prefix(text, prefix)`	`"world", "hello "`	`"hello world"`
`ensure_suffix(text, suffix)`	`"file", ".txt"`	`"file.txt"`
`starts_with_any(text, list)`	`"hello", ["hi", "he"]`	`True`
`ends_with_any(text, list)`	`"file.txt", [".txt", ".md"]`	`True`
`common_prefix(strings)`	`["abc", "abd"]`	`"ab"`
`common_suffix(strings)`	`["abc", "xbc"]`	`"bc"`

🛡️ Escaping

Function	Example	Result
`escape_html(text)`	`"<div>"`	`"<div>"`
`unescape_html(text)`	`"<div>"`	`"<div>"`
`escape_regex(text)`	`"a.b*c"`	`"a\\.b\\*c"`

📏 Similarity & Distance

Function	Example	Result
`distance(a, b)`	`"kitten", "sitting"`	`3`
`similarity(a, b)`	`"hello", "hallo"`	`0.8`
`hamming_distance(a, b)`	`"karolin", "kathrin"`	`Ok(3)`

📝 Text Manipulation

Function	Description
`truncate(text, len, suffix)`	Truncate with emoji preservation
`ellipsis(text, len)`	Truncate with …
`reverse(text)`	Grapheme-aware reversal
`reverse_words(text)`	Reverse word order
`initials(text)`	Extract initials (`"John Doe"` → `"JD"`)
`normalize_whitespace(text)`	Collapse whitespace
`strip(text, chars)`	Remove chars from ends
`squeeze(text, char)`	Collapse consecutive chars
`chomp(text)`	Remove trailing newline

📄 Line Operations

Function	Description
`lines(text)`	Split into lines
`dedent(text)`	Remove common indentation
`indent(text, spaces)`	Add indentation
`wrap_at(text, width)`	Word wrap

🔤 Extra Module (str/extra)

Case Conversions

import str/extra

extra.to_snake_case("Hello World")    // → "hello_world"
extra.to_camel_case("hello world")    // → "helloWorld"
extra.to_pascal_case("hello world")   // → "HelloWorld"
extra.to_kebab_case("Hello World")    // → "hello-world"
extra.to_title_case("hello world")    // → "Hello World"

ASCII Folding (Deburr)

extra.ascii_fold("Crème Brûlée")  // → "Creme Brulee"
extra.ascii_fold("straße")        // → "strasse"
extra.ascii_fold("æon")           // → "aeon"

Slug Generation

extra.slugify("Hello, World!")                    // → "hello-world"
extra.slugify_opts("one two three", 2, "-", False) // → "one-two"
extra.slugify_opts("Hello World", 0, "_", False)   // → "hello_world"

🏗️ Module Structure

str/
├── core        # Grapheme-aware core utilities
├── extra       # ASCII folding, slugs, case conversions
├── tokenize    # Pure-Gleam tokenizer (reference)
└── internal_*  # Character tables (internal)

📖 Documentation

Document	Description
Core API	Grapheme-aware string operations
Extra API	ASCII folding and slug generation
Tokenizer	Pure-Gleam tokenizer reference
Examples	Integration examples and OTP patterns
Character Tables	Machine-readable transliteration data

⚡ Optional OTP Integration

The library core is OTP-free by design. For production Unicode normalization (NFC/NFD):

// In your application code:
pub fn otp_nfd(s: String) -> String {
  // Call Erlang's :unicode module
  s
}

// Use with str:
extra.ascii_fold_with_normalizer("Crème", otp_nfd)
extra.slugify_with_normalizer("Café", otp_nfd)

🧪 Development

# Run the test suite
gleam test

# Regenerate character tables documentation
python3 scripts/generate_character_tables.py

📊 Test Coverage

tests covering all public functions
Unicode edge cases (emoji, ZWJ, combining marks)
Grapheme cluster boundary handling
Cross-module integration tests

🤝 Contributing

Contributions welcome! Areas for improvement:

Expanding character transliteration tables
Additional test cases for edge cases
Documentation improvements
Performance optimizations

gleam test  # Ensure tests pass before submitting PRs

📄 License

MIT License — see LICENSE for details.

🔗 Links

Made with 💜 for the Gleam community

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
assets/img		assets/img
docs		docs
examples		examples
scripts		scripts
src		src
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
EXAMPLES.md		EXAMPLES.md
LICENSE		LICENSE
README.md		README.md
gleam.toml		gleam.toml
manifest.toml		manifest.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

str

✨ Features

📦 Installation

🚀 Quick Start

📚 API Reference

🔤 Case & Capitalization

✂️ Grapheme Extraction

🔍 Search & Replace

🧩 Splitting & Partitioning

📐 Padding & Filling

✅ Validation

🔗 Prefix & Suffix

🛡️ Escaping

📏 Similarity & Distance

📝 Text Manipulation

📄 Line Operations

🔤 Extra Module (str/extra)

Case Conversions

ASCII Folding (Deburr)

Slug Generation

🏗️ Module Structure

📖 Documentation

⚡ Optional OTP Integration

🧪 Development

📊 Test Coverage

🤝 Contributing

📄 License

🔗 Links

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

lupodevelop/str

Folders and files

Latest commit

History

Repository files navigation

str

✨ Features

📦 Installation

🚀 Quick Start

📚 API Reference

🔤 Case & Capitalization

✂️ Grapheme Extraction

🔍 Search & Replace

🧩 Splitting & Partitioning

📐 Padding & Filling

✅ Validation

🔗 Prefix & Suffix

🛡️ Escaping

📏 Similarity & Distance

📝 Text Manipulation

📄 Line Operations

🔤 Extra Module (str/extra)

Case Conversions

ASCII Folding (Deburr)

Slug Generation

🏗️ Module Structure

📖 Documentation

⚡ Optional OTP Integration

🧪 Development

📊 Test Coverage

🤝 Contributing

📄 License

🔗 Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages