fic-time

Estimate elapsed narrative ("fictive") time in passages of English-language fiction using large language models. This library factors Ted Underwood's public GPT-4 experiment into reusable, testable components (preprocessing, prompting, LLM gateway, parsing, evaluation) plus a convenient CLI.

✨ Key Features

Chunking & normalization of raw long fiction text (Preprocessor)
Prompt construction with (optionally shuffled) four Underwood exemplars (UnderwoodPrompter) or a minimal prompt (Prompter)
Pluggable LLM gateway abstraction (LLMGateway) with OpenAI implementation + retry logic (OpenAIClient) and an offline mock (DummyGateway)
Robust parsing that extracts summary, reasoning, unit conversion, final minutes, and confidence (ResponseParser)
High-level synchronous & async estimation (NarrativeTimeEstimator)
Structured result objects (TimeEstimate, TimeEstimateBatch) with DataFrame export
Correlation utilities for human vs model comparison (CorrelationEvaluator)
CLI command fic-time for quick experimentation; JSONL output option for pipelines

🔧 Installation

Minimal (without OpenAI client – only dummy gateway):

pip install fic-time

With OpenAI support:

pip install "fic-time[openai]"

Optional scientific stack (already included in base dependencies: pandas, scipy). Ensure Python ≥ 3.9.

Set your key (required only when using OpenAIClient):

export OPENAI_API_KEY="sk-..."

🗂 Public API Overview

Component	Purpose	Typical Use
`Preprocessor`	Normalize & split long text into size-limited passages	Custom max chars / external sentence splitter integration
`Prompter` / `UnderwoodPrompter`	Build chat message lists	Few-shot replication / reproducibility (shuffle control)
`LLMGateway`	Abstract async chat() interface	Implement new providers (Anthropic, Azure, etc.)
`OpenAIClient`	OpenAI chat with retry (tenacity)	Production usage
`DummyGateway`	Offline deterministic response	Tests / examples without API calls
`ResponseParser`	Parse raw model output to minutes	Custom formatting experiments
`NarrativeTimeEstimator`	High-level batch estimation (sync & async)	Main entry point
`TimeEstimateBatch`	Container + DataFrame helper	Downstream analysis
`CorrelationEvaluator`	Pearson r (raw/log)	Validation against human annotations

All symbols are re-exported from fic_time.__init__.

🚀 Quick Start (Synchronous)

from fic_time import NarrativeTimeEstimator, UnderwoodPrompter, DummyGateway

text = """It was raining again the next morning, a fine curtain of mist..."""

# Offline deterministic demo (no API key needed)
estimator = NarrativeTimeEstimator(gateway=DummyGateway(), prompter=UnderwoodPrompter(shuffle=False))
batch = estimator.estimate(text)
for item in batch.items:
	print(item.index, item.total_minutes, item.confidence)

Using OpenAI

from fic_time import NarrativeTimeEstimator, UnderwoodPrompter, OpenAIClient

estimator = NarrativeTimeEstimator(
	gateway=OpenAIClient(),          # requires OPENAI_API_KEY
	prompter=UnderwoodPrompter(shuffle=True),  # shuffle exemplars (default)
	max_chars=1200,
)
batch = estimator.estimate(open("novel_segment.txt", "r", encoding="utf-8").read())
print(batch.to_dataframe().head())

📦 CLI Usage

Basic estimation:

fic-time estimate passage.txt --provider openai --model gpt-4o-mini

Offline (dummy):

fic-time estimate passage.txt --provider dummy

Deterministic (disable exemplar shuffle):

fic-time estimate passage.txt --provider openai --no-shuffle

JSON Lines output (for piping / later aggregation):

fic-time estimate passage.txt --provider openai --jsonl > out.jsonl

Key options:

--model : OpenAI model name (default: gpt-4o-mini)
--max-chars : Passage split threshold
--no-shuffle : Keep exemplar order fixed (reproducibility)

Exit code 0 on success; non-estimate subcommand prints help (reserved for future extensions).

🧵 Streaming / Async Pattern

For large corpora you may prefer streaming results without holding all passages:

import asyncio
from fic_time import NarrativeTimeEstimator, OpenAIClient, UnderwoodPrompter

async def run(text: str):
	est = NarrativeTimeEstimator(gateway=OpenAIClient(), prompter=UnderwoodPrompter())
	async for item in est.estimate_async_iter(text):
		print(item.index, item.total_minutes)

asyncio.run(run(open("long.txt", "r", encoding="utf-8").read()))

🔀 Prompt Strategies

UnderwoodPrompter(shuffle=True) (default): builds system + 4 few-shot exemplars + target passage (order randomized each call)
UnderwoodPrompter(shuffle=False): stable order → reproducible experiments
Prompter(): minimal single-pass prompt (no exemplars) – faster, potentially weaker performance

Switching is as simple as providing a different prompter to the estimator.

🧪 Parsing Raw Responses

If you want to inspect / debug model formatting:

from fic_time import ResponseParser
raw = "1: A quick summary 2: Reasoning... 3: 1.5 hours => 90 minutes 4: 90 minutes 5: High confidence"
parsed = ResponseParser().parse(raw)
print(parsed.minutes, parsed.confidence)

The parser tolerates ranges (60-90 minutes), decimal hours (1.5 hours), and long units (2 weeks).

🧩 Working With Results

batch = estimator.estimate(text)
minutes_list = batch.to_minutes_series()   # [float | None, ...]
df = batch.to_dataframe()                  # requires pandas
df.to_csv("estimates.csv", index=False)

Each TimeEstimate contains:

index – passage order
raw_text – the passage string
model_judgment – full multi-step LLM answer
total_minutes – parsed numeric value (may be None if parsing failed)
confidence – free-text (e.g., High / Moderate / Low)

📈 Correlation With Human Annotations

Assume a TSV file: segment_id<TAB>minutes matching passage ordering.

import csv
from fic_time import CorrelationEvaluator

human = []
with open("human.tsv", "r", encoding="utf-8") as f:
	for row in csv.reader(f, delimiter='\t'):
		human.append(float(row[1]))

model_minutes = [m for m in minutes_list if m is not None]

r_log, p_log = CorrelationEvaluator().pearson_log(human, model_minutes)
r_raw, p_raw = CorrelationEvaluator().pearson_raw(human, model_minutes)
print("log r=", r_log, "p=", p_log)

The log transform uses log(x + 0.1) mirroring the original notebook to avoid log(0).

🛠 Custom Gateway Example

Implement another provider by subclassing LLMGateway:

from fic_time import LLMGateway
from typing import Sequence, Dict

class MyGateway(LLMGateway):
	async def chat(self, messages: Sequence[Dict[str, str]], model: str) -> str:
		# Call your provider here (pseudo-code)
		resp = await my_client.generate_chat(messages=messages, model=model)
		return resp.content

# Usage
from fic_time import NarrativeTimeEstimator, Prompter
estimator = NarrativeTimeEstimator(gateway=MyGateway(), prompter=Prompter())

Contract: chat(messages, model) -> str returns a single textual reply.

🪄 Reproducibility Tips

Pin package & OpenAI model versions in requirements.txt
Use UnderwoodPrompter(shuffle=False) or set --no-shuffle in the CLI
(Optional) Seed Python's random: import random; random.seed(42) before constructing the prompter
Keep raw model outputs (--jsonl) if auditing later

⚠️ Error Handling & Retries

OpenAI calls retry up to 3 times with exponential backoff (tenacity)
If parsing fails to find a numeric expression, total_minutes becomes None; handle downstream accordingly
For batch operations, one malformed passage does not abort the rest

🧵 Processing Large Corpora (Pattern)

from fic_time import NarrativeTimeEstimator, OpenAIClient, UnderwoodPrompter
texts = [open(p).read() for p in corpus_paths]  # Or pre-split yourself
est = NarrativeTimeEstimator(gateway=OpenAIClient(), prompter=UnderwoodPrompter())
all_minutes = []
for t in texts:
	for item in est.estimate(t).items:
		all_minutes.append(item.total_minutes)

Consider adding rate limiting / sleep if provider quotas apply.

🧷 Minimal Unit Test Style (Using DummyGateway)

from fic_time import NarrativeTimeEstimator, DummyGateway, Prompter

def test_basic():
	est = NarrativeTimeEstimator(gateway=DummyGateway(), prompter=Prompter())
	batch = est.estimate("Short passage.")
	assert len(batch.items) == 1

🙋 FAQ

Why minutes as the canonical unit? Enables direct aggregation and comparison; parser converts larger units.

Does the parser trust model arithmetic? It re-extracts the first numeric + unit pattern and performs its own conversion.

Why not tokenize instead of max char split? Simplicity & model-agnostic; you can pre-chunk with your own logic before passing text.

Can I keep the five-step format but change wording? Provide a custom PromptTemplate and your own Prompter subclass.

🧾 License

MIT License. See LICENSE.

🔍 Changelog (Excerpt)

0.1.0 – Initial packaged release (structure extracted from experiment notebooks)

🤝 Contributing

Issues / PRs welcome: add new gateways, better parsing heuristics, additional evaluation metrics.

Citation (Suggested)

If this toolkit supports published research, cite Ted Underwood's original experiment/blog ( https://github.com/tedunderwood/fictional-time-with-GPT4/tree/main / https://tedunderwood.com/2023/03/19/using-gpt-4-to-measure-the-passage-of-time-in-fiction/ ).

Happy analyzing!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
fic_time		fic_time
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fic-time

✨ Key Features

🔧 Installation

🗂 Public API Overview

🚀 Quick Start (Synchronous)

Using OpenAI

📦 CLI Usage

🧵 Streaming / Async Pattern

🔀 Prompt Strategies

🧪 Parsing Raw Responses

🧩 Working With Results

📈 Correlation With Human Annotations

🛠 Custom Gateway Example

🪄 Reproducibility Tips

⚠️ Error Handling & Retries

🧵 Processing Large Corpora (Pattern)

🧷 Minimal Unit Test Style (Using DummyGateway)

🙋 FAQ

🧾 License

🔍 Changelog (Excerpt)

🤝 Contributing

Citation (Suggested)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

fic-time

✨ Key Features

🔧 Installation

🗂 Public API Overview

🚀 Quick Start (Synchronous)

Using OpenAI

📦 CLI Usage

🧵 Streaming / Async Pattern

🔀 Prompt Strategies

🧪 Parsing Raw Responses

🧩 Working With Results

📈 Correlation With Human Annotations

🛠 Custom Gateway Example

🪄 Reproducibility Tips

⚠️ Error Handling & Retries

🧵 Processing Large Corpora (Pattern)

🧷 Minimal Unit Test Style (Using DummyGateway)

🙋 FAQ

🧾 License

🔍 Changelog (Excerpt)

🤝 Contributing

Citation (Suggested)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages