Skip to content

Ahmet2001/foresightLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ForesightLM

ForesightLM studies whether token-level autoregressive language models can acquire sentence-level foresight through an auxiliary sentence-boundary future semantic objective.

The project includes:

  • Core ForesightLM training scripts
  • Baseline / Core / K-horizon ablations
  • Semantic reranking experiments
  • Future-head-aware reranking and calibration sweep
  • WikiText-103 and WritingPrompts domain-transfer experiments
  • Domain generation diagnostics
  • Bootstrap confidence intervals
  • Qualitative example mining
  • Compute-cost accounting
  • Human-evaluation blind item sheet

Main idea

ForesightLM preserves token-level autoregressive generation while adding an auxiliary objective at sentence boundaries. The model predicts the embedding of a future sentence using a frozen sentence encoder and a learned projection head.

Reproducibility

See:

  • results/reproducibility/foresightlm_repro_package/README_REPRODUCIBILITY.md
  • results/reproducibility/foresightlm_repro_package/manifest.json
  • results/reproducibility/foresightlm_repro_package/referenced_large_files_manifest.csv

Large JSONL files, model checkpoints, caches, and cluster logs are not committed. Their paths and SHA256 hashes are listed in the reproducibility manifests when available.

Human evaluation

Blind annotator files are under:

  • results/human_eval/human_eval_blind_items.csv
  • results/human_eval/human_eval_instructions.md

The answer key is intentionally excluded from Git while annotation is active.

Hugging Face checkpoint

The Core ForesightLM DistilGPT-2 checkpoint is available at:

https://huggingface.co/Mandotosh/foresightlm-core-distilgpt2

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors