We introduce STAR (Stiefel-based Activation Steering for Diverse Reasoning), a method to enhance diversity in reasoning processes during inference.
The repository is organized into two main tasks:
Located in the testeval/ directory.
bestofn.py: Baseline temperature sampling.steering.py: Proposed STAR algorithm.data_utils.py: Data loading and processing utilities.eval_overall.py: Evaluation scripts.prompt_utils.py: Prompt handling functions.load_data.ipynb: Notebook for preparing and exploring datasets.croissant.json: Metadata/configuration file.
Located in the liveideabench/ directory.
bestofn.py: Baseline temperature sampling.steering.py: Proposed STAR algorithm.data/,keywords_data/: Benchmark datasets.
Clone the repository and install dependencies:
git clone https://github.com/lythk88/STARS
cd STARS
pip install -r requirements.txtpython testeval/bestofn.py
python liveideabench/bestofn.py python testeval/steering.py
python liveideabench/steering.py - TestEval: Test case generation benchmark.
- LiveIdeaBench: Scientific discovery benchmark.
Each benchmark evaluates the diversity and correctness of reasoning paths under baseline sampling and STAR steering.