GitHub - doptime/DualModelIterativeReasoning

This Project aims to realize openAI o1 reasoning

Methodology:

Question reasoing & rephrasing
- In most case, a question can not be well-solved, becasue the model can not find out the real meaning/intension of the question. So the LLM tring to solve question literally, ability can't be fully triggered.
- Try rephrasing the question, and until the boundary condition, restriction, context, and destination is converged.
Planing & Solving in Iterative way
- Planing & Solving are try to carried as soon as possible to close the improving loop . The real important thing is End2End Iterating, review the planning and solution, and iterate.
Mutual Reasoning
- As rStar paper shows, self-iterating and correcting is not effective, and multi-model is necessary to boost the performance.
- Use Multi-Model or Dual Modal to cross-review the logic chain. rStar paper shows that self-iterating and correcting is not effective, and multi-model is necessary to boost the performance.
Parallel-beam searching
- Use parallel-beam searching to balance the exploration and exploitation. Single instance may have performance degradation during exploration. Reasoning improvement can not rely on single branch iteration.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
batchop		batchop
main		main
message		message
models		models
query		query
tools		tools
.DS_Store		.DS_Store
Meta_ideas_v2.md		Meta_ideas_v2.md
example.md		example.md
example.querys.toml		example.querys.toml
go.mod		go.mod
go.sum		go.sum
readme.md		readme.md

Provide feedback