Inline math; speed up LLM calls; allow local models
Inline math
Marker will handle inline math if --use_llm is set. This makes reading scientific papers a lot nicer! The feature has been optimized for speed.
Local LLMs
We now support Ollama - when you're passing the --use_llm flag, you can select the Ollama inference service like this:
marker_single FILEPATH --use_llm --llm_service marker.services.ollama.OllamaServiceYou can set the options --ollama_base_url and --ollama_model. By default, it will use llama3.2-vision.
Batch LLM calls
LLM calls are now batched across processors for a significant speedup if you're passing --use_llm.
Misc fixes
- Biology PDFs now work a lot better - leading line numbers are stripped
- Improved OCR heuristics
- Updated the examples
What's Changed
- Batch together llm inference requests by @VikParuchuri in #536
- Add another heuristic to clean up line numbers by @iammosespaulr in #538
- Add Inline Math Support by @tarun-menta in #517
- Factor out llm services, enable local models by @VikParuchuri in #544
- Improve LLM speed; handle inline math; allow local models by @VikParuchuri in #537
Full Changelog: v1.4.0...v1.5.0