Add swe-bench test - Not to run in regular PRs by 0xba1a · Pull Request #100 · microsoft/microbots

0xba1a · 2026-01-28T04:36:49Z

No description provided.

codecov-commenter · 2026-01-28T04:39:40Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 91.84%. Comparing base (cf15005) to head (2700bd0).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #100      +/-   ##
==========================================
- Coverage   92.27%   91.84%   -0.43%     
==========================================
  Files          21       21              
  Lines         880      834      -46     
==========================================
- Hits          812      766      -46     
  Misses         68       68

Flag	Coverage Δ
integration	`60.55% <60.00%> (-4.00%)`	⬇️
ollama_local	`64.62% <33.33%> (+2.01%)`	⬆️
slow-browser	`53.71% <33.33%> (+2.23%)`	⬆️
slow-other	`70.74% <73.33%> (+4.37%)`	⬆️
unit	`65.22% <93.33%> (-10.12%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
src/microbots/MicroBot.py	`99.33% <100.00%> (-0.11%)`	⬇️
src/microbots/llm/anthropic_api.py	`100.00% <100.00%> (ø)`
src/microbots/llm/llm.py	`100.00% <100.00%> (ø)`
src/microbots/llm/ollama_local.py	`98.30% <100.00%> (+0.09%)`	⬆️
src/microbots/llm/openai_api.py	`100.00% <100.00%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

… it's own context window (#96)" This reverts commit 83a91cc.

KavyaSree2610 · 2026-02-11T07:51:01Z

src/microbots/llm/llm.py

    command: str = ""

 class LLMInterface(ABC):
-    def __init__(self, system_prompt: str, max_retries: int = 3):


LLMInterface.__init__ was removed but _validate_llm_response still relies on self.retries, self.max_retries, and self.messages. This means any new subclass that forgets to manually initialize these will get an error at runtime, nothing in the interface signals they're required.
Maybe its better to restore a base __init__ and have subclasses call super().__init__(). The only subclass-specific part is whether messages includes a system prompt entry that can be handled after the super call by appending to messges in the subclass.This also removes the duplicated 3 lines in each subclass.

Add swe-bench test - Not to run in regular PRs

65f0273

0xba1a added 8 commits February 5, 2026 08:58

Backup

ddc3995

Revert "Add summarize_context capability for allowing LLM to maintain…

a6ca5f3

… it's own context window (#96)" This reverts commit 83a91cc.

Backup

e8abe13

Add swe-bench test

8f97992

Add selective swe-bench test case

0f6956d

Update claude model name

f1eb169

Disable check of model name

00ebce6

Remove log based tests

2700bd0

KavyaSree2610 reviewed Feb 11, 2026

View reviewed changes

KavyaSree2610 approved these changes Feb 11, 2026

View reviewed changes

0xba1a merged commit a68a96c into main Feb 11, 2026
12 of 13 checks passed

0xba1a deleted the bala/swe-bench branch March 3, 2026 08:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add swe-bench test - Not to run in regular PRs#100

Add swe-bench test - Not to run in regular PRs#100
0xba1a merged 9 commits intomainfrom
bala/swe-bench

0xba1a commented Jan 28, 2026

Uh oh!

codecov-commenter commented Jan 28, 2026 •

edited

Loading

Uh oh!

KavyaSree2610 Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

0xba1a commented Jan 28, 2026

Uh oh!

codecov-commenter commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

KavyaSree2610 Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Jan 28, 2026 •

edited

Loading