Initial skeleton by Darktex · Pull Request #1 · huggingface/OpenEnv

Darktex · 2025-10-03T19:21:56Z

No description provided.

pankit-eng · 2025-10-05T19:17:31Z

Did we need to check in cpython files?

pankit-eng · 2025-10-06T05:08:46Z

+
+
+@dataclass
+class ExecutionResult:


A few points to consider:

stdout and stderr could be long streams and may cause env container to OOM if we store it in memory. Let's discuss on how the policy would leverage this information. Better to minimize the context sharing from inside and outside of the container.

One of the paradigms we are seeing with SWE agent training is that exit_code, failure reason are generally a good starting point for execution result. Lets discuss whether this paradigm can be applied here too.

Updating list of supporters with LastMile AI

FIX: Handle double-nested observation in client parser

Initial skeleton

Updating list of supporters with LastMile AI

FIX: Handle double-nested observation in client parser

Fix websearch

* Upload current REPL state * use official prompt * unify REPLEnv api * Update default model in server side * Updated example using IP * Updated with prompt * inject final answer --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

…x rank Two config fixes surfaced by Daniel Han's "LoRA Without Regret" guidance at the Scaler workshop 2026-04-22: 1. LORA_TARGETS was attention-only (q/k/v/o). Adding MLP projections (gate_proj, up_proj, down_proj) covers the MLP block. Per Daniel, MLP adapters materially close the gap with full fine-tuning at near-zero VRAM cost and were flagged as the huggingface#1 silent underperformance in attention-only LoRA setups. 2. lora_alpha was LORA_RANK (naive PEFT default = alpha equals rank). New LORA_ALPHA = LORA_RANK * 2 follows the 2x-rank convention that Thinking Machines documented as the regime where LoRA closes the gap with full fine-tuning on small-to-medium models. Both scripts share constants via train_grpo_real.py -> train_sft_warmstart.py import, so the SFT checkpoint slots cleanly into the GRPO phase without re-init. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Grepped src/openenv/core/rubrics/ and confirmed the Rubric base class + container set (WeightedSum, Sequential, Gate, RubricList, LLMJudge) already exist per RFC 004. Updated the README section to show exactly which container our rewards.py functional composition maps to, one row per component in a new mapping table. Does NOT refactor rewards.py (invariant huggingface#1 per ONSITE_BRIEFING.md). The narrative is: functional composition honors the composable-rubrics philosophy in component independence + per-component audit trail + CI contract over multi-component defense-in-depth, even though the class-inheritance refactor is deferred to avoid regressing the 6 red-team ceiling tests. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Basic PR from claude

b803a3f

meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 3, 2025

Claude added Pytest

5ae9f7b

pankit-eng reviewed Oct 5, 2025

View reviewed changes

Comment thread src/__pycache__/types.cpython-310.pyc Outdated

Copy link
Copy Markdown

Contributor

pankit-eng Oct 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did we need to check in cpython files?

pankit-eng reviewed Oct 6, 2025

View reviewed changes

Darktex closed this Oct 6, 2025

Darktex reopened this Oct 6, 2025

Darktex and others added 4 commits October 6, 2025 13:58

Current state

853d5c4

Merge branch 'main' into skeleton

f103c0c

Removed pycache

267c93d

Readd env removed by gitignore

54824d2

Darktex merged commit 1b6e3ff into main Oct 6, 2025
1 check passed

jspisak pushed a commit that referenced this pull request Oct 22, 2025

Merge pull request #1 from andrew-lastmile/patch-1

9959530

Updating list of supporters with LastMile AI

This was referenced Oct 27, 2025

Add DIPGSafetyEnv for Medical AI Safety Research` #97

Merged

[UTIL] deployment script for single env on any namespace #102

Merged

pankit-eng pushed a commit that referenced this pull request Nov 3, 2025

Merge pull request #1 from surfiniaburger/dipg-research

919833c

FIX: Handle double-nested observation in client parser

rycerzes referenced this pull request in rycerzes/OpenEnv Nov 19, 2025

Merge pull request #1 from facebookexternal/skeleton

d214e7b

Initial skeleton

rycerzes referenced this pull request in rycerzes/OpenEnv Nov 19, 2025

Merge pull request #1 from andrew-lastmile/patch-1

cbf899b

Updating list of supporters with LastMile AI

rycerzes referenced this pull request in rycerzes/OpenEnv Nov 19, 2025

Merge pull request #1 from surfiniaburger/dipg-research

322295b

FIX: Handle double-nested observation in client parser

Darktex mentioned this pull request Nov 25, 2025

[RFC 003] Add MCP (Model Context Protocol) support - Phase 1 #224

Merged

3 tasks

burtenshaw pushed a commit that referenced this pull request Dec 8, 2025

Merge pull request #1 from burtenshaw/fix-websearch

1895f2a

Fix websearch

Darktex mentioned this pull request Apr 25, 2026

docs(mcp): add tutorial + tighten lifecycle guide #602

Merged

17 tasks

Darktex mentioned this pull request Apr 25, 2026

Add pathway analysis env #611

Open

12 tasks

greptile-apps Bot mentioned this pull request May 2, 2026

Fix coding_env API compatibility and safety reward false positives #635

Open

12 tasks

Darktex mentioned this pull request May 16, 2026

refactor agent sandbox infrastructure with new agent backends + rework logprobs capture #694

Open

12 tasks

Darktex mentioned this pull request May 24, 2026

feat(mini_swe_env): add SWE-Gym async GRPO environment with Pi interception and HF Space deployment #695

Open

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial skeleton#1

Initial skeleton#1
Darktex merged 6 commits into
mainfrom
skeleton

Darktex commented Oct 3, 2025

Uh oh!

pankit-eng Oct 5, 2025

Uh oh!

pankit-eng Oct 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Darktex commented Oct 3, 2025

Uh oh!

pankit-eng Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

pankit-eng Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants