Skip to content

v0.5.0

Latest
Compare
Choose a tag to compare
@you-n-g you-n-g released this 18 Jun 07:08
923a326

0.5.0 (2025-06-18)

Features

  • add a check for whether values in score_df are NaN (#756) (d9cc780)
  • add competition level filter and extract constants to utils (#869) (b40b605)
  • add DocDev for auto-generating workspace documentation (#781) (bcba6ea)
  • add drafting pipeline (#832) (efedddf)
  • add last_exp_fb to DSTrace and update feedback retrieval usage (#910) (10531fd)
  • add mlflow logger in RD loop to log (#815) (b91b54f)
  • add naive experiment generator and update proposal configurations (#759) (75494f4)
  • add RD-Agent-Quant scenario (#838) (6e42d52)
  • add reasoning_effort parameter to LiteLLMAPIBackend and LLMSett… (#754) (113889f)
  • add reviewer in feedback (#765) (1a95bee)
  • advanced checkpoint selectors (#790) (50ea033)
  • archive python and csv files in workspace to maintain results (#814) (67d0e01)
  • checkpoint selection (#744) (a15a06a)
  • custom data (#810) (6322916)
  • dump model (#776) (b49481e)
  • enable to set different version of idea-proposal for multi traces (#895) (236c28f)
  • enhance compatibility with more LLM models (#905) (8800624)
  • idea pool integrated to exp_gen & add timer to RD-Agent & pause-resume to RD-loops (#795) (e62aefa)
  • joblib cache (#749) (83a0411)
  • log api status to mlflow (#860) (049921b)
  • log reaching max time limit before breaking CoSTEER evolution (#921) (837fff2)
  • merge failed and successful traces together (#766) (3a2aa8c)
  • merge selectively (#888) (06ba314)
  • multi-trace online merge (#886) (2112d67)
  • new proposal (structured outputs) prompts (#887) (150796a)
  • parallel loop running based on asyncio (#932) (c63e207)
  • propose hypothesis across multiple parts in pipeline (#827) (acb0e21)
  • pull image with progress (#777) (5cad086)
  • raise error when timeout in api call (#793) (eafd4df)
  • raise policy violation (#894) (5b9d007)
  • reanalyze competition info & pipeline coding evaluator prompt (#837) (f7b5258)
  • refine merge (#842) (99463b4)
  • refine prompt (#760) (a91b182)
  • replace hard-coded cache paths with dynamic cache_path config (#952) (db56894)
  • revert draft stage into a soft decay in hypothesis selection (#849) (d41db0c)
  • trace merging (#836) (a3d5473)
  • truncate by time (#863) (2b9427a)
  • update prompt to improve json respond format of some LLM models (#928) (0b84709)
  • using different chat model in different part (#822) (c052ea6)

Bug Fixes

  • 'DSProposalV2ExpGen' object has no attribute 'COMPONENT_TASK_MAP… (#950) (e353895)
  • adapting UI to mock trace (#841) (8a5754c)
  • add missing semicolon after chmod in env shell command (#955) (1128eaa)
  • add time to timer when api timeout bug (#826) (f45d6ae)
  • add wait_retry to exp_gen v2 (#783) (b9fb7cf)
  • adjust ds_trace lookup and add stderr redirect to mlebench command (#853) (4e53108)
  • align competion_full_desc and scenario_all_desc, remove redundant info in problems proposal (#808) (76d8536)
  • bug fix in timer start (#807) (9af7161)
  • bug in problem identification (#806) (e1d5a29)
  • conda error information (#941) (fd39a94)
  • default cost to NaN when calculation fails in LiteLLM backend (#912) (51a4048)
  • ds trace (#929) (127e441)
  • duplicate model names test in pipeline coder & runner (#763) (be3ee9d)
  • filter system metadata dirs and init missing DSTrace attribute (#946) (10050ef)
  • fix a bug in docker result extraction (#824) (e1c0f98)
  • fix competition metric direction (#784) (3be0057)
  • fix model input shape bug and costeer_model bug (#821) (b34bd89)
  • fix some minor bugs (#758) (963f96e)
  • fix some minor bugs in qlib scenario (#817) (79962a7)
  • fix the bug in the regular expression matching for stdout (#890) (ee57e37)
  • fix the bug of Exceed-LLM-Context in online merge of multi-tarce (#892) (f760a3e)
  • fix the problems weights bug (#898) (013d79f)
  • fixed CI execution failures caused by document builds (#857) (5c116b2)
  • get_metric_direction for aerial-cactus-identification (#970) (70dc62d)
  • import path of T (#787) (ac008a6)
  • improve eval alignment check (e.g. small-scale finetuning) (#802) (d391578)
  • improve file tree and _walk symlink handling (#877) (516cb69)
  • log info (#965) (f1dbc21)
  • main bug (#938) (c6d34d6)
  • non-exist variable test_eval.py (#847) (4948c38)
  • refine feedback prompt (#901) (12bb2c4)
  • refine the time/memory constraints prompt in hypothesis proposal (#856) (51ce8ef)
  • Set PYTHONPATH in env.run_ret_code call in FBWorkspace class (#755) (68b5018)
  • task_gen for better understanding (#752) (6bfc1e5)
  • trace list but (#852) (32cdc57)
  • typo in workflow (#861) (0e54c9f)
  • update DS env setup with competition volume and timeout (#878) (816ada0)
  • update feedback.py (#772) (133778c)
  • update metric direction to return bool (#791) (0bf365e)
  • update runner max loop to 1 in DS scenario (#820) (3da378e)
  • use fallback messages for missing submission and scores files (#882) (898fdea)
  • use simple stdout and stderr (#966) (0b1c445)
  • use trace count as index (#909) (b87de56)
  • wrong variable test_eval.py (#846) (808ea6c)