Release 1.1.0rc1 · microsoft/debug-gym

What's Changed

arxiv url by @xingdi-eric-yuan in #106
Readme install from pypi by @matheper in #108
Change credential order in AzureOpenAILLM for proper token retrieval by @matheper in #110
AzureOpenAI Chained Credential by @matheper in #111
Function Calling Syntax by @xingdi-eric-yuan in #109
Update human class to use tool calls by @matheper in #114
make sure the working dir is always in sys path by @xingdi-eric-yuan in #115
Validate human input by @matheper in #116
Bugfix human test by @matheper in #118
Disable strict mode by @xingdi-eric-yuan in #117
Fix env.rewrite_counter by @matheper in #119
Fixed EvalTool being called with extra kwargs when reacting to events by @matheper in #120
Add logs viewer for Froggy by @MarcCote in #95
Fix: Use json.dumps for tool arguments in OpenAILLM by @ShiZhengyan in #123
Tools observations by @matheper in #121
Fix 'utf-8' codec error with surrogate pairs in Unicode strings by @Copilot in #129
Add max_retries parameter to Human class to limit terminal read attempts by @Copilot in #126
Improve test coverage report and default pytest configs by @matheper in #98
Clean up pytest.ini, partially reverting #98 by @matheper in #132
Remove current file by @matheper in #127
Add start and end args to ViewTool by @matheper in #133
Minor Fixes by @xingdi-eric-yuan in #124
Fix kwargs by @xingdi-eric-yuan in #135
Fix kwargs for pdb tool by @xingdi-eric-yuan in #136
Fix type annotation: tool_call_list should be list not dict by @ShiZhengyan in #134
Enhance Agent logging to include step number and reason for termination by @matheper in #137
Improve visualization by @xingdi-eric-yuan in #138
Pdb current frame file by @matheper in #139
Pdb breakpoint handling by @matheper in #140
Refactor llm_api into debug_gym.llms subpackage by @MarcCote in #142
Resolve absolute path from RepoEnv by @matheper in #144
Use better command completion for Human Mode by @MarcCote in #143
Fix: resolve_path and is_editable to account for ignored and read-only files by @matheper in #145
Ignore files from .gitignore by @matheper in #146
Fix Aider ignore patterns and add tests for path resolution and ignored/read-only files by @matheper in #147
Fix issue resolving env.working_dir by @matheper in #148
Set default RepoEnv.dir_tree_depth to 1 by @matheper in #150
replace unescape by filtering non-utf8 chars in system prompts by @xingdi-eric-yuan in #151
Adding SWE-Smith support by @MarcCote in #122
Only load image for instance_id we want to tests by @MarcCote in #154
No eval shortcut by @matheper in #152
Parallel execution by @matheper in #153
Trajectory Filtering by @xingdi-eric-yuan in #141
For SWE-Smith, add a new test split distinct from train-789 by @MarcCote in #156
Fix ViewTool handling empty files by @matheper in #157
Improve Retry by @xingdi-eric-yuan in #158
Support pickling PDBTool instances by @threewisemonkeys-as in #166
Resolve path mismatch issue raised on macOS by @dkokkotas in #159
Context change for SFT by @xingdi-eric-yuan in #162
Fix PDB indentation mismatch in list output context by @Copilot in #161
Fix init obs by @xingdi-eric-yuan in #171
Integrating thinking by @xingdi-eric-yuan in #172
Show pytest traceback for test failures. by @MarcCote in #173
Rich logger by @matheper in #170
Agents rich progress by @matheper in #174
A set of fixes by @xingdi-eric-yuan in #175
Add get_problem_ids (formerly get_dataset_split) to all benchmark env by @MarcCote in #176
Pin swe-smith version by @matheper in #181
Change to when tool call is on auto parsing by @icwhite in #182
Add memory limit to Docker containers by @matheper in #183
Disable rich live in human mode by @matheper in #180
Pin dataset revision for SWE-Smith dataset by @MarcCote in #188
Fix progress skip by @matheper in #186
Agent timeout by @matheper in #189
Always use debug level when logging to file by @matheper in #191
Add logfile to progress tracker. Stop pending tasks' spinner. by @MarcCote in #190
PyPI release action by @matheper in #192
Use hasattr instead of check dict to support subclassed methods. by @MarcCote in #194
Fix release pipeline by @matheper in #195
Dump experiment info by @matheper in #193
Avoid copying pycache in the temp workding dir. by @MarcCote in #196

New Contributors

@ShiZhengyan made their first contribution in #123
@Copilot made their first contribution in #129
@threewisemonkeys-as made their first contribution in #166
@dkokkotas made their first contribution in #159
@icwhite made their first contribution in #182

Full Changelog: 1.0.0...1.1.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

1.1.0rc1

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's Changed

New Contributors

Contributors

Uh oh!