1.1.0rc1
Pre-release
Pre-release
What's Changed
- arxiv url by @xingdi-eric-yuan in #106
- Readme install from pypi by @matheper in #108
- Change credential order in AzureOpenAILLM for proper token retrieval by @matheper in #110
- AzureOpenAI Chained Credential by @matheper in #111
- Function Calling Syntax by @xingdi-eric-yuan in #109
- Update human class to use tool calls by @matheper in #114
- make sure the working dir is always in sys path by @xingdi-eric-yuan in #115
- Validate human input by @matheper in #116
- Bugfix human test by @matheper in #118
- Disable strict mode by @xingdi-eric-yuan in #117
- Fix env.rewrite_counter by @matheper in #119
- Fixed EvalTool being called with extra kwargs when reacting to events by @matheper in #120
- Add logs viewer for Froggy by @MarcCote in #95
- Fix: Use json.dumps for tool arguments in OpenAILLM by @ShiZhengyan in #123
- Tools observations by @matheper in #121
- Fix 'utf-8' codec error with surrogate pairs in Unicode strings by @Copilot in #129
- Add max_retries parameter to Human class to limit terminal read attempts by @Copilot in #126
- Improve test coverage report and default pytest configs by @matheper in #98
- Clean up pytest.ini, partially reverting #98 by @matheper in #132
- Remove current file by @matheper in #127
- Add
startandendargs to ViewTool by @matheper in #133 - Minor Fixes by @xingdi-eric-yuan in #124
- Fix kwargs by @xingdi-eric-yuan in #135
- Fix kwargs for pdb tool by @xingdi-eric-yuan in #136
- Fix type annotation: tool_call_list should be list not dict by @ShiZhengyan in #134
- Enhance Agent logging to include step number and reason for termination by @matheper in #137
- Improve visualization by @xingdi-eric-yuan in #138
- Pdb current frame file by @matheper in #139
- Pdb breakpoint handling by @matheper in #140
- Refactor llm_api into debug_gym.llms subpackage by @MarcCote in #142
- Resolve absolute path from RepoEnv by @matheper in #144
- Use better command completion for Human Mode by @MarcCote in #143
- Fix:
resolve_pathandis_editableto account for ignored and read-only files by @matheper in #145 - Ignore files from .gitignore by @matheper in #146
- Fix Aider ignore patterns and add tests for path resolution and ignored/read-only files by @matheper in #147
- Fix issue resolving
env.working_dirby @matheper in #148 - Set default
RepoEnv.dir_tree_depthto 1 by @matheper in #150 - replace unescape by filtering non-utf8 chars in system prompts by @xingdi-eric-yuan in #151
- Adding SWE-Smith support by @MarcCote in #122
- Only load image for instance_id we want to tests by @MarcCote in #154
- No eval shortcut by @matheper in #152
- Parallel execution by @matheper in #153
- Trajectory Filtering by @xingdi-eric-yuan in #141
- For SWE-Smith, add a new test split distinct from train-789 by @MarcCote in #156
- Fix ViewTool handling empty files by @matheper in #157
- Improve Retry by @xingdi-eric-yuan in #158
- Support pickling
PDBToolinstances by @threewisemonkeys-as in #166 - Resolve path mismatch issue raised on macOS by @dkokkotas in #159
- Context change for SFT by @xingdi-eric-yuan in #162
- Fix PDB indentation mismatch in list output context by @Copilot in #161
- Fix init obs by @xingdi-eric-yuan in #171
- Integrating thinking by @xingdi-eric-yuan in #172
- Show pytest traceback for test failures. by @MarcCote in #173
- Rich logger by @matheper in #170
- Agents rich progress by @matheper in #174
- A set of fixes by @xingdi-eric-yuan in #175
- Add get_problem_ids (formerly get_dataset_split) to all benchmark env by @MarcCote in #176
- Pin swe-smith version by @matheper in #181
- Change to when tool call is on auto parsing by @icwhite in #182
- Add memory limit to Docker containers by @matheper in #183
- Disable rich live in human mode by @matheper in #180
- Pin dataset revision for SWE-Smith dataset by @MarcCote in #188
- Fix progress skip by @matheper in #186
- Agent timeout by @matheper in #189
- Always use debug level when logging to file by @matheper in #191
- Add logfile to progress tracker. Stop pending tasks' spinner. by @MarcCote in #190
- PyPI release action by @matheper in #192
- Use hasattr instead of check dict to support subclassed methods. by @MarcCote in #194
- Fix release pipeline by @matheper in #195
- Dump experiment info by @matheper in #193
- Avoid copying pycache in the temp workding dir. by @MarcCote in #196
New Contributors
- @ShiZhengyan made their first contribution in #123
- @Copilot made their first contribution in #129
- @threewisemonkeys-as made their first contribution in #166
- @dkokkotas made their first contribution in #159
- @icwhite made their first contribution in #182
Full Changelog: 1.0.0...1.1.0