Natural language to bash environment#302
Open
PLippmann wants to merge 4 commits intoNousResearch:mainfrom
Open
Natural language to bash environment#302PLippmann wants to merge 4 commits intoNousResearch:mainfrom
PLippmann wants to merge 4 commits intoNousResearch:mainfrom
Conversation
Collaborator
|
Reviewing this week <3 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PR Type
📝 General Information
Description
This PR adds a NL2Bash Generation Environment for training LLMs to convert natural language instructions into executable Bash commands.
Key Features:
1.0for correct matches,-1.0for incorrect answers🔖 Environment Snapshot
🧪 Zero-Training Test Results
Click to expand test results
Unit Tests: 23/23 passing
LLM Integration Test: Qwen2.5-1.5B-Instruct on NVIDIA A100
Examples:
✓ Good Example (Score: 1.0)
yazi✗ Bad Example (Score: -1.0)
tctl disconnecttailscale down✅ Developer & Reviewer Checklist