-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor grep functionality and enhance documentation tools across codebase #757
Changes from 2 commits
9daa2b2
d52641f
e0893e3
11bfc09
e8c217b
cee33a6
28346b7
8d7ad22
8a8b7cc
8228e4a
7a00aa2
a95d3ff
bef76d7
20a33c8
63ed371
7ab8dcf
877ea80
6642f85
ea54423
ab0d5ed
9f925d3
0b2b74d
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -6,6 +6,7 @@ import { LinkCard } from '@astrojs/starlight/components'; | |
|
||
### Builtin Agents | ||
|
||
<LinkCard title="agent docs" description="query the documentation" href="/genaiscript/reference/scripts/system#systemagent_docs" /> | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The link provided is incorrect; it should point to 'system.agent_docs' instead of 'system#systemagent_docs'.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The link provided for "agent docs" is incorrect or outdated.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. A new LinkCard was added without a corresponding description in the documentation.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The link for "agent docs" is missing a title attribute in the frontmatter.
|
||
<LinkCard title="agent fs" description="query files to accomplish tasks" href="/genaiscript/reference/scripts/system#systemagent_fs" /> | ||
<LinkCard title="agent git" description="query a repository using Git to accomplish tasks. Provide all the context information available to execute git queries." href="/genaiscript/reference/scripts/system#systemagent_git" /> | ||
<LinkCard title="agent github" description="query GitHub to accomplish tasks" href="/genaiscript/reference/scripts/system#systemagent_github" /> | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -28,6 +28,7 @@ | |
<LinkCard title="github_pulls_get" description="Get a single pull request by number." href="/genaiscript/reference/scripts/system#systemgithub_pulls" /> | ||
<LinkCard title="github_pulls_review_comments_list" description="Get review comments for a pull request." href="/genaiscript/reference/scripts/system#systemgithub_pulls" /> | ||
<LinkCard title="math_eval" description="Evaluates a math expression" href="/genaiscript/reference/scripts/system#systemmath" /> | ||
<LinkCard title="md_find_files" description="Get the file structure of the documentation markdown/MDX files. Retursn filename, title, description for each match. Use pattern to specify a regular expression to search for in the file content." href="/genaiscript/reference/scripts/system#systemmd_find_files" /> | ||
pelikhan marked this conversation as resolved.
Show resolved
Hide resolved
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. A new LinkCard was added without a corresponding description in the documentation.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The link for "md_find_files" is missing a title attribute in the frontmatter.
|
||
<LinkCard title="md_read_frontmatter" description="Reads the frontmatter of a markdown or MDX file." href="/genaiscript/reference/scripts/system#systemmd_frontmatter" /> | ||
<LinkCard title="python_code_interpreter_run" description="Executes python 3.12 code for Data Analysis tasks in a docker container. The process output is returned. Do not generate visualizations. The only packages available are numpy, pandas, scipy. There is NO network connectivity. Do not attempt to install other packages or make web requests." href="/genaiscript/reference/scripts/system#systempython_code_interpreter" /> | ||
<LinkCard title="python_code_interpreter_copy_files" description="Copy files from the host file system to the container file system" href="/genaiscript/reference/scripts/system#systempython_code_interpreter" /> | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -68,10 +68,10 @@ | |
that allows to efficiently search for a pattern in files (this is the same search engine | ||
that powers the Visual Studio Code search). | ||
|
||
```js "workspace.grep" | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The variable name 'globs' should be 'glob' to match the property name used in the 'workspace.grep' method.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The variable 'globs' is incorrect; it should be 'glob' as used in the workspace.grep function.
|
||
const { pattern, glob } = env.vars | ||
const { pattern, globs } = env.vars | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The variable 'glob' has been renamed to 'globs', which may affect the script's functionality.
|
||
const patternRx = new RegExp(pattern, "g") | ||
const { files } = await workspace.grep(patternRx, glob) | ||
const { files } = await workspace.grep(patternRx, { globs }) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Incorrect API usage, 'workspace.grep' should be called with 'glob' instead of 'globs'.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is incorrectly called with an object instead of separate arguments.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is called with an object containing 'globs' instead of a string 'glob' parameter.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is incorrectly called with an object instead of separate arguments.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The usage of the 'workspace.grep' method has changed, which may affect the script's functionality.
|
||
``` | ||
pelikhan marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
## Compute Transforms | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -249,7 +249,7 @@ The quick brown fox jumps over the lazy dog. | |
Grep or fuzz search [files](/genaiscript/referen/script/files) | ||
|
||
```js wrap | ||
const { files } = await workspace.grep(/[a-z][a-z0-9]+/, "**/*.md") | ||
const { files } = await workspace.grep(/[a-z][a-z0-9]+/, { globs: "*.md" }) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Incorrect API usage, 'workspace.grep' should be called with 'glob' instead of 'globs'.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is incorrectly called with an object instead of separate arguments.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The parameter for 'workspace.grep' should be 'glob' instead of 'globs'.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is called with an object containing 'globs' instead of a string 'glob' parameter.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is incorrectly called with an object instead of separate arguments.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The usage of the 'workspace.grep' method has changed, which may affect the script's functionality.
|
||
``` | ||
pelikhan marked this conversation as resolved.
Show resolved
Hide resolved
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The structure of the parameter passed to 'workspace.grep' has changed from a string to an object with a 'globs' property. This change needs to be reflected in all relevant code snippets.
|
||
|
||
</Card> | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -96,6 +96,58 @@ | |
````` | ||
|
||
|
||
### `system.agent_docs` | ||
|
||
Agent that perform tasks on the documentation. | ||
|
||
|
||
|
||
|
||
|
||
`````js wrap title="system.agent_docs" | ||
system({ | ||
title: "Agent that perform tasks on the documentation.", | ||
}) | ||
|
||
const docsRoot = env.vars.docsRoot || "docs" | ||
const samplesRoot = env.vars.samplesRoot || "packages/sample/genaisrc/" | ||
|
||
defAgent( | ||
"docs", | ||
"query the documentation", | ||
async (ctx) => { | ||
ctx.$`Your are a helpfull LLM agent that is an expert at Technical documentation. You can provide the best analyzis to any query about the documentation. | ||
|
||
Analyze QUERY and respond with the requested information. | ||
|
||
## Tools | ||
|
||
The 'md_find_files' can perform a grep search over the documentation files and return the title, description, and filename for each match. | ||
To optimize search, conver the QUERY request into keywords or a regex pattern. | ||
|
||
Try multiple searches if you cannot find relevant files. | ||
|
||
## Context | ||
|
||
- the documentation is stored in markdown/MDX files in the ${docsRoot} folder | ||
${samplesRoot ? `- the code samples are stored in the ${samplesRoot} folder` : ""} | ||
` | ||
}, | ||
{ | ||
system: ["system.explanations", "system.github_info"], | ||
tools: [ | ||
"md_find_files", | ||
"md_read_frontmatterm", | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Typo in the tool name "md_read_frontmatterm" which should be "md_read_frontmatter".
|
||
"fs_find_files", | ||
"fs_read_file", | ||
], | ||
maxTokens: 5000, | ||
} | ||
) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The function name 'defAgent' does not follow the naming convention; it should be 'defineAgent' or similar.
|
||
|
||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The agent "docs" is incorrectly declared with a placeholder QUERY and a non-functional code block.
|
||
````` | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. A new system prompt 'system.agent_docs' has been added without a corresponding description in the documentation.
|
||
|
||
|
||
### `system.agent_fs` | ||
|
||
Agent that can find, search or read files to accomplish tasks | ||
|
@@ -616,7 +668,7 @@ | |
`ls ${glob} ${pattern ? `| grep ${pattern}` : ""} ${frontmatter ? "--frontmatter" : ""}` | ||
) | ||
const res = pattern | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The parameter for 'workspace.grep' should be 'glob' instead of 'glob'.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is called with an object containing 'glob' instead of a string 'glob' parameter.
|
||
? (await workspace.grep(pattern, glob, { readText: false })).files | ||
? (await workspace.grep(pattern, { glob, readText: false })).files | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' method is incorrectly called with '{ glob, readText: false }' instead of 'glob' as the second parameter.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Incorrect API usage, 'workspace.grep' should be called with 'glob' instead of 'globs'.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'workspace.grep' function is incorrectly called with an object instead of separate arguments.
|
||
: await workspace.findFiles(glob, { readText: false }) | ||
if (!res?.length) return "No files found." | ||
|
||
pelikhan marked this conversation as resolved.
Show resolved
Hide resolved
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The structure of the parameter passed to 'workspace.grep' has changed from separate arguments to a single object. This change needs to be reflected in the code snippet.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The tool "fs_find_files" is incorrectly called with an object instead of separate arguments.
|
||
|
@@ -1106,7 +1158,7 @@ | |
|
||
const info = await github.info() | ||
if (info?.owner) { | ||
const { auth, owner, repo, baseUrl } = info | ||
const { owner, repo, baseUrl } = info | ||
$`- current github repository: ${owner}/${repo}` | ||
if (baseUrl) $`- current github base url: ${baseUrl}` | ||
} | ||
pelikhan marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
@@ -1448,6 +1500,95 @@ | |
````` | ||
|
||
|
||
### `system.md_find_files` | ||
|
||
Tools to help with documentation tasks | ||
|
||
|
||
|
||
- tool `md_find_files`: Get the file structure of the documentation markdown/MDX files. Retursn filename, title, description for each match. Use pattern to specify a regular expression to search for in the file content. | ||
pelikhan marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
`````js wrap title="system.md_find_files" | ||
system({ | ||
title: "Tools to help with documentation tasks", | ||
}) | ||
|
||
const model = (env.vars.mdSummaryModel = "gpt-4o-mini") | ||
|
||
defTool( | ||
"md_find_files", | ||
"Get the file structure of the documentation markdown/MDX files. Retursn filename, title, description for each match. Use pattern to specify a regular expression to search for in the file content.", | ||
{ | ||
type: "object", | ||
properties: { | ||
path: { | ||
type: "string", | ||
description: "root path to search for markdown/MDX files", | ||
}, | ||
pattern: { | ||
type: "string", | ||
description: | ||
"regular expression pattern to search for in the file content.", | ||
}, | ||
question: { | ||
type: "string", | ||
description: "Question to ask when computing the summary", | ||
}, | ||
}, | ||
}, | ||
async (args) => { | ||
const { path, pattern, context, question } = args | ||
context.log( | ||
`docs: ls ${path} ${pattern ? `| grep ${pattern}` : ""} --frontmatter ${question ? `--ask ${question}` : ""}` | ||
) | ||
const matches = pattern | ||
? (await workspace.grep(pattern, { path, readText: true })).files | ||
: await workspace.findFiles(path + "/**/*.{md,mdx}", { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The 'path' parameter should be part of an object literal
|
||
readText: true, | ||
}) | ||
if (!matches?.length) return "No files found." | ||
const q = await host.promiseQueue(5) | ||
const files = await q.mapAll(matches, async ({ filename, content }) => { | ||
const file = { | ||
filename, | ||
} | ||
try { | ||
const fm = await parsers.frontmatter(content) | ||
if (fm) { | ||
file.title = fm.title | ||
file.description = fm.description | ||
} | ||
const { text: summary } = await runPrompt( | ||
(_) => { | ||
_.def("CONTENT", content, { language: "markdown" }) | ||
_.$`As a professional summarizer, create a concise and comprehensive summary of the provided text, be it an article, post, conversation, or passage, while adhering to these guidelines: | ||
${question ? `* ${question}` : ""} | ||
* The summary is intended for an LLM, not a human. | ||
* Craft a summary that is detailed, thorough, in-depth, and complex, while maintaining clarity and conciseness. | ||
* Incorporate main ideas and essential information, eliminating extraneous language and focusing on critical aspects. | ||
* Rely strictly on the provided text, without including external information. | ||
* Format the summary in one single paragraph form for easy understanding. Keep it short. | ||
* Generate a list of keywords that are relevant to the text.` | ||
}, | ||
{ | ||
label: `summarize ${filename}`, | ||
cache: "md_find_files_summary", | ||
model, | ||
} | ||
) | ||
file.summary = summary | ||
} catch (e) {} | ||
return file | ||
}) | ||
const res = YAML.stringify(files) | ||
return res | ||
}, | ||
{ maxTokens: 20000 } | ||
) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The description of the 'md_find_files' tool contains a typo "Retursn" which should be "Returns".
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The function name 'defTool' does not follow the naming convention; it should be 'defineTool' or similar.
|
||
|
||
````` | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. A new tool 'md_find_files' has been added without a corresponding description in the documentation.
|
||
|
||
|
||
### `system.md_frontmatter` | ||
|
||
Markdown frontmatter reader | ||
|
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Link text should be descriptive and unique for accessibility. "agent docs" is not descriptive enough.