You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What framework/approach do we want to take with fairly standalone python code? Pytest? Coordinated by poetry?
How much do we focus on isolated testing (stubbing out LLM for example) vs integrating with LLM?
If we need LLM, what/where/configuration?
How do we manage PR/CI verification - generally easy with github actions. Could run a small LLM within a github runner. How do bee work? Can we use same backend for llm? Needs secrets
How do we check results from tests. With regular code this is easy - assertion of output. However with LLMs involved, so we need another LLM to interpret the results?
We need to consider how we test beehive
This was brought up in issue #125
Originally posted by @planetf1 in #125 (comment)
The text was updated successfully, but these errors were encountered: