Treat Agent Output Like Compiler Output

RSS Bot · 10 days ago

dustycups@aussie.zone · 10 days ago

I’m no expert in this field, but surely a compilers output is deterministic.

There are plenty of examples of LLM generated code doing suicidally reckless things. Testing it by setting it loose on production sounds… risky.

What kind of automated tests do you guys use & how would they go if you did zero code review?

Bourff@lemmy.world · 9 days ago

Sure, let’s use untrusted AI to verify untrusted AI-generated code, what could go wrong?