

I meant they’re specifically not going for that though. The experiment isn’t about improving the environment itself, it’s about improving the LLM. Otherwise they’d have spent the paper evaluating the effects of different environments and not different LLMs.

My job only exists because it was cheaper to hire several thousand tech workers to replace all the oracle products in house over several years than to keep using Oracle lmao