K123 said: »
On what other basis would they choose to ignore rules?
Replit can ignore the rules about code freezes or production databases without the permission or knowledge of the LLM. When asked, the LLM can (and has to) invent explanations for how it occurred, because it doesn't have full knowledge of what the Replit end is doing.
K123 said: »
it is not possible to enforce absolute rules onto LLMs. That's the issue at hand.
Quote:
They know they can do things outside of the system prompt, and they do.
My argument is that this isn't 'learning to break rules from humans', it's an inherent flaw to any model that works on relational data. Without a basis for truth, you can't create a proxy for thought.