Human vs AI: Who Writes Better Cypress Tests?
In the first post I asked whether the AI surprised you with what it caught or missed. After running this locally against Sauce Demo, here is my honest answer. It did surprise me. I went in expectin...

Source: DEV Community
In the first post I asked whether the AI surprised you with what it caught or missed. After running this locally against Sauce Demo, here is my honest answer. It did surprise me. I went in expecting the human to win. That is not quite what happened. After indexing the three docs into ChromaDB and running cy.prompt() with that context, I ran both tests. The same app, the same flows, one written by a human and one grounded in RAG context. The AI knew the locked out user scenario because it was in the bug history doc. It knew the exact selectors because they were in the component doc. It did not guess. It worked from what I gave it. Here are both tests it generated: Both tests passed. But here is where it gets interesting. The AI verified that an error message existed. It did not verify that the message said "Sorry, this user has been locked out." That is intent knowledge. It lives in someone's head, not in a doc. The human catches that. The AI does not. And anything that was never docume