It seems that the e2e_tests.sh script is flaky, for example:
Anyway, this is turning the tree red at seemingly random moments. This looks like an LLM capacity issue.
We need to fix/alleviate this in order to start enforcing "do not land on red" policies.
It seems that the e2e_tests.sh script is flaky, for example:
Anyway, this is turning the tree red at seemingly random moments. This looks like an LLM capacity issue.
We need to fix/alleviate this in order to start enforcing "do not land on red" policies.