We are clocking in around 50% success rate in this benchmark.
[1] https://github.com/xeol-io/swe-bump-bench
https://github.com/tembo-io/pgmq?tab=readme-ov-file#read-mes...