Coding with LLMs (Claude Code, OpenAI Codex) is often presented as the ‘killer app’ for Generative AI. But looking at data, it seems the one piece of the puzzle missing is actual cost. …
I use it a lot, and if you are getting these kinds of results you are either trolling, or just flat out not providing the details and guardrails required with your prompts.
I’ve been in software for decades, and if used correctly, yes it can accelerate velocity of building code out. 10x? No… if you are lucky and careful perhaps 2-4x.
As ALWAYS the human should be in the loop and is on the hook for any code generated.
Just gotta configure and tweak until it gives outputs you find indistinguishable from correct. Just gotta train it to gaslight you properly. Come on don’t you want to be given and endless stream of stuff that looks correct?
I was using a set of template files designed for LLMs to review that project. It is absolutely the fault of Claude that it tools me to do something one way, then told me to try another and when I reverted it said it was the optimal approach.
Where I find it helps is in getting initial starts and as a start to code review. But in both cases they aren’t ever operating on their own and their feedback is filtered through myself or another senior dev.
I use it a lot, and if you are getting these kinds of results you are either trolling, or just flat out not providing the details and guardrails required with your prompts.
I’ve been in software for decades, and if used correctly, yes it can accelerate velocity of building code out. 10x? No… if you are lucky and careful perhaps 2-4x.
As ALWAYS the human should be in the loop and is on the hook for any code generated.
You just need a better prompt, bro
Just gotta configure and tweak until it gives outputs you find indistinguishable from correct. Just gotta train it to gaslight you properly. Come on don’t you want to be given and endless stream of stuff that looks correct?
I was using a set of template files designed for LLMs to review that project. It is absolutely the fault of Claude that it tools me to do something one way, then told me to try another and when I reverted it said it was the optimal approach.
Where I find it helps is in getting initial starts and as a start to code review. But in both cases they aren’t ever operating on their own and their feedback is filtered through myself or another senior dev.