I’ve seen a few articles saying that instead of hating AI, the real quiet programmers young and old are loving it and have a renewed sense of purpose coding with llm helpers (this article was also hating on ed zitiron, which makes sense why it would).
Is this total bullshit? I have to admit, even though it makes me ill, I’ve used llms a few times to help me learn simple code syntax quickly (im and absolute noob who’s wanted my whole life to learn code but cant grasp it very well). But yes, a lot of time its wrong.
I’m enjoying it, mostly. It’s definitely great at some tasks and terrible at orhers. You get a feel for what those are after a while:
-
Throwaway projects - proof of concepts, one-off static websites, that kind of thing: absolutely ideal. Weeks of dev becomes hours, and you barely need to bother reviewing it if it works.
-
Research (find a tool for doing XYZ) where you barely know the right search terms: ideal. The research mode on claude.ai is especially amazing at this.
-
Anything where the language is unfamiliar. AI bootstraps past most of the learning curve. Doesn’t help you learn much, but sometimes you don’t care about learning the codebase layout and you just need to fix something.
-
Any medium sized project with a detailed up front description.
What it’s not good for:
- Debugging in a complex system
- Tiny projects (one line change), faster to do it yourself
- Large projects (500+ line change) - the diff becomes unreviewable fairly quickly and can’t be trusted (much worse than the same problem with a human where you can at least trust the intent)
-
Its an absolute gamechanger, IMO - the research phase of any task is reduced to effectively nothing, and I get massive amounts of work done when I walk away from my desk, because I plan for and keep lists of longer tasks to accomplish during those times.
You need to review every line of code it writes, but that’s no different than it ever was when working with junior devs 🤷♂️ but now I get the code in minutes instead of weeks and the agents actually react to my comments.
We’re using this with a massive monorepo containing hundreds of thousands of lines of code, and in tiny tool repos that serve exactly one purpose. If our code quality checks and standards werent as strict as they have been for the past decade, I think it wouldn’t work well with the monorepo.
The important part is that my company is paying for it - I have no clue what these tools cost. I am definitely more productive, there is absolutely no debate there IMO. Is the extra productivity worth the extra cost? I have literally no idea.
I’m not against AI use in software development… But you need to understand what the tools you use actually do.
An LLM is not a dev. It doesn’t have the capability to think on a problem and come up with a solution. If you use an LLM as a dev, you are an idiot pressing buttons on a black box you understand nothing about.
An LLM is a predictive tool. So use it as a predictive tool.
- Boilerplate code? It can do that, yeah. I don’t like to use it that way, but it can do that.
- Implementing a new feature? Maybe, if you’re lucky, it has been trained on enough data that it can put something together. But you need to consider its output completely untrustworthy, and therefore it will require so much reviewing that it’s just better to write it yourself in the first place.
- Implementing something that solves a problem not solved before? Just don’t. Use your own brain, for fuck’s sake. That’s what you have been trained on.
The one use of AI, at the moment, that I actually like and actually improves my workflow is JetBrains’ full line completion AI. It very often accurately predicts what I want to write when it’s boilerplate-ish, and shuts up when I write something original.
Yes they do have the abikity to think and reason just like you (generally mush faster and slightly better)
https://medium.com/@leucopsis/how-gpt-5-compares-to-claude-opus-4-1-fd10af78ef90
96% on the AIME with zero tools. Only reading the question and reasoning through the answer
Absolutely not. This comment shows you have absolutely zero idea how an LLM works.
This is not true. They do not think or reason. They have code that appears to reason, but it definitely is not.
Once it gets off track it doesn’t consider that it is obviously wrong.
A simple math problem can fail and it is really obvious to a human for example.
No, they can’t think and reason. However, they can replicate and integrate the thinking and reasoning of many people who have written about similar problems. And yes, they can do it must faster than we could read a hundred search result pages. And yes, their output looks slightly better than many of us in many cases, because they are often dispensing best practices by duplicating the writings of experts. (In the best cases, that is.)
I use llms from both ends. It helps me plan an think through complex code architecture and helps me do the little stuff i do too infrequent to remember. Putting it all together is usually all me.
I use it to vet ideas, concepts, approaches, and paradigms. It’s amazing for rubber ducking. I don’t use it for wholesale code gen though.
And as a documentation companion it’s pretty rad. Not always right but generally gets things in the correct direction.
I’m pretty sure every time you use AI for programming your brain atrophies a little, even if you’re just looking something up. There’s value in the struggle.
So they can definitely speed you up, but be careful how you use it. There’s no value in a programmer who can only blindly recite LLM output.
There’s a balance to be struck in there somewhere, and I’m still figuring it out.
I’m pretty sure every time you use AI for programming your brain atrophies a little, even if you’re just looking something up. There’s value in the struggle.
I assume you were joking but some studies have come out recently that found this is exactly what happens and for more than just programming. (sorry it was a while ago so I dont have links)
Doesn’t sound like they’re joking to me.
There are similar studies on the effects of watching a Youtube video instead of reading a manual.
This is literally the exact same argument made against using books and developing writing.
From my experience it’s great at doing things that have been done 1000x before (which makes sense given the training data), but when it comes to building something novel it really struggles, especially if there’s 3rd party libraries involved that aren’t commonly used. So you end up spending a lot of time and money hand holding it through things that likely would have been quicker to do yourself.
the 1000x before bit has quite a few sideffects to it as well.
- lesser used languages suffer because there’s not enough training data. this gets annoying quickly when it overrides your static tools and suggests nonsense.
- larger training sets contain more vulnerabilities as most code is pretty terrible and may just be snippets that someone used once and threw away. owasp has a top 10 for a reason. take input validation for example, if I’m working on parsing a string there’s usually context such as is this trusted data or untrusted? if i don’t have that mental model where I’m thinking about the data i might see generated code and think it looks correct but in reality its extremely nefarious.
Its also trained on old stuff.
And because its old, you get some very strange side effects and less maintainability.
It’s decent at reviewing its own code, especially if you give it different lenses to look though.
“Analyze this code and look for security vulnerabilities.” “Analyze this code and look for ways to reduce complexity.”
And then… think about the response like it’s a random dude online reviewing your code. Lots of times it raises good issues but sometimes it tries too hard to find little shit that is at best a sidegrade.
this
The pycharm AI integration completes each line. That’s very useful when you are repeating a well known algorithm and not distracting when you are doing something unusual. So overall, for small things AI is a speed up. I haven’t tried asking chatgpt for bigger coffe chunks, I haven’t had the greatest experience with it up to now and ii don’t want to spend more time debugging than I am already.
Oh man, the Codeium auto complete in PyCharm has been just awful for me. Slow enough that it doesnt come up fast enough that I ever expect it (and rarely comes up when I pause to wait for it) then goes away instantly when I invariably continue typing when it comes up. Then won’t come back if I backspace, erase the word and start retyping it, etc. And competes with the old school pycharm auto complete sometimes which adds another layer of fun.
I don’t see how it could be more effecient to have AI generate something that you then have to review and make sure actually works over just writing the code yourself, unless you don’t know enough to code it yourself and just accept the AI generated code as-is without further review.
You can type at 300 words per minute with zero mistakes. Youre able to do than on systems youve never worked on before in languages youve never seen. #Doubt
I don’t see how it could be more effecient to have [a junior developer write] something that you then have to review and make sure actually works over just writing the code yourself…
- A junior dev wont be a junior dev their whole career, code reviews also educates them
- You can’t trust the quality of a junior’s work, but you can trust that they are able to understand the project and their role in it. LLMs are by definition unable to think and understand. Just pretty good at pretending they are. Which leads to the third point:
- When you “vibe code”, you don’t “just” have to review the produced code, you also have to constantly tell the LLM what you want it to do. And fight with it when it fucks up.
if the only point of hiring junior devs were to skill them up so they’d be useful in the future, nobody would hire junior devs
LLMs aren’t the brain: they’re exactly what they are… a fancy auto complete…
type a function header, let if fill the body… as long as you’re descriptive enough and the function is simple enough to understand (as all well structured code should be) it usually gets it pretty right: it’s somewhat of a substitute for libraries, but not for your own structure
let it generate unit tests: doesn’t matter if it gets it wrong because the test will fail; it’ll write a pretty solid test suite using edge cases you may have forgotten
fill lines of data based on other data structures: it can transform text quicker than you can write regex and i’ve never had it fail at this
let it name functions based on a description… you can’t think of the words, but an LLM has a very wide vocabulary and - whilst not knowledge - does have a pretty good handle on synonyms and summary etc
there’s load of things LLMs are good for, but unless you’re just learning something new and you know your code will be garbage anyway, none of those things replace your brain: just repetitive crap you probably hate to start with because you could explain it to a non-programmer and they could carry out the tasks
if the only point of hiring junior devs were to skill them up so they’d be useful in the future, nobody would hire junior devs
I never said that, and a single review already will make a junior dev better off the bat
LLMs aren’t the brain: they’re exactly what they are… a fancy auto complete
I agree, but then you say…
type a function header, let if fill the body… as long as you’re descriptive enough and the function is simple enough to understand (as all well structured code should be) it usually gets it pretty right: it’s somewhat of a substitute for libraries, but not for your own structure
…which says the other thing. Implementing a function isn’t for a “fancy autocomplete”, it’s for a brain to do. Unless all you do is reinventing the wheel, then yeah, it can generate a decent wheel for you.
let it generate unit tests: doesn’t matter if it gets it wrong because the test will fail; it’ll write a pretty solid test suite using edge cases you may have forgotten
Fuck no. If it gets the test wrong, it won’t necessarily fail. It might very well pass even when it should fail, and that’s something you won’t know unless you review every single line it spits out. That’s one of the worst areas to use an LLM.
fill lines of data based on other data structures: it can transform text quicker than you can write regex and i’ve never had it fail at this
I’m not sure what you mean by that.
let it name functions based on a description… you can’t think of the words, but an LLM has a very wide vocabulary and - whilst not knowledge - does have a pretty good handle on synonyms and summary etc
I agree with that, naming or even documenting is a good way to use an LLM. With supervision of course, but an imprecise name or documentation is not critical.
Implementing a function isn’t for a “fancy autocomplete”, it’s for a brain to do. Unless all you do is reinventing the wheel, then yeah, it can generate a decent wheel for you.
pretty much every line of code we write in modern software isn’t unique… we use so many orders of magnitude more lines of other people’s code than our own, we’re really just plumbing pipes together
most functions we write that aren’t business logic specific to the problem domain of our software (and even sometimes then) has been written before… the novel part isn’t in the function body: the low level instructions… the novel part is how those instructions are structured… that may as well be pseudocode, and that pseudocode may as well take the form of function headers
Fuck no. If it gets the test wrong, it won’t necessarily fail. It might very well pass even when it should fail, and that’s something you won’t know unless you review every single line it spits out. That’s one of the worst areas to use an LLM.
write tests, tests fail, write code, tests slowly start to pass until you’re done… this is how we’ve always done TDD because it ensures the tests fail when they should. this is a good idea with or without LLMs because humans fuck up unit tests all the time
I’m not sure what you mean by that.
for example, you have an external API of some kind with an enum expressed via JSON as a string and you want to implement that API including a proper Enum object… an LLM can more easily generate that code than i can, and the longer the list of values the more cumbersome the task gets
especially effective for generating API wrappers because they basically amount to function some_name -> api client -> call /api/someName
this is basically a data transformation problem: translate from some structure to a well-defined chunk of code that matches the semantics of your language of choice
this is annoying for a human, and an LLM can smash out a whole type safe library in seconds based on little more than plain english docs
it might not be 100% right, but the price for failure is an error that you’ll see and can fix before the code hits production
and of course it’s better to generate all this using swagger specs, but they’re not always available and tend not to follow language conventions quite so well
for a concrete example, i wanted to interact with blackmagic pocket cinema cameras via bluetooth in swift on ios: something they don’t provide an SDK for… they do, however document their bluetooth protocols
(page 157 if you’re interested)
it’s incredibly cumbersome, and basically involves packing binary data into a packet that represents a different protocol called SDI… this would have been horrible to try and work out on my own, but with the general idea of how the protocol worked, i structured the functions, wrote some test case using the examples they provided, handed chatgpt the pdf and used it to help me with the bitbanging nonsense and translating their commands and positionally placed binaries into actual function calls
could i have done it? sure, but why would i? chat gpt did in 10 seconds what probably would have taken me at least a few hours of copying data from 7 pages of a table in a pdf - a task i dont enjoy doing, in a language i don’t know very well
fill lines of data based on other data structures: it can transform text quicker than you can write regex and i’ve never had it fail at thisI’m
not sure what you mean by that.
Not speaking for them, but I use LLMs for this. You have lines of repetitive code, and you realize you need to swap the order of things within each line. You could brute force it, or you could write a regex search/replace. Instead, you tell the LLM to do it and it saves a lot of time.
Swapping the order of things is just one example. It can change capitalization, insert values, or generate endless amounts of mock data.
yup! absolutely this too - i provided a different example in my reply, but honestly this is exactly the thing i use it for most… type a couple of lines, it gets the idea of what you’re trying to copy, and then it’s just hitting accept until it’s done… it’s pretty close to 100% accurate, and even if it’s not… fixing it ain’t exactly hard!
Ah! That does seem useful indeed! Even just generating a bunch a dummy data.
I was tasked once with writing a front-end for an API that didn’t exist yet, but I had a model. I could have written a loop that generated “Person Man 1”, “Person Man 2”, etc. with all of the associated fields, but instead I gave the LLM my class definition and it spat out 50 people with unique names, phone numbers, emails, and everything. It made it easy to test the paging and especially the filtering. It also took like 30 seconds to ask for and receive.
I originally asked it to make punny names based on celebrities, and it said “I can’t do that.” ☹️
The junior developer can (hopefully) learn and improve.
LLMs are also improving though.
They’ll never be able to learn, though.
A LLM is merely a statistical model of its training material. Very well indexed but extremely lossy compression.
It will always be outdated. It can never become familiar with your codebase and coding practices. And it’ll always be extremely unreliable, because it’s just a text generator without any semblance of comprehension about what the texts it generates actually mean.
All it’ll ever be able to do is reproduce the standards as they were when its training model was captured.
If we are to compare it to a junior developer, it’d be someone who suffered a traumatic brain injury just after leaving college, which prevents them from ever learning anything new, makes them unaware that they can’t learn, and incapable of realising when they don’t know something, makes them unable to reason or comprehend what they are saying, and causes them to suffer from verbal diarrhoea and excessive sycophancy.
Now, such a tragically brain damaged individual might look like the ideal worker to the average CEO, but I definitely wouldn’t want them anywhere near my code.
I like using it. Mostly for quick ideation, and also for getting rid of some of the tedious shit I do.
Sometimes it suggests a module or library I have never heard of, then I go and look it up to make sure it is real, not malicious and well documented.
I also like using my self hosted AI to document my code base in a readme following a template I provide. It gets it pretty good and usually is like 60-80% accurate and to the form I like. I just edit up the remaining and correct mistakes. Saves me a ton of time.
I think the best way to use AI is to use it like a tool. Don’t have it write code for you, but use it to enhance your own ability.
In the grand scheme of things, I think AI code generators make people less efficient. Some studies have come out that indicate this. I’ve tried to use various AI tools, as I do like fields of AI/ML in general, but they would end up hampering my work in various ways.
You can either spend your time generating prompts, tweaking them until you get what you want and then using more prompts to refining the code until you end up with something that does what you want…
or you can just fucking write it yourself. And there’s the bonus of understanding how it works.
AI is probably fine for generating boiler plate code or repetitive simple stuff, but personally I wouldn’t trust it any further than that.
There is a middle ground. I have one prompt I use. I might tweak it a little for different technologies, languages, etc. only so I can fit more standards, documentation and example code in the upload limit.
And I ask it questions rather than asking it to write code. I have it review my code, suggest other ways of doing something, have it explain best practices, ask it to evaluate the maintainability, conformance to corporate standards, etc.
Sometimes it takes me down a rabbit hole when I’m outside my experience (so does Google and stack overflow for what it’s worth), but if you’re executing a task you understand well on your own, it can help you do it faster and/or better.
Not total bullshit, but it’s not great for all use cases:
For coding tasks the output looks good on the surface but often I end up changing stuff, meaning it would have been faster up do myself.
For coding I know little about (currently writing some GitHub actions), it’s great at explaining alternatives, pros and cons, to give me a rudimentary understanding of stuff
I’ve also used it to transcribe tutorial screencasts, and then afterwards having a secondary LLM use the transcription to generate documentation (include in prompt: "when relevant, generate examples, use markdown tables, generate plantuml etc)
I use it mainly to tweak things I can’t be bothered to dig into, like Jekyll or Wordpress templates. A few times I let it run and do a major refactor of some async back-end code. It botched the whole thing. Fortunately, easy to rewind everything from remote git repo.
Last week I started a brand new project, thought I’d have it write the boilerplate starter code. Described in detail what I was looking for. It sat there for ten minutes saying ‘Thinking’ and nothing happened. Killed it and created it myself. This was with Cursor using Claude. I’ve noticed it’s gotten worse lately, maybe because of the increased costs.
The hate is ridiculous as is the hype.
It’s a new tool that is often useful when used correctly. Don’t use it to write entire applications - that’s a recipe for disaster.
But if you’re learning a new language it’s amazing. You have an infinitely patient and immediately available tutor that can teach you a language’s syntax, best practices, etc. I don’t know why that would make you “ill” besides all the shame “real developers” seem to want to lump on anybody who uses AI. If you’re not concerned about passing some “I don’t use an IDE” nerd’s purity test you’ll be fine.
I wouldn’t know about professionally as I don’t work in the industry, but anecdotally a lot of young people I see use LLMs for everything. Meanwhile in the FOSS community online I see very little of AI/LLMs. I think it’s a cultural thing that will vary depending on what circle of people you’re looking at.