AI has a multiplying effect on existing technical skills

Waterluvian · 2026-05-22T14:01:24 1779458484

I had an Iron Man moment last week where I was “vibe coding” a UI design with component tests live on the other screen. Iterating by asking it to move things, reduce emphasis of an element, exploring layout options, etc. The loop was near realtime and felt amazing.

The code it generated was awful. The kind of garbage that people who don’t know any better would ship: it looked right and it worked. But it was instantly a maintenance dead end. But I had an effortless time converging on a design that I wouldn’t have been able to do on my own (I’m not a designer). And then I had a reference design and I manually implemented it with better code (the part I am good at).

adjejmxbdjdn · 2026-05-22T15:36:56 1779464216

Could it be that the fact that the thing you’re an expert at looked like garbage to you, but the things you’re not an expert at, looked just fine, is not a coincidence?

You can talk to a bunch of designers who will say the opposite. Claude Design Studio generated this garbage UI, that I fixed manually, but it created great code j never could have that made it work.

reactordev · 2026-05-22T16:11:07 1779466267

This is the juxtaposition the general public is in. They don’t have advanced tech skills to know any better so they see an output that they can’t produce from their skills and think it’s great. Maybe it is, maybe it isn’t. What does the code look like?

genxy · 2026-05-22T16:30:01 1779467401

Both had a working prototype. The flaw everyone is making is that they are over focusing on the artifact and not that they have a shared tangible object that they can both editorialize and iterate on.

These systems should allow rapid iteration on discovery and thinking. One can now make a prototype a day that would have taken a week. That means that we should be able to converge on a much better design in the same amount of time it would have taken to make a v0 that turns how to have systemic flaws.

AI should scale our understanding of systems, not just shovel out half baked features and apps.

bluejellybean · 2026-05-22T18:44:57 1779475497

Road to hell is paved with a lot of 'shoulds' reality is a very different place filled with piles of trash and half baked ideas.

lobf · 2026-05-22T20:44:46 1779482686

This is where I’m at. I’ve always been a computer tinkerer but a novice coder at best. I work in the film industry, so I don’t need to know how to code.

Where I’m at when building personal applications for my home / life is: does the code execute and perform the desired task?

If so, what do I care how shitty it is? I’m not publishing these projects (for the most part… I have one joke application up at songshift.reachnick.co) so efficient, clean, secure code are not really a priority for me.

rhubarbtree · 2026-05-22T20:08:16 1779480496

Colleague (non-designer) generated UI with Claude. It was awful and broke basic design rules. So yes you may be right.

vslira · 2026-05-22T17:47:33 1779472053

Maybe specialists have a higher bar than consumers, and as a design consumer he's right about the design, and the designer is right about the code, if "being right" means "understanding what the end customer will think about this".

12_throw_away · 2026-05-22T20:21:36 1779481296

> Could it be that the fact that the thing you’re an expert at looked like garbage to you, but the things you’re not an expert at, looked just fine, is not a coincidence?

Well when you put it that way ... monetizing the Dunning-Kruger effect does actually sound like a very good business idea.

tomgp · 2026-05-22T16:26:59 1779467219

I think this is true, it's like a close relation of the Gell-Mann amnesia effect

https://en.wikipedia.org/wiki/Michael_Crichton#:~:text=%5B14...

chrismorgan · 2026-05-22T17:50:40 1779472240

More robust link (to the heading by ID, rather than by text directive with pre/post text that will change): https://en.wikipedia.org/wiki/Michael_Crichton#%22Gell-Mann_...

MichaelZuo · 2026-05-22T17:10:55 1779469855

This!

People are never perfectly even in intelligence across all possible disciplines.

ajross · 2026-05-22T19:05:27 1779476727

It's worth pointing out that Crichton coined that term during a period in his life where he was rapidly descending into conspiracy and iconoclastic thought, and this is of a piece with that.

Gell-Mann's observation was a sincere and thoughtful caution about the way we transmit information about complicated ideas. Crichton's "amnesia effect" is an excuse to ignore media you dislike.

dataflow · 2026-05-22T16:45:30 1779468330

I'm confused, this doesn't make sense. The target they're iterating on (UI) is the same one whose quality they're assessing, not a different one (source code).

You're suggesting that (a) their UI skills are lacking (based on what? isn't UI exactly what they were iterating on and trying to improve?), and (b) that a real UI expert would've somehow felt the UI they were working on was consistently garbage, despite how many times they iterate on it?

Which means you're saying you don't believe anyone can actually produce high quality (to an expert) output with AI on the same target they're working on, and if they think they are, that just means they don't have a good sense of quality?

danielmarkbruce · 2026-05-22T17:08:54 1779469734

It's not confusing. It makes sense.

8note · 2026-05-22T19:48:42 1779479322

no, it is confusing.

the llm produced something the operator thought was garbage for the design too, and the operator iterated it from garbage to good.

they could also have the llm iterate the underlying code from garbage to good, if they wanted.

most likely a specialist would say its neither good nor bad, since its not considering the right things, and hasnt collected the right useability feedback, but making straightforward designs isnt that hard, and counting clicks and interactions, and avoiding hidden functionality is all measureable stuff

danielmarkbruce · 2026-05-22T20:38:45 1779482325

is functional but has bad UI/layout/etc is a thing.

It's only confusing because you don't know the field. Which is kind of the point.

lobf · 2026-05-22T20:47:44 1779482864

> is functional but has bad UI/layout/etc is a thing.

Tell me about it… I was forced to use a program called Farmer’s Wife for a time. What a fucking nightmare of a UX.

michaelchisari · 2026-05-22T17:19:16 1779470356

Without proper training, what looks good may be trash. I always thought pixel art generated by diffusion models looked damn good. Then I started watching and reading reviews by actual pixel artists, and all they saw was flaws. And it wasn't just nitpicking, it was things that were fundamentally wrong, difficult to fix and would look awful and amateurish and distracting to the player in production.

vunderba · 2026-05-22T17:25:32 1779470732

Much of this comes from the fact that, as is true for almost everything, an LLM (generative model etc) presents itself as an expert. It'll very confidently produce results that, to a layperson, look quite good. But the more of an expert you are in a field, the more apparent the cracks become.

AI pixel art looks particularly bad because most users don’t even go through the effort of downscaling and then upscaling it using something as simple as nearest-neighbor scaling, which by itself will squash out a lot of high-frequency noise that manifests in the form of terrible looking "fringing". Proper grid alignment also makes a big difference. It’s not perfect by a long shot, but it helps.

holoduke · 2026-05-22T18:36:02 1779474962

Ai is a hammer. Use it right and it makes you very powerful. But it's not an easy tool.

yawnxyz · 2026-05-22T19:26:59 1779478019

this is why people still enjoy eating at Olive Garden and Chipotle and Sweetgreen

basically the AI-slop version of food, yet still they thrive

jmount · 2026-05-22T16:25:42 1779467142

Good point on "Gell-Mann Amnesia Effect."

akersten · 2026-05-22T14:10:26 1779459026

> The code it generated was awful. The kind of garbage that people who don’t know any better would ship: it looked right and it worked. But it was instantly a maintenance dead end.

In the Tailwind thread the other day I was explicitly told that the intended experience of many frameworks is "write-only code" so maybe this is just the way of the future that we have to learn to embrace. Don't worry how it's all hooked up, if it works it works and if it stops working tell the AI to fix it.

It's kind of liberating I guess. I'm not sure if I've reached AI nirvana on accepting this yet, but I do think that moment is close.

hollowturtle · 2026-05-22T16:46:30 1779468390

I'm pretty you wouldn't want the same for code that runs healthcare, banks or transport. Only useless shitty web projects could embrace what's you're saying. And no there's no "Claude review the code and improve it" magical formula

QuercusMax · 2026-05-22T18:30:53 1779474653

I work in the health software space and there are tons of internal tools which aren't production code that can benefit massively from throwaway "write-only code". Putting a web UI on top of a management CLI tool so support ops can run things without needing an oncall engineer can be a huge win. I recently built a testing UI that doubles a demo-scenario-setup tool. Is it well-engineered? Who cares - it pokes the right things into the database and runs the right backend tasks, and has helped me catch and fix dozens of real bugs in the UIs that customers see.

There is an enormous untapped market for crappy low-effort apps which previously weren't worth the time - but with the effort so low put together a simple dashboard or one-off tool it becomes much more attractive.

simmerup · 2026-05-22T14:21:27 1779459687

The problem is it’s impossibly hard to test all the edge cases

Which is probably why so many random buttons in microsoft/apple/spotify just stop working once you get off the beaten path or load the app in some state which is slightly off base

marcosdumay · 2026-05-22T14:57:55 1779461875

The problem is worse than that.

The number of edge cases in a software is not fixed at all. One of the largest markers of competence in software development is being able to keep them at minimum, and LLMs tend to make that number higher than humanely possible.

disgruntledphd2 · 2026-05-22T14:22:46 1779459766

Yeah, the biggest thing I've noticed from LLMs is that large tech products now have even more bugs. Turns out the humans weren't so bad after all...

michaelcampbell · 2026-05-22T14:27:48 1779460068

> Turns out the humans weren't so bad after all...

The people pushing AI _over_ humans never thought they were. They just don't care about 'good' or 'bad', only 'time-to-market'. A bad app making money is better than a good one that isn't deployed yet. And who cares about anything past the end of the quarter? That's the next guy's problem.

louiereederson · 2026-05-22T14:30:52 1779460252

I'm wondering if companies are 'diverting' engineering resources from core products to AI products with the view that the former are legacy. Kind of two sides of the same coin though.

SpicyLemonZest · 2026-05-22T16:33:15 1779467595

I'm sure there's a lot of AI investment, but I've definitely also seen fixed sets of core product engineers shipping a lot more bugs these days.

noosphr · 2026-05-22T19:56:43 1779479803

The if in there is doing a lot of heavy lifting.

giancarlostoro · 2026-05-22T14:27:01 1779460021

Easy, have Claude review the code, tell it to be critical and that it needs to be easier to understand, follow Clean Code, SOLID principles and best practices. Lie to it, say you got this from a Junior developer, or "review it as if you were a Staff Level Engineer reviewing Junior code" the models can write better code, just nobody tells them to.

itsalwaysgood · 2026-05-22T20:34:04 1779482044

I've had success with this approach also. You do feel empowered, 10x or whatever: but then you're looking at more projects, context switching a lot more, and it can burn you out.

marcosdumay · 2026-05-22T14:53:32 1779461612

Lol, the only thing worse than a junior developer following Clean Code and SOLID has to be an LLM messing with code so it looks like it follows.

giancarlostoro · 2026-05-22T15:12:34 1779462754

Clean Code has its really "meh" areas, but the core idea and spirit of it is sound, heck Python's best guide is PEP-8 if you follow that, it forces you to write much better Python code.

In terms of "junior dev following" it would be the model trying to think and write it as a Senior or Staff Level engineer would.

HappySweeney · 2026-05-22T14:33:10 1779460390

Code review is the main thing I use LLMs for. I have found it to be remarkably candid when you tell it the code came from another LLM (even name it). I was running Kimi K2.6 Q4 locally, seeing if it could SIMD a bit-matrix transpose function, and it was slow enough that I would paste its thinking into Gemini every few minutes. Gemini was savage.

datsci_est_2015 · 2026-05-22T15:13:39 1779462819

> Gemini was savage.

Humorously, this could be the result of LLMs vacuuming up all the sentiment on the web that the code that LLMs produce is trash-tier.

kenjackson · 2026-05-22T14:36:58 1779460618

This is it. I've had a similar experience in just playing around I asked it to clean up some code it wrote to increase maintainability and readability by humans. After a few iterations it had generated quite solid code. It also broke the code a couple of times along the way. But it does get me thinking that these pipelines with agents doing specific tasks makes a lot of sense. One to design and architect, one to implement, one to clean, one to review, one to test (actually there's probably a bunch of different agents for testing -- testing perf/power, that it matches the requirements/spec, matches the design, is readable/maintainable, etc...).

giancarlostoro · 2026-05-22T14:41:57 1779460917

I built GuardRails after some frustrations with Beads which I love, and this whole exchange made me realize, because I have "gates" after tasks, I could add a "Review the code" type of gate, and probably get insanely better output, I already get reasonably good output because I spec out the requirements beforehand, that's the other thing, if you can tell the LLM HOW to build before it does, you will have better output.

ar_lan · 2026-05-22T15:59:55 1779465595

Why wouldn't Claude just impose this same loop in the code it writes - or better, write better code before it needs such review?

y1n0 · 2026-05-22T16:13:30 1779466410

Because language models don’t think before doing, they think by doing.

Maybe a more idealized training set could improve things, but at least for today’s SOTA, you have to get the shitty first draft out and then improve it.

Harnessing makes a difference, but it’s only shuffling around when and where the tokens get generated. It can trade being slower by doing a hidden first draft and only showing the output after doing a self review. But the models still need to generate it all explicitly.

AlecSchueler · 2026-05-22T16:56:39 1779468999

Why would it? It doesn't do anything with intention without being prompted. When you ask it to do something it's going to give you what seems like the most likely result, it isn't striving to give you the most correct result, those things just have some overlap.

giancarlostoro · 2026-05-22T16:10:12 1779466212

I assume it would involve wasting a lot more tokens reasoning about this. It is known that GPT uses less tokens than Claude, but Claude uses them to reason about problems more, which is part of its "secret sauce" and why so many swear by Claude Code.

enraged_camel · 2026-05-22T14:45:56 1779461156

Even better, if you have access to multiple models, tell it you got the code from another AI agent.

I did an experiment on this a few weekends ago and Codex for example was a lot more adversarial and thorough in its review when given Claude-authored code compared to when given the same code with "I wrote this, can you review it?"

giancarlostoro · 2026-05-22T14:47:43 1779461263

If it's within its context window, it will know you're lying, so either compact or start a new chat (don't do this on Claude, it dings your usage, always has).

therouwboat · 2026-05-22T15:56:19 1779465379

Is this a joke? Smartest people on the planet never thought about telling AI to just write better code?

ryandrake · 2026-05-22T16:55:00 1779468900

Kind of wild that you have to tell an LLM things like "do it right" and "make the code maintainable" and "don't make mistakes". Shouldn't that be the default? I wouldn't accept a calculator application that got math wrong unless you pressed a button labeled "actually solve the problem."

jmalicki · 2026-05-22T17:04:07 1779469447

> Kind of wild that you have to tell an LLM things like "do it right" and "make the code maintainable" and "don't make mistakes". Shouldn't that be the default?

It's not the default, because the training data is full of unmaintainable code done wrong with mistakes. People literally complain that LLMs write too many tests or add comments.

If instead of "do it right", you give it specific actionable advice of how to right code, it does surprisingly well. Newer frontier models also do a great job of mimicking the style and rigor of the surrounding codebase without prompting, if you're working in an established codebase, for better or worse.

tayo42 · 2026-05-22T17:22:29 1779470549

The default isn't necessarily what ever you consider maintianable or do it right, which are ambiguous terms anyway.

You never wrote quick exploratory code? One off scripts? How is the Ai suppsed to know unless you tell it.

If you tell another person to write some code, how are they suppsed to know? If you have your boss come to you and ask you to write some code to do some data analysis are you going to spend weeks writing units tests and perfect abstractions? Or do it quick and get the data and result?

giancarlostoro · 2026-05-22T16:11:10 1779466270

You forget that this all takes tokens from the model, so it has to be very stingy and whatever it comes up with "first" is what it goes with. I've seen people do the same as me, tell the model NOT TO GUESS but to do research first, which yields better output and saves time. Models today are better when they review the context directly, the focus shifted from it knowing everything in its training data to being able to dynamically learn new things and use that information in a meaningful way.

For example, I built up a programming language from scratch with Claude, it knows nuances about my languages syntax, and can write code in my language effectively. I did it mostly as a test. It definitely helped that my language is heavily mostly Python based.

naravara · 2026-05-22T17:49:58 1779472198

I have been wondering recently that if the cost of just throwing everything out and building it from scratch again gets low enough, maybe maintainability becomes less of a priority? Can we just embrace the thing like those Zen carpenters who build wooden fire shrines do where they just accept that the thing will keep burning down and they make a discipline around getting really good at rebuilding it?

Granted, the load bearing thing here is whether we’re actually getting good at rebuilding up to any sort of standard of quality. Or if the tooling is even structurally capable of doing that rather than just introducing new baskets of problems with each build.

albedoa · 2026-05-22T16:17:56 1779466676

I'm looking at that Tailwind thread. Do you really think that your comment here is a fair assessment of what you were told there? Come on now.

https://news.ycombinator.com/item?id=48166334

onion2k · 2026-05-22T20:28:18 1779481698

But it was instantly a maintenance dead end.

It doesn't really make sense to suggest AI can work on something any make it now and work correctly, and at the same time say it's unmaintainable. It is maintainable with AI.

The real question is whether or not you're happy to ship AI-generated code that you can't modify to production unless you use AI. Few developers are there yet, plenty of non-tech people are there already. I don't know which group is actually wrong.

jvanderbot · 2026-05-22T14:25:48 1779459948

I wonder how much of this is momentum.

At the moment, we understand the basic tech, could reasonably DIY, but choose not to knowing full well there's a mess of understandable code somewhere we could go clean up but dont want to. We accept fast iterations because we know roughly the shape of how it "should be" and can guide an automated framework towards that. This is especially true on our own projects or something we built originally! Stark/Iron man knew/moved, the suit assisted by adding momentum.

We're riding our "knowledge momentum".

If companies can hold out long enough, that knowledge completely fades, and the tool is all you have. At that point, they are locked in. Then it's not Iron man, it's an Iron lung (couldn't resist!)

lelanthran · 2026-05-22T19:46:02 1779479162

The blog post is spot on: AI is exactly like an Iron Man suit - if you never take it off your muscles will atrophy to nothing!

Waterluvian · 2026-05-22T14:37:16 1779460636

Yeah that’s my main concern. It feels so so easy to be lazy and do a bad job now. And then my skills weaken and what makes me valuable fades.

I love the Iron lung reference. Perfect.

theptip · 2026-05-22T15:17:22 1779463042

We collectively have to re-learn what operations are expensive and what are cheap.

Prototypes are practically free now. You can ask the AI try each architectural or stylistic option and just see which code you like better.

To your point, another interesting note is that rewriting and rearchitecting are also very good.

One pattern I like is to vibe code a set of solutions, pick the approach, then backfill tests and do major refactors to make it maintainable.

Here the skill is knowing what good architecture looks like, and knowing how to prompt and validate (eg what level of tests will speed up the feedback cycle or enable me to make the LLM’s changes legible).

To be fair the “ready, fire, aim” approach of rapid prototyping has been known for a long time, but you need to be quite quick at coding in old world for it to work well IMO.

iwiwk · 2026-05-22T15:29:49 1779463789

Free? Lol

eithed · 2026-05-22T14:34:29 1779460469

That's the model I've arrived to as well:

- first I've created a skill how the architecture of the system should look like

- I'll tell the LLM to follow the guidelines; it will not do that 100%, but it will be good enough

- I'll go through what it produced, align to the template; if I like something (either I've not thought about the problem in that way, or simply forgot) I add that to the skill template

- rinse and repeat

This is not only for architecture of the system, but also when (and how to) write backend, frontend, e2e tests, docs. I know what I want to achieve = I know how the code should be organized and how it should work, I know how tests should be written. LLMs allow me to eliminate the tediousness of following the same template every time. Without these guardrails it switches patterns so often, creating unmaintainable crap

Bear in mind - the output requires constant supervision = LLM will touch something I told it not to touch, or not follow what I told it to do. The amount of the output can also sometimes be overwhelming (so, peer review is still needed), but at this point I can iterate over what LLM produces with it, with another LLM, then give to a human if it together makes sense

dylan604 · 2026-05-22T14:08:56 1779458936

> But I had an effortless time converging on a design that I wouldn’t have been able to do on my own (I’m not a designer).

I'm not a designer either, but I've been around designers long enough to recognize when something is bad but just not know what is needed to make it better/good. I've taken time to find sites that are designed well and then recreated them by hand coding the html/css to the point that I consider myself pretty decent at css now. I don't need libraries or frameworks. My css/html is so much lighter than what's found in those frameworks as well. I still would not call myself a designer, but pages look like they were designed by a mediocre designer rather than an engineer :shrug:

onlyrealcuzzo · 2026-05-22T19:14:03 1779477243

I'm trying to test if vibe coding can actually scale... And man is it painful.

AI is great at creating slop that almost works.

But, my god, it is terrible at following clear as day instructions on how to cleanup slop.

It wrote 150k lines of code that almost works in 2 months. It's taken 1 month to delete about 2000 lines of broken architecture and fix it, and it still hasn't gotten it done, despite nonstop repeated efforts to do something not that hard.

I definitely could've fixed it less time then I've spent prompting at this point (but no way I'd have gotten the other 150k lines). But doing it myself is not the point. It's to see if it can actually scale.

The answer is yes... But my god is it agonizing.

The creating garbage part that almost works is fun.

The inevitable cleanup is not.

And unfortunately I don't see this aspect materially improving in the short term.

If you want it to code you something about 5-10k lines of code that's already been done 1000 times before or only slightly different, it's great.

Most people want more than that.

the__alchemist · 2026-05-22T14:08:37 1779458917

Tangent: I never learned how to make the sorts of websites people find "professional" or "pretty" I could make functional and easy-to-use webapps, but not something people would think looks good or like something they would want to use. LLMs crushed this, without performance overhead; can still be HTML/CSS/targetted JS.

snarf21 · 2026-05-22T14:06:44 1779458804

I feel the same but the question I struggle most with is this: "Does it matter when the people who are going to come along and maintain this are just going to use AI to fix or adjust this maintenance nightmare?"

Waterluvian · 2026-05-22T14:11:10 1779459070

At that point the code becomes a compile target, and then you need a new source of truth.

Which I think is perfectly worthy of exploration. Some people want to check in the prompts. Or even better, check in a plan.md or evenest betterest: some set of very well-defined specifications.

I'm not sure what the answer will be. Probably some mix of things. But today it is absolutely imperative that the code I write for the case I wrote it in is good quality and can be maintained by more than just me.

ff317 · 2026-05-22T14:55:28 1779461728

When we want to maintain a reliable, stable "product" in traditional software development (a binary executable artifact that ships out to users, or the binary engine of some SaaS the company sells to users), we don't just check in (to the source of truth repo) the actual application-layer source code. We also check in build instructions (think autoconf/cmake/etc) and have some concept of compiler compatibilities / versions, build environments, and papering over their runtime differences. And then our official executable output is not just defined by "Tag v1.23.45 of the application source code repo" - it's additionally defined by the build environment (including, critically, the compiler version, among many others).

It's tempting to move out a layer and try making prompts and plan.md the "source code", and then the generated actual-source-code becomes just another ephemeral form of "intermediate representation" in the toolchain while building the final executable product. But then how are you versioning the toolchain and maintaining any reasonable sense of "stability" (in terms of features/bugs/etc) in the final output?

Example: last week, someone ran our "LLM inputs" source code through AgentCo SuperModel-7-39b, and produced a product output that users loved and it seemed to work well. Next week, management asks for a new feature. The "developer" adds the new feature to the prompting with a few trial iterations, but the resulting new product now has 339 new subtle bugs in areas that were working fine in last week's build owing the fact that, in the meantime, AgentCo has tweaked some weights in SuperModel-7-39b under the hood because of some concern about CSAM results or whatever and this had subtle unrelated effects. Or better yet: next month, management has learned that OtherCo MegaModel-42.7c seems to be the new hotness and tells everyone to switch models. Re-building from our "source" with the new model fixes 72 known bugs filed by users, fixes another 337 bugs nobody had even noticed yet, and causes 111 new bugs to be created that are yet-unknown.

If you treat the output source code as a write-only messy artifact, and you don't have stable, repeatable models, and don't treat model updates/changes as carefully as switching compiler vendors and build environments, this kind of methodology can only lead to chaos.

And don't even get me started on the parallel excuses of "Your specifications should be more-perfect" (perfection is impossible), or "An expansive testsuite should catch and correct all new bugs" (also impossible. testing is only as good as the imperfect specification, and then layers in its own finite capabilities to boot).

user34283 · 2026-05-22T14:24:20 1779459860

I don't see the benefit of checking in either prompts or specs.

I never tried spec driven development for myself, but if I review other's MRs I am typically exhausted after the first 10 lines.

And there are hundreds of lines, nearly always with major inaccuracies.

For myself I always found the plan mode to work well. Once the implementation is done, the code is the source of truth. If it works, it works.

When I want to add more functionality or change it, I just tell the agent what I want changed.

I doubt walls of semi-accurate existing specs are going to be beneficial there, but maybe my work differs from yours.

gbear0 · 2026-05-22T15:05:30 1779462330

Those checked-in specs become the requirements for the system. So the next time you ask the AI to make a fix, it can use those specs as part of the solution and not break another requirement. Basically the code underneath keeps getting rewritten over and over, but that doesn't matter as long as it hits the required specs.

jmcodes · 2026-05-22T16:46:21 1779468381

Do you rewrite the specs with new requirement changes if they've already been implemented? How do you supercede a spec?

I've been using LLMs daily and I spun up a few spec driven flows once or twice but like the person above I think the code is the source of truth.

Also why wouldn't you use TDD to enforce the 'spec' then?

macintux · 2026-05-22T14:45:44 1779461144

I value traceability, and I value understanding the "why" of the code. For me, the prompts are useful for both.

mehagar · 2026-05-22T14:12:44 1779459164

Same. Messy code makes it harder for us to understand and thus maintain the code (which is why people often refer to code as a liability), but is that the case for AI tools as well? If not, it seems like clean code may not matter as much anymore.

n_e · 2026-05-22T14:26:39 1779459999

The problem with crappy frontend code is not only the maintenance. It's that stuff such as responsive design, accessibility or cross-browser compatibility that work nearly for free with elegant code won't work at all.

flyinglizard · 2026-05-22T14:12:20 1779459140

The problem is that technical debt is compounding. Bad LLM architectural and implementation decisions just blend in to the background and you build layer upon layer of a mess. At some point it becomes difficult and expensive (token wise) to maintain this code, even for an agent.

I mitigate this by few things: 1. Checkpoints every few days to thoroughly review and flag issues. Asking the LLM to impersonate (Linus Torvalds is my favorite) yields different results. 2. Frequent refactors. LLMs don't get discouraged from throwing things out like humans do. So I ask for a refactor when enough stuff accumulates. 3. Use verbose, typed languages. C# on the backend, TypeScript on the frontend.

Does it produce quality code? Locally yes, architecturally I don't know - it works so far, I guess. Anyway, my alternative is not to make this software I'm writing better but not making it at all for the lack of time, so even if it's subpar it still brings business value.

worldsayshi · 2026-05-22T14:08:54 1779458934

> The code it generated was awful.

I suppose you could solve that in two ways. Manually rewrite it as you did. Or formalize an architecture and let the AI rewrite it with that in mind. I suspect that either works.

bentcorner · 2026-05-22T17:55:49 1779472549

I suspect at some point AI-written code will be eventually artifacts generated build-to-build. The design docs and UI tests are the source and the model follows instructions to generate the product. If you make the models deterministic then model improvements give you code improvements across your entire codebase "for free".

neals · 2026-05-22T16:24:44 1779467084

Very recognizable and hard to reason about! I did something similar, but while looking at the code, it looked so procedural, hardly abstract, no vision. How was I ever going to maintain this? I guess Ai will do it forever?

EasyMark · 2026-05-22T15:11:42 1779462702

Did you try to have it clean up the code and refactor? I find while the code is usually low to mid tier that it’s a lot better than the first pass. I of course back up the working version lol. Usually I can coax something better out of it

noosphr · 2026-05-22T19:56:04 1779479764

This isn't an indictment of how good AI is but how poor our tools are. We had gui makers in 1996 that made slop which allowed you to iterate in real time. They didn't need a datacenter worth of compute and a nuclear reactors worth of power to run.

HikeThe46 · 2026-05-22T14:33:21 1779460401

If you are just blindly vibe coding without any parameters, guardrails, architecture, or broad guidance; you're going to have a mess of slop.

the power comes from creating the machine you can steer. Treat AI like an over eager college intern who you need to hand hold, but do tasks.

nomel · 2026-05-22T17:11:43 1779469903

> But it was instantly a maintenance dead end.

I gave up on this recently. It achieved the goal now, and in a year or two, when you actually want to add whatever feature, the SOTA AI will probably be able to clean it up as it does so. What does "maintain" even mean anymore?

If you don't agree, how many years into the future do we need until you would agree?

dbingham · 2026-05-22T17:17:10 1779470230

The problem is that people keep saying this, but the code keeps being bad. Every time I commit myself to trying to build something with AI, I end up wasting a ton of time and backing it out or completely rewriting it without the AI. The code it generates just isn't where it needs to be.

And people have been saying this exact thing for years now. Someone said this very thing two years ago. And we're still at the "maintenance dead end" stage. So let me flip it back on you: how many years are we going to pour an obscene amount of resources into this thing that is always going to be able to clean up its own messes "in a year or two" before we realize its a dead end (at best) and we need to be using those resources elsewhere? And, similarly, what happens to you when the SOTA AI in two years can't clean up the code it wrote for you two years ago, but people are depending on it and your still on the hook for maintaining it?

nomel · 2026-05-22T18:03:57 1779473037

> If you don't agree, how many years into the future do we need until you would agree?

Respectfully, I asked first. ;)

> before we realize its a dead end (at best)

You've declared the future, which doesn't leave much room for a conversation. So, cheers!

znpy · 2026-05-22T18:19:25 1779473965

> The kind of garbage that people who don’t know any better would ship: it looked right and it worked.

I feel what you write, but then again: every now and then i write small greasmonkey scripts to remove annoyances from webpages, and to do so i have to look at the html and the kind of trash you describe is already there.

threethirtytwo · 2026-05-22T16:45:57 1779468357

Well here's the million dollar question. It's a maintenance dead end for humans to read and edit. But for an LLM, is it a maintenance deadend? Could the LLM iterate on that same code base and be highly effective on it?

hollowturtle · 2026-05-22T16:48:26 1779468506

Yes it is a maintenance deadend for LLMs it's notorious codebases start accumulating enormous amount of tech debt and that it gets almost impossible to unravel it even with the best agent

threethirtytwo · 2026-05-22T17:39:51 1779471591

I've never seen it happen. You say this and it's likely one anecdotal claim. At the same time there's counter points like Bun getting rewritten in rust.

This is talk and talk is cheap. Prove it, otherwise it's still a million dollar question... unanswered.

HN is notoriously mentally deficient when it comes to AI. They were wrong about self driving cars (I sit in AI cars daily), they were wrong about AI getting used for coding (I don't use an IDE or type code anymore as a SWE). So I have to say unless there's something evidence based or substantial here it's likely given HN track record that most people here will end up being another wrong, baseless and over confident answer.

I'm looking for legit answers not confidently biased statements with no evidence.

rustyminnow · 2026-05-22T19:41:00 1779478860

Bun getting rewritten in Rust is not really the counter point you think it is. The rust version hasn't shipped yet, so there hasn't even been a chance to see if the code can be maintained. It's an impressive feat no doubt, but until they've maintained it on a months to years timeline, it's also just talk with no evidence.

ios-contractor · 2026-05-22T17:08:09 1779469689

I used AI to write code (features) then I used AI to refactor the architecture using best practices and get rid of the technical debt. I don't remember the last time I modified code by hand honestly.

danielmarkbruce · 2026-05-22T17:10:50 1779469850

This is the part people are missing. I spend 30-40% of my time "vibe coding" doing "vibe clean up". It's fine, I'm still 10x more productive than I ever was.

wiseowise · 2026-05-22T14:48:49 1779461329

> I had an Iron Man moment

Iron Man created Jarvis whose capabilities are way beyond any models in the near future. So it wasn’t an Iron Man moment.

Supermancho · 2026-05-22T18:59:21 1779476361

>> I had an Iron Man moment

> Iron Man created Jarvis whose capabilities are way beyond any models in the near future. So it wasn’t an Iron Man moment.

Like an LLM, you misunderstood the context. The voyeuristic experience doesn't require fiction to be reality.

etiam · 2026-05-22T15:12:53 1779462773

He was presumably also not constructing a powered exoskeleton of from fictional materials or a physically implausible power source, but since you obviously caught the reference, how about some benevolent interpretation instead, for a decent shorthand about working smoothly with AI assistance.

(And on a personal note, I'm glad we don't have a publicly released Jarvis before we get our act together about the use.)

gbear0 · 2026-05-22T14:51:12 1779461472

Why was it a maintenance dead end? It sounds like you were able to iteratively work on it in its current state, but are you going to be the one maintaining the code?

I keep asking myself the same questions, and the conclusion I keep coming to is the clean modeled structure we want to see is for humans to maintain and extend, but the AI doesn't need this.

There's definitely an efficiency angle here where it's faster for AI to go from a clean modeled solution to the desired solution because it's likely been trained on cleaner code. Is this really going to matter though?

The best argument I can come up with is the clean modeled solution is better for existing development tools because it's less likely to get confused by the patch work of vibes throughout the code; but this feels like it ultimately becomes an efficiency concern as well.

This just might be the new reality, and we need to stop looking behind the curtain and accept what the wizard presents us.

ttd · 2026-05-22T15:00:30 1779462030

> the clean modeled structure we want to see is for humans to maintain and extend, but the AI doesn't need this.

This does not match my experience. I do a lot of AI-assisted coding at this point, and what I've seen is that when the AI is asked to extend or modify existing code, it does a much better job on clean, well-structured and well-abstracted code.

I think the reason is simple, and tracks for humans as well: well-structured code is simply easier to understand and reason about, and takes a smaller amount of working-set memory. Even as LLMs get better with coding, I expect that they would converge on the same conclusion, namely that good structure + good abstractions make for code that is more efficient to work with.

empath75 · 2026-05-22T15:02:55 1779462175

Yeah I have had claude take over multiple internal (human written) projects that were in a dire state and spent a week just completely refactoring them and adding exhaustive tests before doing any new features. It's worth starting from a clean slate.

K0balt · 2026-05-22T15:12:35 1779462755

I keep hearing the assertion that you can’t make high quality, maintainable code with LLMs. The last two years using AI have shown me exactly the opposite.

I think it’s all about the structure you use to work in and how you use the model. We are shipping better, more human friendly code, with less bugs, then we ever did before and doing it at 1/10 the cost before LLMs.

But we are definitely not vibe coding, and the key seems to be devs with years of experience managing teams, managing the LLM instead. Basically you create the same kind of formal specifications, conventions, and documentation that you would develop for a project with two or three teams, then use that to keep the project on the rails recursively looping back through the docs as you go along. I’ve only had to back out of a couple of issues over the last year, and even though that cost a couple of hours, it was still extremely cheap.

Meanwhile we are shipping at 4x speed with 1/4 the labor, and the code is better than it was because the “overhead” of writing maintainable, self documented code has inverted into the secret ingredient to shipping bug free code at unprecedented speed.

If you just explain the standards to which you want the code written, use a strict style guide, have a separate process that ensures test coverage (not in the same context) you can get example quality code all the way through. Turns out that’s also in the training data.

njovin · 2026-05-22T15:03:31 1779462211

Many of us recognize that the days of nearly-free tokens is quickly drawing to a close, and at some point humans may very well have to dig their keyboards out of cold storage and return once again to the code mines.

iwiwk · 2026-05-22T15:42:14 1779464534

OAI and Anthropic need to generate cash flows from operations - once they go public that’s it. Any future funding for reinvestment has to come from internal funds beyond existing raising + IPO.

So yeah, it’s imminent. Let’s see how demand shifts in response in the future.

dawnerd · 2026-05-22T16:04:47 1779465887

June 1st for all the folks taking advantage of copilot. It was an astounding deal and a lot of people were “abusing” it.

ahnick · 2026-05-22T15:12:08 1779462728

The reason why you will never get software engineers (in companies) to accept the man behind the curtain is liability. If a human software engineer is still responsible for what happens when the AI developed code has a catastrophic bug or security vulnerability, then the only way for the human to know if there is a problem is to be able to read through the code or run it through some <insert advanced formal verification tool here> that guarantees zero issues.

I think we eventually end up at the tool approach via vendors providing the tools to other companies, but it still feels like there's a long road ahead to get there.

lambda · 2026-05-22T15:02:25 1779462145

> but the AI doesn't need this

That's not true. The LLM performance will degrade as the codebase gets messier as well. You get to a point where every fix breaks something else and you can't really make forward progress.

Yes, you might be able to get a bit further with a messy codebase just because the LLM won't complain and will just grind through fixing things, but eventually it will just start disabling failing tests instead of actually fixing things.

dawnerd · 2026-05-22T16:05:20 1779465920

The token cost to fix might surpass what a human would cost to just do it.

glhaynes · 2026-05-22T15:03:21 1779462201

Sometimes I think the main value in AI-maintained code being “high quality” is when the structure can enforce invariants. If invalid states aren’t representable, then the AI can’t easily add bugs in the future.

Of course that just leads to: what’s the best way to achieve that goal? Through elegant code or adding lots of tests? Which is a debate from long before LLMs existed.

otabdeveloper4 · 2026-05-22T14:58:15 1779461895

> Why was it a maintenance dead end?

LLMs have a limit to how deep they can understand and refactor architectural issues.

That limit is far, far lower than a human's.

jplusequalt · 2026-05-22T15:03:43 1779462223

>This just might be the new reality, and we need to stop looking behind the curtain and accept what the wizard presents us.

This is how societies become shittier. People who are ostensibly responsible for doing their jobs not giving a damn about quality.

znnajdla · 2026-05-22T20:22:34 1779481354

Bingo. AI is NOT making raw skill/talent obsolete, rather it’s actually making it more valuable. Deep technical knowledge now has even more leverage in the real world, not less, because you then have more “surfaces” to apply the use of AI. And this realization is what actually inspired me to build my own homelab datacenter to host my tech SaaS rather than use cloud services like AWS. The value of learning basic networking, devops, and server hardware is now multiplied because that expertise can be applied faster and farther with AI. Before AI I would have to spend several hours or days learning RouterOS, for example, to be able to configure a datacenter-class Mikrotik router. With Claude that became a 20 minute job and I learned a lot about routing configuration in the process — that gives me unique controls over my product offering that I simply wouldn’t have if I “just used the cloud”. In fact I am actually tempted to build my own OS — something I wouldnt have dared to think of before AI.

simonw · 2026-05-22T15:19:31 1779463171

The more time I spend accelerating my work with AI tools the more I realize how incredibly hard the craft of shipping useful software actually is.

Sure, Claude Code and Codex can write (most of) the code for me - but the amount of technical knowledge I need to decide what and how to build remains enormous.

As an example: I'm working on a system right now that works like Claude Artifacts, allowing custom HTML+JS apps to safely run in an iframe sandbox inside a larger application.

Just understanding why that's a useful thing that can be built requires deep knowledge of sandboxing, security threats, browser security models, and half a dozen different platform features that have been evolving over a couple of decades.

A vibe coded without that technical understanding would have zero chance of prompting such a thing into existence, no matter how much guidance the LLMs gave them.

It really saddens me to see some developers talk about literally quitting their careers over AI, right when the benefits of existing deep technical experience have never been more valuable.

miyoji · 2026-05-22T17:50:55 1779472255

> It really saddens me to see some developers talk about literally quitting their careers over AI, right when the benefits of existing deep technical experience have never been more valuable.

1. Because the experience of interacting with AI is miserable. I like writing code. I don't like finding the magic incantation that gets the machine to write the correct code. I don't like correcting the machine when it gets things wrong. I don't like any of this, it's awful and I would never have gone into this field if someone had told me that it would be like this one day.

2. I cannot condone the means by which these tools were created, which is, as far as I am concerned, theft. I think it's unethical to use them at all, because they were created unethically. I dislike using stolen work, I think it's wrong, and I think everyone who uses it is making the world worse and normalizing theft. If continuing in my career means that I have to compromise my ethics, I wouldn't do it even if I loved this stuff, and see point (1).

3. Is anyone going to pay me more for my "more valuable" skills? Doesn't seem like it, engineering salaries on the whole are going down right now. You can believe they'll go up eventually if you like, but there's no evidence that will happen, or that it's happening. If my employer captures all the value, why should I care whether I'm creating more of it?

echelon · 2026-05-22T18:48:37 1779475717

> Because the experience of interacting with AI is miserable. I like writing code.

I'm your exact opposite.

I've felt like code is 1960's punch card tech my entire career. I've always wanted to do more.

So much of coding is plumbing. Or paying attention to tiny little details. Or hunting down stupid bugs. Or changing requirements and refactoring. That shit sucks. All of it.

I've never had so much fun with software. It's starting to feel like magic. And because we possess deep understanding, we are uniquely positioned to take advantage of this.

The AST is not the objective. The finished product is. Our DNA is by all accounts filled with garbage. Let your feelings about code purity and sanctity go. It's the job to be done that matters.

Code is not holy. In 100 years people will look at our ephemeral artifacts as silly little things. Treat it that way today. Means to an end.

customguy · 2026-05-22T20:26:01 1779481561

"the sand doesn't matter, only the beach does"? Makes no sense.

Perfection is achieved when there is nothing left to take away.

> In 100 years people will look at our ephemeral artifacts as silly little things

Whereas they'll totally admire the hamster wheels in which people shoveled product? Well, I don't care either way. Craftsmanship and care have their own rewards, and shape the person engaging in them for the better.

keybored · 2026-05-22T15:52:29 1779465149

I can’t just quit the “career” that I’ve spent years building (for what else?). I’ll just fade somewhat gradually into unemployment, I imagine.

JumpCrisscross · 2026-05-22T16:06:29 1779465989

> can’t just quit the “career” that I’ve spent years building

One, I think the talk about AI replacing developers is tripe. We’re still correcting the post-Covid hiring binge.

Two, even if that level is breached, I’d consider your skillset more broadly than what you can literally do right now. Organizing people and technical systems is hard. And the article highlights how that doesn’t seem to be something AI is focused on improving on right now. (Would take larger context windows. Which would make inference more expensive.)

simonw · 2026-05-22T15:58:21 1779465501

I firmly believe that your existing skills and experience are more valuable in a world where the AI tools can speed up the bit where you type the code.

NiloCK · 2026-05-22T18:47:47 1779475667

It's great that you believe this, but are you hiring?

I don't intend this to read as pure snark, but someone's abstract value isn't much good to them if the job market itself can't / won't recognize it.

reconnecting · 2026-05-22T13:58:51 1779458331

> I think AI tools are more like Iron Man's suit.

There's an interesting repository with 63600 stars on GitHub (1). The developer of the repository is No 1 at the GitHub's trending contributors list (2). However, it seems like the application isn't what it's described to be (3), and the developers, on their end, are unable to clearly answer whether this is real or not, as it's just messy LLM output.

Proof that the suit alone doesn't make anyone Iron Man.

1. https://github.com/ruvnet/RuView

2. https://github.com/trending/developers?since=weekly

3. https://github.com/deletexiumu/wifi-densepose

tambourine_man · 2026-05-22T14:47:35 1779461255

> After a thorough independent code audit with cross-verification from three AI systems (Claude, Codex/GPT-5.2, Gemini), we confirm that this project is a non-functional facade.

So, a nonfunctional project is created by AI and AI is used to attests its nonfunctionality.

What a brave new world.

reconnecting · 2026-05-22T15:01:26 1779462086

It seems like a perfect ouroboros illustration for the current world.

AI creates a delusional product, people don't trust their own opinion regarding it and follow it, another AI is needed to prove that the product is unreal.

In the loop.

keeda · 2026-05-22T18:57:13 1779476233

> Proof that the suit alone doesn't make anyone Iron Man.

I do believe that is a running theme in the Iron Man comics and movies.

pelasaco · 2026-05-22T14:14:08 1779459248

the whole thing is creepy. The ruvnet, has multiple projects.. its just AI. A lot of AI. It floods GH infra.. Kind of easy to understand why GH struggles.

reconnecting · 2026-05-22T14:18:38 1779459518

On the other hand, there are 8,400 forks, and it looks very real, so developers seem to have confidence in it.

krapht · 2026-05-22T18:09:41 1779473381

I went and looked at the code. It's AI bots and a few confused trend followers all the way down. There's no way anything in there works.

chrisweekly · 2026-05-22T14:40:43 1779460843

The forks are as meaningful as the stars.

irchans · 2026-05-22T17:54:59 1779472499

I also "had an Iron Man moment last week" as a mathematician. I've been doing joint math research with two friends (professors) on a project for several years. Last week, I decided to explore part of our research using chat GPT. 1) I would have a thought. 2) Present it to GPT. 3) Ask GPT to write theorems that were easy to prove and put those proofs into LaTeX. (I always have to check the proofs carefully.) 4) Then I would ask it to generate code (I mostly use Mathematica the language.) 5) Running the code would help verify the proofs. It would also inspire more thoughts and I would go back to step 2). This worked very well, but at one point I could not bound a particular expression and I was not understanding it well, so I got out the pencil and paper and redid the derivation myself which helped a lot. This whole process worked about 10 times faster than doing it without GPT. At the end of a couple of hours I had around 20 pages of correct proofs and all the code needed to do numerical simulations related to the proofs.

wccrawford · 2026-05-22T19:03:03 1779476583

I think that it's not a multiplier on skills.

It's a reducer of time.

For less experienced developers, it's an immediately reduction at the start of a project. But then they will almost certainly have problems later when their initial decisions come back to haunt them.

For senior devs, it's like having a junior or mid-level dev that will instantly do things within their capability, so long as it's explained to them well enough. This junior dev will do things fairly smartly, but any important decisions left to them will be wholly or subtly wrong. And the subtle ones are the worst ones, because they're so hard to detect.

But if that senior dev sets the guidelines well enough, and notices the problems, development is so, so, so much faster. It's wild.

xnx · 2026-05-22T14:04:23 1779458663

An "elephant in the room" is a big topic that no one is talking about. Everyone is talking about AI.

Better headline: "Why AI Multiplies Developer Skills Rather Than Replacing Them"

joshwcomeau · 2026-05-22T20:40:16 1779482416

For context, this was a newsletter issue sent as part of a marketing campaign for my latest course. It was an elephant in the room in the sense that up until that point, my marketing campaign hadn't talked about AI.

This is the "view on web" link designed for people to read my emails if they don't display correctly in their email client . It’s not really intended for a broader audience.

bluefirebrand · 2026-05-22T16:24:23 1779467063

The outcome is the same, though

Fewer developers required to achieve the same things means a lot of people are going to be unemployed

It also means that the people who remain will likely be paid less. Why would you pay a senior salary when you could pay a junior salary plus AI subscription and get "the same result"?

I think Software Devs are in for a rough time. I've been doing this for 15 years now, and I'm not looking forward to it. I'm honestly thinking about re-skilling to a different industry. Even if it pays less, it's probably worth it to sit out this shitshow.

joshwcomeau · 2026-05-22T20:47:01 1779482821

> Fewer developers required to achieve the same things means a lot of people are going to be unemployed

I don’t think that's a foregone conclusion. Every company I’ve worked for has had a huge list of tasks we'd do if we had more engineering resources. There's never been a shortage of worthwhile things we could do, it's always been ruthless prioritization to find the 10% of tasks that are the most important.

Look up Jevons' Paradox. This is a thing that has happened a bunch of times before.

nyantaro1 · 2026-05-22T16:32:12 1779467532

I agree with most of your take, but I don't really think those left are going to be paid less. I am not one of them, but I don't see why they would be paid less.

bluefirebrand · 2026-05-22T16:37:36 1779467856

Simple economics. There are fewer software developers required to achieve the same goals, so there are fewer jobs for software developers. That means there's an oversupply of software development labor. That means salaries for software developers will go down

It's the same thing that happened to every other skilled profession that was automated in the past. That's why unions became a thing and they started busting heads until their employers paid them more.

Edit: the only way I can see software developer salaries staying the same is if the amount of work available for them expands dramatically.

Hypothetically if half of software developers are laid off and replaced by AI, there will need to be twice as many software development jobs in order for salaries to not go down

Also keep in mind that even if salaries remain flat, inflation means you're making less.

snide · 2026-05-22T14:01:47 1779458507

I mostly share Josh's opinion, but I think a lot of these posts that talk about Senior vs. Junior experience when working with AIs is kind of rubbish. Sure, you get better results as a Senior working with AI tooling and struggle more as a Junior. Nothing has changed in that equation except the amplification.

What folks seem to avoid is that a Junior (in ANY subject) has the ability to LEARN so much faster with an AI research assistant, and that becoming an expert has accelerated for those with the personal stamina to dig deep (this as a requirement hasn't changed). I spend just as much time with my AI tooling asking questions as I do asking it to "build" or "fix" things. "How does this work?". "Can you suggest other tools?".

I think some people always think about AI as an input / output relationship, when a lot of the time, the fiddling in between, with or without AI was always the important part. Yes people will suck in the beginning, against they always did. I think the good folks though will suck for a MUCH shorter time than I did getting into things.

A lot of people will drop out and get discouraged. That happened before too. Learning things requires persistence. I think the only real case to be made is that AI's sense of immediate pleasure can neuter people away from running into friction. AI natives likely won't understand friction and question it.

djeastm · 2026-05-22T17:24:24 1779470664

>I think the only real case to be made is that AI's sense of immediate pleasure can neuter people away from running into friction. AI natives likely won't understand friction and question it.

This is key, I think, and gets overshadowed by people being offended by seeing bad vibecode or claims of 10x speeds, etc.

The most important learning that happens is not when we ask and get the answer to our question right away. It's when we stretch ourselves to seek out the answer, fail a few times, think deeply, then perhaps after a nap, solve the problem. That kind of knowledge is priceless because it not only gets you an answer it gets you some errant paths you can use to avoid problems in future problem solving as well as getting you increased trust in your own thinking.

If the next generations skip this step, they'll always think answers are supposed to be easy to find and will find themselves more and more dependent on AI and less and less confident in their own brains.

bluefirebrand · 2026-05-22T17:58:04 1779472684

> If the next generations skip this step, they'll always think answers are supposed to be easy to find and will find themselves more and more dependent on AI and less and less confident in their own brains

This seems like a very polite way of saying they will become less intelligent and less capable

lelanthran · 2026-05-22T17:11:32 1779469892

> What folks seem to avoid is that a Junior (in ANY subject) has the ability to LEARN so much faster with an AI research assistant,

You don't learn by reading, you learn by doing.

In this case, simply reading the output of an LLM isn't going to substantially educate you.

ok_dad · 2026-05-22T19:43:23 1779479003

You’re not as senior as you think if you think reading code isn’t worth it. Do you think novelists just write novels from nothing? They read books. Software developers need to read software, too. When was the last time you read the code for the best open source software in your industry? I routinely read the libraries I use.

JumpCrisscross · 2026-05-22T14:17:20 1779459440

> a Junior (in ANY subject) has the ability to LEARN so much faster with an AI research assistant

I’m not seeing this. And based on what we’re seeing at the university level, I’m not expecting to.

keeda · 2026-05-22T19:17:05 1779477425

I think the key word is ability, and I fully agree with that. Using GenAI as a teaching aid can supercharge learning, especially as it makes it very easy to learn by doing. The problem is that people use GenAI to do and hence don't learn.

(The preliminary research so far supports this: using AI to do the hard assignments produces poor learning outcomes, but using AI as a tutor, or even just for help with the hard assignments, produces slightly better learning outcomes.)

I think what you're seeing is the effect of the incentives of the system. The system uses simplistic numbers like grades as proxies for actual learning, and these grades heavily influence students' job prospects, and so you're simply seeing Goodhart's Law in action. Given how easy current methods of skill assessment are to game with AI, my guess is the entire system has to be overhauled.

JumpCrisscross · 2026-05-22T19:23:20 1779477800

> using AI as a tutor, or even just for help with the hard assignments, produces slightly better learning outcomes

Source? The few people I’ve seen try to do this wind up with a terrible understanding of the material, with large knowledge gaps and one or two fundamental fuckups. In every case, an introductory textbook would have been better. (It would also have been harder.)

sonofhans · 2026-05-22T14:25:35 1779459935

Yes, I agree, the skills are orthogonal. Digital typesetting is vastly quicker than manually putting down metal type, and since you’re exposed to more type you have the opportunity to learn faster. But getting good at typography with digital tools will help you very little if you need to lay out type manually.

JumpCrisscross · 2026-05-22T14:31:07 1779460267

> getting good at typography with digital tools will help you very little if you need to lay out type manually

The analogy is unlimited typing in Gmail won’t make you a better writer or typesetter on its own.

vanuatu · 2026-05-22T15:35:06 1779464106

I wonder how much of this is due to poor incentives at the university level?

I've seen this work well at a job when there's a feedback loop for juniors that incentivized them to learn with more scope and compensation

judahmeek · 2026-05-22T19:14:34 1779477274

How did that business evaluate that the juniors were actually mastering concepts they had not known before?

xxs · 2026-05-22T15:01:04 1779462064

> has the ability to LEARN so much faster with an AI research assistant, and that becoming an expert has accelerated for those with the personal stamina to dig deep (this as a requirement hasn't changed)

If anything it allows to be as lazy as possible. I have not seen anyone digging deeper with the AI tools.

dmoose · 2026-05-22T19:48:10 1779479290

I have been having a blast going back through topics I learned in college and haven't used in years. Being able to rubber duck specific questions and follow a path based on what I remember vs don't is much faster with LLM than it would be with textbook. However, I'm doing this because it is personally fun. I'm guessing if presented with a task I wasn't interested in the LLM would create exactly the opposite outcome. Thankfully I'm at a point in my career where I don't have a lot of stuff forced on me externally so this hasn't come up, but I can picture teenage me taking a much lazier path with a much different end result.

qweiopqweiop · 2026-05-22T18:09:42 1779473382

If you decide to dig deeper, it's an incredible tool. Getting a summary of the internals of something you only use as an API, then getting it to test you on it until you understand. It really allows you to learn a lot.

runarberg · 2026-05-22T14:28:44 1779460124

> a Junior (in ANY subject) has the ability to LEARN so much faster with an AI research assistant

This is a testable hypotheses with severe lack of citations. Intuition would argue the opposite. We learn by using our brains, if we offload the thinking to a machine and copy their output we don‘t learn. A child does not learn multiplication by using a calculator, and a language learner will not learn a new language by machine translating every sentence. In both cases all they’ve learnt is using a tool to do what they skipped learning.

simonw · 2026-05-22T16:23:44 1779467024

This seems to me like one of those things where people go into it with widely different initial assumptions.

1. AI is for cheating and doing the work for you. Obviously it won't help you learn faster because you won't have to do any thinking at all.

2. AI is an always-available question answering machine. It's like having a teaching assistant who you can ask about anything at any time. This means you can greatly accelerate the process of learning new things.

I'm in team 2, but given how many people are in team 1 (and may not even acknowledge team 2 as even being a possibility) I suspect there may be some core values or different-types-of-people factors at play here.

runarberg · 2026-05-22T16:48:37 1779468517

This is also a testable hypothesis. I would like to see usage statistics before making assumptions here but my gut feeling is that an overwhelming AI usage (like > 90%) would fall into your category 1.

But even with category 2. I think that still does not absolve AI as a cheating machine. Doing research is a skill and if you ask AI to do the research for you that is a skill a junior developer simply never learns.

simonw · 2026-05-22T17:18:42 1779470322

This is interesting and relevant: https://www.sciencedirect.com/science/article/pii/S095947522...

"The expertise reversal effect is present when instructional assistance leads to increased learning gains in novices, but decreased learning gains in experts."

There's a whole lot of depth to the question of how AI tools support or atrophy learning for different levels of expertise.

keeda · 2026-05-22T19:32:01 1779478321

Actually, you're both right. Using AI as a supplementary learning aid -- i.e. students use AI as a personalized tutor but still do the assignments themselves -- produces better outcomes. But using AI as a crutch -- i.e. using it to do the assignments -- produces worse outcomes.

There is even preliminary research evidence for this, e.g. https://www.mdpi.com/2076-3417/14/10/4115 and https://www.sciencedirect.com/science/article/pii/S2666920X2...

runarberg · 2026-05-22T20:38:17 1779482297

> students use AI as a personalized tutor but still do the assignments themselves.

So your first study actually concludes the opposite. It concluded that all AI users performed worse, but the effect was smaller for students which used AI as a tutor.

The second meta analysis I don‘t quite understand. I understand they conclude that using AI tutor shows significant improvement, but I don‘t understand the methodology. I may be misunderstanding but it seems to simply count papers which shows positive outcomes and reaches conclusion that way. I think that methodology is deeply flawed as it will amplify whichever biases are present in the studies it uses. I also think the lack of control groups is a major issues. If we are comparing AI tutor to nothing, off course the AI tutor is gonna perform better. We need to compare to traditional methods. And this is especially relevant in our discussion because junior developers usually have excellent access to senior developers (via peer review, pair programing, etc.), much better then student’s access to tutors for that matter.

So out of the meta-analysis I picked the paper with the strongest claim (trying to steel-man it) which is this one: https://online-journal.unja.ac.id/JIITUJ/article/view/34809/...

It claims the following in the abstract:

> The results indicated that students employing AI tutors shown significant improvements in problem-solving and personalized learning compared to the control group.

Now when I look at the control group it claims this (also in the abstract):

> Participants were allocated to a control group receiving conventional training and an experimental group utilizing AI technology,

But when I look into the methodology section I see this:

> The researchers classified the patients into two groups: MathGPT and Flexi 2.0

MathGPT and Flexi 2.0 are both AI tutors. Now I am confused, where is the control group and how was this “conventional training conducted”?

The methodology section actually tells a different story from the abstract:

> This research utilized a quantitative methodology via a quasi-experimental design.

By quasi-experimental design they mean that they tested the same students before and after AI intervention. And concluded that the AI tutor helped them improve. Now this is not what control group means, so the researchers are actually lying by omission in the abstract. This is a spectacularly bad experimental design and I wonder how it would pass peer review, so I look at the publisher Jurnal Ilmiah Ilmu Terapan Universitas Jambi. So not exactly a reputable journal.

I still stand by my no evidence for a testable hypotheses. I suspect that your first link is actually correct in that AI is bad for students and just less bad if it is used as a tutor.

jononor · 2026-05-22T15:07:43 1779462463

As a precondition I think we have to assume that the person in question 1) wants to learn and 2) is smart enough to absorb new info and apply it and 3) reflects enough to adjust their approach when hitting bottlenecks or making mistakes 4) has a drive to create. Without these, self driven learning is not viable - and that has very little to do with AI.

For such a person, I believe AI can be very empowering for learning. Like Google, wikipedia and stack overflow, Arxiv before it - AI tools give access to a lot of information. It allows to quickly dig deep into any topic you can imagine. And yes, the quality is variable - so one needs to find ways to filter and synthesize from imperfect info. But that was also the case before. Furthermore AI tools can be used to find holes in arguments or a paper. And by coding one can use it to test out things in practice. These are also powerful (albeit imperfect) learning tools. But they will not apply themselves.

runarberg · 2026-05-22T15:21:11 1779463271

Who is talking about self driven learning? Every workplace teachers their juniors how to do their job, and how to become better at their jobs.

And as we are talking about junior developers it is safe to assume your conditions (1), (2), and (4) are all true, if any of them are false, then why did that person apply for and get a job as a junior developer? As for condition (3), all workplaces eventually hires a person who does not fulfill this, then they either fire that person, or they give them a talk and the developer grows out of it and changes their behavior to fulfill that condition.

Aside: you listed 4 conditions for learning. I am not sure these are actually conditions recognized as such by behavior science. In fact, I doubt they are and that these conditions are just your opinions (man).

renticulous · 2026-05-22T14:54:14 1779461654

There are other axes as well.

Companies with AI will move faster than those without.

AI itself could subsume what we collectively consider as Engineering Taste.

AI is faster at what it does. So even if a junior costs less on his own than AI. Paying extra for AI means gaining first mover advantage.

judahmeek · 2026-05-22T19:18:00 1779477480

> AI itself could subsume what we collectively consider as Engineering Taste.

Only if AI feeds on more taste than garbage.

bluefirebrand · 2026-05-22T16:30:04 1779467404

> that becoming an expert has accelerated for those with the personal stamina to dig deep (this as a requirement hasn't changed)

This is a contradictory statement imo.

Digging deep still takes the same amount of time it used to. AI accelerates the surface level (badly, tbh), it doesn't accelerate digging deep. Becoming an expert still takes time and effort, there really aren't shortcuts.

To torture the Iron Man metaphor a bit. If you're not an expert without the AI, then you're not an expert with it.

justinhj · 2026-05-22T15:08:52 1779462532

Smart, motivated juniors have incredible tools to amplify their learning and capabilities.

samdjstephens · 2026-05-22T14:31:18 1779460278

Many or even most software engineers are experts in their own codebases though, which means a large proportion of engineers are getting high value out of AI.

What’s not clear to me is: if writing more code per engineer is possible, does that result in fewer engineers or just more software, especially in areas that traditionally got squeezed: UX, testing, DevEx, documentation, etc. Perhaps the bar just gets raised?

techblueberry · 2026-05-22T15:52:21 1779465141

I had this observation talking to Claude one time. I forget the context exactly but I said something like:

Me: Isn’t it crazy that X is better than Y.

Claude: what an insightful critique, Y is better than X because of x, y, and z reasons.

- And this answer from Claude was good. Thoughtful, well-reasoned. But it was opposite the point I wanted to make so I said :

Me: “oh, you heard me say Y is better than X, I actually was being counter/intuitive and said X is better than Y”

And Claude responded:

Claude: oh you’re absolutely right, X is better than Y for the following reasons (and Claude again provided a well reasoned response here)

And this is sort of that dumb smart genius meme.

“It’s just autocomplete” “No it’s way more than that it has a model in its mind” “It’s just autocomplete”

I liken it to the library of babel. All the genius in the world, but only if you have the right index keys.

zephen · 2026-05-22T18:06:15 1779473175

It is a well-known trope that LLMs are prediction engines.

But, to get the best out of them, you really have to consider what that means in the small. They are predicting, to a first order of approximation, what you want or expect them to answer.

Its response to your first prompt is hilarious, because the LLM completely misunderstood you and based its prediction on what it thought you wrote. Its response to your second prompt further cements that its goal is to predict what you want or expect to see.

It's also well known that LLMs are prone to hallucinations. One of the biggest triggers for hallucinations is when the LLM's interpretation of your expectations doesn't match reality.

Because the LLM will try to make reality match what it perceives to be your expectations.

One of the best ways to reduce hallucinations is to work hard to remove any assertions from your prompts.

For example "Isn’t it crazy that X is better than Y." contains an explicit assertion. The LLM misunderstood the direction of the assertion, but certainly understood that an assertion was there, and so it gave you reasons why reality matched its understanding of your assertion.

When you clarified the assertion, it switched, and again gave you reasons why reality matches its understanding of your assertion.

Lawyers often get into trouble for made-up citations. "Claude, find me case law that shows X" is a recipe for disaster. Instead "Claude, what is the case law on X?" is probably a better starting point.

techblueberry · 2026-05-22T18:55:02 1779476102

Yeah I think this concerns me in two fronts and this specific tendency may be the line in the sand separating LLMs from AGI.

On the getting work done front, if what you’re trying to do is remotely subjective, than you really need to be sure you’re asking it the right thing and not expecting it to correct you and provide capital T truth.

On the social element. Like wow using it as a therapist or a mentor or to bounce ideas off of. What a huge trap if you expect it to correct you with a semblence of objective reality.

zephen · 2026-05-22T19:12:31 1779477151

> What a huge trap if you expect it to correct you with a semblence of objective reality.

Yes, as a few cases have shown, people can go off the deep end, and the LLM goes right there along with them.

The LLM has no understanding of objective reality. It's even worse than any one of the blind men trying to describe an elephant, because it has no true experience of either the thing, or the other thing it is trying to compare it to.

voidUpdate · 2026-05-22T13:56:13 1779458173

> "I think AI tools are more like Iron Man’s suit. It can do incredible things, but not on its own."

Someone needs to watch iron man 3...

acbart · 2026-05-22T13:56:52 1779458212

Or "Age of Ultron".

wg0 · 2026-05-22T17:35:52 1779471352

I'm a severe AI critic. Bitterly so. But this much I would agree that AI has a multiplying effect for the expert who can still do the job by hand albeit slower no matter how much slower but a good job nevertheless.

And still worth repeating that AI has net negative gain in team settings but is a booster for lone wolves like me.

travisgriggs · 2026-05-22T15:14:18 1779462858

What is unclear to me is how less skilled people gain useful experience, when using these amplifying tools. I’ve been at this for 35 years; I like to think that sometimes i get some pretty amazing results.

I work with two pretty green developers. The rate that they can make a mess is now phenomenal. And the sense of confidence the tools give them with early successes, means any experience I might have to offer means less now. Which is ok, I’m not going to be that “my experience has to be useful to you so I still fell relevant” old guy. But I do find myself curious how “lessons are learned” that lead to greater and greater tool exploitation in this brave new world.

rglover · 2026-05-22T18:13:27 1779473607

> But I do find myself curious how “lessons are learned” that lead to greater and greater tool exploitation in this brave new world.

(I think I'm reading this the right way but if not feel free to correct).

In a word: pain.

Until there's a legitimate threat to their well-being (emotional, psychological, or financial), the lessons won't be truly "learned." Until you know the true cost of a decision, you're flying high.

Older engineers have dealt with this organically so it's kind of encoded into their DNA. The very reason certain things aren't done (or a certain way) is because that pain has already been felt/encouraged learning a better way.

sarreph · 2026-05-22T14:07:52 1779458872

I agree with the author that -- right now -- we're still in the part of the AI adoption / product development curve that it's an extreme force multiplier.

I like to think of it as a normal distribution, the further away a programmer is to the right of the mean, the more their benefit. It's almost like it's their standard deviation squared (σ²). So someone like Matt Perry (as OP mentioned), who is a >99.99% programmer for argument's sake and is therefore four standard deviations away from the mean... Matt gets a (4×4) 16x multiplying effect on their productivity.

Someone who is a slightly above average programmer might see a 2 or 3x boost on their productivity, which is huge(!) and might also make them fear for their job. Which tracks with the level of moral panic we are seeing and experiencing. This math kinda still holds up for "bad programmers" too (i.e. left of the mean), as in they still see a boost to their productivity (negative squared is a positive number)... but there's something iffy about their results. The technical debt is unmaintainable and because they don't _understand_ the systems that they're operating in, they end up in the "3 hour" prompt loops that the OP refers to.

> Similarly, if Matt Perry handed me the keys to the Motion repository and told me to take over, I wouldn’t have the same results even though I have access to the same set of LLM tools.

The question is -- how long is this multiplier going to exist for? Some people would wager "for the foreseeable long-term future"; some people think it will widen further; and some people think it will diminish or god forbid even collapse. It feels like most arguments at the moment (like this article's) are that the humans who "know what they are doing" will be able to baton the hatches and avoid being usurped by ever-capable models. I saw it in a café yesterday: someone was using a coding agent to build a marketing website for their project, getting more and more frustrated by not getting the outcome they wanted. Their friend typed a couple of sentences on their keyboard and got a "Dude! How did you do that? That was sick!" a minute or so later. "I used to build websites" the friend said. -- The friend 'knew what they were doing'.

How much longer is knowing what you're doing going to be a moat?

disgruntledphd2 · 2026-05-22T14:26:51 1779460011

> How much longer is knowing what you're doing going to be a moat?

For a looooonnnnngggg time, unless there's massive progress in AI research.

Fundamentally, next token prediction is limited. Granted, I'm pretty amazed at how well it's done, but if you can't activate the right parts of the models (with your prompts), then you're not going to get good results.

And to be fair, for lots of things this doesn't matter. Steve in Finance or Mindy in Marketing can create dashboards that actually help them, and the code quality mostly doesn't matter.

For stuff that needs to be shipped, monitored and maintained you still need to know what you're doing.

iwiwk · 2026-05-22T15:25:08 1779463508

We also need to consider the price. At some point the price will need to go up (assuming cost of producing each token doesn’t drop dramatically) to generate enough revenues to cover not only operating expenses and taxes (once the nol carry forward’s are used up) but reinvestment. OAI and Anthropic are burning through their cash balances. OAI has also stated some very ambitious plans to develop models beyond just programming… I will be very intrigued to see how they are going to generate enough revenue to fund all this in the future.

DarkTree · 2026-05-22T14:29:20 1779460160

> How much longer is knowing what you're doing going to be a moat?

To me, I don't see how this will ever not be an advantage. All software requires constraints. Some of those constraints might be objective (scale, performance, etc.) but a lot of them are subjective and require active decision making (architecture, UI, readability).

So if there was only one way to do something or only one desired output, then yes I think models would surpass humans. But like art, I don't think there is a objective truth to software and because of that, humans get the opportunity to play an important role.

Now whether that is valued from a business/industry perspective is a question that I think we all know the answer to unfortunately.

r_lee · 2026-05-22T18:22:17 1779474137

> Now whether that is valued from a business/industry perspective is a question that I think we all know the answer to unfortunately.

sounds like "no moat" to me

idopmstuff · 2026-05-22T14:27:23 1779460043

100% agree with this. I think takes like OP's would be much more interesting if they staked out a position in the future. I think it's pretty uncontroversial to say that someone with a great deal of technical expertise is going to be a hugely more effective LLM user today.

The question that really matters is whether that will continue to be the case. My guess is that technical expertise matters less over time, and the ability to specify the desired outcome is eventually the only thing that becomes important. But I could be wrong! The direction this all goes is pretty fuzzy in my mind.

chasd00 · 2026-05-22T16:45:03 1779468303

> My guess is that technical expertise matters less over time, and the ability to specify the desired outcome is eventually the only thing that becomes important

if you look at LLMs based coding as another step up in programming abstraction then it's clear this is the case. Think about the progression of programming languages. Over time, we go further and further from the hardware and closer and closer to specifying the desired outcome. The terminology, structure, and completeness of a user story that guides a codingagent to the desired output, and only the desired output, is the new programming language.

lelanthran · 2026-05-22T17:15:26 1779470126

> if you look at LLMs based coding as another step up in programming abstraction then it's clear this is the case. Think about the progression of programming languages. Over time, we go further and further from the hardware and closer and closer to specifying the desired outcome. The terminology, structure, and completeness of a user story that guides a codingagent to the desired output, and only the desired output, is the new programming language.

But that entire narrative follows from one, single, very big "If". It is not a given that AIs are a step up in abstraction.

Like, copying the answers in a test isn't considered an abstraction, I don't consider copy-pasting AI into your codebase an abstraction.

chasd00 · 2026-05-22T19:01:49 1779476509

in the case of tools like claudecode there's no copy/pasting. Claudecode updates files directly, runs tests, starts/stops server, everything else on its own (with your permission).

I guess to take it a step further, you can lay your requirements in order with guidance in a markdown file called 'myprogram.md'. Then tell ClaudeCode to read that file and do what it says. In that way, myprogram.md, actually your requirements doc, is the programming language being turned into the 1s and 0s the computer understands.

hypfer · 2026-05-22T17:08:02 1779469682

Cafés are no proper workspace though.

They're full of noise and distractions. They offer no ergonomics, no proper screens, no nothing.

Anything that happens or doesn't happen there is mostly irrelevant to relevant software at large.

furyofantares · 2026-05-22T14:38:31 1779460711

> So, on the one hand, I’m seeing the most talented developers I know amplify what they can do with AI, and on the other, I’m seeing people with less domain knowledge struggle to get past the “MVP” stage.

Those are people who weren't making it to the MVP stage before LLMs.

There is no doubt that highly technical people are getting A LOT more out of LLMs than people without dev experience, in an absolute sense. I think it's less clear in a relative sense.

A question I also ask myself a lot: What are the skills I'm leveraging, exactly, as a highly experienced developer that's now doing a lot of vibe coding?

1) I'm choosing good technology for the task, and thinking about what LLM-agents are good at and choosing technology that they can work well with.

2) I'm choosing good workflows for the LLM-agent, starting a new context at the right time, having it test things, making sure it has logging that it can inspect, making sure it can operate the application in a way that it can debug and inspect it.

3) I'm thinking about the code even though I'm not looking at it, I'm telling it how I want things implemented, I'm telling it how to debug things.

I think these are all hard things for non-developers to do, but I also think non-developers will be able to replicate a large chunk of #1 and #2 relatively quickly. I only have to figure out that it's valuable to tell the LLM-agent to use playwright when working on web page visuals once, and then I can tell you to do that too. Or the coding agents will come with that knowledge built-in (to the model or as a builtin skill or whatever). Knowledge around this will accumulate and become easier for non-developers to access, and in many cases be builtin to the models or harnesses.

block_dagger · 2026-05-22T19:39:16 1779478756

> You could give me Jimi Hendrix’s exact guitar but it would sound very different if I tried to play it!

Guitars do not think. AI does. The analogies that try to paint AI as "just another inanimate tool" are way off base, and so is the conclusions of this article.

tj-teej · 2026-05-22T19:42:24 1779478944

I think the better analogy is "you could give me an afternoon in the studio with Jimi Hendrix's sound engineer and the record we create would sound very different from Jimi's albums."

sandeepkd · 2026-05-22T16:26:21 1779467181

Its an interesting topic and it will take a while to really know how things pan out. One thing is for sure, it allows some one to ramp up in an unknown knowledge based territory fairly quickly(physical work still requires craftsmanship).

It can allow a skilled engineer to have multiplied effect of repeating their skills HOWEVER it would take away their ability to question, think and improve themselves. The syntax highlighting by editors is a good example, most engineers cant work without it, however its a static skill which does not needs constant improvement so its an acceptable support risk.

rspoerri · 2026-05-22T15:24:52 1779463492

I am working on a project since 10 months that is solely written by ai. The second iteration of the code is even coded in a language that i can really write in (rust), it uses advanced and very complex structures (crdt, plugins, parallel threaded webviews). None of the advanced features were an idea of the llm's, but a vision i had in my mind or requirements that came up when encountered with problems.

The software is a tool specifically designed around my requirements of managing lectures that need to be prepared, managed, have presentations, grading etc. I wanted one big space where i can quickly access all related data in a workspace, fold and unfold important aspects while also editing and moving contents across multiple days/lectures.

The first version is a vscode plugin, which i now use since about 4 months without or with minor modifications to manage my lectures and private data. The second version is a standalone application which improves on the ideas of the first version and goes a few steps further.

AI can make you something that looks like its running quickly. But when you try to finish it takes way longer then you'd think. You need to specify every little detail. You need to make its KISS and DRY etc. You let it analyze the application structure and simplify and cleanup nearly the same amount of times as you add features. While fixing bugs you might need to run the same thing multiple times and revert any unrequired changes. You need to think about good level of debug logs and ways that the program can help you find errors and report them quickly.

I hope my project will be ready in about 2 to 3 months. The current version is according to a quick analysis over 850 files with 250'000 lines of code.

I spent about 2000$ on ai subscriptions in that time. 200$ claude for a while down to 100$ a month now. 20$ to openai which is very important for architecture and reviews. 20$ on tests with other ai's, but i rarely use them in the works. I also spent 1500$ on 2 * 3090's to hopefully have a local ai agent in the future.

I spend about 2 to 4 hours each day (including weekends) to check that app and write prompts.

I would never have been able to create such a large and complex project next to my other tasks and i am very confident that the final product will be good enough for productive work.

giancarlostoro · 2026-05-22T15:27:27 1779463647

> None of the advanced features were an idea of the llm's, but a vision i had in my mind or requirements that came up when encountered with problems.

This is the correct way to code with AI. If you don't understand the code, we're not yet in a point where the model can do it all, well it can, but where you can confidently move forward knowing its been thoroughly built and reviewed by a model up to par. Some day maybe, but not currently.

x187463 · 2026-05-22T14:16:19 1779459379

> Without guidance, LLMs tend to paint themselves into a corner, because they’re generating code to solve individual prompts, not thinking holistically about an application’s architecture.

I've found I can prevent the LLM, in many cases, from thrashing on a bug/feature for long periods of time by switching into plan mode and, even in the middle of a conversation, having it reassess the structure around the problem, first. If you keep prompting about the same bug, it may keep producing variations of the problem code. But forcing it to stop and 'think' for a bit, has yielded much better results.

bluegatty · 2026-05-22T15:01:54 1779462114

"Without guidance, LLMs tend to paint themselves into a corner, because they’re generating code to solve individual prompts, not thinking holistically about an application’s architecture." user error, mostly.

But the general argument of 'we will need skilled operators' still holds.

For every 'junior' displaced by AI, there will be some other kind of relevant role they're needed for.

Agentic workflows, integration, all the data science stuff, new UX paradigms.

I don't think the job numbers will dwindle, just shift.

muldvarp · 2026-05-22T14:13:55 1779459235

The fact that AI currently requires some human supervision to produce valuable results is not a good predictor that it will stay this way sadly. LLMs were basically unable to reason two years ago. They are now better at many reasoning tasks than most people. If there is even a remote chance that LLMs will make your job obsolete I would pivot as fast as I could. This includes first and foremost software engineering.

mohsen1 · 2026-05-22T14:39:03 1779460743

I agree with you. A lot of "AI code is not clean" is hopeful thinking. In two years it might be able to design and architect better than most humans too.

geraneum · 2026-05-22T14:20:23 1779459623

The people you see in the TV are not actually in the TV box. It looks real until you try to shake one’s hand. It’s kind of the same thing with AI (reasoning and whatnot).

muldvarp · 2026-05-22T14:21:48 1779459708

I don't think it matters if the reasoning is philosophically "real" if it can solve real problems.

geraneum · 2026-05-22T14:35:28 1779460528

If you read my analogy in the context of the article, it should be clearer what I meant.

muldvarp · 2026-05-22T14:40:36 1779460836

I think it would be even more clear if you just write what you mean.

worldsayshi · 2026-05-22T14:02:58 1779458578

I see two points:

1. AIs aren't yet good at architecture.

2. AIs aren't yet good at imagining technically exciting stuff to build.

And I agree that there's still space there to build a career in the short to medium term (plus Jevons Paradox). When both those points are no longer true we are certainly much closer to, dear I say it, agi. I suspect that (1) will be solved for somewhat limited domains in the near future using harnesses. And it could snowball from there.

zaphar · 2026-05-22T14:16:47 1779459407

Nearly every argument that hinges on the word "yet" is just an example of over-extrapolation[0] at play.

0: https://www.fallacyfiles.org/overxtra.html

muldvarp · 2026-05-22T15:03:25 1779462205

But saying "LLMs are not good at architecture so software engineering has a bright future" is _also_ extrapolation.

zaphar · 2026-05-22T16:18:40 1779466720

Anyone who claims to know the correct strategy to be best positioned for the future is lying or misinformed. The most you can reasonably say is that for the moment an LLM in the hand of an non-expert or naive user is unlikely to produce high quality results or create an absurd boost in productivity. We can make reasoned decisions now and continue to monitor things.

It would be just as unwise to ignore the progression of LLM agents as it would be to over-index on them.

muldvarp · 2026-05-22T17:00:36 1779469236

I think be both agree that the future of software engineering is very uncertain right now (and likely will be for years). For me personally that alone is enough to recommend not investing money and time to get into software engineering anymore.

iwiwk · 2026-05-22T15:17:55 1779463075

Moreover we haven’t seen all the effects of today’s tech to understand the net benefits.

E.g ‘productivity’ is seemingly increasing but what is the effect on a firms financial position? It’s all speculative and experimental right now.

worldsayshi · 2026-05-22T14:22:40 1779459760

You're probably on point there.

hoppp · 2026-05-22T18:05:25 1779473125

Yeah I mean I can now program in every language. Before I used to learn them and it took me a week to a month to start working with a new language (talking about similar syntax languages)

Now I can just get to it. I know what I want and can organize the codebase, whatever the code is I can generate it.