Yeah this is exactly my view. We've had several years of work on the tech, and L...

selridge · 2026-02-27T00:51:27 1772153487

Did google not rely on Gemini to do their ISA changeover?

Was Gemini worse than no tool at all there?

parliament32 · 2026-02-27T01:21:42 1772155302

Probably. According to the paper, 83.82% of automated commits were already made by algorithmic tools (non-LLM). For the remainder, a three-phase LLM approach was tried, and achieved a success rate of 30%. Based on these numbers, it probably would have been faster, cheaper, and more efficient to just enhance their current strategy rather than screwing around with text generators.

selridge · 2026-02-27T15:31:18 1772206278

I think that’s a bad faith read on that paper.

johnfn · 2026-02-27T00:30:55 1772152255

Do you really think that Opus 4.6 hallucinates to exactly the same degree as GPT-3.5? I am mystified how you can hold this perspective.

parliament32 · 2026-02-27T00:47:58 1772153278

If you're not seeing the hallucinations, I'd assert you're either not using it enough, or (more likely) you don't have enough knowledge in the subject matter to notice when it's hallucinating.

johnfn · 2026-02-27T01:08:19 1772154499

I'm not interested in getting into some argument about who has "more knowledge in the subject matter". I'm genuinely curious: do you think Opus 4.6 hallucinates just as much as GPT-3.5?

segfaultex · 2026-02-27T03:10:00 1772161800

Yes. I see it hallucinate method names for 3rd party libraries constantly.

It’s useful, but when users here say they’re vibe coding 98% of their work, I have to think they’re not working on anything complex.

andoando · 2026-02-27T03:47:36 1772164056

Hmm no way. Ive used to see hallucinations like 50% of the time prompting gpt3.5 for simple functions.

I don't remember the last time ive seen a made up library/methods these days and Im definitely using way more for more complex stuff. The tool calling changed the game.

Even for work I do almost 100% of my coding telling claude what to do. I mean I break down the tasks and tell it more or less exactly what I want but I find "rename this thing across these two repos" easier than doing it myself

hdgvhicv · 2026-02-27T09:15:15 1772183715

I ran into the non existent methods and functions far more a year ago than I do today. I hadn’t even considered it as I don’t write a lot of code, Most of my job is talking with people to understand the problems and to drive strategy.

Orygin · 2026-02-27T11:25:57 1772191557

What a condescending post. You either haven't used any recent models if you make that statement. Anyone who used GPT3.5 and any other newer model know that hallucinations have gone down tremendously.

Of course it's not perfect, here and there are inaccuracies or plain hallucinations, but it's impossible to state that it's still the same garbage it was 3 years ago.

selridge · 2026-02-27T00:52:20 1772153540

LMFAO does it hallucinate to the same degree as GPT 3?

Which is what was questioned.