More

iamjackg · 2026-04-08T15:47:28 1775663248

The problem, unfortunately, is the scale. It's always scale. Humans make all the kinds of mistakes that we ascribe to LLMs, but LLMs can make them much faster and at much larger scale.

Models have gotten ridiculously better, they really have, but the scale has increased too, and I don't think we're ready to deal with the onslaught.

SkyBelow · 2026-04-08T16:00:17 1775664017

Scale is very different, but I wonder if human trust isn't the real issue. We trust technology too much as a group. We expect perfection, but we also assume perfection. This might be because the machines output confident sounding answers and humans default to trusting confidence as an indirect measure for accuracy, but I think there is another level where people just blindly trust machines because they are so use to using them for algorithms that trend towards giving correct responses.

Even before LLMs where in the public's discourse, I would have business ask about using AI instead of building some algorithm manually, and when I asked if they had considered the failure rate, they would return either blank stares or say that would count as a bug. To them, AI meant an algorithm just as good as one built to handle all edge cases in business logic, but easier and faster to implement.

We can generally recognize the AIs being off when they deal in our area of expertise, but there is some AI variant of Gell-Mann Amnesia at play that leads us to go back to trusting AI when it gives outputs in areas we are novices in.

iamjackg · 2026-04-08T14:09:53 1775657393

Scott's work is amazing.

Another related project that builds on a similar foundation: https://github.com/ledfx/ledfx

iamjackg · 2026-03-31T20:26:46 1774988806

All it means is that the PO-32 is not a sampler, it's a synthesizer, so it receives a "preset" for the synth rather than an audio sample.

iamjackg · 2026-03-27T18:28:30 1774636110

You might like https://github.com/tomnomnom/gron

iamjackg · 2026-03-16T15:42:16 1773675736

It depends on what your "currency" is: inference cost vs. models getting dumber/slower with a fuller context.

iamjackg · 2026-03-10T19:10:28 1773169828

I find the concept of LLM "brain surgery" fascinating, precisely because of how opaque the network is. One of the first things I did back when llama.cpp first got vision model support was hack the code to zero out (or otherwise modify) random numbers in the image embedding generated by the projector and then ask the LLM to describe the image. It was absolutely fascinating.

It would go from a normal description of the item in the picture to suddenly seeing people clapping in the background that were not there, or making up some other stuff. I kinda stopped after a while, but I should pick that back up and do a more coherent experiment to see if I can find any correlation between vector dimensions and "meaning."

dnhkng · 2026-03-10T19:58:30 1773172710

Yes, it's an amazing time to be a hacker!

iamjackg · 2026-03-10T00:06:20 1773101180

Nobody is promoting a product. Simon is just sharing an experiment he attempted. No products being sold here.

pelcg · 2026-03-10T00:28:11 1773102491

Maybe not, but in the past some here see that the blog is the product that is being promoted here.

Even in this thread alone https://news.ycombinator.com/item?id=47314929 some commenters here are clearly annoyed with the way AI is being shoved in each place where they do not want it.

I don't care, but I can see why many here are getting tired of it.

iamjackg · 2026-02-19T17:54:10 1771523650

Heck yeah! Love the VisiData shoutout. Echoing other people's desire for a web UI, mostly so I don't have to be the sole Maintainer of the Truth as the only resident household technomancer.

EDIT: alternatively, exposing the data/functionality via MCP or similar would allow me to connect this to an agent using Home Assistant Voice, so anybody in the house could ask for changes or add new information.

cpcloud · 2026-02-19T18:32:00 1771525920

This is super interesting. I do have a GitHub issue for LLM-powered data entry: "Add a landscaping project to do the backyard. Still ideating, thinking a budget of $40k."

defaultcompany · 2026-02-19T20:53:47 1771534427

This makes me want to use visidata for my databases.

cpcloud · 2026-02-19T21:46:33 1771537593

Funny enough, Saul and I recently hacked on getting visidata's Ibis integration updated, so you can use visidata for poking around databases of any size, really. You might like that, but also visidata has non-ibis support for SQLite I believe.

iamjackg · 2026-01-30T16:01:34 1769788894

Maybe your point was that, but that's not the point the person you replied to was addressing. Nobody was arguing about the specific definition of "modern."

The original commenter made a very clear claim: that the most recent "peak" system was the Xbox, which was discontinued in 2005, and that everything after that has been a rehash.

YurgenJurgensen · 2026-01-30T18:15:28 1769796928

If the pace of quality titles is such that people have to go fishing through multiple decades to find what they believe to be a convincing-looking list of titles to compete with the OG Xbox, that is an indication that yes, even with more games coming out, fewer good games are coming out.

iamjackg · 2026-01-21T21:35:17 1769031317

It's not unsolved, at least not the first part of your question. In fact it is a feature offered by all main LLM providers!

- https://platform.openai.com/docs/guides/prompt-caching

- https://platform.claude.com/docs/en/build-with-claude/prompt...

- https://ai.google.dev/gemini-api/docs/caching

imiric · 2026-01-21T21:43:11 1769031791

Ah, that's good to know, thanks.

But then why is there compounding token usage in the article's trivial solution? Is it just a matter of using the cache correctly?

StevenWaterman · 2026-01-21T21:51:00 1769032260

Cached tokens are cheaper (90% discount ish) but not free

moyix · 2026-01-21T22:11:57 1769033517

Also, unlike OpenAI, Anthropic's prompt caching is explicit (you set up to 4 cache "breakpoints"), meaning if you don't implement caching then you don't benefit from it.

netcraft · 2026-01-21T22:47:43 1769035663

thats a very generous way of putting it. Anthropic's prompt caching is actively hostile and very difficult to implement properly.

igravious · 2026-01-21T23:17:30 1769037450

dumb question, but is prompt caching available to Claude Code … ?

stavros · 2026-01-22T05:51:14 1769061074

If you're using the API, yes. If you have a subscription, you don't care, as you aren't billed per prompt (you just have a limit).