For my grug brain can somebody translate this to ELIgrug terms? Does this mean I... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ssijak 12 days ago \| parent \| context \| favorite \| on: TurboQuant: Redefining AI efficiency with extreme ... For my grug brain can somebody translate this to ELIgrug terms? Does this mean I would be able to run 500b model on my 48gb macbook without loosing quality?

		help

x_may 12 days ago | [–]

KV cache compression, so how much memory the model needs to use for extending its context. Does not affect the weight size.

prabal97 10 days ago | [–]

I wrote this more intuitive explanation. I think you might find it helpful!

https://prabal.ca/posts/google-long-context-cheaper/

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact