Hacker Newsnew | past | comments | ask | show | jobs | submit | ipsum2's commentslogin

I read the title 2-3 times, and every time I swore it said "Robot cat ears". Wtf.

Yep, same here. Its the only reason I clicked on the article TBH

Literally same, saw “Robot cat ears”. Somehow I not only misread the words, but flipped the order as well, multiple times.

For those who need help recovering from the crushing self induced disappointment, here’s some brainwave controlled cat ears: https://www.youtube.com/watch?v=CqSZg0oiYuM


I read it multiple times, read your comment and clicked on the article and I still got it wrong.

I saw Robot ear cats and couldn’t make sense of it. But clicking in the article and seeing the title in huge whatever sizes fonts made it click.

I think it’s the small font sizes in HN causing our brains to ‘fill in the blanks’.


Then you complain about the Rs in strawberry...

That's exactly why we complain about the Rs in strawberry. We can get funny-stupid human interpretations all day long. What we can't get is cold facts, and isn't that what was promised by AI (at least, before ChatGPT was released in 2022)?

Unfortunately we fed this current iteration of AI with human behaviour (not only that: human behaviour on the Internet...)


> What we can't get is cold facts, and isn't that what was promised by AI (at least, before ChatGPT was released in 2022)?

Not that I know of. An entity dealing only in cold facts is not intelligent, it's a theorem prover- extremely narrow, rigid and incapable of interpretation and insight- basically of bridging the smallest gap of knowledge. That's exactly what intelligence isn't.


“Robots eat cats” for me

Definitely read it that way at first too

I was disappointed there were no cat ears in the article...

Agreed I was going to buy a set of robotic cat ears for my partner.

It looks pretty good! I'm using it now and its a meaningful improvement to the existing site.

Out of curiosity, why did you make a new Github account for the extension instead of developing it on your own account?


Thanks!

Because I see this becoming bigger than me and a separate organization made sense. There is a super thin backend component right now too. There is the potential to also add in some extra features that require a server/db. I'm kind of inspired by the atuin model of things.

I've got a LONG list of features I'd like to implement over time.


It made it realistic. A pelican is much more likely to be flying in the sky than riding a bicycle.

I don't get why writers do this. It makes sense for fiction, but why a factual, non-fiction article?

You said factual. But what is factual for you and I may not be for someone else. There are a lot of recollections in the article where sama remembers one version or doesn't remember at all and the other party remembers something else. Combine that with the nature of the article and the legal issues considering egos and sums involved. To top all of that New Yorker is known for fact checking that is exhaustive to the point of paranoia.

I am just speculating but if @ronanfarrow is still checking the discussion here, it would be amazing to hear the actual reasons.


> There are a lot of recollections in the article where sama remembers one version or doesn't remember at all and the other party remembers something else.

Or he's being dishonest about what he remembers.


> I don't get why writers do this.

Do what, edit?


It keeps it shorter.

Parakeet is significantly more accurate and faster than Whisper if it supports your language.

Are you running Parakeet with VoiceInk[0]?

[0]: https://github.com/beingpax/VoiceInk



i am, working great for a long time now

Right, and if you're on MacOS you can use it for free with Hex: https://github.com/kitlangton/Hex

Or write your own custom one with the library that backs it: https://github.com/FluidInference/FluidAudio

I did that so that I could record my own inputs and finetune parakeet to make it accurate enough to skip post-processing.


There's a fork of FluidAudio that supports the recent Cohere model: https://github.com/altic-dev/FluidAudio/tree/B/cohere-coreml...

It's used by this dictation app: https://github.com/altic-dev/FluidVoice/


I have been using Parakeet with MacWhisper's hold-to-talk on a MacBook Neo and it's been awesome.

Parakeet supports japanese now, but I cant find a version ported to apple silicone yet.

And indeed, Ghost Pepper supports parakeet v3

How good is the guitar to MIDI detection? Can it recognize chords?

Unfortunately not right now, it's in the works. Polyphonic guitar to midi is a problem I am yet to understand and try solving in this one. Jam Origin's Midi Guitar is good like that, I still need to get there.

Would be useful to have a video demoing it if people don't have a guitar or MIDI instrument handy.

You can use your computer keyboard as well. Setup the IAC buses and use the keyboard and you should be sorted.

I just started reading Asimov press. It has a weird name, I thought they were a sci-fi publishing company at first.

It had a unique blend of popular science writing that was sorely missing from the internet. Alas I hardly knew thee.


Isaac Asimov vote a huge amount of popular science book as well. They just have a shorter shelf life.

I enjoyed his “science for the layman” books, a lot more than his sci-fi stuff.

He was really good at explaining very complex stuff, in a simple, approachable manner.


My mother bought me "The Planet that Wasn't" when I was a kid - excellent book that I re-read many times:

https://en.wikipedia.org/wiki/The_Planet_That_Wasn%27t


This is why I read HN's comments.

I get MANY recommendations for books/movies/shows/topics I knew nothing about etc. here.

Just ordered this book for my 10-year-old grandson.


Maybe I should have mentioned that was about 50 years ago - still a good book though!

I suspect much of it is still relevant.

Well I'm pretty sure that Vulcan still isn't there!

The question is, has this had anything to do with Asimov the writer? Was he involved at the start or endorsed it somehow?

Judging by the .press domain it's too new for that.


Judging by the fact that you DNRTFA, we can't help you.

Actually I did look them up and that Asimov was never involved with them or their parent company.

A bit dishonest don't you think?


Everyone is surprised at the $300k/year figure, but that seems on the low end. My previous work place spends tens of millions a year on GPU continuous integration tests.


The $300K/year figure is surprising because it was for something that didn't need to exist (RPC calls).


Also not free nor open source, it just calls to fal.


prior version used modal to host the sam audio model (https://github.com/sambarrowclough/clearaudio/pull/2). fal made things simpler & faster


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: