flowinghorse's comments

flowinghorse · 2025-12-25T00:18:21 1766621901

              *
             / \
            /   \
           /  o  \
          /   .   \
         /  +   o  \
        /   .    .  \
       /  o    @     \
      /_______________\
            |   |
            |___|

flowinghorse · 2025-12-22T08:41:52 1766392912

Local models less than 2b are good enough for code auto completion. Even you don't have 128G memory.

flowinghorse · 2025-12-15T01:56:54 1765763814

Actually when the outage happened, my first action was to check Cloudflare status.

flowinghorse · on Nov 9, 2023

Just note that any organization is quite complex, and surely at different times different people in an organization have different purposes, and they may have different understandings if they share the same purposes. So it is pretty common that everybody's actions, understanding, and purposes are not coordinated.

The purpose of an organization is never simple. From Hebert Simon's <Administrative Behavior>:

> The survival and success of organizations depend on their providing sufficient incentives to their members to secure the contributions that are needed to carry out the organizations' tasks

flowinghorse · on Oct 29, 2023

Google’s Helpful Content Update is the Beginning and the Beginning of the End of SEO, and SEO for HCU is actually LLMO.

flowinghorse · on Oct 2, 2023

Or the operator of the most ambiguity

flowinghorse · on Sept 23, 2023

"The Habit Loop is a neurological loop that governs any habit. The habit loop consists of three elements: a cue, a routine, and a reward. Understanding these elements can help in understanding how to change bad habits or form better ones."

- Duhigg, C.

flowinghorse · on Sept 20, 2023

GPTBot — the official crawler of OpenAI, has been announced for nearly 2 months. GPTBot is for crawling web information to improve the models of OpenAI, e.g. GPT-4. We are wondering what the reactions from the Internet are. Is the bot being accepted or rejected?

Bender · on Sept 20, 2023

I suspect not many website operators/developers are aware this exists. Usage of robots.txt is unenforceable and would only show intent to OpenAI. This would not be useful for other LLM's as Google, Bing and other search engines already have decades of ingested data to feed their LLM's.

In my poor armchair quarterback opinion if people wish for something to not be crawled then they must make a best effort to ensure only humans are accessing it with strong authentication, legal agreements, best-effort bot detection and also have binding legal contracts that implement punitive actions for doing something with data it was not approved for and then actually follow through with legal action for breach of contract.

flowinghorse · on Sept 20, 2023

The number of disallow we found in the robots.txt files actually surprises us.

Companies like OpenAI also have to do a lot of things to ensure the compliance to the regulation.

Bender · on Sept 20, 2023

to ensure the compliance to the regulation

Did legislation pass requiring people and their bots to obey robots.txt? If so I totally missed it. That would be big news if so.

socrateslee · on Sept 20, 2023

ChatGPT only swallows data or content but bring back no traffic to content creators. Maybe this is how Google is different from ChatGPT. But what will become of Google if the experimental generative AI's answer replace all the SERPs? An closed web where no search engine no AI can actually enter?

flowinghorse · on July 15, 2023

Actually, the EU-US data privacy framework help preserve a cross continent market for small businesses