More

ithkuil · 2026-06-19T06:29:18 1781850558

We wouldn't have identities either if we were all clones and our memories could be edited and shuffled at each conversation.

For an agent to have an identity we would have to intentionally make it hard to context engineering and limit it to append only messages that mimick human communication.

I can implant a thought into your head. If I say "Don't think about a green elephant" for a moment you'll think about a green elephant. There are more sophisticated examples of a person implanting thoughts in somobodies head (e.g. propaganda) but that's about it, I can't literally edit thoughts.

Why on earth do we want to limit our ability to do more powerful context engineering in a substrate that offers that ability natively?

Presumably because for some use cases you want the context of an agent to belong to a different "administrative domain" and you so want to have control over what information reaches it and how can it affect it?

ithkuil · 2026-06-18T12:40:13 1781786413

I guess the signal-to-noise ratio matters.

In the last 3 months I received 700 spam/scam calls to my phone, my wife received about 400. We can't turn off ringing for unknown callers and we're getting mad. A few days ago I vented to one of those call-center people trying to sell me a cheaper power utility for the Nth time, and told her to find another job or something like that; she actually called me back yelling at me that "any job is worth", and yelled at her that I cannot fucking receive sometimes up to 20 calls in a day, sometimes at quite annoying times of the day! It's getting ridiculous.

EDIT: I know not everybody is having the same experience in my country. Some people are only getting a few calls per week; I registered our phones in https://registrodelleopposizioni.it/ and also I'm using android's spam filter which filters out additional hundreds of calls automatically.

EDIT 2: I sometimes wonder if we're being harassed by somebody ; I cannot tell. The voices are often quite similar, but it might be the albanian accent that makes them sound similar.

EDIT 3: caller id numbers are always different

hellojesus · 2026-06-18T17:26:15 1781803575

> I vented to one of those call-center people trying to sell me a cheaper power utility for the Nth time, and told her to find another job or something like that

I threaten to kill and rape them all the time, but that usually doesn't do much.

I've found that politely asking them to kill themselves elicits much more engagement, and I hope it at least implants some lasting memory.

ithkuil · 2026-06-19T05:44:32 1781847872

I assumed they are used to people being exhausted by those calls, so I was frankly surprised when she actually called again to complain

ithkuil · 2026-06-16T11:41:14 1781610074

I wonder if opus 4.8 would also be able to fix the code too

InsideOutSanta · 2026-06-16T12:03:22 1781611402

In my experience, most models are pretty good at finding security vulnerabilities and fixing them. I can run GLM-5.2, Kimi K2.7, or even a Mistral model, and it'll find issues and propose reasonable fixes.

My impression is that Anthropic's point about Mythos is that it is uniquely good at finding vulnerabilities and then using them to create working exploit chains.

zozbot234 · 2026-06-16T12:25:55 1781612755

Exactly. Which is somewhat helpful for cyber defense because it helps prioritize fixes for those bugs that are in fact involved in a viable exploit chain. But it makes sense that one would want to restrict the ability of building those until the vulnerable software has been comprehensively fixed.

There is some meaningful evidence that Fable is fine-tuned or steered away from helping on this very task, which is not something that can be feasibly circumvented by a basic jailbreak.

HarHarVeryFunny · 2026-06-16T17:32:41 1781631161

It's not even clear if Anthropic care. If they genuinely think the user is trying to do something dangerous, then "OK, sure, but you're going to have to use Opus 4.8 for that" doesn't make a whole lot of sense.

Maybe this is just Anthropic pre-IPO marketing to try to convince people how much better Mythos is than Opus 4.8. There sure seemed to be a lot of shills out on release day talking about how it was a "step change" (exact phrase) in capability.

b--l · 2026-06-17T09:29:59 1781688599

I doubt it. It's a shit a model.

ithkuil · 2026-06-16T05:33:24 1781588004

But LLM can write code that can do math and count. Tool use, more broadly, has proven to be a very powerful way to let LLMs do what they're good at (handle the fuzzy and imprecise nuances of natural language, which includes the scooping of a lot of context) and delegate other things they're not good at to external tools, some of which if can write on the spot.

If you think about it, we humans do that all the time too.

I'm crap at 4 digit multiplication in my head, but I have no problem doing that with pencil and paper

razorbeamz · 2026-06-17T00:04:52 1781654692

> But LLM can write code that can do math and count.

They cannot, however, execute that code. They can feed that code into an external program they've been given access to, but they can't execute it themselves.

ithkuil · 2026-06-15T17:40:34 1781545234

I'm honestly unsure if I'm more annoyed by slop or by the anyslop police at this point

ithkuil · 2026-06-13T06:19:44 1781331584

Fair enough, there _could_ be powerful models that are hidden from the general public, but I wouldn't call it "naive" to think the current capitalistic incentives are such that the only way to produce such models is to do exactly what we see out in the open with a handful of companies each trying their hardest to outcompete the other

ithkuil · 2026-06-11T06:31:47 1781159507

An interesting way to rate limit access while also getting some data to analyze. They will lift this restriction later when they have more capacity

ithkuil · 2026-06-10T23:47:08 1781135228

Why stop at bytes? Let's split it in individual bits and then look up the bits in pi!

But Pi's binary expansion is not very practical for this purpose, since it's 11.0010...

OTOH. e is 10.1011...

Let's stick to fractional digits (the ones right of the binary point) at index 0 we have 1 and at index 1 we have 0.

So, to encode a stream of bytes so that each bit is encoded as the index of that bit in the e, all you need to do is to xor it with 0xFF

nvader · 2026-06-11T00:46:55 1781138815

Hang on hang on let me write a CUDA kernel for this. This is going to be really huge.

hatthew · 2026-06-11T00:40:19 1781138419

genius

ithkuil · 2026-06-10T23:33:49 1781134429

You'll find this an interesting watch:

Reinventing Entropy Compression is Intelligence Part 1

3blue1brown https://youtu.be/l6DKRf-fAAM?is=ne73FCJ7ErXhzZ-v

nz · 2026-06-11T11:37:32 1781177852

You, and the HN users, `lojban`, `klingon`, `ido`, `brithenig`, `solresol`, `babm`, and `tokipona`, may want to start a club. Amusingly, nobody seems to have registered the `esperanto`, `volapuk`, `interslavic`, `balaibalan`, and `dothraki` usernames.

dothraki · 2026-06-11T13:56:51 1781186211

What can I say other than thank you for the inspiration.

idiotsecant · 2026-06-11T13:32:39 1781184759

I feel like I am having a stroke reading this comment

lompad · 2026-06-11T14:10:13 1781187013

The user names all describe conlangs[0]. Though I'd suggest nz to join as well, considering only a true conlang-connisseur would actually notice.

[0]: https://en.wikipedia.org/wiki/Constructed_language

cestith · 2026-06-11T15:15:35 1781190935

I don’t see users with ‘khuzdul’, ’sindarin’, or ‘quenya’ either.

sam_lowry_ · 2026-06-11T06:12:27 1781158347

Also this article by Ted Chiang as a literary explanation of the connection between intelligence and compression: https://www.newyorker.com/tech/annals-of-technology/chatgpt-...

ithkuil · 2026-06-08T06:43:22 1780901002

Italy is no Denmark but you still require to register before selling you scrap copper.

I think it's a reasonable response for a real problem and refusing to do this due to some idealistic free market principle appears to me to be a sign of fanaticism.