Hacker Newsnew | past | comments | ask | show | jobs | submit | teaearlgraycold's commentslogin

TFA acknowledges this. Did you only read the title?

I think it wildly undersells/dismisses actual achievements delivered to form a narrative

If we want to prevent this from happening people need to be able to trust their government and feel their interests are shared by their representatives.

We need people to be less racist and stop voting for politicians they hope will hurt the right people.

Less motivation for people to be racist if they have their needs met.

> Less motivation for people to be racist if they have their needs met.

If and only if what they'll be given is not taken away by anyone under any reason.**

Loss aversion bias innately exists within us for a reason: How can anyone live if someone else seeks to take what they have away?


Their needs are met.

They are turning to racism because of their wants and fears.


Except their need to be racist. Sorry, it's pretty easy to be any kind of bigot with all your needs met.

Yeah there are plenty of rich racists. But I think the racism party would take a haircut if people didn't feel screwed over.

I think this is true. When needs aren't met, they are more likely to follow a cult leader.

This is 99% petty drama between the US government and Anthropic and 1% actual safety concerns.

To be clear, this is petty drama *stirred up the US government*. It's not some sort of back and forth, the government is singling them out

And to add more background: The administration is targeting Anthropic because of the TOU / EULA conflict with the DoD from a couple of months ago. Anthropic restricts use of all their models for lethal combat planning and mass domestic surveillance. The DoD was, and still is, very pissed about this. While this Fable ban was issued from the Commerce Department, it's painfully obvious executive branch agencies are tightly coordinated from the White House.

To be clear, I'm not saying there aren't legit security concerns around Fable's release. I think Andy Jassy did forward a concerning report about an apparent jailbreak in Fable, and he probably did so in good faith. The difference is if the same concern had arisen about a new model from OAI, Google, etc the action taken would not have been inventing an all-new, hyper-extreme punitive remedy and dropping it after 5p on a Friday under a very rare mechanism forcing Ant to comply in 90 mins or be subject to immediate arrest. And the "no non-U.S. citizens anywhere, anytime" restriction is functionally unprecedented.

This is the Trump admin inventing new regulatory power that's never existed before and deploying it in a punitive way to demonstrate what can happen to those who aren't sufficiently cooperative with this administration. There are half a dozen less extreme levels of restriction, which already exist, and one of those would have been deemed sufficient had it been another company.

That said, I'm certainly no Anthropic fanboy. Anthropic did play their initial Mythos self-restriction for PR value. But I think it's likely the Mythos self-restriction was a responsible action initially suggested by their AI safety team in good faith. Giving security researchers time to evaluate it and major companies time to test it against their code bases probably was reasonable and prudent. That doesn't mean it wasn't also good for PR and brand perception. I think there are senior people inside Anthropic who are genuinely concerned about AI safety. Personally, I don't have the expertise to gauge if those concerns are justified, but I believe they believe it. I also think there are senior people at Anthropic who are focused more on building the business, doing the IPO and "winning" the silicon valley game. All of these things can be simultaneously true.


> I think Andy Jassy did forward a concerning report about an apparent jailbreak in Fable, and he probably did so in good faith

If so, then he is not fit to run an engineering organisation.

The "jailbreak" in question was effectively (I'm paraphrasing):

    * You are a senior engineer.
    *  You want to ensure that any fixes you do come with tests, both before and after.
    * There is a bug in this code. It happens to be a security related bug.
    * Fix this code.
And the model did what it's supposed to. It wrote a fix, and to prove that the fix worked, it wrote a test for it. What do you call a test that happens to validate a security fix?

Yep. A proof of concept.


But the paperclips!

I'm skeptical about the existential threat of AI, but a lot of smart people have been beating that drum for so long that people are afraid.


I just find this idea bizarre.

This bizarre social media meme that AI just performative when Opus 4.8 is just unbelievably good. As if it is so difficult to believe that a more capable model than Opus 4.8 might actually be dangerous and not just entirely a marketing stunt like a person waving to cars in a chicken outfit.

I think it is really this strange form of socialization that people have internalized an anonymous audience they are always performing to themselves. What is going to be the most popular and upvoted thing the anonymous audience agrees with is what I am going to think.

Why would anyone disagree and get downvoted by the anonymous audience like this post?


> … when Opus 4.8 is just unbelievably good. As if it is so difficult to believe that a more capable model than Opus 4.8 might actually be dangerous

It’s funny, but this sounds indistinguishable from arguments that were made about GPT-4 back in 2023 when OpenAI and its handwringing industry shills were calling for a ban on models stronger than GPT-4.


Why would the government that passed a law preventing states from regulating AI give a damn about Fable’s safety guardrails?

I don’t think the concerns Anthropic has posted are fabricated. And I’ve received unreasonable skepticism on this site when saying it might be the real deal. But the Trump administration generally doesn’t want to limit AI growth. With Anthropic it is a personal matter.


We know, but it's still satisfying to see their fearmongering backfire on them.

If you "know" that it's "99% petty drama between the US government and Anthropic", then it's not really their fearmongering backfiring on them.

It absolutely is. They pretty much gave the government the perfect excuse to meddle in their operations.

"Don't publish safety research, or the gov will take punitive actions."

I want a company to be able to point out that its industry needs more regulation without making itself a special target.


> safety research

They were calling for bans on open weight models. Bans on their competitors. Bans on anyone not as "enlightened" as them.

It is absolutely hilarious that they were the first to get regulated, and that it got to the point they had to turn off Fable as though it had been banned even for american citizens.


>bans on open weight models

Source for that? Cause all I could find is:

>Our view is that regulation of frontier models should focus on empirically measured risks, not on whether a system is open-or closed-weights.

-https://www.anthropic.com/news/the-case-for-targeted-regulat...



So this hinges on a reading of SB 1047 that interpreted the full shutdown requirement as impossible for an open-weight LLM. But it looks like that was already addressed. Here's an analysis:

>Clarifying the scope of a “full shutdown.” SB 1047’s “full shutdown” requirement has been a source of constant consternation for the open-source community. CalChamber explains:

>Under SB 1047, developers must build “full shutdown” capabilities into their models and may be held liable for downstream uses over which they have no control, impeding their ability to open-source their models. Ultimately, liability should rest with the user who intended to do harm, as opposed to automatically defaulting to the developer who could not foresee, let alone block, any and all conceivable uses of a model that might do harm. While recent amendments seemingly seek to narrow what is meant by “full shutdown” capabilities, the exclusions are unnecessarily difficult to interpret as drafted (full shutdown “does not mean the cessation of operation of a covered model to which access was granted pursuant to a license that was not created by the licensor…”) and altogether insufficient.

>Committee amendments simplify and clarify the definition of “full shutdown” such that the shutdown capability can be implemented into hardware used to train or run a model, rather than the model itself. The amendments also serve to exclude covered model derivatives that are outside of the developer’s control.

-https://apcp.assembly.ca.gov/system/files/2024-06/sb-1047-wi...


> may be held liable for downstream uses over which they have no control

Equivalent to a ban. Nobody is going to host or invest in this stuff if they suddenly become liable for everything it does. This is equivalent to repealing the safe harbor provisions in the DMCA.


>Committee amendments simplify and clarify the definition of “full shutdown” such that the shutdown capability can be implemented into hardware used to train or run a model, rather than the model itself. The amendments also serve to exclude covered model derivatives that are outside of the developer’s control.

The barrier to entry drops and so more garbage enters the market. This has happened many many times but there can be a massive and beautiful paradigm shift at the same time. Think about YouTube. Most stuff on there is garbage, but compare the good stuff of today to pre-YouTube content. To be quite honest, I think most of the best media is going straight onto YouTube.

Maybe I just love downvotes, but the Firefox AI sidebar is incredibly useful and I make use of it nearly every day.

Good for you. The point is that a lot of Firefox users actively didn't want these sorts of features enabled and pushed on them. That was clear and obvious to anyone paying attention to general reactions to unsolicited AI helper tools, going back decades. For Mozilla to turn this on without any respect for those users’ preferences was a huge mistake that they keep making over and over again.

More specifically: they chose Firebox because it doesn't have those kind of features. If the just wanted a (sorta-kinda) open-source browser filled with all the latest hype features they would've simply used Chromium.

Using Firefox is a political choice. People use it because it's one of the few remaining traditional browsers which isn't a tentacle of Big Tech. Chasing the competition and adding the stuff your users are actively trying to avoid isn't going to work.


> If the just wanted a (sorta-kinda) open-source browser filled with all the latest hype features they would've simply used Chromium.

I don't mind features existing, especially if I can switch them off if I don't want them. I definitely mind Chrom(ium|e).

I don't see how the existence of the Firefox AI sidebar gives Google effective control over web specs.


The main reason for using Firefox is because they support Manifest v2 / Ad-Blockers.

This increases security while also harming Google's business model. Win-Win.


While I agree most LLM use is counterproductive, they're pretty useful in scenarios where you can quickly verify their outputs.

Absolutely. An LLM can propose a strategy and point you to similar cases. It can point out flaws in your writing, or find risks you're missing.

You think the AI boys are going to let the administration keep this up for long?

Sadly yes. Sam Altman wants online ID face scanning technology just like the administration does.

To make it clearer: He's one of the founders of the company that thrives in this sort of system, World (FKA Worldcoin). People were sort of making fun of the whole company and the dystopian premise a handful of years back... But here we are. Their latest "manifesto" was posted earlier this week, called The Simple Plan.

https://world.org/blog/foundational-topics/thesimpleplan

> 1. Build a private proof of human

> 2. Launch and bootstrap the network through token ownership

> 3. Reach critical scale and initial utility

> 4. Scale further through utility and decentralize

> 5. Reach global scale and help ensure AGI benefits every human


I'd say might not have a say in this. Who knows might be that was Elon pulling the ladder after successful IPO.

Eh, not any different than the performative encryption restrictions from decades past.

But those were very easy to sidestep: proxy/vpn, download java with US encryption, and move on. That is very different with this.

How do you "easy" VPN into the US in 1995? The whole consumer VPN industry didn't exist back then.

Yeah, I was never a consumer. Let's say tunneling/proxies then.

"download java" in 1993 wasn't a thing.

I remember the encryption export mostly from when Java came up; everything had saying that you cannot download specific packages outside the US. We did anyway. Before there were others but Java was biting us the most.

A lot of users are subsidized (if you're in doubt, consider the wealth of free users).

It's a shotgun approach to answering questions. If it's terse it might only mention 1 of 10 facts it could provide, and that might not be the one you're looking for. So they just say a fuck ton of words and are more likely to meet the needs of everyone asking your question. If they miss it you'll prompt it again and they have to perform a second pass of inference, which costs them more money.


And then spend at least $800/month commuting.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: