More

nzeid · 2026-06-25T13:25:36 1782393936

What!? Amazing.

nzeid · 2026-05-23T23:31:29 1779579089

I want this. It's the reason why every time I shop for desks I look for workbenches. Desks are always TINY and I never understood why.

throwaway2037 · 2026-05-24T03:20:04 1779592804

I agree. I recently bought a beast from IKEA: TROTTEN (160x80 cm). Crazy cheap and built like a tank.

bschwindHN · 2026-05-24T15:15:59 1779635759

I recently upgraded to a desk of the same size, it's great! I have a habit of sitting cross legged and my legs were always bumping into my old desk whose legs were spaced too close together.

I had to the wooden top cut at a home center and then I sanded it and applied a finish myself because I couldn't find anywhere local that sold a desk top that size. Combined with a dual motor standing desk set of legs, it turned out great!

I also 3D printed brackets for all the power bricks of various devices I have and mounted them on the underside.

throwaway2037 · 2026-05-24T16:02:08 1779638528

This would be a good blog post or YouTube video. Do you have anything public to share? Your project sounds cool.

bschwindHN · 2026-05-25T00:39:40 1779669580

I was actually planning to write a small blog post about it because some parts are specific to where I live, but I'm sure there are plenty of articles already around. Just look up "custom standing desk" or "countertop/butcherblock desk".

The woodworking was fun but you're basically screwing a big piece of wood to some pre-made metal legs that have motors to raise it up and down.

nzeid · 2026-05-19T21:42:48 1779226968

> # External mode — you manage llama-server, forge proxies it

> python -m forge.proxy --backend-url http://localhost:8080 --port 8081

This is a good example because I've currently stuck with llama.cpp's UI. I can read your code (or throw Gemma at it =p ) but thought I'd ask anyway.

In this example, what is it exactly that your proxy is fortifying? The HTTP SSE requests? (Those would be `/chat/completions`.)

zambelli · 2026-05-19T21:49:29 1779227369

Yes that's correct !

/v1/chat/completions is the entry point.

In proxy mode, here's what forge applies on each request (handler.py builds these):

Response validation: ResponseValidator(tool_names) checks each tool call against the declared tools array. If the model emits a call to a name not in tools[], or a malformed call shape, it's caught before the response goes back.

Rescue parsing: When the model emits tool calls in the wrong format — JSON in a code fence, [TOOL_CALLS]name{args} (Mistral), <tool_call>...</tool_call> (Qwen XML) — rescue parsers extract the structured call and re-emit it in the canonical OpenAI tool_calls schema. This is the biggest practical lift, especially on Mistral-family models that ignore native FC and emit their own bracket syntax.

Retry loop with error tracking: ErrorTracker(max_retries=N) — if validation fails, forge retries inference up to N times with a corrective tool-result message on the canonical channel, rather than returning a malformed response to your caller. From your perspective the proxy looks like a single request that just took a few extra ms.

What proxy mode does NOT do (because it's single-shot, not multi-turn): prerequisite/step enforcement (those need a workflow definition spanning turns), context compaction, session memory. For that surface you wrap the WorkflowRunner class in Python — proxy mode trades that depth for "use forge with your existing setup, no Python rewrite."

So yes — the proxy is fortifying the response shape and retry behavior of /v1/chat/completions. The full agentic guardrails are at the Python class level above it.

For greenfield projects, I've been building on forge native using WorkflowRunner so I get all guardrails. But obviously as a drop-in replacement in existing systems then proxy is the way to go.

cyanydeez · 2026-05-19T21:53:10 1779227590

the funniest thing I see in opencode with tool calling is the model calls 10.0 and opencode says it's an error because the spec is an integer, even though it's obvious to anyone that if a float can be coerced properly to a integer, then that should be a success.

zambelli · 2026-05-19T21:58:41 1779227921

Yeah it's a delicate balance between precise and silly, and too permissive.

I'm definitely still iterating on forge, but so far sending the model a friendly and gracefully handled error message works wonders (instead of barfing a stack trace or something).

nzeid · 2026-05-11T17:31:38 1778520698

> They've essentially gotten roped into maintaining a huge chunk of internet infrastructure, for free. If they ever shut it down the whole world would end up rioting because it's so widely used.

Not even remotely true. They regularly shut down products and services with impunity. If Gmail cost more than the data they directly or indirectly mine and sell from their users, Gmail wouldn't exist either.

traderj0e · 2026-05-11T17:35:20 1778520920

The stuff they've shut down has been nowhere near as important as Gmail.

duskwuff · 2026-05-11T18:05:25 1778522725

Shutting down GMail would practically amount to shutting down email. It's by far the largest email provider in the US (and probably in the world but I don't have that data). There's no other provider who could take up the slack; if it were to abruptly shut down, a lot of users would simply lose access to email altogether.

elmomle · 2026-05-11T18:13:46 1778523226

They'd generate a huge amount of ill will by shutting it down, and that in turn would likely lead to a nontrivial share of people moving away from Google core products (like search) out of pure spite.

loloquwowndueo · 2026-05-12T02:43:24 1778553804

Wait, Google does search these days?

KennyBlanken · 2026-05-12T03:32:38 1778556758

To what? Google Search sucks thanks to the idiot who ran Yahoo into the ground, but everyone else sucks more. Every time I try to use non-google search the results are virtually useless.

Google has firmly been in the "we're so big we can suck at everything, but you'll still use our stuff because you have no other choice" phase that Microsoft was (is?) in.

They've dominated email so much that their spam filter makes it a very risky proposition to run your own domain; chances are very good it'll just start dropping your messages. Even if chances aren't great, can you take the risk of an important email getting zapped?

To this day I still routinely have to fish out my gmail spam folder dozens of emails from various open source mailing lists that have been around for a decade or two, some hosted on kernel.org, because the spam filter is convinced they're spam. Google is too fucking stupid or lazy to whitelist sites like kernel.org.

FFS even google groups I'm in that are technical get obviously-not-spam messages tagged as spam!

throwaway173738 · 2026-05-12T13:23:11 1778592191

kagi has been pretty good. Not great but way better for searches for information that happen to have a lot of people selling you something.

bigstrat2003 · 2026-05-12T01:42:14 1778550134

No, that wouldn't happen. Lots of people don't have email through Google, for one. Those people will still use email just fine. Moreover, the people who do use Gmail will simply sign up with another provider. It won't be a big deal.

duskwuff · 2026-05-12T02:27:58 1778552878

> No, that wouldn't happen. Lots of people don't have email through Google, for one.

Based on some data I collected around five years ago, roughly 80% of US customers used GMail for personal email. It was overwhelmingly the most common choice. I suspect that number has only drifted upwards since.

(What about the rest? 15% were using Yahoo; the rest were spread thinly across AOL, Microsoft, ISPs, and colleges.)

autoexec · 2026-05-12T01:35:12 1778549712

At one point AOL was the largest ISP and email provider on Earth too. If gmail died off people would just move to something else. It'd be annoying, but it wouldn't be the end of email

a2128 · 2026-05-12T02:26:21 1778552781

Google could actually do everyone a solid by killing gmail. They have enough influence in the industry that they could create a standard for email address portability, and then slowly force everybody to move off. By the end, one of the biggest problems with email would be solved and people would be able to switch email providers like how we can switch phone providers without needing to change our phone numbers. And Google would get to save a lot of money by no longer needing to provide everyone's emails

Sohcahtoa82 · 2026-05-13T16:53:23 1778691203

In the days when AOL was the largest AOL, the only people on the internet were middle class and above and the uber-nerds. The landscape has changed.

traderj0e · 2026-05-12T16:37:57 1778603877

When AOL was the largest email provider, there weren't as many people using email, at least not for important things

omcnoe · 2026-05-11T19:13:46 1778526826

I’d honestly expect to see regulatory intervention if they tried this.

idle_zealot · 2026-05-11T19:35:19 1778528119

In a better time I would expect the government to step in a acquire this fundamental service and fund it with tax money. Right now? The only intervention I would expect is a massive subsidy to pay Google to keep providing it, while also letting them continue to spy on everyone's mail (which is a crime, but not if the mail is on a computer, I guess).

icase · 2026-05-12T04:04:00 1778558640

oh yes, government-run email.

what could possibly go wrong

idle_zealot · 2026-05-12T05:40:53 1778564453

Why is this inconceivable? I don't know where you live, but the Post Office is extremely cheap and reliable around here. What drives you to pretend that states can't provide services to their people?

traderj0e · 2026-05-13T18:17:21 1778696241

https://en.wikipedia.org/wiki/HealthCare.gov was not a hard problem and they struggled. Gmail tackles a hard problem that even other large tech companies struggle with.

idle_zealot · 2026-05-13T18:48:17 1778698097

An excellent example of how not to do a government program!

> On October 1, 2013, HealthCare.gov was rolled out as planned, despite the concurrent partial government shutdown. The launch was marred by serious technological problems, making it difficult for the public to sign up for health insurance.[4] The deadline to sign up for coverage that would begin January 1, 2014, was December 23, 2013, by which time the problems had largely been fixed. The open enrollment period for 2016 coverage ran from November 1, 2015, to January 31, 2016.[5] State exchanges also have had the same deadlines; their performance has been varied.[6][7][8]

> The design of the website was overseen by the Centers for Medicare and Medicaid Services and built by a number of federal contractors, most prominently CGI Inc. of Canada. The original budget for CGI was $93.7 million, but this grew to $292 million prior to launch of the website. While estimates that the overall cost for building the website had reached over $500 million prior to launch[1][9][10][11][12] and in early 2014 HHS Secretary Sylvia Mathews Burwell said there would be "approximately $834 million on Marketplace-related IT contracts and interagency agreements,"[13] the Office of Inspector General released a report in August 2014 finding that the total cost of the HealthCare.gov website had reached $1.7 billion[14] and a month later, including costs beyond "computer systems," Bloomberg News estimated it at $2.1 billion.[15]

Got it. So if you're fighting an obstinate faction that would rather the government not exist than provide services then that can cause issues. Further, contractors will fleece you for everything you're worth. Compare to a successful project like the Post Office that gets pushed through with overwhelming political will and is run directly by a government agency (oddly structured as a government-owned corporation) and then even despite attempts to destroy it it continues to provide good service.

It's not easy; you need someone competent heading it up and setting it up for success. If the Democrats were to propose it in 2028 under president Gavin I would expect it to be a boondoggle. That doesn't change the fact that I want it to be done and done well.

account42 · 2026-05-12T08:33:43 1778574823

It's already called G-mail. Perfect fit.

traderj0e · 2026-05-11T21:09:36 1778533776

Government-operated Gmail would become such a massive cesspool of spam and hijacked accounts. It'd be spectacular.

WalterBright · 2026-05-11T20:01:48 1778529708

Do you believe that if the government provided email, that the government wouldn't keep track of everything you did on it?

idle_zealot · 2026-05-11T20:13:17 1778530397

Depends on the health of our institutions. In the US at least they're legally obligated not to by the highest law in the land. It gets ignored now, but it's a more promising path to privacy-preserving digital infrastructure than letting the private market handle it.

WalterBright · 2026-05-11T21:08:45 1778533725

Oh, I think it's been ignored for a long time. Remember Snowden?

> but it's a more promising path to privacy-preserving digital infrastructure than letting the private market handle it.

The history of governments suggests otherwise.

bigstrat2003 · 2026-05-12T01:45:20 1778550320

Unfortunately, the Constitution has been flagrantly ignored by the federal government for close to 100 years now, if not longer. Everything that FDR did was blatantly unconstitutional, but nobody stopped him, nor did they roll it back when he was gone. The Constitution has no real practical power to restrain the government if the people don't exercise their rights as voters to hold it accountable, and it is abundantly clear that the unconstitutional stuff the government gets up to is (largely) actually pretty popular.

s1mplicissimus · 2026-05-11T20:35:24 1778531724

Do you believe the government doesn't keep track of your email, just because it's hosted on googles servers?

WalterBright · 2026-05-11T21:04:16 1778533456

I used a private mail server for years, and the government didn't keep track of it. Of course, what happened at the email's destination, who knows?

icase · 2026-05-12T03:25:35 1778556335

oh no. what a shame that would be.

redeeman · 2026-05-13T13:31:12 1778679072

bullshit, email exists outside of gmail, and email would continue to exist without it. many would have to get a new account somewhere, but that would be not a problem. there are shitloads of providers that would be quite happy

lukan · 2026-05-11T18:04:12 1778522652

Yeah, but they still don't run a charity. They sell ads and information - and gmail provides them with lots of valuable information.

If that ceases to be true, goodbye (free) gmail.

LightBug1 · 2026-05-11T20:07:59 1778530079

Shutting down GReader ruined my life.

traderj0e · 2026-05-11T21:16:47 1778534207

Has nobody made a better RSS reader since then? Or is the issue that GReader was so popular that shutting it down made everyone stop using RSS?

negura · 2026-05-11T22:13:14 1778537594

it's insane to frame anything a company like Google does as some kind of goodwill. rather than an amoral profit optimization. contrary to OP, what people often overlook about GMail is not their "plight". but the powerful brand awareness it creates

eatsyourtacos · 2026-05-11T18:03:57 1778522637

Yes it is remotely true. Name one thing they have shut off that a large number of people actually used and it was important. We all joke about Google dropping things and yes they have, but saying they can just drop Gmail is.. well, insane.

negura · 2026-05-11T22:19:20 1778537960

they essentially shut down the old (useful) google search when they prioritized ad-heavy websites in the ranking

nzeid · 2026-05-11T18:14:29 1778523269

This fixation on "importance" is laughable. It is "insane" to drop Gmail because it makes them a shitload of money. That is how corporations work.

traderj0e · 2026-05-11T18:23:49 1778523829

The reason people mention importance is because corps like Google don't just care about per-product profitability, they assess how one product affects the rest of their business.

nzeid · 2026-05-05T19:34:57 1778009697

A few days ago I switched again from Qwen3.6 to Gemma 4 - for personal use I've experienced better average performance with the 26B version of the latter than the 27B of the former.

For someone who's been running local models for a long while, these are very very exciting times.

girvo · 2026-05-06T00:42:18 1778028138

Oh that's fascinating. 3.6 27B is pretty damned good, but slow in wall-clock times on my DGX Spark-alike. It generates huge reams of thinking before it gets the (usually correct!) answer, so wall-clock time is rough for tasks even at ~20tk/s

I'm surprised the 26B-A4B is better? It should be faster too, interesting. I'm excited to try 31B with MTP, because MTP-2 is what makes 27B bearable on the GB10.

What are you using it for? Agent-based coding, or something else?

nzeid · 2026-05-06T19:10:58 1778094658

General purpose, mostly internet research in the form of slow-crawling. (Emphasis on slow - I've ultimately landed on Scrapling's API for seamless content rendering, and I use image support so as not to exclude informative images or weirdly rendered text.)

For coding I don't need image support so I stuff the entire GPU with text-only mode. I don't have a workflow where I send LLMs off to generate thousands of lines of code but what little coding I did I did with Qwen3.6 and it was spectacular, as you likely suggest.

glenngillen · 2026-05-06T02:44:56 1778035496

I've been thinking about doing more of this too. What spec machine are you running? And are you using long-running autonomous agents or more of the IDE/co-pilot style of collaboration?

apexalpha · 2026-05-05T20:25:15 1778012715

I’ve been swapping between these too as well.

However I find qwen unbeatable for toolcallling. I think gemma wasnt trained on that at all.

sigmoid10 · 2026-05-05T20:30:41 1778013041

Gemma certainly was trained for tool calling, but the implementation in llama.cpp has been troubled because Gemma uses a different chat template format. The processor from the transformers library works fine though.

apexalpha · 2026-05-06T11:10:07 1778065807

Oh I must've missed this.

The AI space moves so fast! I'll check it out again.

intothemild · 2026-05-06T13:32:39 1778074359

Don't forget to update the gguf you have too. The templates in them were updated recently too

nzeid · 2026-05-05T20:39:03 1778013543

I'm using llama.cpp with Gemma and tool calling is mission critical. It's perfectly fine on my end.

There are definitely differences in the eagerness to tool-call that you'll need to manage. And for all local models I've ever used, I've had to micromanage the tools provided by servers to eliminate any possibility that they reach for something wonky or confusing.

magicalhippo · 2026-05-06T00:30:04 1778027404

> However I find qwen unbeatable for toolcallling. I think gemma wasnt trained on that at all.

Gemma4 chat template seems to had multiple issues, at least with llama.cpp, not sure they're all fixed yet. It assumed simple types for parameters for example.

nzeid · 2026-04-30T04:00:39 1777521639

I'm a huge neal.fun fan but I still worried that this was some scammy LLM coding YouTube clickbait.

Love the joystick for mobile users.

nzeid · 2026-04-29T17:54:38 1777485278

Also, embedded servers are now much much much more popular. Stuff an HTTP server directly into your application and do whatever you gotta do without gateways.

agwa · 2026-04-29T18:17:54 1777486674

That is way! Unfortunately, sometimes you have to do path-based routing to different backends, and now you're back to needing a proxy between your clients and your applications.

nostrademons · 2026-04-29T18:37:43 1777487863

This is the way only if you're operating in a trusted environment (eg. homelab, intranet) or you're sticking CloudFlare or some other "reverse proxy as a service" in front of it. If you expose an embedded HTTP app server directly to the Internet you're almost guaranteed to get pwned, as the Internet has now become an extremely hostile place.

agwa · 2026-04-29T18:41:02 1777488062

Go's embedded HTTP server can handle it just fine: https://blog.gopheracademy.com/advent-2016/exposing-go-on-th...

winstonwinston · 2026-04-29T18:57:03 1777489023

These are often not enough ‘battle-tested” and come with a warning to never expose to public internet. So then you put a WAF in front of it, and you are back to HTTP reverse proxy setup.

nzeid · 2026-04-29T20:28:56 1777494536

I've always chuckled at this. Just don't used bad HTTP server libraries. I wouldn't put something like that on my intranet either.

But even if you disagree with me the point is that I can count on only one hand the number of times I went "oh man, I need a FastCGI middle end".

winstonwinston · 2026-04-30T16:12:11 1777565531

I agree with your point but this is the reality:

F.E. Python stdlib http.server comes with a warning: Warning http.server is not recommended for production. It only implements basic security checks.

The `standard` way is then to use WSGI or ASGI, not FastCGI, but it is similar interface implementation.

nzeid · 2026-04-29T17:45:43 1777484743

The paper isn't saying "AI can't have one" it's saying (very approximately) that behavioral mimicry is not the path to one.

FrustratedMonky · 2026-04-29T17:57:03 1777485423

That is good point.

Just wondering, once an 'AI Model of Some Form', is in a Physical Body a 'robot', and is provided with some rules about survival so it doesn't fall into a hole. After a series of these events, does it matter? Does mimicry become reality, or no longer differentiable.

Kind of the philosophical zombie argument. If a robot can perfectly mimic a human, can you really know the internal state of the 'real' one is different from the 'mimicked' one.

nzeid · 2026-04-29T18:20:39 1777486839

The paper isn't concerned specifically with survival. It's saying that you cannot achieve "abstraction" (presumably the structure that underlies critical thinking, creativity, etc.) through shear mimicry.

Again, just echoing the paper here. I don't know that I'm doing it justice.

nzeid · 2026-04-21T13:24:59 1776777899

No, Brussels is Belgium.

And Brussels is not the capitol of the EU because the EU is not a country.

nzeid · 2026-04-16T13:21:19 1776345679

This was surprisingly complicated for me on Altice/Optimum, which is why my home didn't have IPv6 for a while even after they started provisioning.

We actually have a /128 address only, and had to tweak several settings including enabling IPv6 masquerading (NAT).

I haven't the slightest clue why they didn't give us a block.