More

mdavid626 · 2026-04-01T18:34:08 1775068448

I trust sandbox-exec more, or Docker on Linux. Those come from the OS, well tested and known.

MITM proxy is nice idea to avoid leaking secrets. Isn’t it very brittle though? Anthropic changes some URL-s and it’ll break.

afshinmeh · 2026-04-01T18:36:14 1775068574

Thanks for sharing that. Zerobox _does_ use the native OS sandboxing mechanisms (e.g. seatbelt) under the hood. I'm not trying to reinvent the wheel when it comes to sandboxing.

Re the URLs, I agree, that's why I added wildcard support, e.g. `*.openai.com` for secret injection as well as network call filtering.

mdavid626 · 2026-04-01T19:04:36 1775070276

You know, the thing is, that it is super easy to create such tools with AI nowadays. …and if you create your own, you can avoid these unnecessary abstractions. You get exactly what you want.

mdavid626 · 2026-04-01T18:47:55 1775069275

How do you intercept network traffic on mac os? How do you fake certificates?

afshinmeh · 2026-04-01T19:04:03 1775070243

Zerobox creates a cert in `~/.zerobox/cert` on the first proxy run and reuses that. The MTIM process uses that cert to make the calls, inject certs, etc. This is actually done by the underlying Codex crate.

mdavid626 · 2026-04-01T19:06:05 1775070365

Yeah, but how does the sandboxed process “know” that it has to go through the proxy? How does it trust your certificate? Is the proxy fully transparent?

afshinmeh · 2026-04-01T19:09:37 1775070577

Oh I see. It inject HTTP_PROXY/HTTPS_PROXY/etc. env vars into the process so that all sandboxed subprocesses go through the proxy.

blanched · 2026-04-01T19:24:04 1775071444

What if the program doesn’t respect those env vars? Can Zerobox still block network calls in that case?

afshinmeh · 2026-04-01T19:50:39 1775073039

Great question! On Linux, yes, network namespaces enforce that and all net traffic goes through the proxy. Direct connections are blocked at the kernel level even if the program ignores proxy env vars, but I will test this case a bit more (unsure how to though, most network calls would respect HTTPS_PROXY and other similar env vars).

That being said, the default behaviour is no network, so nothing will be routed if it's not allowed regardless of whether the sandboxed process respects env vars or not.

solarkraft · 2026-04-02T18:50:10 1775155810

Does this work inside of Podman containers?

simonw · 2026-04-01T20:04:04 1775073844

How about on macOS?

afshinmeh · 2026-04-01T20:08:42 1775074122

On macOS, the proxy is best effort. Programs that ignore HTTPS_PROXY/HTTP_PROXY can connect directly. This is a platform limitation (macOS Seatbelt doesn't support forced proxy routing).

BUT, the default behaviour (no net) is fully enforced at the kernel level. Domain filtering relies on the program respecting proxy env vars.

simonw · 2026-04-01T20:12:46 1775074366

I thought seatbelt-exec had mechanisms for that?

  (allow network-outbound
    (remote tcp "127.0.0.1:8080"))

afshinmeh · 2026-04-01T20:21:32 1775074892

It does but because I'm inheriting the seatbelt settings from Codex, I'm not resetting it in Zerobox (I thought it's a safer option). Let me look into this, there should be a way to take Codex' profile and safely combine/modify it.

mdavid626 · 2026-04-01T05:51:53 1775022713

How the hell is it 500k lines?

twsted · 2026-04-01T06:00:10 1775023210

It is vibe coded.

dankobgd · 2026-04-01T11:16:24 1775042184

it's just bunch of useless junk

mdavid626 · 2026-03-31T18:45:43 1774982743

It’s time to run development in sandboxes. Docker or sandbox-exec (for Mac).

mdavid626 · 2026-03-31T12:14:55 1774959295

Oh boy, you couldn't be more wrong. If something, LLM-s need MORE readable code, not less. Do you want to burn all your money in tokens?

jen20 · 2026-03-31T17:35:09 1774978509

I very much doubt Anthropic devs are metered, somehow.

mdavid626 · 2026-03-26T07:39:46 1774510786

There is no such thing as agentic codebase. If humans don’t understand it, nothing really does. Agents give zero fuck about anything. If they burn 100 or million tokens to add a feature, they don’t care. It’s the developers responsibility to keep it under control.

drzaiusx11 · 2026-03-26T12:59:31 1774529971

100% this. With these new tools it's tempting to one-shot massive changesets crossing multiple concerns in preexisting, stable codebases.

The key is to keep any changes to code small enough to fit in your own "context window." Exceed that at your own risk. Constantly exceeding your capacity for understanding the changes being made leads to either burnout or indifference to the fires you're inevitably starting.

Be proactive with these tools w.r.t. risk mitigation, not reactive. Don't yolo out unverified shit at scales beyond basic human comprehension limits. Sure, you can now randomly generate entirely (unverified) new software into being, but 95% of the time that's a really, really bad idea. It is just gambling and likely some part of our lizard brains finds it enticing, but in order to prevent the slopification of everything, we need to apply some basic fucking discipline.

As you point out, it's our responsibility as human engineers to manage the risk reward tradeoffs with the output of these new tools. Anecdotally, I can tell you, we're doing a fucking bad job of it rn.

codyb · 2026-03-26T13:19:22 1774531162

The big AI projects I've seen at work are...

- A Kafka topic visualization dashboard

and

- A chrome extension the original "developer" can no longer work on cause the bots will wreck something else on every new feature he tries to add or bug he tries to fix

I think we're a ways out from truly complex code bases that only agents understand.

I've seen a bunch of hype video where people spend lord knows how much money in order to have a bunch of these things run around and I guess... use Facebook, and make reports to distribute amongst themselves, and then the human comes in and spends all their time tweaking this system. And then apparently one day it's going to produce _something_ but two years and counting and much like bitcoin, I've yet to see much of this _something_ materialize in the form of actual, working, quality software that I want to use.

My buddy made a thing that tells him how many people are at the gym by scraping their API and pushing it into a small app package... I guess that's kind of nice.

mdavid626 · 2026-03-24T06:50:01 1774335001

Isn’t that good advice? Rewriting it in C can prove that the problem is not in your programmin language.

Outdated OS can be the problem as well.

What kind of advice did you expect?

Panzerschrek · 2026-03-24T07:17:58 1774336678

> Rewriting it in C can prove that the problem is not in your programmin language

It's a lot of work of rewriting it in C. It's doable, but impractical. And such rewrite may introduce new bugs.

Proving that the problem isn't in my language wasn't necessary - there were no room for it to introduce such kind of bug. Language bugs are usually manifest themselves in a way different way, (like broken binary code leading to crashes or accepting invalid code). That's why I have created my question in a first place - while I was sure, that it wasn't a language bug.

> Outdated OS can be the problem as well.

Outdated OS can't be a problem. Basic socket functionality was implemented eons ago and all known bugs should be fixed even in pretty outdated versions of the kernel.

> What kind of advice did you expect?

Eventually I have found myself, that in my test program I create too much connections for a TCP server and its internal connection queue overflows. But it took some time to find it, way longer compared to what could be achieved if my SO was answered. The problem was not so hard to spot considering that I have described what exactly I do. Even providing code examples wasn't necessary - textual description was enough.

7bit · 2026-03-24T07:45:24 1774338324

Your arguments fall flat.

Could very well be that your language was the issue and writing a small proof of concept for the specific use case that's problematic in a battle tested language other people know is a standard trouble shooting step. Especially if it was a rare limiting error, that sounds like a trivial thing to be implemented in C with a simple for loop and some dummy data perhaps.

Same with the OS. Only because socket functionality is decades old doesn't mean that you can't hit a recently introduced bug, that could have been fixed in a new version. That also is a standard troubleshooting step.

It doesn't make for bad advice only because you're too lazy to do it.

mdavid626 · 2026-03-22T08:48:03 1774169283

I’d be happier if this could run as one binary without Docker. Java is so much harder to setup.

jeroenhd · 2026-03-22T09:43:38 1774172618

Getting Java to run is a base requirement for running most software written in Java.

However, there is a dedicated Dockerfile for creating a native image (Java words for "binary") that shouldn't require a JVM. I haven't tested running the binary myself so it's possible there are dependencies I'm not aware of, but I'm pretty sure you can just grab the binary out of the container image and run there locally if you want to.

It'll produce a Linux image of course, if you're on macOS or Windows you'd have to create a native image for those platforms manually.

mdavid626 · 2026-03-22T14:22:57 1774189377

Yeah, compare this with downloading single binary approach.

Downloading JDK, setting up the correct env variables, or running Docker, all this is just pain, compared to single binary approach.

drzaiusx11 · 2026-03-22T14:54:50 1774191290

Isn't a docker image basically a universal binary at this point? It's a way to ship a reproducible environment with little more config than setting an ENV var or two. All my local stuff runs under a docker compose stack so I have a container for the db, a container for redis, LocalStack, etc

mdavid626 · 2026-03-22T15:35:33 1774193733

Is it though?

On my Mac Docker runs Linux virtualized. It’s a resource hog.

Compare that with simple native binary.

drzaiusx11 · 2026-03-22T17:49:23 1774201763

I'm not saying it's ideal, just saying that's what we've shifted to for repeatable programs. Your Linux "universal" binary certainly won't work on your Mac directly either...

mdavid626 · 2026-03-19T19:03:37 1773947017

Firefox doing everything, but being a good browser.

mdavid626 · 2026-03-19T06:57:27 1773903447

Sigh… that’s not how it works.

qsera · 2026-03-19T08:02:23 1773907343

How what works?

mdavid626 · 2026-03-18T12:24:12 1773836652

JPEG still rules the world. Many new alternatives were developed, none of them really that much better than JPEG.

yboris · 2026-03-18T16:00:22 1773849622

JPEG XL is a superior replacement for JPEG. It is better in basically every way to JPEG.

jfoster · 2026-03-18T16:51:24 1773852684

Except compatibility, but the biggest gap is browser support, which is in the process of getting closed. Chrome has shipped JXL support behind a flag. Firefox are in the process of implementing support.

In Chrome you can enable JXL from here: chrome://flags/#enable-jxl-image-format

You can track Firefox progress from here: https://bugzilla.mozilla.org/show_bug.cgi?id=1539075

GuB-42 · 2026-03-18T18:46:36 1773859596

There is much better than JPEG, however, because still images are not really a problem in terms of bandwidth and storage, we just use bigger JPEGs if we need more quality. The extra complexity and breaking standards is not worth it.

This is different for video, as video uses a whole lot more bandwidth and storage, it means we are more ready to accept newer standards.

That's where webp comes from, the idea is that images are like single frame videos and that we could use a video codec (webm/VP8) for still images, and it will be more efficient than JPEG.

That's also the reason why JPEG-XL is taking so long to be adopted. Because efficient video codecs are important, browsers want to support webm, and they get webp almost for free. JPEG-XL is an entirely new format just for still images, it is complex and unlike with video, there is no strong need for a better still image format.

adgjlsfhk1 · 2026-03-20T04:54:45 1773982485

IMO most of JPEG-XL's value is in the feature set. Having a format that can do transparency, proper HDR, and is efficient for everything from icons to pixel art to photographs is a really strong value prop. JPEG, Webp, and AVIF all have some of these, but none have all. (AVIF is the closest, but as a video codec it still has a pretty significant overhead for small images like icons).

atiedebee · 2026-03-18T13:05:32 1773839132

A lot of them are better in many areas. JPEG is just good enough (tm)