More

xyzzy_plugh · 2026-03-13T17:15:09 1773422109

Because the fundamental task many of these programs are doing is neither complicated nor resource intensive.

In the age of cheap custom software solutions everyone should at least try to make something themselves that's fit for purpose. It doesn't take much to be a dangerous professional these days, and certainly more than ever before can a dangerous professional be truly dangerous.

shimman · 2026-03-13T19:34:44 1773430484

Thank you, I get so confused when people think a $5/vps shouldn't be able to do much. We're talking about 99% of small business that might have 5 concurrent users max.

2 gigs of ram should be considered overkill to cover every single business case for a variety of tools (analytics, mailer/newsletter, CRM, socials, e-commerce).

xyzzy_plugh · 2026-03-13T16:58:29 1773421109

I heartily agree with you except for the ongoing childhood-screentime pandemic where kids aren't going outside to play, but instead are staying inside, alone, and maybe playing with others virtually, but with more exposure to harm (e.g. gambling). This is clearly going to cause some serious long term generational fallout.

autoexec · 2026-03-13T23:23:06 1773444186

I'm grateful that I got the best of both worlds. When I was young I could play outside with freedom and climb around on highly dangerous playground equipment and now that I'm older and more fragile I get to stay inside on the couch and play amazing video games all day.

It's a shame that kids today don't get the option to do crazy kid stuff while they're young and healthy enough to bounce back from injury. I can't blame the tech for that though. It's parents who don't restrict screentime and our society that thinks it's okay to call the police on parents who let their kids walk down the street unattended.

GMoromisato · 2026-03-13T17:09:48 1773421788

Agreed--we're already seeing some of that, and I fully support minimizing kids' exposure to that.

I probably should have been explicit that I don't think technology has no downsides--it most certainly does. It's just, IMHO, the benefits outweigh the risks. And, over time, we figure out how to ameliorate the downsides.

xyzzy_plugh · 2026-03-13T14:14:16 1773411256

As did I. The most unbelievable part is that we used that tiny keyboard.

uka · 2026-03-13T18:24:21 1773426261

EEEPc ? Oh, I used to dream of having an EEEPc ... I got my degree using old C64 - had to manually encrypt packets with pen and paper to use https.

https://www.youtube.com/watch?v=VKHFZBUTA4k

xyzzy_plugh · 2026-03-10T15:12:38 1773155558

I think you're confused. Kindles need to be connected to the Internet so you can purchase and read books on them. The SIM card removed friction from the process e.g. buying books while on vacation or at the airport or whatever.

They didn't put SIM cards in there to spy on you. They were always an opt-in (at additional cost) option for a better user experience.

ceejayoz · 2026-03-10T16:15:48 1773159348

> They didn't put SIM cards in there to spy on you.

I'm sure Amazon tracked all sorts of activity on those, but that's not the point.

It would be quite trivial to add them to TVs to avoid ad blocking and track behavior when wifi isn't available.

xyzzy_plugh · 2026-03-09T23:56:34 1773100594

Isn't it obvious?

It's the range from 1773100800 down to 13.

xyzzy_plugh · 2026-03-09T18:38:01 1773081481

Modern medicine absolutely short-circuits natural selection. If you have an older sibling who was delivered via C-section chances are you wouldn't exist.

shrubble · 2026-03-09T18:45:20 1773081920

That’s not true for the USA however.

The large award for a medical malpractice trial was the reason for doctors pushing for a C-section if there’s any possibility of a complication. (Sometimes called defensive medicine.)

Most people point to the cases won by John Edwards, trial lawyer and vice presidential candidate as the reason for the great increase in C-sections. His case wins include 30 trials at which he won at least $1 million dollars each.

DanielHB · 2026-03-09T19:03:10 1773082990

In my generation (80s-90s) pretty much everyone in Brazil that was born in a hospital was born through C-section. Only recently did the practice of defaulting to c-section is beginning to fade.

xyzzy_plugh · 2026-03-08T20:53:54 1773003234

This is just a wrapper around sandbox-exec. It's nice that there are a ton of presets that have been thought out, since 90% of wielding sandbox-exec is correctly scoping it to whatever the inner environment requires (the other 90% is figuring out how sandbox-exec works).

I like that it's just a shell script.

I do wish that there was a simple way to sandbox programs with an overlay or copy-on-write semantics (or better yet bind mounts). I don't care if, in the process of doing some work, an LLM agent modifies .bashrc -- I only care if it modifies _my_ .bashrc

e1g · 2026-03-08T21:41:40 1773006100

Thanks, I picked Bash because I’m scared of all Go and Rust binaries out there!

Re “overlay FS” - I too wish this was possible on Macs, but the closest I got was restricting agents to be read-only outside of CWD which, after a few turns, bullies them into working in $TMP. Not the same though.

kstenerud · 2026-03-09T05:41:30 1773034890

I took a more paranoid approach to sandboxing agents. They can do whatever they want inside their container, and then I choose which of their changes to apply outside as commits:

    ┌─ YOLO shell ──────────────────────┬─ Outer shell ─────────────────────┐
    │                                   │                                   │
    │ yoloai new myproject . -a         │                                   │
    │                                   │                                   │
    │ # Tell the agent what to do,      │                                   │
    │ # have it commit when done.       │                                   │
    │                                   │ yoloai diff myproject             │
    │                                   │ yoloai apply myproject            │
    │                                   │ # Review and accept the commits.  │
    │                                   │                                   │
    │ # ... next task, next commit ...  │                                   │
    │                                   │ yoloai apply myproject            │
    │                                   │                                   │
    │                                   │ # When you have a good set of     │
    │                                   │ # commits, push:                  │
    │                                   │ git push                          │
    │                                   │                                   │
    │                                   │ # Done? Tear it down:             │
    │                                   │ yoloai destroy myproject          │
    └───────────────────────────────────┴───────────────────────────────────┘

Works with Docker, Seatbelt, and Tart backends (I've even had it build an iOS app inside a seatbelt container).

https://github.com/kstenerud/yoloai

dbmikus · 2026-03-08T22:14:08 1773008048

I've been working on an OSS project, Amika[1], to quickly spin up local or remote sandboxes for coding workloads. We support copy-on-write semantics locally (well, "copy-and-then-write" for now... we just copy directories to a temp file-tree).

It's tailored to play nicely with Git: spin up sandboxes form CLI, expose TCP/UDP ports of apps to check your work, and if running hosted sandboxes, share the sandbox URLs with teammates. I basically want running sandboxed agents to be as easy as `git clone ...`.

Docs are early and edges are rough. This week I'm starting to dogfood all my dev using Amika. Feedback is super appreciated!

FYI: we are also a startup, but local sandbox mgmt will stay OSS.

[1]: https://github.com/gofixpoint/amika

xyzzy_plugh · 2026-03-08T22:56:57 1773010617

This is just a thin wrapper over Docker. It still doesn't offer what I want. I can't run macOS apps, and if I'm doing any sort of compilation, now I need a cross-compile toolchain (and need to target two platforms??).

Just use Docker, or a VM.

The other issue is that this does not facilitate unpredictable file access -- I have to mount everything up front. Sometimes you don't know what you need. And even then copying in and out is very different from a true overlay.

dbmikus · 2026-03-09T00:06:24 1773014784

Appreciate the deets!

It sounds like a big part of your use case is to safely give an agent control of your computer? Like, for things besides codegen?

We're probably not going to directly support that type of use case, since we're focused on code-gen agents and migrating their work between localhost and the cloud.

We are going to add dynamic filesystem mounting, for after sandbox creation. Haven't figured out the exact implementation yet. Might be a FUSE layer we build ourselves. Mutagen is pretty interesting as well here.

divmain · 2026-03-08T21:51:37 1773006697

This is what I was going for with Treebeard[0]. It is sandbox-exec, worktrees, and COW/overlay filesystem. The overlay filesystem is nice, in that you have access to git-ignored files in the original directory without having to worry about those files being modified in the original (due to the COW semantics). Though, truthfully, I haven’t found myself using it much since getting it all working.

[0] https://github.com/divmain/treebeard

xyzzy_plugh · 2026-03-08T22:53:00 1773010380

This approach is too complex for what is provided. You're better off just making a copy of the tree and simply using sandbox-exec. macFUSE is a shitshow.

The main issue I want to solve is unexpected writes to arbitrary paths should be allowed but ultimately discarded. macOS simply doesn't offer a way to namespace the filesystem in that way.

divmain · 2026-03-09T03:39:19 1773027559

Completely agree; my approach was not the most practical. I mostly wanted to know how hard it would be and, as I said, haven’t used it much since. Yes, macFUSE is messy to rely upon. I feel as though the right abstraction is simply unavailable on macOS. Something akin to chroot jails — I don’t feel like I need a particularly hardened sandbox for agentic coding. I just need something that will prevent the stupid mistakes that are particularly damaging.

tuananh · 2026-03-09T01:41:04 1773020464

isn't sandbox-exec already deprecated?

e1g · 2026-03-09T01:56:49 1773021409

Yes, for about a decade. But it’s available everywhere, and still works - and protects us - like brand new!

rvz · 2026-03-09T03:52:13 1773028333

It's quite naive to assume that. There is a reason why it is deprecated by Apple.

Apple is likely preparing to remove it for a secure alternative and all it takes is someone to find a single or a bunch of multiple vulnerabilities in sandbox-exec to give a wake up call to everyone why were they using it in the first place.

I predict that there is a CVE lurking in sandbox-exec waiting to be discovered.

TheTon · 2026-03-09T05:00:42 1773032442

On the other hand, the underlying functionality for sandboxing is used heavily throughout the OS, both for App Sandboxes and for Apple’s own system processes. My guess is sandbox-exec is deprecated more because it never was adequately documented rather than because it’s flawed in some way.

rvz · 2026-03-09T06:39:57 1773038397

> the underlying functionality for sandboxing is used heavily throughout the OS, both for App Sandboxes and for Apple’s own system processes.

The security researchers will leverage every part of the OS stack to bypass the sandbox in XNU which they have done multiple times.

Now, there is a good reason for them to break the sandbox thanks to the hype of 'agents'. It could even take a single file to break it. [0]

> My guess is sandbox-exec is deprecated more because it never was adequately documented rather than because it’s flawed in some way.

You do not know that. I am saying that it has been bypassed before and having it being used all over the OS doesn't mean anything. It actually makes it worse.

[0] https://the-sequence.com/crashone-cve-2025-24277-macos-sandb...

TheTon · 2026-03-09T15:22:34 1773069754

You could apply this same reasoning to any feature or technology. Yes there could be a zero day nobody knows about. We could say that about ssh or WebKit or Chrome too.

I hear what you're saying about the deprecation status, but as I and others mentioned, the fact that the underlying functionality is heavily used throughout the OS by non deprecated features puts it on more solid footing than a technology that's an island unto itself.

JimDabell · 2026-03-09T05:03:33 1773032613

As I understand it, Chrome, Claude Code, and OpenAI Codex all use sandbox-exec. I’m not sure Apple could remove it even if they were sufficiently motivated to.

rvz · 2026-03-09T06:46:10 1773038770

> As I understand it, Chrome, Claude Code, and OpenAI Codex all use sandbox-exec.

Apple can still decide to change it for any reason, regardless of who uses it, since it is undocumented for their use anyway.

> I’m not sure Apple could remove it even if they were sufficiently motivated to.

It can take multiple security issues for them to remove it.

TheTaytay · 2026-03-09T13:51:00 1773064260

Is there a better alternative on Mac?

xyzzy_plugh · 2026-03-08T00:07:37 1772928457

It's not materially any different from maintaining lines in a Dockerfile.

mort96 · 2026-03-08T09:10:33 1772961033

It is mateirially different compared to "maintaining" the line 'RUN apt-get -y install foobar'

SOLAR_FIELDS · 2026-03-08T16:08:28 1772986108

Is it though? If the way that I’m going to edit those files is by typing the same natural language command into Claude code, and the edit operation to maintain it takes 20 seconds instead of 10, to me that seems pretty materially the same

mort96 · 2026-03-08T16:30:43 1772987443

Yes, it is

SOLAR_FIELDS · 2026-03-09T21:31:28 1773091888

How so?

xyzzy_plugh · 2026-03-07T16:48:44 1772902124

Modern developers are predisposed to reach for off the shelf solutions, full stop. They're afraid of, or perhaps allergic to, just reading and writing files.

If you can learn to get past this you can unlock a whole universe of problem solving.

xyzzy_plugh · 2026-03-07T16:41:32 1772901692

This post is amusing to me because after solving the problem in ~2 seconds the author boils the ocean to get that down further, then finally ends with questioning what the problem statement even is?

Classic software engineer pitfall. First gather the requirements!

Second, if their initial interpretation was correct, and it's a one-shot operation, then the initial solution solves it. Done! Why go any further?

I get that it's fun to muse over solutions to these types of problems but the absurdity of it all made me laugh. Jeff's answer was the best, because it describes a solution which makes the assumptions crystal clear while outlining a straightforward implementation. If you wanted something else, it's obvious you need to clarify.

sdenton4 · 2026-03-08T01:06:38 1772931998

They don't actually solve the problem in 2 seconds - at that point, they are running on a sample of only 3,000 vectors! Then they get it down further, but still find it will take a loooooong time to get through all 3B:

"With these small improvements, we’ve already sped up inference to ~13 seconds for 3 million vectors, which means for 3 billion, it would take 1000x longer, or ~3216 minutes." ...which is about two days.