More

imiric · 2026-04-26T06:04:16 1777183456

> So this is proof of the models actually getting stronger (previous generations of LLMs were unable to solve this one).

No, it's not.

While I don't dispute that new models may perform better at certain tasks, the fact that someone was able to use them to solve a novel problem is not proof of this.

LLM output is nondeterministic. Given the same prompt, the same LLM will generate different output, especially when it involves a large number of output tokens, as in this case. One of those attempts might produce a correct output, but this is not certain, and is difficult if not impossible for a human not expert in the domain to determine this, as shown in this thread.

notahacker · 2026-04-26T12:32:04 1777206724

As others have pointed out, a key part of the prompt used here may have been "don't search the internet" as it would most likely have defaulted to starting off with existing approaches to that problem...

imiric · 2026-04-25T05:31:22 1777095082

This resonates a lot with me.

Long breaks help. Take your mind off of things that bothered you. Do things you enjoy. Which may include tech work, but on your own terms.

I wouldn't be surprised if you decide to not go back. The status quo of most organizations is grim. But there are still people who care about the same things as you. You can seek them out and work together, much like you did 15 years ago. This is more difficult now among the noise, but you can tune that out. The industry will never recover altogether, but this current period is a blip of high insanity, which will subside in a few years.

Good luck!

imiric · 2026-04-24T14:32:38 1777041158

As disturbing as the film "Midsommar" was, I found the concept of a human life being divided into 4 seasons of 18 years each pretty compelling. Not necessarily that life should end after Winter, but a person's contributions to society probably should. Having politicians in office pushing 80 is a disgrace.

imiric · 2026-04-24T05:57:55 1777010275

For crying out loud, why are we discussing and paying attention to articles and claims about a product that doesn't even exist yet?!

If this isn't a sign of a bubble, where marketing is more important than the actual product, I don't know what is. This industry has completely lost the plot.

imiric · 2026-04-23T15:53:33 1776959613

> A password manager does not need a CLI tool.

That's a wild statement. The CLI is just another UI.

The problem in this case is JS and the NPM ecosystem. Go would be an improvement, but complexity is the enemy of security. Something like (pass)age is my preference for storing sensitive data.

imiric · 2026-04-20T22:36:27 1776724587

Very cool, thanks for sharing.

I built something similar recently on top of Incus via Pulumi. I also wanted to avoid libvirt's mountain of XML, and Incus is essentially a lightweight and friendlier interface to QEMU, with some nice QoL features. I'm quite happy with it, though the manifest format is not as fleshed out as what you have here.

What's nice about Pulumi is that I can use the Incus Terraform provider from a number of languages saner than HCL. I went with Python, since I also wanted to expose a unified approach to provisioning, which Pyinfra handles well. This allows me to keep the manifest simple, while having the flexibility to expose any underlying resource. I think it's a solid approach, though I still want to polish it a bit before making a public release.

khimaros · 2026-04-20T23:29:49 1776727789

check out https://github.com/lnussbaum/incant

imiric · 2026-04-21T07:44:46 1776757486

Ah, that's neat as well.

I took a slightly different approach in that I don't want to use YAML as the authoritative source. Many projects abuse it, and end up creating a DSL on top of it with all sorts of hacks to achieve the flexibility of a programming language. Pulumi and Pyinfra already provide user-friendly primitives and idempotent(ish) APIs that work much better than YAML. I simply want to expose some (opinionated) building blocks to make them easy to use, and allow users to customize them and add their own as needed. E.g. I definitely don't want to write any shell scripts inside YAML. :)

BTW, Pulumi already supports YAML[1], which can be used with any provider. But to me it's too verbose and generic, and of course, it lacks the provisioning primitives.

[1]: https://www.pulumi.com/docs/iac/languages-sdks/yaml/

zeroecco · 2026-04-20T22:46:44 1776725204

Your path makes a ton of sense too! You get typed languages, state management, and the Incus team's work on the QEMU layer. The tradeoff I wasn't willing to make is the daemon + state store: Incus wants to own the VM lifecycle the way libvirtd does, and once you have that you're back to "two sources of truth" if you ever shell out. Holos is deliberately stateless on the host; Everything lives under one directory per project, rm -rf is a valid uninstall (though it will abandon the VMs if running). Different answer to the same frustration. Would genuinely like to see your thing when it's public.

d3Xt3r · 2026-04-20T23:16:40 1776727000

This is exactly what I dislike about incus. But what I do like about incus is how I can easily spin up and configure VMs directly using the CLI, without preparing a config first (I hate yaml).

So would be nice if holos could replicate that docker/incus CLI functionality, like say "holos run -d --name db ubuntu:noble bash -c blah".

zeroecco · 2026-04-20T23:27:26 1776727646

Docker does have both docker run and docker compose... Holos is compose-only right now. A holos run on top of the same VM machinery isn't a huge lift; mostly it's deciding what the sensible defaults are (image, cloud-init user, port, volume scope); Noted. it's not in 0.1 but I hear the ask.

imiric · 2026-04-21T07:54:39 1776758079

> Incus wants to own the VM lifecycle the way libvirtd does, and once you have that you're back to "two sources of truth" if you ever shell out.

That's true. But I didn't want to reinvent what Incus or any hypervisor abstraction does. I simply wanted to add some sugar on top that allows me to easily declare infra using small abstractions, and to tie in the provisioning aspect along the way. I still use Incus directly, and can benefit from their work, as you say. State is also managed by Pulumi, so really, there are 3 places for it to exist. There are some challenges with this, of course, but I think the tradeoff is worth it.

Good luck with your project, I'll be keeping an eye on it. I'll probably make a Show HN post when I release mine. Cheers!

imiric · 2026-04-20T07:46:14 1776671174

That's ludicrous hair splitting.

If I have evidence that a crime has been committed based on my layperson understanding of the law, I will surely inform others before the case is even brought to courts. Journalists can and should do the same.

By your logic, reporting based on evidence provided by whistleblowers shouldn't exist. Things like Watergate would likely have never happened.

Journalists shouldn't accuse anyone of committing a crime, and goes without saying that facts shouldn't be fabricated, which is unfortunately common nowadays as well, but they should report events that happened based on the information they have, whether these happen to be related to crimes or not.

testfoobar · 2026-04-20T15:29:45 1776698985

>If I have evidence that a crime has been committed based on my layperson understanding of the law, I will surely inform others before the case is even brought to courts. Journalists can and should do the same.

In the US, careful journalistic organizations follow ethical and legal guidelines that often split hairs.

Have a look here: New York Times - Ethical Journalism A Handbook of Values and Practices for the News and Opinion Departments

https://www.nytimes.com/editorial-standards/ethical-journali...

ndsipa_pomu · 2026-04-20T09:26:56 1776677216

Reporting based on evidence is definitely allowed in the UK. Any accusation of libel/slander could be defended by producing the evidence and thus proving that the statements were true.

Going beyond the evidence and jumping straight to the crime is where the situation becomes tricky as the defense would be unlikely to prove beyond doubt that the accused person was actually guilty - that's why terms are used such as "alleged child abuser". Alternatively, the evidence/facts can be reported e.g. "Trump featured in many victim reports as an abuser".

imiric · 2026-04-17T15:53:18 1776441198

I can't say whether this was machine-generated or not, but the reason LLMs use these patterns is because they're often used by humans, which is what they're trained to mimic. LLM spam has now made it annoying, but there are many people who still write like this. Asking them to change their writing patterns because LLMs have ruined it for readers is not just unfair—it's offensive. (See what I did there? Double whammy!)

AlecSchueler · 2026-04-17T18:00:56 1776448856

> I can't say whether this was machine-generated or not, but the reason LLMs use these patterns is because they're often used by humans, which is what they're trained to mimic.

It's definitely written by an AI. I understand that people use these same rhetoric devices but the word "mimic" is exactly right. They're not writing like humans, they're mimicking human writing in a way that feels extremely uncanny.

(Hey look, I did the thing too!)

thewebguyd · 2026-04-17T18:13:55 1776449635

Thank you for saying this. I use those writing devices a lot, always have since I was young. I've always said I don't write like an LLM, the LLMs are copying me. Its also a common hallmark of neurodivergent writing, and it's frustrating to frequently get dismissed for being an LLM just because of writing patterns?

I'm now having to deliberately re-word my emails and comments, spending additional time, to avoid being accused of being an LLM.

vardump · 2026-04-17T17:37:23 1776447443

Sigh. That's why I stopped using — (em dash)...

imiric · 2026-04-16T15:46:01 1776354361

A major reason for that is because there's no way to objectively evaluate the performance of LLMs. So the meme projects are equally as valid as the serious ones, since the merits of both are based entirely on anecdata.

It also doesn't help that projects and practices are promoted and adopted based on influencer clout. Karpathy's takes will drown out ones from "lesser" personas, whether they have any value or not.

imiric · 2026-04-16T14:21:51 1776349311

This Mozilla?[1] The company whose 85% of revenue depends on an adtech giant?

They're certainly doing better than others in this space, but their track record does not inspire confidence for anyone concerned about their privacy and data.

[1]: https://en.wikipedia.org/wiki/Mozilla#Controversies

baal80spam · 2026-04-16T14:30:08 1776349808

[flagged]

dralley · 2026-04-16T14:45:53 1776350753

Coughing baby vs. atom bomb

Wolfrich · 2026-04-16T14:53:04 1776351184

that is the firefox groupn not thunderbird. Diff bro