Hacker Newsnew | past | comments | ask | show | jobs | submit | varispeed's commentslogin

Until thumb drives became large enough to fit most datasets it stopped becoming Big Data. Just normal data.

To some degree IMO big data is still a mindset when it might take a day to process your data in a normal SQL query. Some tech doesn't scale to the data size for all use cases, and you need different solutions.

We have thumb drives that can store petabytes of data?

Or did you mean the "big data" crowd which thought 500GB was noteworthy? I don't think anyone took those serious, neither in 2010s nor now. That was always "small" data


My rule of thumb was "can it fit in RAM on a server?" If it can, then it's not big data.

500GB is in the "fits" category.


> We have thumb drives that can store petabytes of data

We do?


It was a question that you've edited out the punctuation. You're asking the exact same thing as the person you've replied

Please provide a link.

You would need 4 and change of these 245tb Kioxias to hold 1 petabyte, and an entire server grade computer to run them.

https://www.tomshardware.com/pc-components/ssds/kioxia-unvei...

Or 250 of these ~$400 4tb flash drives and an insane number of dongles to connect them all:

https://www.slashgear.com/1847725/largest-usb-thumb-drive-hi...


Most companies using term "big data" had datasets in TB region. One company I had a gig at had full Hadoop cluster setup and their whole dataset was 40GB. Their marketing had all the big data adjacent keywords over the brochures for clients.

Russians started the war together with Germans. The idea that they could "help" is Western propaganda that tries to whitewash helping Soviet Union.

Don't forget that Germans were able to quickly find people they wanted to exterminate by going through municipal and church records. Today it will be much easier thanks to push for Digital ID. Choose demographic and the dashboard will show where everyone is and their connections.

Corporations already have a "digital ID" for everyone. As we know from WW2 (and more recent conflicts) private corporations would be lining up to work with the next Nazi regime if the price was right.

Ten years ago multiple tech giants openly stated they would not help the Trump administration build a Muslim registry [1]. Since then, several of them have bowed the knee and donated to his second inauguration. I’m from Germany, and keep wondering how much more damage the NSDAP could have done if they had access to the data these companies now have on everybody. [1] https://www.theverge.com/2016/12/16/13990234/google-muslim-r...

Germany has the Schufa. Should be trivial to narrow down to a specific group based on name only.

Again a reminder that Google etc have data far more than historical dictatorship like Stasi, KGB etc. just for better marketing. we need better data protection though GDPR is a good start.

Digital ID? Kiddos <25 years old give away 3D face scans and fingerprints at the gym, grocery store, and in the gambling mobile app.

The choosen demographic in Poland was easy to find without any records. Jews were mostly orthodox and easily visibly different and poor.

Polish were basically "anyone there" no records necessary.


That's a big simplification. There were plenty of ethnic Jews who already converted to Catholicism (or remained atheist) and identified as regular Polish citizens. There were also mixed families who were exterminated as a whole nonetheless.

They were going after key people like academics, engineers, doctors, artists first.

Yeah the Nazis had future plans for Poland. The idea was to make the entire country into illiterate slaves.

A lot of people don't know that although what the Nazis did was obviously terrible it was nothing compared to what they wanted to do.

Poland survived 60 years of communism. It would not have survived national socialism.


There was a version which considered leaving 15%-20% of the Polish population to be slaves, but "In 1941, the German leadership decided to destroy the Polish nation completely, and in 15–20 years the Polish state under German occupation was to be fully cleared of any ethnic Poles and settled by German colonists.[16]: 32 A majority of them, now deprived of their leaders and most of their intelligentsia (through mass murder, destruction of culture, banning education above the absolutely basic level, and kidnapping of children for Germanization), would have to be deported to regions in the East and scattered over as wide an area of Western Siberia as possible. According to the plan, this would result in their assimilation by the local populations, which would cause the Poles to vanish as a nation.[46]"

https://en.wikipedia.org/wiki/Generalplan_Ost#Poland


Specifically a national socialism predicated on a definition of “nation” that was based on dubious race science and excluded anyone whose physical appearance or genealogy was “Polish”.

Germany already has religion logged in the Melderegister


I'm a little confused. It's still a field on the registration form and is used to collect Church tax, right? My last Anmeldung was in 2019 so I might have missed something.

Issue with these benchmark also is that they measure a model you are unlikely going to be routed to. My experience with Anthropic is that despite using Opus 4.6 and 4.7, most of the time the performance is matching low B parameter Qwen. I think there should be a way to verify what model is actually being used to process prompts - that should be independently verified. At the moment it is so bad, you have to ask verification question to the model in form of a non-trivial problem. If it solves it, then there is a chance you actually get Opus and not an impostor and so you can continue the session instead of restarting it hoping you get routed correctly. But that does not help if model is replaced with cheaper one mid session. I've got so much work lost because of these shenanigans.

> My experience with Anthropic is that despite using Opus 4.6 and 4.7, most of the time the performance is matching low B parameter Qwen.

Is this just the next level of the "they're serving quantized models!" theory?


Not a theory buy lived experience. You never know when you get the nerfed session.

I'm sure some inference providers don't, but most intentionally obfuscate this data. They have the full trace logs- my impression is that they don't share them because it's their competitive advantage, and it's easier for a competitor to distil their model if they did.

You can quickly get something "working" until you realise it has a ton of subtle bugs that make it unusable in the long run.

You then spend months cleaning it up.

Could just have written it by hand from scratch in the same amount of time.

But the benefit is not having to type code.


It also seems to me they route prompts to cheaper dumber models that present themselves as e.g. Opus 4.7. Perhaps that's what is "adaptive reasoning" aka we'll route your request to something like Qwen saying it's Opus. Sometimes I get a good model, so I found I'll ask a difficult question first and if answer is dumb, I terminate the session and start again and only then go with the real prompt. But there is no guarantee model will be downgraded mid session. I wish they just charged real price and stopped these shenanigans. It wastes so much time.

You're describing a Taravangian prompt situation (a character in a book series who wakes up with a different/random intelligence level each day and has a series of tests for himself to determine which kind of decisions he's capable of that day). https://coppermind.net/wiki/Taravangian

I always agree if this is for academic purposes, if it helps with research etc. I can't see why I shouldn't. We are just meat that will expire one day.

The Western corruption is called "lobbying" and is only allowed for the rich.

No. Lobbying is indeed legal for everyone.

But when's the last time you had $300 million in your personal budget to spend on advertising to a specific human being to improve your personal income?

When's the last time you got a call from an actual politician begging you for money and "support"?

US congress members spend the vast vast majority of their time on the phone begging a list of rich people for a piss of nickles to fund advertising for their next election. There's always a subtle threat of strings attached.

Both the prince and pauper are forbidden from sleeping under the bridge.


I think they are routing to cheaper models that present themselves as e.g. Opus. I add to prompts now stuff to ensure that I am not dealing with an impostor. If it answers incorrectly, I terminate the session and start again. Anthropic should be audited for this.

It appears that Opus 4.7 has been nerfed already. Can't get any sensible results since yesterday. It just keeps running in circles. Even mention that it is committing fraud by doing superficial work it has been told specifically not to do doesn't help.

oh yes. I tried to get some review of a code base after some refactoring. CC produced a complete garbage review. After pointing that out it admitted that that was garbage - and promptly produced another pile of garbage. After the third failed attempt I had to call it a day.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: