More

Lapel2742 · 2026-02-21T12:31:10 1771677070

> These models fail significantly in understanding real-world social norms (Rezaei et al., 2025), aligning with human moral judgments (Garcia et al., 2024; Takemoto, 2024), and adapting to cultural differences (Jiang et al., 2025b). Without consistent and reliable moral reasoning, LLMs are not fully ready for real-world decision-making involving ethical considerations.

LOL. Finally the Techbro-CEOs succeeded in creating an AI in their own image.

runlaszlorun · 2026-02-21T15:14:23 1771686863

I think this issue is way overlooked. Current LLMs embed a long list of values that are going to be incongruent with a large percentage of the population.

I don't see any solution longer term other than more personalized models.

throw310822 · 2026-02-21T14:06:56 1771682816

> These models

Which models? The last ones came out this week.

Lapel2742 · 2026-02-16T07:39:41 1771227581

GLM 5.0:

> You should *drive*.

> Here is why:

> If you walk to the car wash, you will arrive there empty-handed. Since your car is still at home, you won't have anything to wash.

> While driving 50 meters is a very short trip (and technically not great for a cold engine), it is the only way to get the car to the car wash to complete your goal.

Kimi K2.5:

> You should drive, but with an important caveat.

> Since your goal is to wash your car, you must bring the vehicle to the car wash. Walking there without the car does not advance your goal (unless you are simply checking availability or buying tokens first).

> However, driving only 50 meters is bad for your car:

> ...

> Better options:

> Wash at home: Since the car wash is only 50 meters away, you likely have access to water at home. Hand-washing in your driveway avoids the cold-start issue entirely.

> ...

Current models seem to be fine answering that question.

Retric · 2026-02-16T08:27:27 1771230447

> seem to be fine

Now repeat the question to the same model in different contexts several times and count what percentage of the time it’s correct.

Lapel2742 · 2026-02-08T15:02:39 1770562959

> Anyone found some good use cases beyond a better interface for AI code assistance?

Yesterday, I saw a demo of a product similar to OpenClaw. It can organize your files and directories and works really great (until it doesn't, of course). But don't worry, you surely have a backup and need to test the restore function anyway. /s

Edit:

So far, I haven’t found a practical use case for this. To become truly useful, it would need access to certain resources or data that I’m not comfortable sharing with it.

Lapel2742 · 2026-01-03T13:52:14 1767448334

> Did we just remove a terrible regime in a comparably bloodless way?

You captured Maduro in an blatantly illegal act of war and until now the Regime is still there.

I hope for the people in Venezuela that this will end without a bloodshed. AFAIK Maduro has still support, especially in the poorer part of the population.

Lapel2742 · 2026-01-03T13:33:50 1767447230

> How much oil did we get out of the deal?

Apparently a lot.

> The 2003 Iraq War, initiated as a U.S. unilateral action, has also been viewed through the lens of economic interests, particularly oil access. Following the conflict, significant American business opportunities arose, notably through contracts with oil companies to exploit Iraqi oil fields, marking the end of Iraq’s long-standing oil nationalization policy. Technological advancements were another key economic byproduct of these wars; innovations developed for military use often transitioned into civilian applications, influencing various sectors.

> Additionally, a trend towards privatization emerged, as private firms undertook roles traditionally held by the military, further intertwining the defense industry with the economy. This shift raised ethical concerns and sparked debate regarding the implications of privatizing military functions. Overall, the Iraq wars illustrate the complex intersection of military action, resource control, and economic interests within American foreign policy.

https://www.ebsco.com/research-starters/military-history-and...

Lapel2742 · 2026-01-03T13:17:39 1767446259

> And while Venezuela has oodles of oil, is this really the case of America wanting Venezuelan oil?

Yes it is.

> But Trump has also made his desire for Venezuelan oil clear. He said that the blockade of sanctioned oil tankers going to and from the country would remain “until such time as they return to the United States all of the oil, land, and other assets that they stole from us.” He did not clarify what land and “other assets” he was referring to.

> In a social media post, Miller also characterized the expropriations as an injustice against the US. “American sweat, ingenuity and toil created the oil industry in Venezuela,” wrote. “Its tyrannical expropriation was the largest recorded theft of American wealth and property.”

> And in a 2023 speech, Trump was even more pointed about his designs on the country’s oil. “When I left, Venezuela was ready to collapse,” he said, referring to the end of his first term in the White House. “We would have taken it over, we would have gotten all that oil, it would have been right next door.”

https://www.bloomberg.com/news/articles/2025-12-17/trump-s-v...

Lapel2742 · 2025-12-28T09:02:52 1766912572

> This is what works for me.

Than you are the lucky one.

E.g.: Just for entertainment I recently searched for information on a new LLM model. The result was filled with AI garbage. I stumbled upon videos with AI generated content, presented by an AI generated voice and an AI generated human. And if it is not something like that, YouTube at least lies to me about the language of the video and tries to make me listen to an AI voice which is equally vile.

There are times where I constantly have to close videos after a couple of seconds. It is unbearable. At least they should mark these videos clearly. At best they should allow me to filter them out but of course they won't do that or else they couldn't make the experience on their platform even worse than it already is, what seems to be their real goal.

koakuma-chan · 2025-12-28T09:05:28 1766912728

That's a skill issue on your part. Do not use YouTube to search for information. If you need to find information on a new LLM model, look for official documentation from the model's maker. This applies to everything.

Lapel2742 · 2025-12-28T09:22:52 1766913772

> That's a skill issue on your part.

No, it's not. I search for videos and i get AI garbage as a result. It shouldn't matter what i search for or if you think that it is dumb to search for it. It shouldn't return garbage in the first place unless I explicitly ask for it.

> Do not use YouTube to search for information. If you need to find information on a new LLM model, look for official documentation from the model's maker.

That is what I do but I like the entertainment factor of some YouTubers. I know that I do not get the best information from them but sometimes I'm entertained. That's why I'm using YouTube after all. Entertainment (and I'm not entertained by AI).

> This applies to everything.

So you are saying: In this glorious new AI world I need a search diploma just to evade the endless AI slop YouTube is filled with nowadays. And than there are topics that are taboo to search for at all. From what angle do I have to look at this to discover the good part?

koakuma-chan · 2025-12-29T02:38:53 1766975933

> In this glorious new AI world I need a search diploma just to evade the endless AI slop YouTube is filled with nowadays. And than there are topics that are taboo to search for at all. From what angle do I have to look at this to discover the good part?

It has always been this way, even before AI. When people need to find information on anything, they would rather trust some social media influencer, than look at official docs.

Lapel2742 · 2025-12-22T16:14:26 1766420066

>Only issue I have found with llama.cpp is trying to get it working with my amd GPU.

I had no problems with ROCm 6.x but couldn't get it to run with ROCm 7.x. I switched to Vulkan and the performance seems ok for my use cases

Lapel2742 · 2025-12-22T15:40:11 1766418011

> OK now slap the wrists of Alphabet and Meta.

Google is probably next (Antitrust case(s)). AFAIK the EU is currently probing a case.

And before the Nationalists get mad again: If I sell in the US I'm naturally obliged to follow US rules and regulations. I wouldn't even think twice about this. The same is true in other markets. So for the Single Market: If you play on European turf, you play by European rules.

mk89 · 2025-12-22T19:53:53 1766433233

The weirdest thing is that all fines from EU countries are about privacy and anti-competitive behavior, which literally every citizen can benefit from. The implementation might be something questionable sometimes, but this hatred is totally nonsense.

It strikes me as odd that the land of opportunity has become the land of bigX that must overtake everything and everyone just accepts it. This isn't the spirit of the Americans I know, who actually challenge and see opportunity everywhere. How can you just choose to bend to Apple/Google/Meta etc? I understand they are great companies, but they do really ugly things to push competitors out, to allow scam/phishing, etc.

Lapel2742 · 2025-12-22T09:58:12 1766397492

> I highly doubt a RTX 5090 can run anything that competes with Sonnet 3.5 which was released June, 2024.

I don't know about the capabilities of a 5090 but you probably can run a Devstral-2 [1] model locally on a Mac with good performance. Even the small Devstral-2 model (24b) seems to easily beat Sonnet 3.5 [2]. My impression is that local models have made huge progress.

Coding aside I'm also impressed by the Ministral models (3b, 8b and 14b) Mistral AI released a a couple of weeks ago. The Granite 4.0 models by IBM also seem capable in this context.

[1] https://mistral.ai/news/devstral-2-vibe-cli

[2] https://www.anthropic.com/news/swe-bench-sonnet

cmrdporcupine · 2025-12-22T16:10:19 1766419819

Thing is you can pay basically fractions of cents a query to e.g. DeepSeek Platform or DeepInfra or Z.Ai or whatever and have them run the same open models for far cheaper and faster than you could ever build out at home.

It's neat to play with, but not practical.

The only story that I can see that makes sense for running at home is if you're going to fine tune a model by taking an open weight model and <hand waving> doing things to it and running that. Even then I believe there's places (hugging face?) that will host and run your updated model for cheaper than you could run it yourself.

Aurornis · 2025-12-22T16:13:23 1766420003

> Even the small Devstral-2 model (24b) seems to easily beat Sonnet 3.5 [2].

I've played with Devstral 2 a lot since it came out. I've seen the benchmarks. I just don't believe it's actually better for coding.

It's amazing that it can do some light coding locally. I think it's great that we have that. But if I had to choose between a 2024-era model and Devstral 2 I'd pick the older Sonnet or GPTs any day.