More

jollymonATX · 2026-01-22T00:20:19 1769041219

These sites all suffer from the same defect, amazon pa-api pricing is NOT consistent in any region with the carted values an end user will be shown. This is a well known thing if you have worked with that api before and you are essentially just dropping the authors 24 hr amz cookie for them to earn off all other sales. Not to say thats bad, but the value add from a price comparison site like this is minimal to the end user as you will very likely not get that shown price.

vektor888 · 2026-01-22T00:27:33 1769041653

Unfortunately, this is what I sometimes experience, and I am not sure there's much that can be done. I am already trying to filter out outliers, but if a price looks "plausible", this filtering doesn't do much.

Sometimes I get prices for items that are unavailable or completely off (perhaps from a 3rd-party seller?).

Aurornis · 2026-01-22T02:37:31 1769049451

Is this really it? The prices just seem completely wrong from the links I clicked. I can’t imagine the PA-API is really that far off for every product, unless something has changed drastically from the last time I used the API.

jollymonATX · 2025-12-25T05:22:45 1766640165

That it got to this point is hilarious

jollymonATX · 2025-12-25T04:36:39 1766637399

Wow had been strongly considering the framework 16 and really glad I read this. Thanks author! That keyboard space jank is not okay.

jollymonATX · 2025-12-24T21:44:55 1766612695

This is my hope as well, but fear of ai scrape is real among folks I have chatted with this about.

api · 2025-12-25T03:03:33 1766631813

If you are putting something out for free for anyone to see and link and copy, why is LLM training on it a problem? How’s that different from someone archiving it in their RSS reader or it being archived by any number of archive sites?

If you don’t want to give it away openly, publish it as a book or an essay in a paid publication.

baubino · 2025-12-25T09:26:16 1766654776

The problem is that LLM “summaries” do not cite sources. They furthermore don’t distinguish between making summaries and taking direct quotes; that “summary” is often directly lifting text that someone wrote. LLMs don’t cite in either case. It’s a clear case of plagiarism, but tech companies are being allowed to get away with it.

Publishing in a paid publication is not a solution because tech companies are scraping those too. It’s absolutely criminal. As an individual, I would be in clear violation of the law if I took text someone else wrote (even if that text was in the public domain) and presented it as my own without attribution.

From an academic perspective, LLM summaries also undermine the purpose of having clear and direct attribution for ideas. Citing sources not only makes clear who said what; it also allows the reader to know who is responsible for faulty knowledge. I’ve already seen this in my line of work, where LLMs have significantly boosted incorrect data. The average reader doesn’t know this data is incorrect and in fact can’t verify any of the data because there is no attribution. This could have serious consequences in areas like medicine.

jollymonATX · 2025-12-25T04:47:49 1766638069

Its important to consider others perspectives, even if inaccurate. As it was expressed to me when I suggested "why not write a blog" to a relative who is into niche bug photos and collecting they didn't want to give their writing and especially photos to be trained on. They have valid points honestly and an accurate framing of what will happen, it will get injested eventually likely. I think they overestimate a tad their works importance overall but still they seemed to have a pretty accurate guage of likely outcomes. Let me flip the question, why should they not be able to choose "not for training uses" even if they put it up publically?

falcor84 · 2025-12-25T08:52:43 1766652763

> why should they not be able to choose "not for training uses" even if they put it up publically?

I'm having trouble even parsing that question; "Publically" means that you put yourself out there, no? It sounds to me like that Barbra Streisand thing of building an ostentatious mansion and expecting no one to post photos of it.

I suppose you could try to publish things behind some sort of EULA, but that's expressly not public.

jollymonATX · 2025-12-25T13:50:43 1766670643

If you are having trouble understanding, just ask. Of course I'm talking about a websites terms of use.

falcor84 · 2025-12-25T17:31:44 1766683904

As I understand it, terms of use on a publicly accessible page aren't enforceable. That's why it's legal to e.g. scrape pages of news sites regardless of any terms of use. If it's curlable, it's fair game (but it's fair for the site to try to block my scraping).

justinator · 2025-12-25T03:37:01 1766633821

This is not an answer to your question, but one issue is that if you write about some niche sort of thing (as you do, on a self-hosted blog) that no one else is really writing about, the LLM will take it as a sole source on the topic and serve up its take almost word for word.

That's clearly plagiarism, but it's also interesting to me as there's really no way the user who's querying their fav. ai chatbot if the answer has truthiness.

I can see a few ways this could be abused.

falcor84 · 2025-12-25T08:58:07 1766653087

I don't see how this is different from the classic citogenesis process; no AI needed. If a novel claim is of sufficient interest, then someone will end up actually doing proper research and debunking of it, probably having fun and getting some internet fame.

justinator · 2025-12-26T01:37:00 1766713020

> I don't see how this is different from the classic citogenesis process;

Lack of novelty doesn't remove it as a problem.

falcor84 · 2025-12-26T05:56:54 1766728614

Agreed, it's definitely a problem, but I'm just saying that it's the basic problem of "people sometimes say bullshit that other people take at face value". It's not a technical problem. The most relevant approach to analyze this is probably https://en.wikipedia.org/wiki/Truth-default_theory

justinator · 2025-12-28T19:37:51 1766950671

Are you suggesting that the AI chatbot have this built-in? Because the chances that I, an amateur who is writing about a subject out of passion, have gotten something wrong would approach 1 in most circumstances, and the ask that the person receiving the now recycled information will perform these checks every time they query an AI chatbot would be 0.

Exoristos · 2025-12-26T02:53:53 1766717633

These scrapers can bring a small website to its knees. Also, my "contribution" will be drowned in the mass, making me undiscoverable. Further, I can't help fearing a nightmare where someday I'm accused of using AI when I'm only plagiarizing myself.

incompatible · 2025-12-25T01:34:05 1766626445

Fear of AI scrape? I'm just amused at the idea of my words ending up manipulating chatbots to rewrite stuff that I've written, force-feeding it in some distorted form to people silly enough to listen.

jollymonATX · 2025-12-24T21:43:27 1766612607

We are a major sized user cohort and using social platforms is just not worth the energy is my feeping also. Granted not family tradeoff in ky case, just I don't have free time to waste.

jollymonATX · 2025-12-24T21:22:35 1766611355

Spot on observation. The very class of interaction indeed.

jollymonATX · 2025-12-24T21:17:09 1766611029

It's sad that even hitting these meteics will reault in little actual growth. Bluesky is devoid of shareable content. Threads is.... just go to threads and use it and I bet you come away feeling like its unusable like I did. Fediverse when I browse it is like venturing into a ghost town. EVER time I see a blog with a linked acct I check it out. Always they are devoid of interactions. Wordpress blogs have real comments (sometimes) with real interactions happening at a decent clip. Thats the real state of things. Numbers go up predictions like this make no sense to me for one big fat reason, where are the interactions? (I want it to work just ftr)

heavyset_go · 2025-12-24T22:58:58 1766617138

Hasn't been my experience with Lemmy and some closer knit communities on Mastodon. My interests are niche, though.

IMO if you were used to the smaller communities of the pre-social media internet, fediverse stuff feels familiar. You aren't going to get 256k upvotes like you will on Reddit, but you can have some interesting conversations.

conception · 2025-12-24T21:25:07 1766611507

How does bluesky have no shareable content? My friends who twittered now all use BS. Haven’t noticed a change in “check this out”

jollymonATX · 2025-12-24T21:40:35 1766612435

Point to publically posted bs links or embeds please then. Not F2F shares. That's what I am talking about. X and FB have those in droves and they are a real growth driver.

Lord-Jobo · 2025-12-25T17:58:32 1766685512

Not embeds or links, but tons of skeet screenshots on Reddit, daily. ubiquitous. Not remotely as common as xeets, but the platform is way less common so that checks out.

And the overwhelming majority of Xitter content that breaks out of platform are just screenshots as well and not direct links/embeds.

jollymonATX · 2025-12-22T00:48:50 1766364530

This is not really a guide to local coding models which is kinda disappointing. Would have been interested in a review of all the cutting edge open weight models in various applications.

jollymonATX · 2025-12-21T05:59:00 1766296740

Well this should be fun or impossible.

jollymonATX · 2025-12-21T05:56:17 1766296577

Would have hoped they trained for this but at least now they likely will be.

whimsicalism · 2025-12-21T08:00:43 1766304043

frankly at least at the intersection i witnessed i saw plenty of them handling it