Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I would rather not. While it is already highly questionable to use it normally because it steals opensource code, but let's give it a pass for this thought experiment, it probably scrapped the multiple git repository of Windows leaked source code. In which case it would ABSOLUTELY undermine the project's ability to say it's a clean room implementation


If they use Copilot it is probably fair game.


How do you steal open source code? It's open.


You violate the license (such as GPL)


Copyright licenses are not one word. They are written with intent, and usually at minimum that intent is to credit the original author.


"it probably scrapped the multiple git repository of Windows leaked source code. In which case it would ABSOLUTELY undermine the project's ability to say it's a clean room implementation"

If an LLM model has been fed leaked code, then that is a general problem for that model and for its use for anything. Singling out its use for an open-source project and denouncing that as a potential problem while otherwise keeping quiet about it just makes no sense. Just take legal action against the model if there's anything plausible to warrant that, don't weaponize it against open-source projects.


All LLM have probably as they scrape github, and there are still to this day multiple Windows XP source code live on it (I won't give links but they are pretty easy to find). And I'd bet there is way more than just windows leaks on there...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: