You have a mechanism that can regurgitate (digest, remix, emit) without attribution all of the world's code and all of the world's art.
With these systems, you're giving everyone the ability to plagiarize everything, effortlessly and unknowingly. No skill, no effort, no time required. No awareness of the sources of the derivative work.
My work is now your work. Everyone can "write" my code, without ever knowing I wrote it, without ever knowing I existed. Everyone can use my hard work, regurgitated anonymously, stripped of all credit, stripped of all attribution, stripped of all identity and ancestry and citation.
It's a new kind of use not known (or imagined?) when the copyright laws were written.
Training must be opt in, not opt out.
Every artist, every creative individual, must EXPLICITLY OPT IN to having their hard work regurgitated anonymously by Copilot or Dall-E or whatever.
If you want to donate your code or your painting or your music so it can easily be "written" or "painted", in whole or in part, by everyone else, without attribution, then go ahead and opt in.
But if an author or artist does not EXPLICITLY OPT IN, you can't use their creative work to train these systems.
All these code/art washing systems, that absorb and mix and regurgitate the hard work of creative people must be strictly opt in.
What would you think if models were bundled with a second model, the "copyright filter". Just as humans know to keep their creations sufficiently far away from copyrighted material, you could distribute models which are trained on copyrighted materials but know well enough not to produce anything so close so something copyrighted that it infringes.
This would prevent anybody from accidentally infringing when using these tools. Does that seem like a reasonable solution, or is your concern greater than accidental infringement?
But why? It really seems to me that things like Copilot will save millions of man hours and make the world a better place. The only harms people have come up with are highly speculative and far smaller in magnitude.
That is the underlying theory of copyright law. We make the speculation that if we don't have copyright law then people won't have the incentive to create future works.
A world without copyright would however save more than just a few millions of man hours that copilot might do. Allowing people and companies to freely use the best software available, view the best art, enjoy the most relaxing music, have the best recreational time with the best films. The only harm is the highly speculative claim that people won't be creating the best software, the best art, the best music or the best films.
You have a mechanism that can regurgitate (digest, remix, emit) without attribution all of the world's code and all of the world's art.
With these systems, you're giving everyone the ability to plagiarize everything, effortlessly and unknowingly. No skill, no effort, no time required. No awareness of the sources of the derivative work.
My work is now your work. Everyone can "write" my code, without ever knowing I wrote it, without ever knowing I existed. Everyone can use my hard work, regurgitated anonymously, stripped of all credit, stripped of all attribution, stripped of all identity and ancestry and citation.
It's a new kind of use not known (or imagined?) when the copyright laws were written.
Training must be opt in, not opt out.
Every artist, every creative individual, must EXPLICITLY OPT IN to having their hard work regurgitated anonymously by Copilot or Dall-E or whatever.
If you want to donate your code or your painting or your music so it can easily be "written" or "painted", in whole or in part, by everyone else, without attribution, then go ahead and opt in.
But if an author or artist does not EXPLICITLY OPT IN, you can't use their creative work to train these systems.
All these code/art washing systems, that absorb and mix and regurgitate the hard work of creative people must be strictly opt in.
That's how the law needs to be.