GitHub’s Automatic Coding Tool Rests on Untested Legal Ground

posted by martyb on Friday July 09 2021, @12:52AM

from the we-violate-all-open-source-licenses-equally dept.

GitHub’s automatic coding tool rests on untested legal ground:

The Copilot tool has been trained on mountains of publicly available code
[...] When GitHub announced Copilot on June 29, the company said that the algorithm had been trained on publicly available code posted to GitHub. Nat Friedman, GitHub’s CEO, has written on forums like Hacker News and Twitter that the company is legally in the clear. “Training machine learning models on publicly available data is considered fair use across the machine learning community,” the Copilot page says.
But the legal question isn’t as settled as Friedman makes it sound — and the confusion reaches far beyond just GitHub. Artificial intelligence algorithms only function due to massive amounts of data they analyze, and much of that data comes from the open internet. An easy example would be ImageNet, perhaps the most influential AI training dataset, which is entirely made up of publicly available images that ImageNet creators do not own. If a court were to say that using this easily accessible data isn’t legal, it could make training AI systems vastly more expensive and less transparent.
Despite GitHub’s assertion, there is no direct legal precedent in the US that upholds publicly available training data as fair use, according to Mark Lemley and Bryan Casey of Stanford Law School, who published a paper last year about AI datasets and fair use in the Texas Law Review.
[...] And there are past cases to support that opinion, they say. They consider the Google Books case, in which Google downloaded and indexed more than 20 million books to create a literary search database, to be similar to training an algorithm. The Supreme Court upheld Google’s fair use claim, on the grounds that the new tool was transformative of the original work and broadly beneficial to readers and authors.

Microsoft’s GitHub Copilot Met with Backlash from Open Source Copyright Advocates:

GitHub Copilot system runs on a new AI platform developed by OpenAI known as Codex. Copilot is designed to help programmers across a wide range of languages. That includes popular scripts like JavaScript, Ruby, Go, Python, and TypeScript, but also many more languages.
“GitHub Copilot understands significantly more context than most code assistants. So, whether it’s in a docstring, comment, function name, or the code itself, GitHub Copilot uses the context you’ve provided and synthesizes code to match. Together with OpenAI, we’re designing GitHub Copilot to get smarter at producing safe and effective code as developers use it.”
One of the main criticisms regarding Copilot is it goes against the ethos of open source because it is a paid service. However, Microsoft would arguably justify this by saying the resources needed to train the AI are costly. Still, the training is problematic for some people because they argue Copilot is using snippets of code to train and then charging users.

Is it fair use to auto-suggest snippets of code that are under an open source copyright license? Does that potentially bring your code under that license by using Copilot?

One glorious day code will write itself without developers developers.

Original Submission

This discussion has been archived. No new comments can be posted.

GitHub’s Automatic Coding Tool Rests on Untested Legal Ground | Log In/Create an Account | Top | 73 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

GitHub’s Automatic Coding Tool Rests on Untested Legal Ground

Related Stories

I trained AI on SoylentNews commentsI trained AI on SoylentNews comments (Score: 1, Funny) by Anonymous Coward on Friday July 09 2021, @12:56AM (2 children)

Tay Did Nothing Wrong(Score: 1, Insightful) by Anonymous Coward on Friday July 09 2021, @03:02AM

Re:I trained AI on SoylentNews comments(Score: 2) by DannyB on Friday July 09 2021, @02:47PM

Stupid people don't read the ToSStupid people don't read the ToS (Score: 4, Informative) by darkfeline on Friday July 09 2021, @01:12AM (28 children)

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by c0lo on Friday July 09 2021, @02:05AM (9 children)

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by darkfeline on Friday July 09 2021, @02:13AM (8 children)

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by c0lo on Friday July 09 2021, @02:32AM (2 children)

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by darkfeline on Friday July 09 2021, @09:52AM (1 child)

Re:Stupid people don't read the ToS(Score: 2) by c0lo on Saturday July 10 2021, @04:56AM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 4, Informative) by http on Friday July 09 2021, @03:31AM (4 children)

Re:Stupid people don't read the ToS(Score: 2) by bradley13 on Friday July 09 2021, @05:34AM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by darkfeline on Friday July 09 2021, @06:00AM (2 children)

Re:Stupid people don't read the ToS(Score: 2) by PiMuNu on Friday July 09 2021, @09:31AM

Re:Stupid people don't read the ToS(Score: 1, Insightful) by Anonymous Coward on Sunday July 11 2021, @09:11AM

Comment Below Threshold

Re:Stupid people don't read the ToS(Score: -1, Flamebait) by Anonymous Coward on Friday July 09 2021, @02:18AM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 0) by Anonymous Coward on Friday July 09 2021, @03:57AM (2 children)

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by HiThere on Friday July 09 2021, @03:08PM (1 child)

Re:Stupid people don't read the ToS(Score: 3, Insightful) by JoeMerchant on Friday July 09 2021, @03:44PM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 0) by Anonymous Coward on Friday July 09 2021, @06:54AM (2 children)

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by DannyB on Friday July 09 2021, @03:02PM (1 child)

Re:Stupid people don't read the ToS(Score: 0) by Anonymous Coward on Friday July 09 2021, @09:44PM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2, Insightful) by Anonymous Coward on Friday July 09 2021, @07:06AM (1 child)

Re:Stupid people don't read the ToS(Score: 2) by HiThere on Friday July 09 2021, @03:11PM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 1, Insightful) by Anonymous Coward on Friday July 09 2021, @11:07AM (1 child)

Re:Stupid people don't read the ToS(Score: 2) by HiThere on Friday July 09 2021, @03:13PM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 4, Insightful) by DannyB on Friday July 09 2021, @03:01PM (6 children)

Re:Stupid people don't read the ToS(Score: 0) by Anonymous Coward on Friday July 09 2021, @04:31PM

Re:Stupid people don't read the ToSRe:Stupid people don't read the ToS (Score: 2) by darkfeline on Friday July 09 2021, @07:39PM (3 children)

Re:Stupid people don't read the ToS(Score: 0) by Anonymous Coward on Saturday July 10 2021, @04:35AM

Re:Stupid people don't read the ToS(Score: 0) by Anonymous Coward on Saturday July 10 2021, @04:55AM

Re:Stupid people don't read the ToS(Score: 0) by Anonymous Coward on Sunday July 11 2021, @09:29AM

Re:Stupid people don't read the ToS(Score: 0) by Anonymous Coward on Friday July 09 2021, @09:26PM

And when it's est suggestion is a GPL copy-pastaAnd when it's est suggestion is a GPL copy-pasta (Score: 2, Interesting) by Anonymous Coward on Friday July 09 2021, @01:13AM (8 children)

Mod up parentMod up parent (Score: 1, Interesting) by Anonymous Coward on Friday July 09 2021, @02:24AM (4 children)

Re:Mod up parentRe:Mod up parent (Score: 0) by Anonymous Coward on Friday July 09 2021, @03:59AM (3 children)

Re:Mod up parent(Score: 0) by Anonymous Coward on Friday July 09 2021, @04:03AM

Comment Below Threshold

Re:Mod up parent(Score: -1, Redundant) by Anonymous Coward on Friday July 09 2021, @04:16AM

Re:Mod up parent(Score: 0) by Anonymous Coward on Friday July 09 2021, @11:17AM

Re:And when it's est suggestion is a GPL copy-past(Score: 2) by HiThere on Friday July 09 2021, @03:17PM

Re:And when it's est suggestion is a GPL copy-pastRe:And when it's est suggestion is a GPL copy-past (Score: 2) by JoeMerchant on Friday July 09 2021, @03:47PM (1 child)

Re:And when it's est suggestion is a GPL copy-past(Score: 0) by Anonymous Coward on Friday July 09 2021, @05:35PM

Copilot?(Score: 2, Flamebait) by bmimatt on Friday July 09 2021, @01:17AM

Imagine(Score: 0) by Anonymous Coward on Friday July 09 2021, @04:39AM

Copilot = ClippyCopilot = Clippy (Score: 2) by PiMuNu on Friday July 09 2021, @10:23AM (1 child)

Re:Copilot = Clippy(Score: 0) by Anonymous Coward on Friday July 09 2021, @02:47PM

BoilerplateBoilerplate (Score: 1) by shrewdsheep on Friday July 09 2021, @11:26AM (9 children)

Re:Boilerplate(Score: 0) by Anonymous Coward on Friday July 09 2021, @03:04PM

Re:BoilerplateRe:Boilerplate (Score: 2) by DannyB on Friday July 09 2021, @03:13PM (7 children)

Re:BoilerplateRe:Boilerplate (Score: 2) by HiThere on Friday July 09 2021, @03:25PM (4 children)

Re:BoilerplateRe:Boilerplate (Score: 2) by DannyB on Friday July 09 2021, @03:33PM (3 children)

Re:BoilerplateRe:Boilerplate (Score: 2) by hendrikboom on Friday July 09 2021, @05:37PM (1 child)

Re:Boilerplate(Score: 2) by DannyB on Friday July 09 2021, @08:43PM

Re:Boilerplate(Score: 2) by HiThere on Friday July 09 2021, @06:04PM

Re:BoilerplateRe:Boilerplate (Score: 1) by shrewdsheep on Friday July 09 2021, @03:54PM (1 child)

Re:Boilerplate(Score: 2) by DannyB on Friday July 09 2021, @08:50PM

Has anyone tried it?Has anyone tried it? (Score: 2) by richtopia on Friday July 09 2021, @03:12PM (17 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by DannyB on Friday July 09 2021, @03:14PM (2 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by HiThere on Friday July 09 2021, @03:29PM (1 child)

Re:Has anyone tried it?(Score: 2) by DannyB on Friday July 09 2021, @03:35PM

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by JoeMerchant on Friday July 09 2021, @03:50PM (12 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by DannyB on Friday July 09 2021, @08:53PM (11 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by JoeMerchant on Saturday July 10 2021, @01:29AM (10 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by DannyB on Monday July 12 2021, @01:39PM (9 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by JoeMerchant on Monday July 12 2021, @02:45PM (3 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by DannyB on Monday July 12 2021, @03:16PM (2 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by JoeMerchant on Monday July 12 2021, @04:03PM (1 child)

Re:Has anyone tried it?(Score: 3, Interesting) by DannyB on Monday July 12 2021, @04:31PM

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by JoeMerchant on Monday July 12 2021, @02:49PM (4 children)

Re:Has anyone tried it?Re:Has anyone tried it? (Score: 2) by DannyB on Monday July 12 2021, @03:30PM (3 children)