The Linux Foundation has created one open-data licence framework to rule them all, allowing users to collaborate on data-driven projects.
Today at the Open Source Summit in Prague, executive director Jim Zemlin announced the Community Data License Agreement, which is designed for non-proprietary data.
The org says data producers can now share the goods "with greater clarity about what recipients may do with it".
One branch "puts terms in place to ensure that downstream recipients can use and modify that data, and are also required to share their changes", while the other does not oblige users to share those changes.
The idea is to accelerate machine learning in open source.
(Score: 4, Insightful) by shrewdsheep on Tuesday October 24 2017, @11:52AM
Just putting out licenses feels like a cheap and uniformed shot on behalf of the Linux Foundation. While there might be a subset of small datasets for which this might be of use there are other licenses out there already. For serious data such as scientific data (except for the astronomical/physical variety), web-traffic, social networking these data are governed by privacy laws which overrule some points in the licenses. Anyway, the difficult part about data is to host it, have meta-data available and have some quality assurances (or least proper descriptions on the measurement itself). In the biomedical sciences this has turned out to be an extremely hard problem, the solution to which so far are publicly funded data repositories. The licenses are non-standardized but licenses seem to be irrelevant compared to the other challenges anyway.