Research Paper
Open Source License Inconsistencies on GitHub
Author(s)
T. Wolter, A. Barcomb, D. Riehle and N. Harutyunyan
Download
PDF
Year
2022
Keywords
Abstract
Almost all software, open or closed, builds on open source software and therefore needs to comply with the license obligations of the open source code. Not knowing which licenses to comply with poses a legal danger to anyone using open source software. This article investigates the extent of inconsistencies between licenses declared by an open source project at the top level of the repository, and the licenses found in the code. We analysed a sample of 1,000 open source GitHub repositories. We find that about half of the repositories did not fully declare all licenses found in the code. Of these, approximately ten percent represented a permissive vs. copyleft license mismatch. Furthermore, existing tools cannot fully identify licences. We conclude that users of open source code should not only look at the declared licenses of the open source code they intend to use, but rather examine the software to understand its actual licenses.
BibTeX
@article{wolter:2022:open, title = {Open Source License Inconsistencies on GitHub}, author = {Wolter, Thomas and Barcomb, Ann and Riehle, Dirk and Harutyunyan, Nikolay}, year = {2022}, doi = {https://dl.acm.org/doi/10.1145/3571852}, journal = {Transactions on Software Engineering and Methodology}, }