On Code Reuse from StackOverflow: An Exploratory Study on Jupyter
Notebook
- Yutian Tang,
- Mingke Yang,
- Yuming Zhou,
- Bixin Li
Yuming Zhou
Nanjing University Department of Computer Science and Technology
Author ProfileBixin Li
Southeast University School of Computer Science and Engineering
Author ProfileAbstract
Jupyter Notebook is a popular tool among data analysts and scientists
for working with data. It provides a way to combine code, documentation,
and visualizations in a single, interactive environment, facilitating
code reuse. While code reuse can improve programming efficiency, it can
also decrease readability, security, and overall performance. We conduct
a large-scale exploratory study of code reuse practices in the Jupyter
Notebook development community on the Stack Overflow platform to
understand the potential negative impacts of code reuse. Our findings
identified 1,097,470 Jupyter Notebook clone pairs that reuse Stack
Overflow code snippets, and the average code snippet has 7.91 code
quality violations. Through our research, we gain insight into the
reasons behind Jupyter Notebook developers' decision to reuse code and
the potential drawbacks of this practice.23 Feb 2023Submitted to Software: Practice and Experience 27 Feb 2023Submission Checks Completed
27 Feb 2023Assigned to Editor
15 Mar 2023Review(s) Completed, Editorial Evaluation Pending
18 Mar 2023Reviewer(s) Assigned
24 Sep 2024Editorial Decision: Revise Major