prof. Jan Vitek, MSc., Ph.D.

Theses

Master theses

Analysing JavaScript and NPM at scale

Author
Jakub Žitný
Year
2017
Type
Master thesis
Supervisor
prof. Jan Vítek, MSc., Ph.D.
Reviewers
Ing. Antonín Procházka
Summary
In this work we looked at different sources of JavaScript code and metadata, collected rich datasets and performed a data analysis on top of them. We have created a pipeline for detecting duplicates within millions of JavaScript files and found that there is more than 90\% of JavaScript files in Github projects that are cloned from somewhere else. We discuss the reasons for the high percentage of clones and present few ideas on what analyses we can to in the future with collected data.