Projects per year
Abstract
Cloud-native applications increasingly provision infrastructure resources programmatically through Infrastructure as Code (IaC) scripts. These scripts have in turn become the subject of empirical software engineering research. However, an often-overlooked part are the software ecosystems that have grown around the IaC languages. For example, Galaxy is an ecosystem for the popular Ansible IaC language. Galaxy features a large number of so-called “roles”, which are reusable collections of Ansible code akin to libraries for general-purpose languages. In contrast to, and despite their similarities, such IaC ecosystems have enjoyed far less attention in the literature than library ecosystems for general-purpose languages.
In this data showcase paper, we present Andromeda, the first dataset capturing the Ansible Galaxy ecosystem, its roles, and their evolution. Andromeda provides structural representations of more than 125000 role versions, and upwards of 800000 concrete changes between such versions extracted from the un- derlying git repositories. Andromeda aims to provide an extensive view of the contributor side of the Galaxy ecosystem, which we hope will stimulate additional research on IaC ecosystems.
In this data showcase paper, we present Andromeda, the first dataset capturing the Ansible Galaxy ecosystem, its roles, and their evolution. Andromeda provides structural representations of more than 125000 role versions, and upwards of 800000 concrete changes between such versions extracted from the un- derlying git repositories. Andromeda aims to provide an extensive view of the contributor side of the Galaxy ecosystem, which we hope will stimulate additional research on IaC ecosystems.
Original language | English |
---|---|
Title of host publication | Proceedings of the 18th International Conference on Mining Software Repositories (MSR 2021), Data Showcase Track |
Publisher | IEEE |
Pages | 580-584 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-7281-8710-5 |
ISBN (Print) | 978-1-7281-8710-5 |
DOIs | |
Publication status | Published - May 2021 |
Event | 18th International Conference on Mining Software Repositories (MSR2021) - Virtual Duration: 17 May 2021 → 19 May 2021 Conference number: 18 https://conf.researchr.org/home/msr-2021 |
Publication series
Name | Proceedings - 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR) |
---|
Conference
Conference | 18th International Conference on Mining Software Repositories (MSR2021) |
---|---|
Abbreviated title | MSR |
Period | 17/05/21 → 19/05/21 |
Internet address |
Keywords
- Infrastructure as Code
- Ansible
- Dataset
- IaC
- Meta-model
Fingerprint
Dive into the research topics of 'Andromeda: A Dataset of Ansible Galaxy Roles and Their Evolution'. Together they form a unique fingerprint.Projects
- 2 Finished
-
FWOSB103: Pattern Mining and Static Analysis for Detecting Defects in Infrastructure as Code
1/11/20 → 31/10/24
Project: Fundamental
-
FWOEOS10: Automated Assistance for Developing Software in Ecosystems of the Future
De Roover, C., Mens, T., Demeyer, S. & Cleve, A.
1/01/18 → 31/12/21
Project: Fundamental
Datasets
-
Andromeda: A Dataset of Ansible Galaxy Roles and Their Evolution
Opdebeeck, R. (Creator), Zerouali, A. (Creator) & De Roover, C. (Creator), figshare, 2021
DOI: 10.6084/m9.figshare.13664519
Dataset
Activities
- 1 Talk or presentation at a conference
-
Andromeda: A Dataset of Ansible Galaxy Roles and Their Evolution
Ruben Opdebeeck (Speaker)
19 May 2021Activity: Talk or presentation › Talk or presentation at a conference
File