Abstract
In an ongoing industry-university collaboration we are developing a language-parametric framework for mining code idioms in legacy systems. This modular framework has a pipeline architecture and a language- parametric meta representation of the artefacts used by each of its 5 components: source code importer, mining preprocessor, pattern miner, pattern matcher, and modernisation assistant. The pipeline enables reuse of its components across systems and languages, as well as for project partners to work on each of these components separately. An example is the exploration of novel pattern mining techniques independently of the languages on which they will be applied and the modernisation assistant in which they will be used. Our first results on mining Java and COBOL code are promising, even though challenges still lie ahead to make the framework and its constituting components truly scalable, customisable, and language independent.
Original language | English |
---|---|
Title of host publication | Seminar on Advanced Techniques & Tools for Software Evolution |
Number of pages | 6 |
Volume | 1 |
Publication status | Published - 2019 |
Event | 12th Seminar on Advanced Techniques & Tools for Software Evolution - Free University of Bozen-Bolzano, Bolzano, Italy Duration: 8 Jul 2019 → … Conference number: 12 http://sattose.org/2019 |
Publication series
Name | CEUR Workshop Proceedings |
---|---|
ISSN (Print) | 1613-0073 |
Conference
Conference | 12th Seminar on Advanced Techniques & Tools for Software Evolution |
---|---|
Abbreviated title | SATToSE |
Country | Italy |
City | Bolzano |
Period | 8/07/19 → … |
Internet address |
Keywords
- Pattern Mining
- Frequent Tree Mining
- Source Code Regularities