Please find my publications by type below. You can also visit my ORCID or dblp profile for a year-by-year view. Authors marked with * are equally contributing authors.
Preprints currently under review or revision
- Differential testing for machine learning: an analysis for classification algorithms beyond deep learning, Steffen Herbold, Steffen Tunkel
- What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes, Alexander Trautsch, Johannes Erbel, Steffen Herbold, Jens Grabowski
- Are automated static analysis tools worth it? An investigation into relative warning density and external software quality, Alexander Trautsch, Steffen Herbold, Jens Grabowski
- A new perspective on the competent programmer hypothesis through the reproduction of bugs with repeated mutations, Eike Stein, Steffen Herbold, Fabian Trautsch, Jens Grabowski
- Broccoli: Bug localization with the help of text search engines, Benjamin Ledel, Steffen Herbold
- On the differences between quality increasing and other changes in open source Java projects, Alexander Trautsch, Johannes Erbel, Steffen Herbold, Jens Grabowski
- Repayment Under Flexible Loan Contracts: Evidence from Tanzania, Antonia Grohmann, Steffen Herbold, Friederike Lenel, https://dx.doi.org/10.2139/ssrn.3671344
Peer-Reviewed Journal Articles
- Exploring the relationship between performance metrics and cost saving potential of defect prediction models, Steffen Tunkel, Steffen Herbold, Empirical Software Engineering, Vol 27:145, Springer, 2022
- On the validity of pre-trained transformers for natural language processing in the software engineering domain, Julian von der Mosel, Alexander Trautsch, Steffen Herbold, IEEE Transactions on Software Engineering, 2022
- Expert Decision Support System for aeroacoustic source type identification using clustering, Armin Goudarzi, Carsten Spehr, Steffen Herbold, The Journal of the Acoustical Society of America, Vol 151:1259-1276, 2022
- Spatio-temporal mapping of soil water storage in a semi-arid landscape of Northern Ghana – A multi-tasked ensemble machine-learning approach, Kwabena A. Nketia, Amanda Ramcharan, Stephen B. Asabere, Steffen Herbold, Stefan Erasmi, Daniela Sauer, Geoderma, Vol 410, Elsevier, 2022
- Problems with with SZZ and Features: An empirical assessment of the state of practice of defect prediction data collection, Steffen Herbold*, Alexander Trautsch*, Fabian Trautsch*, Benjamin Ledel, Empirical Software Engineering,Vol 27:45, Springer, 2022
- Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects, Steffen Herbold, Tobias Haar, Empirical Software Engineering, Vol 27:45, Springer, 2022
- A Fine-grained Data Set and Analysis of Tangling in Bug Fixing Commits, Steffen Herbold, Alexander Trautsch, Benjamin Ledel, Alireza Aghamohammadi, Taher Ahmed Ghaleb, Kuljit Kaur Chahal, Tim Bossenmaier, Bhaveet Nagaria, Philip Makedonski, Matin Nili Ahmadabadi, Kristof Szabados, Helge Spieker, Matej Madeja, Nathaniel Hoy, Valentina Lenarduzzi, Shangwen Wang, Gema Rodríguez-Pérez, Ricardo Colomo-Palacios, Roberto Verdecchia, Paramvir Singh, Yihao Qin, Debasish Chakroborti, Willard Davis, Vijay Walunj, Hongjun Wu, Diego Marcilio, Omar Alam, Abdullah Aldaeej, Idan Amit, Burak Turhan, Simon Eismann, Anna-Katharina Wickert, Ivano Malavolta, Matus Sulir, Fatemeh Fard, Austin Z. Henley, Stratos Kourtzanidis, Eray Tuzun, Christoph Treude, Simin Maleki Shamasbi, Ivan Pashchenko, Marvin Wyrich, James Davis, Alexander Serebrenik, Ella Albrecht, Ethem Utku Aktas, Daniel Strüber, Johannes Erbel, Empirical Software Engineering, Springer (Accepted on 17th Oct 2021)
- Automatic source localization and spectra generation from deconvolved beamforming maps, Armin Goudarzi, Carsten Spehr, Steffen Herbold, The Journal of the Accoustical Society of America, Vol. 150(3): 1866:1882, Accoustical Society of America, 2021
- A systematic mapping study of developer social network research, Steffen Herbold, Aynur Amirfallah, Fabian Trautsch, Jens Grabowski, Journal of Systems and Software, Vol. 171, Elsevier, 2021
- On the cost and profit of software defect prediction, Steffen Herbold, IEEE Transactions on Software Engineering, Vol. 47(11):2617-2631, IEEE, 2021
- A Longitudinal Study of Static Analysis Warning Evolution and the Effects of PMD on Software Quality in Apache Open Source Projects, Alexander Trautsch, Steffen Herbold, Jens Grabowski, Empirical Software Engineering, Vol. 25: 5137-5192, Springer, 2020
- On the feasibility of automated prediction of bug and non-bug issues, Steffen Herbold, Alexander Trautsch, Fabian Trautsch, Empirical Software Engineering, Vol. 25: 5333–5369, Springer, 2020
- A Multi-Objective Anytime Rule Mining System to Ease Iterative Feedback from Domain Experts, Tobias Baum, Steffen Herbold, Kurt Schneider, Expert Systems with Applications X, Vol. 8, Elsevier, 2020
- Are Unit and Integration Test Definitions Still Valid for Modern Java Projects? An Empirical Study on Open-Source Projects, Fabian Trautsch, Steffen Herbold, Jens Grabowski, Journal of Systems and Software, Vol. 159, Elsevier, 2020
- Correction of “A Comparative Study to Benchmark Cross-project Defect Prediction Approaches”, Steffen Herbold, Alexander Trautsch, Jens Grabowski, IEEE Transactions on Software Engineering, Vol. 45(6):632-636, IEEE, 2019
- A Comparative Study to Benchmark Cross-project Defect Prediction Approaches, Steffen Herbold, Alexander Trautsch, Jens Grabowski, IEEE Transactions on Software Engineering, Vol. 44(9):811-833, IEEE, 2018
- Addressing problems with replicability and validity of repository mining studies through a smart data platform, Fabian Trautsch, Steffen Herbold, Philip Makedonski, Jens Grabowski, Empirical Software Engineering, Vol. 23(2):1036-1083, Springer, 2018
- Comments on ScottKnottESD in response to “An Empirical Comparison of Model Validation Techniques for Defect Prediction Models”, Steffen Herbold, IEEE Transactions on Software Engineering, Vol. 43(11):1091-1094, IEEE, 2017
- Global vs. Local Models for Cross-project Defect Prediction: A Replication Study, Steffen Herbold, Alexander Trautsch, Jens Grabowski, Empirical Software Engineering, Vol. 22(4):1866-1902, Springer, 2017
- Combining usage-based and model-based testing for service-oriented architectures in the industrial practice, Steffen Herbold, Patrick Harms, Jens Grabowski, International Journal on Software Tools for Technology Transfer, Vol. 19(3):309-324, Springer, 2017
- A Generalized Model of PAC Learning and its Applicability, Thomas Brodag, Steffen Herbold, Stephan Waack, RAIRO – Theoretical Informatics and Applications, Vol. 48(2):209-245, 2014
- Calculation and Optimization of Thresholds for Sets of Software Metrics, Steffen Herbold, Jens Grabowski, Stephan Waack, Empirical Software Engineering, Vol. 16(6):812-841, Springer, 2011
Pre-registered Study Protocols
- Exploring the relationship between performance metrics and cost saving potential of defect prediction models, Steffen Herbold, International Conference on Mining Software Repositories – Registered Reports (Continuity Acceptance), 2021
- Large-Scale Manual Validation of Bugfixing Changes, Steffen Herbold, Alexander Trautsch, Benjamin Ledel, International Conference on Mining Software Repositories – Registered Reports (In Principle Acceptance), 2020, https://osf.io/acnwk
Editorials
- Model-based testing as a service, Steffen Herbold, Andreas Hoffmann, International Journal on Software Tools for Technology Transfer, 19(3):271-279, Springer, 2017
- System Analysis and Modeling. Technology-Specific Aspects of Models, Jens Grabowski, Steffen Herbold, Lecture Notes in Computer Science (LNCS), Vol. 9959, Springer, 2016
Invited Articles
- Software-Fehlervorhersage: Intelligente Qualitätssicherung durch statistische Methoden, Steffen Herbold, Jens Grabowski, OBJEKTSpektrum Ausgabe Testing/2015, SIGS DATACOM, 2015
Book Chapters
- Mining Big Data for Analyzing and Simulating Collaboration Factors Influencing Software Development Decisions, Philip Makedonski, Verena Herbold, Steffen Herbold, Daniel Honsel, Jens Grabowski, Stephan Waack, Social Network Analysis: Interdisciplinary Approaches and Case Studies, CRC Press, 2017
- Deployable Capture/Replay Supported by Internal Messages, Steffen Herbold, Uwe Bünting, Jens Grabowski, Stephan Waack, Advances in Computers, Vol. 85:327-367, Elsevier, 2012
Peer Reviewed Conference and Workshop Articles
- Predicting Issue Types with seBERT, Alexander Trautsch, Steffen Herbold, 1st International Workshop on Natural Language-based Software Engineering (NLBSE) – Tool Competition, 2022
- Static source code metrics and static analysis warnings for fine-grained just-in-time defect prediction, Alexander Trautsch, Steffen Herbold, Jens Grabowski, 36th International Conference on Software Maintenance and Evolution (ICSME), 2020
- Expert Decision Support System for Aeroacoustic Classification from Deconvolved Beamforming Maps, Armin Goudarzi, Carsten Spehr, Steffen Herbold, AIAA AVIATION 2020 FORUM, 2020
- With Registered Reports Towards Large Scale Data Curation, Steffen Herbold, 42nd International Conference on Software Engineering (ICSE) – NIER Track, 2020
- The SmartSHARK Ecosystem for Software Repository Mining, Alexander Trautsch, Fabian Trautsch, Steffen Herbold, Benjamin Ledel, Jens Grabowski, 42nd International Conference on Software Engineering (ICSE) – Demonstrations Track, 2020
- Performance Tuning for Automotive Software Fault Prediction, Harald Altinger, Steffen Herbold, Friederike Schneemann, Jens Grabowski, Franz Wotawa, IEEE 24th International Conference on Software Analysis, Evolution, and Reengineering (SANER), 2017
- On the Relatively Small Impact of Deep Dependencies on Cloud Application Reliability, Xiaowei Wang, Fabian Glaser, Steffen Herbold, Jens Grabowski, 10th IEEE International Conference on Cloud Computing (CLOUD), 2017
- Hidden Markov Models for the Prediction of Developer Involvement Dynamics and Workload, Verena Honsel, Steffen Herbold, Jens Grabowski, 12th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE), 2016
- Learning from Software Project Histories: Predictive Studies Based on Mining Software Repositories, Verena Honsel, Steffen Herbold, Jens Grabowski, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery (ECML-PKDD) – NEKTAR Track, 2016
- Addressing Problems with External Validity of Repository Mining Studies Through a Smart Data Platform, Fabian Trautsch, Steffen Herbold, Philip Makedonski, Jens Grabowski, 13th International Conference on Mining Software Repositories (MSR), 2016
- Novel Insights on Cross Project Fault Prediction applied to Automotive Software, Harald Altinger, Steffen Herbold, Jens Grabowski, Franz Wotawa, 27th International Conference on Testing Software and Systems (ICTSS), 2015
- The MIDAS Cloud Platform for Testing SOA Applications, Steffen Herbold, Alberto De Francesco, Jens Grabowski, Patrick Harms, Lom Messan Hillah, Fabrice Kordon, Ariele-Paolo Maesano, Libero Maesano, Claudia Di Napoli, Fabio de Rosa, Martin Schneider, Nicola Tonellotto, Marc-Florian Wendland, Pierre-Henri Wuillemin, 8th IEEE International Conference on Software Testing, Verification and Validation (ICST) – Testing Tools Track, 2015
- Automated Deployment and Parallel Execution of Legacy Applications in Cloud Environments, Michael Göttsche, Fabian Glaser, Steffen Herbold, Jens Grabowski, 8th IEEE International Conference on Service Oriented Computing & Applications (SOCA), 2015
- CrossPare: A Tool for Benchmarking Cross-Project Defect Predictions, Steffen Herbold, 4th International Workshop on Software Mining (SoftMine), 2015
- Mining Software Dependency Networks for Agent-Based Simulation of Software Evolution, Verena Honsel, Daniel Honsel, Steffen Herbold, Jens Grabowski, Stephan Waack, 4th International Workshop on Software Mining (SoftMine), 2015
- Improving Security Testing With Usage-Based Fuzz Testing, Martin Schneider, Steffen Herbold, Marc-Florian Wendland, Jens Grabowski, 3rd International Workshop on Risk Assessment and Risk-driven Testing (RISK), 2015
- Intuition vs. Truth: Evaluation of Common Myths about StackOverflow Posts, Verena Honsel, Steffen Herbold, Jens Grabowski, 12th Working Conference on Mining Software Repositories (MSR) – Challenge Track, 2015
- Training data selection for cross-project defect prediction, Steffen Herbold, 9th International Conference on Predictive Models in Software Engineering (PROMISE), ACM, 2013
- AutoQUEST – Automated Quality Engineering of Event-driven Software, Steffen Herbold, Patrick Harms, 4th International Workshop on Testing Techniques & Experimentation Benchmarks for Event-driven Software (TESTBEDS), IEEE Computer Society, 2013
- A Model for Usage-based testing of Event-driven Software, Steffen Herbold, Jens Grabowski, Stephan Waack, 3rd International Workshop on Model-based Verification & Validation: From Research to Practice (MVV), IEEE Computer Society, 2011
- Improved Bug Reporting and Reproduction through Non-intrusive GUI Usage Monitoring and Automated Replaying, Steffen Herbold, Uwe Bünting, Jens Grabowski, Stephan Waack, 3rd International Workshop on Testing Techniques & Experimentation Benchmarks for Event-Driven Software (TESTBEDS), IEEE Computer Society, 2011
- Retrospective Analysis of Software Projects using k-Means Clustering, Steffen Herbold, Jens Grabowski, Helmut Neukirchen, Stephan Waack, 2nd Design for Future 2010 Workshop (DFF), 2010
- Machine Learning for Software Process Analysis, Steffen Herbold, Ph.D. Symposium at the 2nd International Conference on Software Testing, Verification, and Validation (ICST), 2009
Additional Talks at Conferences and Workshops
- Repayment Behavior under Flexible Loan Contracts, 2nd International Conference on Globalization and Development (GlaD), Göttingen, 2018 (with Frederike Lenel and Antonia Grohmann)
- Enhancing Test Models by Incorporating Monitored Usage Information, 3rd ETSI User Conferences on Advances in Automated Testing (UCAAT), Sophia-Antipolis, 2015
- Model and Inference Driven Testing of Services Architectures, 2nd ETSI User Conference on Advances in Automated Testing (UCAAT), München, 2014
- Model and Inference Driven Testing of Services Architectures, 1st ETSI User Conference on Advances in Automated Testing (UCAAT), Paris, 2013
- Berechnung und Optimierung von Grenzwerten für Mengen von Softwaremetriken, Softwareforen Leipzig – User Group Softwaretesten und Qualitätssicherung, Leipzig, 2011
- Nachweis von Feature Freezes durch Clustering, Metrikon, München, 2008
Preprint Graveyard
This section lists preprints for which we are not aware of any major flaws (e.g., wrong data, wrong conclusions, etc) but which are unpublished nevertheless (motivation, scope, or other issues) and which we do not submit anymore. Read and cite at your own peril.
- The SmartSHARK Repository Mining Data, Alexander Trautsch, Steffen Herbold, https://arxiv.org/abs/2102.11540 (This one will actually leave the graveyard soon, because we use this data for the MSR mining challenge, where it will be included in the proceedings. )
- A systematic mapping study on cross-project defect prediction, Steffen Herbold, https://arxiv.org/abs/1705.06429 (The paper is basically too long to be published. The collected literature may also be incomplete, due to the use of GoogleScholar.)