Curriculum Vitae
Senior Researcher and Lecturer
Distributed and Self-organizing Systems, TU Chemnitz, Germany
Education
2016 – 2019
Ph.D., Computer Science
Friedrich Schiller University Jena, Germany
Thesis: A Provenance-based Semantic Approach to Support Understandability, Reproducibility, and Reuse of Scientific Experiments
Thesis: A Provenance-based Semantic Approach to Support Understandability, Reproducibility, and Reuse of Scientific Experiments
2011 – 2013
Master of Technology (M.Tech.), Information Technology
International Institute of Information Technology, Bangalore, India
2007 – 2011
Bachelor of Technology (B.Tech.), Computer Science and Engineering
Cochin University of Science and Technology (CUSAT), India
Research Areas
- Reproducible Research
- Data Provenance
- Scientific Data Management
- Knowledge Graphs
- Semantic Web
- Machine Learning
- Explainability (XAI)
- FAIR Data Principles
Academic Experience
2024 – present
Senior Researcher and Lecturer
Distributed and Self-organizing Systems, TU Chemnitz, Germany
2019 – 2024
Postdoctoral Researcher
Heinz Nixdorf Chair for Distributed Information Systems (FUSION), Friedrich Schiller University Jena, Germany
2016 – 2019
Doctoral Researcher
Heinz Nixdorf Chair for Distributed Information Systems (FUSION), Friedrich Schiller University Jena, Germany
Industrial Experience
Jul 2013 – Dec 2015
Member of Technical Staff II
Aruba, a Hewlett Packard Enterprise Company
Jan 2013 – Jun 2013
Graduate Technical Intern
Aruba, a Hewlett Packard Enterprise Company
Funding & Projects
2025 – 2027
Co-Principal Investigator — Jupyter4NFDI Integration Phase
2021 – 2024
Freistaats Thüringen Funding
Research project: Explainability and Reproducibility for AI
2020 – 2021
Start-up Funding — Michael Stifel Centre Jena (MSCJ)
Project: Integrating Knowledge Graphs for DL Interpretability
2020 – 2021
IMPULSE Project — Friedrich Schiller University Jena
Support programme for early and advanced postdocs. Funding code: IP 2020-10
2017
ProChance Grant — Friedrich Schiller University Jena
Promotion of scientific interaction of young female researchers
Teaching
| Semester | Course | Institution |
|---|---|---|
| SoSe 2026 | Current Trends in Web Engineering | TU Chemnitz |
| WiSe 2024/25 | XML | TU Chemnitz |
| WiSe 2024/25 | Planspiel Web Engineering | TU Chemnitz |
| SoSe 2024 | Web Engineering Seminar | TU Chemnitz |
| SoSe 2024 | Pro-/Haupt- und Forschungsseminar VSR | TU Chemnitz |
| WiSe 2023/24 | Semantic Technologies for Science | FSU Jena |
| SoSe 2021 | Management of Scientific Data | FSU Jena |
| WiSe 2020/21 | Semantic Technologies for Science | FSU Jena |
| SoSe 2020 | Management of Scientific Data | FSU Jena |
| WiSe 2019/20 | Semantic Web Technologies | FSU Jena |
| SoSe 2019 | Management of Scientific Data | FSU Jena |
| SoSe 2019 | Softwareentwicklungsprojekt (SWEP): Project Supervision | FSU Jena |
| WiSe 2018/19 | Semantic Web Technologies | FSU Jena |
| SoSe 2018 | Management of Scientific Data | FSU Jena |
| WiSe 2017/18 | Semantic Technologies for Science | FSU Jena |
Thesis & Project Supervision
PhD Theses
Waqas Ahmed
Reproducibility for AI
2021–
Jihen Amara
Integrating Knowledge Graphs for DL Interpretability
2021–
Master Theses
Hemanta Lo
The Role of Docker in Computational Reproducibility of Jupyter Notebooks from Scholarly Publications PubMed Central
2025
Jungsan Kim
Developing a Tool for Automating Reproducibility Assessments for Repositories
2025
Murad Ali
Document Question Answering using Large Language Models
2024
Badr El Haouni
Interactive web application to explain machine learning results
2024
Ashok Tanubuddi
A recommendation tool to implement and validate the reproducibility of studies
2022
Balaramakrishna Paritala
Provenance Tracking in Machine Learning Python Jupyter Notebooks
2022
Sravan Kumar Devireddy
Reproducibility of Jupyter Notebooks from publications
2022
Bachelor Theses
Dominik Kerzel
Provenance-Tracking und -Visualisierung von Maschinellen-Lern-Skripten in Jupyter Notebooks
2021
Tarek Al Mustafa
Reproducibility of Machine Learning Experiments given the provenance data
2021
Conference Organization
Proceedings Co-Chair — Mensch und Computer 2025
Co-Organizer — 2nd Workshop on Data Engineering for Data Science (DE4DS) at BTW 2025
Reproducibility Co-Chair — BTW 2023
PC Member — EKAW 2024, 2026
PC Member — Sustainable Data Analytics Workshop associated with INFORMATIK (cancelled), 2021
Organizing Committee — Werkstatt Machine Learning Summer School, 2020
Local Organizing Committee — 10th International Conference on Ecological Informatics (ICEI), 2018
Co-Organizer — Workshop "Fostering reproducible science – What data management tools can do and should do for you", 2017
Reviewing
PeerJ Computer Science, 2026
EKAW 2026, 2024
PLOS ONE, 2026, 2024
Software X, 2026
F1000Research 2026, 2024
IEEE Transactions on Cognitive Communications and Networking, 2026
Nature Communications, 2025
Journal of Biomedical Semantics 2026, 2025
Semantic Web Journal (Guest Editorial Board), 2025
Energy and AI, 2025
Transactions of Knowledge and Data Engineering, 2025
Research Ideas and Outcome, 2025
Engineering Applications of Artificial Intelligence, 2024
IEEE Trans. on Pattern Analysis and Machine Intelligence, 2024
ESWC (co-reviewer), 2024
Open Research Europe, 2024
Expert Systems with Applications, 2023
DE4DS Workshop (BTW), 2023
Earth Science Informatics, 2022
GigaScience Journal, 2021
Frontiers Journal, 2021
JupyterCon, 2020
Memberships
Gesellschaft für Informatik (GI), 2026–present
Arbeitskreis Data Engineering for Data Science, 2020–present
Michael Stifel Centre Jena (MSCJ), 2021–present
Featured Software, Datasets & Ontologies
LLM assisted KG construction Pipeline
2024
A (semi-)automatic pipeline of Ontology and Knowledge Graph Construction.
Computational Reproducibility Dataset
2024
Dataset of 27,000+ notebooks from 2,660 repositories from biomedical publications with reproducibility metrics.
FAIR Jupyter Knowledge Graph
2025
A knowledge graph encoding metadata about Jupyter notebook reproducibility at a granular level.
REPRODUCE-ME Ontology
2019
OWL ontology for end-to-end provenance representation of scientific experiments.
Reproducibility Survey
2020
Survey data on researcher practices and understanding of reproducibility across disciplines.
See all software, datasets & ontologies
Selected Publications
See the full publications list and Google Scholar profile.
Containing the Reproducibility Gap: Automated Repository-Level Containerization for Scholarly Jupyter Notebooks
S Samuel, D Mietchen, H Lo, M Gaedke — arXiv, 2026 · Preprint
S Samuel, D Mietchen, H Lo, M Gaedke — arXiv, 2026 · Preprint
From human experts to machines: An LLM supported approach to ontology and knowledge graph construction
VK Kommineni, B König-Ries, S Samuel — arXiv, 2024 · Preprint
VK Kommineni, B König-Ries, S Samuel — arXiv, 2024 · Preprint
Computational reproducibility of Jupyter notebooks from biomedical publications
S Samuel, D Mietchen — GigaScience, 2024 · Paper
S Samuel, D Mietchen — GigaScience, 2024 · Paper
FAIR Jupyter: a knowledge graph approach to semantic sharing and granular exploration of a computational notebook reproducibility dataset
S Samuel, D Mietchen — Transactions on Graph Data and Knowledge, 2024 · Paper
S Samuel, D Mietchen — Transactions on Graph Data and Knowledge, 2024 · Paper
Machine Learning Pipelines: Provenance, Reproducibility and FAIR Data Principles
S Samuel, F Löffler, B König-Ries — IPAW, 2021 · Paper
S Samuel, F Löffler, B König-Ries — IPAW, 2021 · Paper
Understanding experiments and research practices for reproducibility: an exploratory study
S Samuel, B König-Ries — PeerJ, 2021 · Paper
S Samuel, B König-Ries — PeerJ, 2021 · Paper
End-to-End Provenance Representation for the Understandability and Reproducibility of Scientific Experiments using a Semantic Approach
S Samuel, B König-Ries — Journal of Biomedical Semantics, 2022 · Paper
S Samuel, B König-Ries — Journal of Biomedical Semantics, 2022 · Paper
ProvBook: Provenance-based Semantic Enrichment of Interactive Notebooks for Reproducibility
S Samuel, B König-Ries — ISWC 2018 Demo Track · Paper
S Samuel, B König-Ries — ISWC 2018 Demo Track · Paper





