Skip to main content

Showing 1–25 of 25 results for author: Stuckenschmidt, H

  1. arXiv:2407.02112  [pdf, other

    cs.LG cs.AI

    A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular Data

    Authors: Andrej Tschalzev, Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Tabular data is prevalent in real-world machine learning applications, and new models for supervised learning of tabular data are frequently proposed. Comparative studies assessing the performance of models typically consist of model-centric evaluation setups with overly standardized data preprocessing. This paper demonstrates that such model-centric evaluations are biased, as real-world modeling… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.01115  [pdf, other

    cs.LG stat.ML

    Enabling Mixed Effects Neural Networks for Diverse, Clustered Data Using Monte Carlo Methods

    Authors: Andrej Tschalzev, Paul Nitschke, Lukas Kirchdorfer, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Neural networks often assume independence among input data samples, disregarding correlations arising from inherent clustering patterns in real-world datasets (e.g., due to different sites or repeated measurements). Recently, mixed effects neural networks (MENNs) which separate cluster-specific 'random effects' from cluster-invariant 'fixed effects' have been proposed to improve generalization and… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.09639  [pdf, other

    cs.LG cs.SI

    TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs

    Authors: Julia Gastinger, Shenyang Huang, Mikhail Galkin, Erfan Loghmani, Ali Parviz, Farimah Poursafaei, Jacob Danovitch, Emanuele Rossi, Ioannis Koutis, Heiner Stuckenschmidt, Reihaneh Rabbany, Guillaume Rabusseau

    Abstract: Multi-relational temporal graphs are powerful tools for modeling real-world data, capturing the evolving and interconnected nature of entities over time. Recently, many novel models are proposed for ML on such graphs intensifying the need for robust evaluation and standardized benchmark datasets. However, the availability of such resources remains scarce and evaluation faces added complexity due t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 27 pages, 8 figures

  4. arXiv:2404.16726  [pdf, other

    cs.LG

    History repeats Itself: A Baseline for Temporal Knowledge Graph Forecasting

    Authors: Julia Gastinger, Christian Meilicke, Federico Errica, Timo Sztyler, Anett Schuelke, Heiner Stuckenschmidt

    Abstract: Temporal Knowledge Graph (TKG) Forecasting aims at predicting links in Knowledge Graphs for future timesteps based on a history of Knowledge Graphs. To this day, standardized evaluation protocols and rigorous comparison across TKG models are available, but the importance of simple baselines is often neglected in the evaluation, which prevents researchers from discerning actual and fictitious progr… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted at IJCAI 2024

  5. arXiv:2404.06267  [pdf, other

    cs.LG cs.AI

    PGTNet: A Process Graph Transformer Network for Remaining Time Prediction of Business Process Instances

    Authors: Keyvan Amiri Elyasi, Han van der Aa, Heiner Stuckenschmidt

    Abstract: We present PGTNet, an approach that transforms event logs into graph datasets and leverages graph-oriented data for training Process Graph Transformer Networks to predict the remaining time of business process instances. PGTNet consistently outperforms state-of-the-art deep learning approaches across a diverse range of 20 publicly available real-world event logs. Notably, our approach is most prom… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 16 pages, 4 figures, To be published in: Advanced Information Systems Engineering - 36th International Conference, CAiSE 2024, Limassol, Cyprus, June 03-07, 2024, Proceedings

  6. arXiv:2309.17130  [pdf, other

    cs.LG

    GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Despite the success of deep learning for text and image data, tree-based ensemble models are still state-of-the-art for machine learning with heterogeneous tabular data. However, there is a significant need for tabular-specific gradient-based methods due to their high flexibility. In this paper, we propose $\text{GRANDE}$, $\text{GRA}$die$\text{N}$t-Based $\text{D}$ecision Tree $\text{E}$nsembles,… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  7. arXiv:2309.00306  [pdf, ps, other

    cs.AI

    On the Aggregation of Rules for Knowledge Graph Completion

    Authors: Patrick Betz, Stefan Lüdtke, Christian Meilicke, Heiner Stuckenschmidt

    Abstract: Rule learning approaches for knowledge graph completion are efficient, interpretable and competitive to purely neural models. The rule aggregation problem is concerned with finding one plausibility score for a candidate fact which was simultaneously predicted by multiple rules. Although the problem is ubiquitous, as data-driven rule learning can result in noisy and large rulesets, it is underrepre… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: KLR Workshop@ICML2023

  8. arXiv:2307.14151  [pdf, other

    cs.LG stat.ML

    Learning Disentangled Discrete Representations

    Authors: David Friede, Christian Reimers, Heiner Stuckenschmidt, Mathias Niepert

    Abstract: Recent successes in image generation, model-based reinforcement learning, and text-to-image generation have demonstrated the empirical advantages of discrete latent representations, although the reasons behind their benefits remain unclear. We explore the relationship between discrete latent spaces and disentangled representations by replacing the standard Gaussian variational autoencoder (VAE) wi… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  9. Planning Landmark Based Goal Recognition Revisited: Does Using Initial State Landmarks Make Sense?

    Authors: Nils Wilken, Lea Cohausz, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Goal recognition is an important problem in many application domains (e.g., pervasive computing, intrusion detection, computer games, etc.). In many application scenarios, it is important that goal recognition algorithms can recognize goals of an observed agent as fast as possible. However, many early approaches in the area of Plan Recognition As Planning, require quite large amounts of computatio… ▽ More

    Submitted 10 November, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Full publication: Wilken, N., Cohausz, L., Bartelt, C., Stuckenschmidt, H. (2023). Planning Landmark Based Goal Recognition Revisited: Does Using Initial State Landmarks Make Sense?. In: Seipel, D., Steen, A. (eds) KI 2023: Advances in Artificial Intelligence. KI 2023. Lecture Notes in Computer Science(), vol 14236. Springer, Cham. arXiv admin note: text overlap with arXiv:2301.10571

  10. arXiv:2305.03515  [pdf, other

    cs.LG cs.AI

    GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Decision Trees (DTs) are commonly used for many machine learning tasks due to their high degree of interpretability. However, learning a DT from data is a difficult optimization problem, as it is non-convex and non-differentiable. Therefore, common approaches learn DTs using a greedy growth algorithm that minimizes the impurity locally at each internal node. Unfortunately, this greedy procedure ca… ▽ More

    Submitted 12 March, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  11. arXiv:2301.10571  [pdf, other

    cs.AI

    Leveraging Planning Landmarks for Hybrid Online Goal Recognition

    Authors: Nils Wilken, Lea Cohausz, Johannes Schaum, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Goal recognition is an important problem in many application domains (e.g., pervasive computing, intrusion detection, computer games, etc.). In many application scenarios it is important that goal recognition algorithms can recognize goals of an observed agent as fast as possible and with minimal domain knowledge. Hence, in this paper, we propose a hybrid method for online goal recognition that co… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 9 pages. Presented at SPARK 2022 (https://icaps22.icaps-conference.org/workshops/SPARK/)

  12. arXiv:2301.05608  [pdf, other

    cs.AI

    Investigating the Combination of Planning-Based and Data-Driven Methods for Goal Recognition

    Authors: Nils Wilken, Lea Cohausz, Johannes Schaum, Stefan Lüdtke, Heiner Stuckenschmidt

    Abstract: An important feature of pervasive, intelligent assistance systems is the ability to dynamically adapt to the current needs of their users. Hence, it is critical for such systems to be able to recognize those goals and needs based on observations of the user's actions and state of the environment. In this work, we investigate the application of two state-of-the-art, planning-based plan recognition… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  13. arXiv:2207.08414  [pdf, other

    cs.LG

    Outlier Explanation via Sum-Product Networks

    Authors: Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Outlier explanation is the task of identifying a set of features that distinguish a sample from normal data, which is important for downstream (human) decision-making. Existing methods are based on beam search in the space of feature subsets. They quickly becomes computationally expensive, as they require to run an outlier detection algorithm from scratch for each feature subset. To alleviate this… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  14. Explaining Neural Networks without Access to Training Data

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Andrej Tschalzev, Heiner Stuckenschmidt

    Abstract: We consider generating explanations for neural networks in cases where the network's training data is not accessible, for instance due to privacy or safety issues. Recently, $\mathcal{I}$-Nets have been proposed as a sample-free approach to post-hoc, global model interpretability that does not require access to training data. They formulate interpretation as a machine learning task that maps netwo… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Journal ref: Machine Learning (2024)

  15. arXiv:2201.07670  [pdf, other

    cs.CL cs.LG

    Top-Down Influence? Predicting CEO Personality and Risk Impact from Speech Transcripts

    Authors: Kilian Theil, Dirk Hovy, Heiner Stuckenschmidt

    Abstract: How much does a CEO's personality impact the performance of their company? Management theory posits a great influence, but it is difficult to show empirically -- there is a lack of publicly available self-reported personality data of top managers. Instead, we propose a text-based personality regressor using crowd-sourced Myers--Briggs Type Indicator (MBTI) assessments. The ratings have a high inte… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  16. arXiv:2110.05165  [pdf, other

    cs.LG cs.AI stat.ML

    Exchangeability-Aware Sum-Product Networks

    Authors: Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Sum-Product Networks (SPNs) are expressive probabilistic models that provide exact, tractable inference. They achieve this efficiency by making use of local independence. On the other hand, mixtures of exchangeable variable models (MEVMs) are a class of tractable probabilistic models that make use of exchangeability of discrete random variables to render inference tractable. Exchangeability, which… ▽ More

    Submitted 28 April, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: accepted at IJCAI 2022

  17. arXiv:2012.06006  [pdf, other

    cs.AI cs.LG

    xRAI: Explainable Representations through AI

    Authors: Christiann Bartelt, Sascha Marton, Heiner Stuckenschmidt

    Abstract: We present xRAI an approach for extracting symbolic representations of the mathematical function a neural network was supposed to learn from the trained network. The approach is based on the idea of training a so-called interpretation network that receives the weights and biases of the trained network as input and outputs the numerical representation of the function the network was supposed to lea… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: 8 pages, 6 figures, 3 tables

  18. arXiv:2010.10024  [pdf, other

    cs.CV

    Neural Architecture Performance Prediction Using Graph Neural Networks

    Authors: Jovita Lukasik, David Friede, Heiner Stuckenschmidt, Margret Keuper

    Abstract: In computer vision research, the process of automating architecture engineering, Neural Architecture Search (NAS), has gained substantial interest. Due to the high computational costs, most recent approaches to NAS as well as the few available benchmarks only provide limited search spaces. In this paper we propose a surrogate model for neural architecture performance prediction built upon Graph Ne… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: camera ready version for DAGM GCPR 2020. arXiv admin note: substantial text overlap with arXiv:1912.05317

  19. arXiv:2004.04412  [pdf, other

    cs.AI cs.LG

    Reinforced Anytime Bottom Up Rule Learning for Knowledge Graph Completion

    Authors: Christian Meilicke, Melisachew Wudage Chekol, Manuel Fink, Heiner Stuckenschmidt

    Abstract: Most of todays work on knowledge graph completion is concerned with sub-symbolic approaches that focus on the concept of embedding a given graph in a low dimensional vector space. Against this trend, we propose an approach called AnyBURL that is rooted in the symbolic space. Its core algorithm is based on sampling paths, which are generalized into Horn rules. Previously published results show that… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

  20. arXiv:1912.05317  [pdf, other

    cs.CV cs.LG

    A Variational-Sequential Graph Autoencoder for Neural Architecture Performance Prediction

    Authors: David Friede, Jovita Lukasik, Heiner Stuckenschmidt, Margret Keuper

    Abstract: In computer vision research, the process of automating architecture engineering, Neural Architecture Search (NAS), has gained substantial interest. In the past, NAS was hardly accessible to researchers without access to large-scale compute systems, due to very long compute times for the recurrent search and evaluation of new candidate architectures. The NAS-Bench-101 dataset facilitates a paradigm… ▽ More

    Submitted 26 August, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  21. arXiv:1904.06217  [pdf, other

    cs.CL

    Political Text Scaling Meets Computational Semantics

    Authors: Federico Nanni, Goran Glavas, Ines Rehbein, Simone Paolo Ponzetto, Heiner Stuckenschmidt

    Abstract: During the last fifteen years, automatic text scaling has become one of the key tools of the Text as Data community in political science. Prominent text scaling algorithms, however, rely on the assumption that latent positions can be captured just by leveraging the information about word frequencies in documents under study. We challenge this traditional view and present a new, semantically aware… ▽ More

    Submitted 14 October, 2021; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: Updated version - accepted for Transactions on Data Science (TDS)

  22. arXiv:1511.05719  [pdf, other

    cs.AI

    Using Abduction in Markov Logic Networks for Root Cause Analysis

    Authors: Joerg Schoenfisch, Janno von Stulpnagel, Jens Ortmann, Christian Meilicke, Heiner Stuckenschmidt

    Abstract: IT infrastructure is a crucial part in most of today's business operations. High availability and reliability, and short response times to outages are essential. Thus a high amount of tool support and automation in risk management is desirable to decrease outages. We propose a new approach for calculating the root cause for an observed failure in an IT infrastructure. Our approach is based on Abdu… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

  23. arXiv:1507.02456  [pdf, ps, other

    cs.AI

    Towards Log-Linear Logics with Concrete Domains

    Authors: Melisachew Wudage Chekol, Jakob Huber, Heiner Stuckenschmidt

    Abstract: We present $\mathcal{MEL}^{++}$ (M denotes Markov logic networks) an extension of the log-linear description logics $\mathcal{EL}^{++}$-LL with concrete domains, nominals, and instances. We use Markov logic networks (MLNs) in order to find the most probable, classified and coherent $\mathcal{EL}^{++}$ ontology from an $\mathcal{MEL}^{++}$ knowledge base. In particular, we develop a novel way to de… ▽ More

    Submitted 15 July, 2015; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: StarAI2015

  24. arXiv:1304.4379  [pdf, other

    cs.AI

    RockIt: Exploiting Parallelism and Symmetry for MAP Inference in Statistical Relational Models

    Authors: Jan Noessner, Mathias Niepert, Heiner Stuckenschmidt

    Abstract: RockIt is a maximum a-posteriori (MAP) query engine for statistical relational models. MAP inference in graphical models is an optimization problem which can be compiled to integer linear programs (ILPs). We describe several advances in translating MAP queries to ILP instances and present the novel meta-algorithm cutting plane aggregation (CPA). CPA exploits local context-specific symmetries and b… ▽ More

    Submitted 30 April, 2013; v1 submitted 16 April, 2013; originally announced April 2013.

    Comments: To appear in proceedings of AAAI 2013

  25. arXiv:1208.3148  [pdf, other

    cs.AI

    Evaluating Ontology Matching Systems on Large, Multilingual and Real-world Test Cases

    Authors: Christian Meilicke, Ondrej Sváb-Zamazal, Cássia Trojahn, Ernesto Jiménez-Ruiz, José-Luis Aguirre, Heiner Stuckenschmidt, Bernardo Cuenca Grau

    Abstract: In the field of ontology matching, the most systematic evaluation of matching systems is established by the Ontology Alignment Evaluation Initiative (OAEI), which is an annual campaign for evaluating ontology matching systems organized by different groups of researchers. In this paper, we report on the results of an intermediary OAEI campaign called OAEI 2011.5. The evaluations of this campaign ar… ▽ More

    Submitted 15 August, 2012; originally announced August 2012.

    Comments: Technical Report of the OAEI 2011.5 Evaluation Campaign