GRANDE: Gradient-Based Decision Tree Ensembles for tabular data
Marton, Sascha
;
Lüdtke, Stefan
;
Bartelt, Christian
;
Stuckenschmidt, Heiner
URL:
|
https://iclr.cc/virtual/2024/poster/18457
|
Weitere URL:
|
https://openreview.net/forum?id=XEFWBxi075
|
Dokumenttyp:
|
Konferenzveröffentlichung
|
Erscheinungsjahr:
|
2024
|
Buchtitel:
|
International Conference on Learning Representations
|
Seitenbereich:
|
1-27
|
Veranstaltungstitel:
|
TheTwelfth International Conference on Learning Representations, ICLR 2024
|
Veranstaltungsort:
|
Wien, Austria
|
Veranstaltungsdatum:
|
07.-11.05.2024
|
Verlag:
|
OpenReview.net
|
Verwandte URLs:
|
|
Sprache der Veröffentlichung:
|
Englisch
|
Einrichtung:
|
Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik > Practical Computer Science II: Artificial Intelligence (Stuckenschmidt 2009-) Außerfakultäre Einrichtungen > Institut für Enterprise Systems (InES)
|
Fachgebiet:
|
004 Informatik
|
Freie Schlagwörter (Englisch):
|
tabular data , architectures . ensembles , decision trees , gradient descent
|
Abstract:
|
Despite the success of deep learning for text and image data, tree-based ensemble models are still state-of-the-art for machine learning with heterogeneous tabular data. However, there is a significant need for tabular-specific gradient-based methods due to their high flexibility. In this paper, we propose GRANDE, GRAdieNt-Based Decision Tree Ensembles, a novel approach for learning hard, axis-aligned decision tree ensembles using end-to-end gradient descent. GRANDE is based on a dense representation of tree ensembles, which affords to use backpropagation with a straight-through operator to jointly optimize all model parameters. Our method combines axis-aligned splits, which is a useful inductive bias for tabular data, with the flexibility of gradient-based optimization. Furthermore, we introduce an advanced instance-wise weighting that facilitates learning representations for both, simple and complex relations, within a single model. We conducted an extensive evaluation on a predefined benchmark with 19 classification datasets and demonstrate that our method outperforms existing gradient-boosting and deep learning frameworks on most datasets. The method is available under: https://github.com/s-marton/GRANDE
|
| Dieser Eintrag ist Teil der Universitätsbibliographie. |
Suche Autoren in
BASE:
Marton, Sascha
;
Lüdtke, Stefan
;
Bartelt, Christian
;
Stuckenschmidt, Heiner
Google Scholar:
Marton, Sascha
;
Lüdtke, Stefan
;
Bartelt, Christian
;
Stuckenschmidt, Heiner
ORCID:
Marton, Sascha ORCID: 0000-0001-8151-9223 ; Lüdtke, Stefan ; Bartelt, Christian ; Stuckenschmidt, Heiner ORCID: 0000-0002-0209-3859
Sie haben einen Fehler gefunden? Teilen Sie uns Ihren Korrekturwunsch bitte hier mit: E-Mail
Actions (login required)
|
Eintrag anzeigen |
|
|