The tree alternating optimisation (TAO) algorithm

This page collects papers describing the "tree alternating optimisation" (TAO) algorithm, a new way to train many types of decision trees (axis-aligned, oblique, etc.) and tree-based models, so that they optimise a desired objective function and regularisation term or constraint. TAO can be used by itself to learn a single tree, or as the base learner of a forest (tree ensemble). It vastly outperforms the traditional, greedy recursive partitioning algorithms such as CART or C4.5 (in isolation or as part of a forest), and allows one to learn models that strike a good balance between accuracy and interpretability.

This work has been done in collaboration with my past and present students and collaborators Magzhan Gabidolla, Kuat Gazizov, Yerlan Idelbayev, Rasul Kairgeldin and Arman Zharmagambetov.

It has been funded in part by:

NSF award IIS #2007147 (2020-2023): "The TAO algorithm: principled, efficient optimization of decision trees, forests, tree-based neural nets, and beyond".
NSF award #2228243 (2022-2023): "I-Corps: Tree-based artificial intelligence (AI) models for financial fraud detection".
NSF award IIS #1423515 (2014-2017): "Algorithms for accelerating optimization in deep learning".

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Drawings by Lucía Carreira Mateos.

Selected presentations

Software

Coming soon.

References

Carreira-Perpiñán, M. Á. and Tavallali, P. (2018): "Alternating optimization of decision trees, with application to learning sparse oblique trees". Advances in Neural Information Processing Systems 31 (NeurIPS 2018), pp. 1211-1221.
[external link] [paper preprint] [supplementary material] [poster]
Zharmagambetov, A. and Carreira-Perpiñán, M. Á. (2020): "Smaller, more accurate regression forests using tree alternating optimization". 37th Int. Conf. Machine Learning (ICML 2020), pp. 11398-11408.
[external link] [paper preprint] [slides] [supplementary material] [video]
Carreira-Perpiñán, M. Á. and Zharmagambetov, A. (2020): "Ensembles of bagged TAO trees consistently improve over random forests, AdaBoost and gradient boosting". ACM-IMS Foundations of Data Science Conf. (FODS 2020), pp. 35-46.
[external link] [paper preprint] [slides]
Gabidolla, M., Zharmagambetov, A. and Carreira-Perpiñán, M. Á. (2020): "Boosted sparse oblique decision trees". Bay Area Machine Learning Symposium (BayLearn 2020).
[external link] [paper preprint] [video]
Carreira-Perpiñán, M. Á. and Hada, S. S. (2021): "Counterfactual explanations for oblique decision trees: exact, efficient algorithms". 35th AAAI Conf. Artificial Intelligence (AAAI 2021), pp. 6903-6911.
[external link] [paper preprint] [slides] [poster] [Python implementation (coming soon)]
Longer version: Mar. 1, 2021, arXiv:2103.01096: [external link] [paper preprint]
Zharmagambetov, A. and Carreira-Perpiñán, M. Á. (2021): "Learning a tree of neural nets". IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2021), pp. 3140-3144.
[external link] [paper preprint] [slides] [© IEEE]
Zharmagambetov, A., Gabidolla, M. and Carreira-Perpiñán, M. Á. (2021): "Improved boosted regression forests through non-greedy tree optimization". Int. Joint Conf. on Neural Networks (IJCNN 2021).
[external link] [paper preprint] [slides] [© IEEE]
Zharmagambetov, A., Hada, S. S., Gabidolla, M. and Carreira-Perpiñán, M. Á. (2021): "Non-greedy algorithms for decision tree optimization: an experimental comparison". Int. Joint Conf. on Neural Networks (IJCNN 2021).
[external link] [paper preprint] [slides] [© IEEE]
Older version: "An experimental comparison of old and new decision tree algorithms", Mar. 20, 2020, arXiv:1911.03054: [external link] [paper preprint]
Hada, S. S., Carreira-Perpiñán, M. Á. and Zharmagambetov, A. (2021): "Understanding and manipulating neural net features using sparse oblique classification trees". IEEE Int. Conf. Image Processing (ICIP 2021), pp. 3707-3711.
[external link] [paper preprint] [slides] [poster] [© IEEE]
Zharmagambetov, A. and Carreira-Perpiñán, M. Á. (2021): "A simple, effective way to improve neural net classification: ensembling unit activations with a sparse oblique decision tree". IEEE Int. Conf. Image Processing (ICIP 2021), pp. 369-373.
[external link] [paper preprint] [slides] [poster] [© IEEE]
Zharmagambetov, A., Gabidolla, M. and Carreira-Perpiñán, M. Á. (2021): "Improved multiclass Adaboost for image classification: the role of tree optimization". IEEE Int. Conf. Image Processing (ICIP 2021), pp. 424-428.
[external link] [paper preprint] [slides] [poster] [© IEEE]
Hada, S. S. and Carreira-Perpiñán, M. Á. (2021): "Exploring counterfactual explanations for classification and regression trees". Int. Workshop and Tutorial on eXplainable Knowledge Discovery in Data Mining (at ECML 2021).
[external link] [paper preprint] [slides]
Zharmagambetov, A., Gabidolla, M. and Carreira-Perpiñán, M. Á. (2021): "Softmax Tree: an accurate, fast classifier when the number of classes is large". Conf. Empirical Methods in Natural Language Processing (EMNLP 2021), pp. 10730-10745.
[external link] [paper preprint] [slides] [poster]
Zharmagambetov, A. and Carreira-Perpiñán, M. Á. (2022): "Learning interpretable, tree-based projection mappings for nonlinear embeddings". 25th Int. Conf. Artificial Intelligence and Statistics (AISTATS 2022), pp. 9550-9570.
[external link] [paper preprint] [slides] [poster] [animations] [video]
Hada, S. S. and Carreira-Perpiñán, M. Á. (2022): "Interpretable image classification using sparse oblique decision trees". IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2022), pp. 2759-2763.
[external link] [paper preprint] [slides] [poster] [© IEEE]
Gabidolla, M. and Carreira-Perpiñán, M. Á. (2022): "Pushing the envelope of gradient boosting forests via globally-optimized oblique trees". IEEE Conf. Computer Vision and Pattern Recognition (CVPR 2022), pp. 285-294.
[external link] [paper preprint] [supplementary material] [slides] [poster] [© IEEE]
Gabidolla, M., Zharmagambetov, A. and Carreira-Perpiñán, M. Á. (2022): "Improved multiclass AdaBoost using sparse oblique decision trees". Int. Joint Conf. on Neural Networks (IJCNN 2022).
[external link] [paper preprint] [slides] [© IEEE]
Hada, S. S. and Carreira-Perpiñán, M. Á. (2022): "Sparse oblique decision trees: a tool to interpret natural language processing datasets". Int. Joint Conf. on Neural Networks (IJCNN 2022).
[external link] [paper preprint] [slides] [© IEEE]
Gabidolla, M. and Carreira-Perpiñán, M. Á. (2022): "Optimal interpretable clustering using oblique decision trees". ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 2022), pp. 400-410.
[external link] [paper preprint] [slides] [poster]
Zharmagambetov, A. and Carreira-Perpiñán, M. Á. (2022): "Semi-supervised learning with decision trees: graph Laplacian tree alternating optimization". Advances in Neural Information Processing Systems 35 (NeurIPS 2022), pp. 2392-2405.
[external link] [external link2] [paper preprint] [supplementary material] [poster] [video]
Extended abstract at the Bay Area Machine Learning Symposium (BayLearn 2022): [paper preprint] [poster]
Carreira-Perpiñán, M. Á. and Hada, S. S. (2023): "Very fast, approximate counterfactual explanations for decision forests". 37th AAAI Conf. Artificial Intelligence (AAAI 2023), pp. 6935-6943.
[external link] [paper preprint] [slides] [slides] [Python implementation (coming soon)]
Longer version: Mar. 5, 2023, arXiv:2303.02883: [external link] [paper preprint]
Hada, S. S., Carreira-Perpiñán, M. Á. and Zharmagambetov, A.: "Sparse oblique decision trees: a tool to understand and manipulate neural net features". Data Mining and Knowledge Discovery, (): (to appear).
[external link] [paper preprint]
Also as: Jan. 30, 2023, arXiv:2104.02922: [external link] [paper preprint]
Many of the figures in the publisher's version are badly messed up, with wrong labels. The arXiv paper has the correct figures.
Carreira-Perpiñán, M. Á., Gabidolla, M. and Zharmagambetov, A. (2023): "Towards better decision forests: Forest Alternating Optimization". IEEE Conf. Computer Vision and Pattern Recognition (CVPR 2023), pp. 7589-7598.
[external link] [OpenReview] [paper preprint] [supplementary material] [poster] [slides] [© IEEE]
CVPR 2023 Art Gallery videos:
- Variations on Warhol's Marilyn
- Variations on TAO
- Mimicking great paintings by oblique decision forests

An illustration of TAO in 2D

This animation for a 2D dataset with 6 classes shows the result of training an oblique decision tree of depth 3 (with 8 leaves), with random initial parameters, for one TAO iteration. The animation shows the result of optimizing (bottom-up, i.e., from the leaves towards the root) each node's parameters given the other nodes are fixed. The objective function over the whole tree decreases monotonically after each update. Note that some decision nodes are automatically pruned in the process. Animation done by Magzhan Gabidolla.

An illustration of FAO in 2D

This animation for a 2D dataset with 6 classes shows the result of training a forest of 2 oblique decision trees of depth 2 (with 4 leaves each), with random initial parameters, for 3 FAO iterations. FAO (Forest Alternating Optimization) optimizes each tree in turn using TAO, and also optimizes all leaves of all trees jointly. The objective function over the whole forest decreases monotonically after each update. Nodes can also be automatically pruned in the process (although this does not happen in this particular case). Animation done by Magzhan Gabidolla.

Miguel A. Carreira-Perpinan

Last modified: Tue Sep 19 14:11:19 PDT 2023

UC Merced | EECS | MACP's Home Page