Skip to content

Commit

Permalink
Update paper.md
Browse files Browse the repository at this point in the history
  • Loading branch information
kristinblesch authored Sep 22, 2023
1 parent 58baac9 commit 980cfc0
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion paper/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@ affiliations:

# Summary

Generative modeling is a challenging task in machine learning that aims to synthesize new data which is similar to a set of given data. State of the art are computationally intense and tuning-heavy algorithms such as generative adversarial networks [@goodfellow2014;@xu2019], variational autoencoders [@kingma2014], normalizing flows [@rezende2015], diffiusion models [@ho2020] or transformers [@vaswani2017]. A much more lightweight procedure is to use an Adversarial Random Forest (ARF) [@watson2023]. ARFs achieve competitive performance in generative modeling in much faster runtime [@watson2023] and are especially useful for data that comes in a table format, i.e., tabular data. That is because ARFs are based on random forests [@breiman2001] that leverage the advantages that tree-based methods have over neural networks on tabular data (see [@grinsztajn2022]) for generative modeling. Further, as part of the procedure, ARFs give access to the estimated joint density, which is useful for several other fields of research, e.g., unsupervised machine learning. For the task of density estimation, ARFs have been demonstrated to yield remarkable results as well [@watson2023]. Hence, ARFs are a promising methodological contribution to the field of generative modeling and density estimation. To reach scholars in these fields that are predominantely based in python, and a broad audience more generally, a fast and userfriendly implementation of ARFs in python is highly desirable, which is provided by the software package `arfpy`.
Generative modeling is a challenging task in machine learning that aims to synthesize new data which is similar to a set of given data. State of the art are computationally intense and tuning-heavy algorithms such as generative adversarial networks [@goodfellow2014;@xu2019], variational autoencoders [@kingma2014], normalizing flows [@rezende2015], diffiusion models [@ho2020] or transformers [@vaswani2017]. A much more lightweight procedure is to use an Adversarial Random Forest (ARF) [@watson2023]. ARFs achieve competitive performance in generative modeling in much faster runtime [@watson2023] and are especially useful for data that comes in a table format, i.e., tabular data. That is because ARFs are based on random forests [@breiman2001] that leverage the advantages that tree-based methods have over neural networks on tabular data (see @grinsztajn2022) for generative modeling. Further, as part of the procedure, ARFs give access to the estimated joint density, which is useful for several other fields of research, e.g., unsupervised machine learning. For the task of density estimation, ARFs have been demonstrated to yield remarkable results as well [@watson2023]. Hence, ARFs are a promising methodological contribution to the field of generative modeling and density estimation. To reach scholars in these fields that are predominantely based in python, and a broad audience more generally, a fast and userfriendly implementation of ARFs in python is highly desirable, which is provided by the software package `arfpy`.

# Statement of need

Expand Down

0 comments on commit 980cfc0

Please sign in to comment.