sl3 - Pipelines for Machine Learning and Super Learning
A modern implementation of the Super Learner prediction algorithm, coupled with a general purpose framework for composing arbitrary pipelines for machine learning tasks.
Last updated 8 days ago
data-scienceensemble-learningensemble-modelmachine-learningmodel-selectionregressionstackingstatistics
9.95 score 101 stars 7 packages 756 scriptsorigami - Generalized Framework for Cross-Validation
A general framework for the application of cross-validation schemes to particular functions. By allowing arbitrary lists of results, origami accommodates a range of cross-validation applications. This implementation was first described by Coyle and Hejazi (2018) <doi:10.21105/joss.00512>.
Last updated 2 years ago
cross-validationmachine-learning
9.36 score 27 stars 10 packages 472 scripts 860 downloadshal9001 - The Scalable Highly Adaptive Lasso
A scalable implementation of the highly adaptive lasso algorithm, including routines for constructing sparse matrices of basis functions of the observed data, as well as a custom implementation of Lasso regression tailored to enhance efficiency when the matrix of predictors is composed exclusively of indicator functions. For ease of use and increased flexibility, the Lasso fitting routines invoke code from the 'glmnet' package by default. The highly adaptive lasso was first formulated and described by MJ van der Laan (2017) <doi:10.1515/ijb-2015-0097>, with practical demonstrations of its performance given by Benkeser and van der Laan (2016) <doi:10.1109/DSAA.2016.93>. This implementation of the highly adaptive lasso algorithm was described by Hejazi, Coyle, and van der Laan (2020) <doi:10.21105/joss.02526>.
Last updated 3 days ago
cross-validationlasso-regressionmachine-learning-algorithmsnonparametric-regression
9.34 score 49 stars 6 packages 374 scripts 1.6k downloadstmle3 - The Extensible TMLE Framework
A general framework supporting the implementation of targeted maximum likelihood estimators (TMLEs) of a diverse range of statistical target parameters through a unified interface. The goal is that the exposed framework be as general as the mathematical framework upon which it draws.
Last updated 8 days ago
causal-inferencemachine-learningtargeted-learningvariable-importance
7.90 score 35 stars 5 packages 304 scriptsdelayed - A Framework for Parallelizing Dependent Tasks
Mechanisms to parallelize dependent tasks in a manner that optimizes the compute resources available. It provides access to "delayed" computations, which may be parallelized using futures. It is, to an extent, a facsimile of the 'Dask' library (<https://www.dask.org/>), for the 'Python' language.
Last updated 7 months ago
parallel-computing
7.01 score 22 stars 8 packages 39 scripts 748 downloads