You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+54-11Lines changed: 54 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,13 +1,17 @@
1
-
# DoubleML-Serverless - Distributed Double Machine Learning with a Serverless Architecture
1
+
# DoubleML-Serverless - Distributed Double Machine Learning with a Serverless Architecture <ahref="https://docs.doubleml.org"><imgsrc="https://raw.githubusercontent.com/DoubleML/doubleml-for-py/master/doc/logo.png"align="right"width = "120" /></a>
2
2
3
3
This repo contains a prototype implementation **DoubleML-Serverless** of distributed double machine learning with a serverless infrastructure
4
4
using [AWS Lambda](https://aws.amazon.com/lambda).
5
-
A detailed discussion of this prototype can be found in the paper "Distributed Double Machine Learning with a Serverless Architecture" (Kurz, 2021).
5
+
A detailed discussion of this prototype can be found in the paper ["Distributed Double Machine Learning with a Serverless Architecture" (Kurz, 2021)](https://doi.org/10.1145/3447545.3451181).
6
6
DoubleML-Serverless is an extension for serverless cloud computing of the Python package **DoubleML**.
7
7
DoubleML is available via PyPI [https://pypi.org/project/DoubleML](https://pypi.org/project/DoubleML) and on GitHub [https://github.com/DoubleML/doubleml-for-py](https://github.com/DoubleML/doubleml-for-py).
8
-
Also see [https://docs.doubleml.org](https://docs.doubleml.org) for a detailed documentation and user guide for the DoubleML package.
8
+
The Python package DoubleML was introduced in
9
+
"DoubleML - An Object-Oriented Implementation of Double Machine Learning in Python"
10
+
([Bach et al., 2021](https://arxiv.org/abs/2104.03220))
11
+
and a detailed documentation \& user guide for the package is available at
### Deploy the corresponding serverless app to AWS Lambda using AWS SAM
37
+
### Deploy the Corresponding Serverless App to AWS Lambda using AWS SAM
34
38
35
39
To use AWS Lambda for estimating double machine learning models, a deployment in your AWS account is necessary.
36
40
The corresponding serverless application consists of the following components:
@@ -56,11 +60,11 @@ There are two options for deployment:
56
60
sam deploy --guided
57
61
```
58
62
59
-
### Estimating a partially linear regression model with double machine learning and serverless scaling using AWS Lambda
63
+
### Estimating a Partially Linear Regression Model with Double Machine Learning and Serverless Scaling Using AWS Lambda
60
64
61
65
To demonstrate the functionality of DoubleML-Serverless we revisit the Pennsylvania Reemployment Bonus experiment
62
-
and estimate the effect of provisioning a cash bonus on the unemployment duration as studied in Chernozhukov et al. (2018).
63
-
This example is also discussed in the accompanying paper to the DoubleML-Serverless package (Kurz, 2021).
66
+
and estimate the effect of provisioning a cash bonus on the unemployment duration as studied in [Chernozhukov et al. (2018)](https://doi.org/10.1111/ectj.12097).
67
+
This example is also discussed in the accompanying paper to the DoubleML-Serverless package ([Kurz, 2021](https://doi.org/10.1145/3447545.3451181)).
64
68
65
69
We first load the data using functionalities from the DoubleML package.
title = {Distributed Double Machine Learning with a Serverless Architecture},
134
+
year = {2021},
135
+
isbn = {9781450383318},
136
+
publisher = {Association for Computing Machinery},
137
+
address = {New York, NY, USA},
138
+
url = {https://doi.org/10.1145/3447545.3451181},
139
+
doi = {10.1145/3447545.3451181},
140
+
abstract = {This paper explores serverless cloud computing for double machine learning. Being based on repeated cross-fitting, double machine learning is particularly well suited to exploit the high level of parallelism achievable with serverless computing. It allows to get fast on-demand estimations without additional cloud maintenance effort. We provide a prototype Python implementation DoubleML-Serverless for the estimation of double machine learning models with the serverless computing platform AWS Lambda and demonstrate its utility with a case study analyzing estimation times and costs.},
141
+
booktitle = {Companion of the ACM/SPEC International Conference on Performance Engineering},
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W. and Robins, J. (2018),
118
-
Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21: C1-C68. doi:[10.1111/ectj.12097](https://doi.org/10.1111/ectj.12097).
152
+
Bach, P., Chernozhukov, V., Kurz, M. S., and Spindler, M. (2021).
153
+
DoubleML - An Object-Oriented Implementation of Double Machine Learning in Python.
0 commit comments