-
Notifications
You must be signed in to change notification settings - Fork 78
Capstone Project 1
How To Add a New Column to Using a Dictionary in Pandas Data Frame ?: Pandas Tutorial
https://stackoverflow.com/questions/13445241/replacing-blank-values-white-space-with-nan-in-pandas
https://www.dataquest.io/blog/regular-expressions-data-scientists/
https://python-graph-gallery.com/all-charts/
https://stackoverflow.com/questions/14745022/how-to-split-a-column-into-two-columns https://stackoverflow.com/questions/13996302/python-rolling-functions-for-groupby-object https://stackoverflow.com/questions/13872533/plot-different-dataframes-in-the-same-figure https://stackoverflow.com/questions/51711306/filter-group-by-and-count-in-pandas https://pandas.pydata.org/pandas-docs/stable/reshaping.html https://stackoverflow.com/questions/26646191/pandas-groupby-month-and-year
https://stackoverflow.com/questions/23891575/how-to-merge-two-dataframes-side-by-side
https://seaborn.pydata.org/examples/wide_data_lineplot.html
===> Really good tutorial on how to build a logistic regression model: https://towardsdatascience.com/building-a-logistic-regression-in-python-step-by-step-becd4d56c9c8 https://stackoverflow.com/questions/14745022/how-to-split-a-column-into-two-columns
====> Random Forest
- https://towardsdatascience.com/random-forest-in-python-24d0893d51c0
- https://www.datacamp.com/community/tutorials/random-forests-classifier-python
- https://stackabuse.com/random-forest-algorithm-with-python-and-scikit-learn/
- https://machinelearningmastery.com/implement-random-forest-scratch-python/
- https://en.wikipedia.org/wiki/Random_forest
====> Boosted
-
https://nycdatascience.com/blog/meetup/featured-talk-1-kaggle-data-scientist-owen-zhang/
-
https://datascienceplus.com/extreme-gradient-boosting-with-python/
-
https://medium.com/mlreview/gradient-boosting-from-scratch-1e317ae4587d
-
http://benalexkeen.com/gradient-boosting-in-python-using-scikit-learn/
-
https://machinelearningmastery.com/gentle-introduction-gradient-boosting-algorithm-machine-learning/
Napoleon march
- https://robots.thoughtbot.com/analyzing-minards-visualization-of-napoleons-1812-march
- https://blog.ouseful.info/2017/11/28/quick-round-up-visualising-flows-using-network-and-sankey-diagrams-in-python-and-r/
- https://plotlyblog.tumblr.com/post/120532468127/how-to-analyze-data-seven-modern-remakes-of-the
Data Cleaning:
-
http://www.datasciencemadesimple.com/reshape-long-wide-pandas-python-pivot-function/
-
https://www.datacamp.com/community/tutorials/pandas-multi-index
-
https://stackoverflow.com/questions/20110170/turn-pandas-multi-index-into-column
-
https://stackoverflow.com/questions/36537945/reshape-wide-to-long-in-pandas
-
https://stackoverflow.com/questions/42477572/sort-values-method-in-pandas
-
https://stackoverflow.com/questions/16958499/sort-pandas-dataframe-and-print-highest-n-values
-
http://pandas.pydata.org/pandas-docs/version/0.17/generated/pandas.DataFrame.sort.html
-
https://stackoverflow.com/questions/19523277/renaming-column-names-in-pandas-groupby-function
-
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.pivot_table.html
-
For figuring out how to optimize logistic regression model: https://towardsdatascience.com/logistic-regression-model-tuning-with-scikit-learn-part-1-425142e01af5
*For explaining LogisticRegression vs LogisticRegressionCV: https://stackoverflow.com/questions/46507606/what-does-the-cv-stand-for-in-sklearn-linear-model-logisticregressioncv
- Hyperparameter tuning on random forests: https://towardsdatascience.com/hyperparameter-tuning-the-random-forest-in-python-using-scikit-learn-28d2aa77dd74

