Rachel Silver

Sample Code: Spark + SciKit-Learn for analysis of AirBnB data

Discussion created by Rachel Silver Employee on Oct 26, 2016

Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark by Nick Amato

The Jupyter notebook in this repo contains examples to run regression estimators on the Inside Airbnb listings dataset from San Francisco. The target variable is the price of the listing. To speed up the hyperparameter search, the notebook shows examples that use the spark-sklearn package to distribute GridSearchCV across nodes in a Spark cluster. This provides a much faster way to search and can lead to better results.

 

GitHub - mapr-demos/spark-sklearn-airbnb-predict: Code example to predict prices of Airbnb vacation rentals, using sciki… 

Outcomes