Chevron Left
Building Machine Learning Pipelines in PySpark MLlib(으)로 돌아가기

Coursera Project Network의 Building Machine Learning Pipelines in PySpark MLlib 학습자 리뷰 및 피드백

4.3
별점
49개의 평가
8개의 리뷰

강좌 소개

By the end of this project, you will learn how to create machine learning pipelines using Python and Spark, free, open-source programs that you can download. You will learn how to load your dataset in Spark and learn how to perform basic cleaning techniques such as removing columns with high missing values and removing rows with missing values. You will then create a machine learning pipeline with a random forest regression model. You will use cross validation and parameter tuning to select the best model from the pipeline. Lastly, you will evaluate your model’s performance using various metrics. A pipeline in Spark combines multiple execution steps in the order of their execution. So rather than executing the steps individually, one can put them in a pipeline to streamline the machine learning process. You can save this pipeline, share it with your colleagues, and load it back again effortlessly. Note: You should have a Gmail account which you will use to sign into Google Colab. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions....

최상위 리뷰

필터링 기준:

Building Machine Learning Pipelines in PySpark MLlib의 8개 리뷰 중 1~8

교육 기관: Andrés M

2021년 5월 7일

I never write reviews, but ... please DON´T TAKE THIS PROJECT. Terrible project, no theoretical explanation, no explanation of functions, no complete project using pipelines (only 2 lines). The installation of libraries is not correct, I spent about two days trying to install them. Poor English language, zero explanations in general. It's a shame that people with high level of education are trying to scam people, I regret 100% of paying for this. I DID NOT LEARN ANYTHING.

교육 기관: Jeremy S

2022년 1월 26일

This project gives a good overview of the basic commands of PySpark, as well as a pretty decent glimpse of the functions and methods in MLlib. It will bring you through the methods required to split your training/validation/testing data, train the model, cross validate it, and evaluate it. There is also a brief section on cleaning data, though the majority of the lesson is focused on the random forest pipeline. Note that this project will not teach you machine learning basics or Random Forest techniques whatsoever. Similarly, this project will not teach you the basics of databases or SQL. If your sole intention is to learn the MLlib of Pyspark, then this course is good, but don't expect more. Finally, the project is contained on Coursera's Python notebook service, Rhyme. This course will not show you how to set up Pyspark or any of the supporting installations or environment variables on your own computer. Similarly, you will not be able to easily download the dataset used in this project. Even "downloading" the dataset only downloads it to the virtual machine running Rhyme, though the VM has access to the internet (hint hint). For me, this project was worth the $10 USD I paid.

교육 기관: Aruparna M

2021년 2월 21일

The dataset provided was wrong. It was not the exact one that was demonstrated by the instructor!

교육 기관: 19BST035-HARI K R B B C

2020년 9월 25일

This Course is Very useful. This course big advantage is short. Read short, Learn Big.

교육 기관: Cheikh B

2021년 3월 27일

Awsome project and very good explaination thank you for this project

교육 기관: Leonardo E

2020년 11월 21일

pretty useful, actually.

교육 기관: MD R I

2020년 10월 5일

helpful project

교육 기관: Sankirna J

2022년 5월 2일

Good project to get you started with some spark code. It was very short and hands on. Instructor doesn't really talk about the library or how things work. We are expected to follow the guidelines in the companion video and basically replicate the code the instructor provides which is very clear and concise.

I would recommend this project as a quick hands-on but I did not feel like I learned a lot from it because of the very simple guided nature.