HS
2020년 5월 2일
This course provides an introduction of some important concepts and tools on a very important aspect of data science: cleaning and organizing data before any analysis. A must for any data scientist.
BE
2016년 10월 25일
This course is really a challenging and compulsory for any one who wants to be a data scientist or working in any sort of data. It teaches you how to make very palatable data-set fro ma messy data.
교육 기관: Aki T
•2019년 10월 24일
This course was excellent and fundamental in order to even start a data analysis. It sets the foundation for how to read and treat the data, which is as the instructor mentioned, often overlooked. Thank you very much for taking the time to break the cleaning process into each comprehensive pieces.
교육 기관: Nino P
•2019년 5월 24일
A bit tough course with topics of getting the data since I don't know much about file types, but cleaning part is a must do for every data scientist. dplyr and tidyverse is the base of R and nowadays I only use dplyr for my data wrangling. Highly recommendable course and specialization.
교육 기관: Sudheergouda P
•2018년 12월 31일
The course project was really helpfull in understanding how the data is presented to datascientists. Now to get the jist of the data we have to go through assembling, cleaning and cutting the data.. It was a challenged to understand the data.. assembling the data was a lot of fun in R..
교육 기관: Fernando V
•2016년 12월 14일
A great course. I mean, It has not been easy, I have spent a lot of time in front of the PC practising and doing exercises, but this time and the tools that I have learned make me much more agile and confortable with R, and I have seen the big possibilities that this language has.
교육 기관: Luis T
•2022년 7월 6일
Getting and cleaning data is a great course, the lectures are clear and detailed also the weekly quizzes and final project are challenging. The teaches how to get data from different sources csv, txt, XML, JSON, web and APIs, read the data and transform it into some tidy data.
교육 기관: Christopher L
•2017년 7월 17일
great course, I am fairly familiar with R in my line of work but this was a great opportunity to practice web-scraping. I might even switch from a dplyr-centric wrangling workflow to one centered on data.table in my personal and professional work. more compact and faster!
교육 기관: Carlos M
•2016년 12월 21일
Difficult but valuable. You will be watching the videos repeatedly and become a regular at StockOverflow but it was completely worth it. Getting, cleaning, and processing data is pretty much 80%+ of the job, this course's information is vital to any future data worker.
교육 기관: Gilvan S
•2017년 2월 11일
Excellent course. It gets through the "dirty job" of obtaining data from diverse sources (including API, web, and others), cleaning it, and transforming it into a "tidy" dataset. Highly recommended, along with the R programming course (which you should take first).
교육 기관: Scott C
•2018년 2월 17일
Good overview of what it means to get and clean your own data. Really enjoyed the final project as it challenged you to, with minimal guidance, think through what a tidy dataset really means, and figure out how to make that happen with the dataset you are provided.
교육 기관: Tim S
•2016년 3월 23일
For someone with no programming background and limited experience working with data, this was a challenging, sometimes frustrating, course. But perseverance through the struggle can end in a deep sense of satisfaction. Happily, this is how it was - quite rewarding.
교육 기관: Gbolahan
•2016년 9월 7일
Wonderful course. gets you through the basics and beyond in getting and cleaning data from diverse sources. Very well thought and explained. There is a lot to be learnt from this course, and it requires devoting a good amount of time to let the material sink in.
교육 기관: Diego A S R
•2020년 7월 4일
Good course, but needs an update. Week 2 was really difficult compared to what was explained in the lectures and regex expressions should be explained using R, it was a little hard to learn to use them directly in R. I feel that I learned a lot in this course.
교육 기관: Renzzo S S
•2020년 11월 16일
Excellent course! i learned a lot with the packages mentioned dplyr, tidyr, readr, lubridate. the swirl package is perfect to learn by doing and the assignment is very challenging and it is good because it incentivates you to research deeply and learn more.
교육 기관: Randal N
•2018년 1월 23일
Very enlightening course. It is the first course where I felt like I was actually doing something data sciency. Would recommend even as a stand alone course because I have now come to appreciate the importance of tidy data in performing successful analyses.
교육 기관: Keat C C
•2016년 11월 7일
Really can learn practical skills! I like that each sub course of data science specialisation just focus on a certain areas and takes only 4 weeks, this way I won't be overburden between work and learning, and also easier for me to absorb the new skills.
교육 기관: Waleed A
•2018년 1월 31일
Another brilliant course from Johns Hopkins University in the data science specialisation. Data preparation is a step where an analyst may spend considerable time before beginning any analysis task. I found this course useful and practical. It provided
교육 기관: Daniel M D V
•2019년 9월 3일
Excellent! From my point of view, this is the best course so far. The general concepts that are thought here can be applied to any programming language you use for data analysis. The specific R concepts really shows the power R has to manipulate data.
교육 기관: Kunal P
•2019년 12월 15일
This was one of the best class. Recommend more side reading material on data. SWIRL has a reading link but the link is not provided anywhere else on the board. Also, it would be beneficial if the links can be made clickable in lecture slides. Thanks.
교육 기관: Martin H
•2016년 8월 14일
Exellent course, which brings you to the next level of a Data Scientist.
Getting and Cleaning data principles can be used in alot of situations. I found the build up of this and the assignment at the end to be very well tought trough and important.
교육 기관: Oleksandr K
•2018년 4월 14일
Very good course and lectures. However, it would be good to have a book covering all of the material in this course. That would make work on final project much easier. In my opinion, it is impossible to finish final project in just 2 hours.
교육 기관: Kristin K
•2017년 8월 4일
This course solidified any gaps that were left from the R Programming Course and opens the world of data science to everyone in a very practical way. I really enjoyed the presentation of the material and am very happy I took the class.
교육 기관: 강인배
•2017년 6월 8일
This was so hard to me, because I didn't know anything about 'Making tidy dataset'. So, when I took a course project, I was struggling to find 'what should I do'. Comprehending raw data is so hard then you think, newbies! Be careful!
교육 기관: Jan K
•2017년 3월 7일
Covers a wide range of topics without loosing transparency. In my opinion requires more work than the other courses, but is really worth a go. You end up having a firm basis for working with data and learning more about the process.
교육 기관: Tomer E
•2020년 6월 21일
Very nice course.
helped to understand how to find sources of data (I found that extremely important), and strengthened my R skills.
It would be nice though to have the links which were shown in the slides available for the students.
교육 기관: Miguel C
•2017년 12월 20일
This is a very complete course. It covers the basics of what you have to know to adquire data from different sources and filter that data to be used in further steps of data analysis. It offered great notions on Data Mining also.