This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data.
제공자:
이 강좌에 대하여
배울 내용
Understand analytic graphics and the base plotting system in R
Use advanced graphing systems such as the Lattice system
Make graphical displays of very high dimensional data
Apply cluster analysis techniques to locate patterns in data
귀하가 습득할 기술
- Cluster Analysis
- Ggplot2
- R Programming
- Exploratory Data Analysis
제공자:

존스홉킨스대학교
The mission of The Johns Hopkins University is to educate its students and cultivate their capacity for life-long learning, to foster independent and original research, and to bring the benefits of discovery to the world.
강의 계획표 - 이 강좌에서 배울 내용
Week 1
This week covers the basics of analytic graphics and the base plotting system in R. We've also included some background material to help you install R if you haven't done so already.
Week 2
Welcome to Week 2 of Exploratory Data Analysis. This week covers some of the more advanced graphing systems available in R: the Lattice system and the ggplot2 system. While the base graphics system provides many important tools for visualizing data, it was part of the original R system and lacks many features that may be desirable in a plotting system, particularly when visualizing high dimensional data. The Lattice and ggplot2 systems also simplify the laying out of plots making it a much less tedious process.
Week 3
Welcome to Week 3 of Exploratory Data Analysis. This week covers some of the workhorse statistical methods for exploratory analysis. These methods include clustering and dimension reduction techniques that allow you to make graphical displays of very high dimensional data (many many variables). We also cover novel ways to specify colors in R so that you can use color as an important and useful dimension when making data graphics. All of this material is covered in chapters 9-12 of my book Exploratory Data Analysis with R.
Week 4
This week, we'll look at two case studies in exploratory data analysis. The first involves the use of cluster analysis techniques, and the second is a more involved analysis of some air pollution data. How one goes about doing EDA is often personal, but I'm providing these videos to give you a sense of how you might proceed with a specific type of dataset.
검토
- 5 stars74.16%
- 4 stars21.23%
- 3 stars3.42%
- 2 stars0.73%
- 1 star0.43%
탐구 데이터 분석의 최상위 리뷰
This is a great course. The basics are explained very clearly and very easy to understand. I highly recommend this course for those who wish to start in Data Analyst / Data Science track.
Nice course, but too much focus on "R" as a tool.... Industries don't use R as much... The course must be made more generic and independent of R - understand it is not easy to do but ....
This is the second course I have taken from Roger Peng and both were outstanding. I have a strong math background, but not much of a background in stats, but this course was very approachable for me.
Good introduction. The swirl exercises kind of reproduce the lectures though- felt like it might not have been the most efficient use of time to go over the exact same example again.
자주 묻는 질문
강의 및 과제를 언제 이용할 수 있게 되나요?
이 전문 분야를 구독하면 무엇을 이용할 수 있나요?
재정 지원을 받을 수 있나요?
궁금한 점이 더 있으신가요? 학습자 도움말 센터를 방문해 보세요.