Material Detail
Data Management, EDA, and Regression Analysis with 1969-2000 Major League Baseball Attendance
This article, created by James J. Cochran of Louisiana Tech University, describes a dataset containing Major League Baseball data from seasons 1969 through 2000 and illustrates how this data can be used as a course long project covering basic data management, the use of exploratory data analysis to "clean" data, and construction of regression models. The set contains data such as: runs scored, runs allowed, wins, losses, number of games behind the division leader and attendance. This is a great lesson for anyone interested in the statistics of baseball. The data is in .dat format.
Quality
- User Rating
- Comments
- Learning Exercises
- Bookmark Collections
- Course ePortfolios
- Accessibility Info