Datasets for Coding and Statistics Practice
Datasets Built into R Packages
- Starwars: https://github.com/tidyverse/dplyr/blob/main/data-raw/starwars.csv
- Cars: https://github.com/tidyverse/ggplot2/blob/main/data-raw/mpg.csv
- Diamonds: https://github.com/tidyverse/ggplot2/blob/main/data-raw/diamonds.csv
- Geocodes: https://github.com/tidyverse/ggplot2/blob/main/data-raw/seals.csv
- Economics: https://github.com/tidyverse/ggplot2/blob/main/data-raw/economics.csv
- Midwest towns: https://github.com/tidyverse/ggplot2/blob/main/data-raw/midwest.csv
- Housing info: https://github.com/tidyverse/ggplot2/blob/main/data-raw/tx-housing.csv
- US Presidents: https://github.com/tidyverse/ggplot2/blob/main/data-raw/presidential.csv
UCI Machine Learning Repository
- Index: https://archive.ics.uci.edu/ml/index.php
- Flowers: https://archive.ics.uci.edu/ml/datasets/Iris
- Dry Beans: https://archive.ics.uci.edu/ml/datasets/Dry+Bean+Dataset
- Wine: https://archive.ics.uci.edu/ml/datasets/Wine
U.S. Federal Datasets
- NOAA Storm Events Database: https://www.ncdc.noaa.gov/stormevents/
- Bureau of Labor Statistics: https://www.bls.gov/emp/tables/occupational-projections-and-characteristics.htm
Kaggle Datasets
- https://www.kaggle.com/datasets
- Students Exam Scores: Extended Dataset - https://www.kaggle.com/datasets/desalegngeb/students-exam-scores
- LearnPlatform COVID-19 Impact on Digital Learning - https://www.kaggle.com/competitions/learnplatform-covid19-impact-on-digital-learning/data
- Country Statistics - UNData: https://www.kaggle.com/datasets/sudalairajkumar/undata-country-profiles
- World Universities Rankings Advanced Analysis - https://www.kaggle.com/code/gpreda/world-university-rankings-advanced-analysis/report
Published Research Datasets
Other Datasets
- Florida School Accountability Reports: https://www.fldoe.org/accountability/accountability-reporting/school-grades/
- Open University Learning Analytics dataset: https://analyse.kmi.open.ac.uk/open_dataset
- Common Data Set Initiative: https://commondataset.org/
- OpenIntro datasets: https://www.openintro.org/data/
- Data Science in Education Using R book datasets and practice: https://data-edu.github.io/dataedu/