Find 3 Datasets
- Due Sep 4, 2024 by 11:59pm
- Points 50
- Submitting a text entry box
In this assignment, you are to find 3 public datasets and load them using code.
These will be the first candidates for use in your final project! Try to find datasets that are genuinely interesting to you, that you could imagine working with deeply for the rest of the class as a project. They will need to be small (<5MB), but big enough to be interesting (probably more than 100 rows, and more than 3 columns, generally).
Instructions
- Find 3 datasets on a topic you are interested in
- Get the data in CSV format <5MB (use whatever tools you like e.g. Excel -> export as CSV, Google Sheets -> export as CSV, Python)
- For each of those, fork this viz: Pengiuns Links to an external site. and replace the CSV data
- Parse the quantitative attributes into numbers
- Update `README.md` to delete all the content from before and add a new description
- Link to the original data source
Submission
- Please submit the 3 VizHub links in Canvas (as a single text entry, separated by newlines)
- Also please share them in Discord! Feel free to discuss the datasets
Grading Criteria
- There should be 3 different datasets (-20 points for each missing if you submit fewer than 3)
- -10 points if the dataset doesn't load and display the first row as JSON
- -10 points for each numeric field that you don't parse
- -20 points if a link back to the original data source is missing or broken
Good luck!