Member-only story

Different ways of getting datasets for your data science tasks

Resources for finding datasets suitable for your needs.

Parul Pandey
5 min readSep 2, 2021

While going through the list of the articles that I have written to date, I discovered that quite a few were related to the concept of acquiring datasets for data science tasks. Some of those articles are targeted at finding good dataset websites, while others look at ways to create custom datasets. This article is a compilation of the various concepts covered in different articles. One can think of it as summarizing the multiple techniques while linking back to the original articles.

1. Advanced Google Search

Image by Author

Google search is by far the most common way to search for a dataset. But did you know that you could customize the search query to get accurate results and that, too, faster? In this article, we look at three ways to optimize our search on the internet.

Link: Advanced Google Search

2. Useful sites for finding datasets for Data Analysis tasks

Image by Author

Google search is great, but there are also dedicated sites harboring good-quality datasets. This article lists five such datasets with detailed video instructions on how to access them. Do not worry; I have left out the common ones like the UCI Machine Learning Repository, Kaggle datasets, and Data.gov and instead provided you with some of the lesser-known ones.

Link: Useful sites for finding datasets for Data Analysis tasks

3. Five Real-world datasets for honing your Exploratory Data Analysis skills

Real-world datasets

--

--

Parul Pandey
Parul Pandey

Written by Parul Pandey

Principal Data Scientist @H2O.ai | Author of Machine Learning for High-Risk Applications

No responses yet

Write a response