Every
Beginner or fresher who want to build career in Analytics fields have only one
question that “Don’t Know Where to find public Data sets?” because they require
data to test our skills. Well don’t take tension because we have Every
Beginner or fresher who want to build career in Analytics fields have only one
question that “Don’t Know Where to find public Data sets?” because they require
data to test our skills. Well don’t take tension because we have done easy for
you. We will provide you some excellent data sources name through which you can
download public data set easy for
you. We will provide you some excellent data sources name through which you can download public data-set.
1. Kaggle:
Kaggle is founded in 2010, Kaggle is an
online platform for data-mining and predictive modelling competitions. In
Kaggle mostly companies and researchers post their data and statisticians, Data
scientist and data miners from all over the world compete to produce the best
models.
United States Census Bureau (USBC) is the agency of
the U.S. Federal Statistical System, which is responsible for producing data
about the American people and economy. It collects the census information once
every 10 years. Once you go head to the
link, click on the Topics tab at the top and you will see the sub tabs
containing titled like Health, Population, Education and many more. Each of these
tabs contains data related to topics.
Kaggle is the home for everything data
science related. Kaggle analytics competitions find the talent foe their
businesses. It is one unique platform for analyst which test our knowledge and
also grow in our career.
World Bank provide all the users to access World
Bank data, according to the open Data Terms of Use. You can access data by name
of the country, economy, education, healthcare, trade etc.
UC Irvine is an American university. It is a
machine learning repository which contains databases, domain theories that are
used by the machine learning community for empirical analysis of machine
learning algorithm. In this repository dataset are freely available to the
general public that why it has been widely used by students, educators, and
researchers all over the world.
Amazon web services provides a centralized
repository of public data sets that anyone can access for free. Unlike all the
AWS services user have to pay for the compute and storage they use for their
own applications but now AWS hosting the public data set at no charge for the community.
Reddit is social news networking service
which allow registered community members to submit content such as text posts
or direct links. In other words we can say it is a collection of entries by its
registered user, essentially a bulletin board system. Recently Reddit released
an enormous data set containing all ~1.7 billion of their publicly available
comments.
Data.Gov is an open Government Data Platform
which allow users to access Datasets/Apps published by Minister/Department. It
is U.S. government website launched in late May 2009. Once you go head the link
you will see there is numerous available topics such as Production of
Automobiles and Resultant Pollution, Datasets in Education sector and many
more. When you click on any one them you will see all the available dataset
related to these topics.
8.
Quandl:
Quandl is a market place which allows users
to access several millions, financial economic and social datasets. In other words
we can say Quandl is search engine for numerical data. It basically helps
analyst to save time, effort and money obtaining financial and economic data.
These are only some few interesting data banks that we
explored. You can find what you are looking for at the following links:
- KDnuggets: http://www.kdnuggets.com/2011/02/free-public-datasets.html
- Georgia Tech FODAVA: http://fodava.gatech.edu/visual-data-analytics-data-sets
- YouTube Data: http://netsg.cs.sfu.ca/youtubedata/
- Data Science Central: http://www.datasciencecentral.com/profiles/blogs/big-data-sets- available-for-free
- Mode Analytics: https://blog.modeanalytics.com/five-public-dataset/
- Stack exchange: http://stats.stackexchange.com/questions/7/locating-freely-available-data-samples
- Abbott Analytics: http://www.abbottanalytics.com/data-mining-resources-sets.php
- Bigml blog: http://blog.bigml.com/2013/02/28/data-data-data-thousands-of-public-data-sources/
- Reddit: https://r-dir.com/reference/datasets.html
- Social Network Dataset: http://ww43.growmeme.com/overview
No comments:
Post a Comment