List of Datasets

Below is a list of datasets you can use to practice different analytical methods throughout the academy.

Here you will find a list of datasets you can download for your online practice. Below, each dataset lists the modules for which they are most appropriate. If you want to more information on a certain dataset, use the left-hand menu to go to the module of choice, and click on the sub-topic (i.e., finance, insurance, retail, other) to see the corresponding datasets.

AMEX, NYSE, and NASDAQ Stocks Histories

  • Association Rules

  • Visualisation

  • Regression

  • Time Series

Bank Customer dataset

  • Data

  • Clustering

  • Classification

Financial Tweets of Publicly Traded Companies

  • Text Analysis

Game of Thrones (battles)

  • Data

  • Association Rules

  • Clustering

  • Classification

  • Text Analysis

Home Insurance policies (2007-2012) dataset

  • Time Series

Natural Gas Prices

  • Visualisation

  • Regression

New York State Unemployment Insurance Data (2001-present)

  • Data

  • Clustering

  • Visualisation

  • Time Series

North Korea Missile Tests Database

  • Data

  • Association Rules

  • Clustering

  • Visualisation

  • Regression

  • Classification

President Trump's 56 speeches

  • Text Analysis

Retail Sales Index (internet sales)

  • Data

  • Visualisation

  • Regression

  • Time Series

S&P 500 Stock Data

  • Association Rules

  • Visualisation

  • Regression

  • Time Series

Tobacco Advertising Study (2008 and 2011)

  • Data

  • Association Rules

  • Clustering

  • Classification

Travel Insurance data from a Singapore-based company

  • Data

  • Visualisation

  • Association Rules

  • Clustering

  • Classification

Tweets about US airline sentiment

  • Text Analysis

  • Classification

  • Association Rules

US Health Insurance Coverage, 2010 and 2015

  • Data

  • Visualisation

  • Regression

  • Classification

Last updated