Decoded Datastore
  • Welcome
  • List of Datasets
  • List of Data Sources
  • Data
    • Data: Finance
    • Data: Insurance
    • Data: Retail
    • Data: Other
  • Association Rules
    • Association Rules: Finance
    • Association Rules: Insurance
    • Association Rules: Retail
    • Association Rules: Other
  • Clustering
    • Clustering: Finance
    • Clustering: Insurance
    • Clustering: Retail
    • Clustering: Other
  • Visualisation
    • Visualisation: Finance
    • Visualisation: Insurance
    • Visualisation: Retail
    • Visualisation: Other
  • Regression
    • Regression: Finance
    • Regression: Insurance
    • Regression: Retail
    • Regression: Other
  • Classification
    • Classification: Finance
    • Classification: Insurance
    • Classification: Retail
    • Classification: Other
  • Time Series Analysis
    • Time Series: Finance
    • Time Series: Insurance
    • Time Series: Retail
    • Time Series: Other
  • Text Analysis
    • Text Analysis: Finance
    • Text Analysis: Insurance
    • Text Analysis: Retail
    • Text Analysis: Other
  • SQL
  • Big Data
Powered by GitBook
On this page

Was this helpful?

Big Data

Here you will find a selection of data sources you can use for practicing the Big Data module.

PreviousSQL

Last updated 5 years ago

Was this helpful?

DATA SOURCES

The Enron email dataset contains approximately 500,000 emails generated by employees of the Enron Corporation. The emails were obtained by the Federal Energy Regulatory Commission during its investigation of Enron's collapse in 2001. More information can be found , and the complete dataset (1.4 GB) can be found on .

  • Coming Soon: a small subset of the dataset

is an online data base where you can download information on multiple financial indicators (information on indicators ) measured over the past few decades at global and individual country levels. You can choose which indicators, country(ies), and time spans you would like to use to make your own data set for personal use.

has a plethora of metadata for member and some non-member nations. The Finance tab includes downloadable data including, but not limited to: bank profitability, central government debt, financial and insurance statistics, pensions, SME financing, and more.

is a very large dataset that contains macroeconomic and financial information spanning mostly from the 12th century to present-day (one or two benchmark estimates from 1086, the year of the Domesday Book). The .xls file contains the original data from the Bank of England, including hundreds of time series data. The .csv file is an extract of several dozen headline time series from the .xls file.

here
Kaggle
The World Bank: Financial Sector Indicator
here
The Organisation for Economic and Co-operation and Development (Finance)
A Millennium of Macroeconomic Data for the UK (1086-2016)