A list of some useful Dataset to explore Machine Learning

Tabular Dataset

List of dataset

Number

Nom

Download link

Type

Industry

Target

Detail

1

churn

csv

Classification

B2B

churn

Churn usecase

2

Forecast Energy France trainset

csv

Regression

Energy

TARGET

Energy usecase

3

Forecast Energy France validation

csv

Regression

Energy

TARGET

Energy usecase

4

DNS Attacks Origins

csv

Multiclassification

Energy

Class

DNS usecase

5

Sales Forecasting

csv

Regression

Retail

Weekly_Sales

6

Songs Hits

csv

Classification

Retail

target

7

House pricing Regression - Trainset

csv

Regression

Retail

TARGET

House usecase

8

House pricing Regression - Holdout

csv

Regression

Retail

TARGET

House usecase

9

EDF Classification - Trainset

csv

Classification

Energy

TARGET

10

EDF Classification - Holdout

csv

Classification

Energy

TARGET

11

Sales Timeseries - Trainset

csv

Timeserie

Retail

Volume

12

Sales Timeseries - Holdout

csv

Timeserie

Retail

Volume

Images

Images Folders

Number

Nom

Download link

Type

Industry

1

youtube train

You tube adds Trainset

Images

Ads

2

youtube train labels

You tube adds Trainset Labels

Images Labels

Ads

3

youtube test

You tube adds Testset

Images

Ads

4

youtube test Labels

You tube adds Testset Labels

Images Labels

Ads

5

Cheezam

Cheezam Images

Images

Ads

6

Cheezam Labels

Cheezam Labels

Images Labels

Ads

NLP

NLP Folders

Number

Nom

Download link

Type

Industry

1

Netflix catalog

Netflix movies with data

NLP

entertainment

2

French candidates multiclassif trainset sample

Tweets of French Presidential Candidates 2022 ( train sample )

NLP

politics

3

French candidates multiclassif holdout

Tweets of French Presidential Candidates 2022 ( holdout )

NLP

politics

Externals models

Externals Models

Number

Nom

Download link

type

format

1

classication model

Files

Classification

onnx