MLclf

mini-imagenet dataset transformed to fit classification task or keep the format for meta-learning tasks.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

The project Machine Learning CLassiFication (MLclf)

The project is to transform the mini-imagenet dataset which is initially created for the few-shot learning (other datasets, e.g. tiny-imagenet, will come soon...) to the format that fit the classical classification task. You can also use this package to download and obtain the raw data of the mini-imagenet dataset (for few-shot learning tasks).

The original dataset includes totally 100 classes, but due to its intention to meta-learning or few-shot learning, the train/validatioin/test dataset contains different classes. They have respectively 64/16/20 classes.

In order to make the mini-imagenet dataset fit the format requirement for the classical classification task. MLclf made a proper transformation (recombination and splitting) of the original mini-imagenet dataset.

The transformed dataset is divided into train, validation and test dataset, each dataset of which includes 100 classes. Each image has the size 84x84 pixels with 3 channels.

The MLclf package can be found at: https://pypi.org/project/MLclf/

or https://github.com/tiger2017/MLclf

Welcome to create an issue to the repository of MLclf on GitHub, and I will add more datasets loading functions based on the issues.

The mini-imagenet source data can be accessed from: https://deepai.org/dataset/imagenet

How to install MLclf package:

pip install MLclf

How to use this package:

from MLclf import MLclf
import torch
import torchvision.transforms as transforms

# Download the original mini-imagenet data:
MLclf.miniimagenet_download(Download=True)


# Transform the original data into the format that fits the task for classification:
# Note: If you want to keep the data format as the same as that for the meta-learning or few-shot learning (original format), just set ratio_train=0.64, ratio_val=0.16, shuffle=False.

transform = transforms.Compose([transforms.ToTensor(), transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))])
train_dataset, validation_dataset, test_dataset = MLclf.miniimagenet_clf_dataset(ratio_train=0.6, ratio_val=0.2, seed_value=None, shuffle=True, transform=transform, save_clf_data=True)

# The dataset can be transformed to dataloader via torch: 

train_loader = torch.utils.data.DataLoader(dataset=train_dataset, batch_size=128, shuffle=True, num_workers=0)


# You can check the corresponding relations between labels and label_marks of the image data:
# (Note: The relations can be obtained after MLclf.miniimagenet_clf_dataset is called, otherwise they will be returned as None instead.)

labels_to_marks = MLclf.labels_to_marks
marks_to_labels = MLclf.marks_to_labels

You can also obtain the raw data from the downloaded pkl files:

from MLclf import MLclf

# The raw data of mini-imagenet can be also obtained via the function below:

data_raw_train, data_raw_val, data_raw_test = MLclf.miniimagenet_data_raw()

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.2.14

Jul 3, 2022

0.2.12

Jul 3, 2022

0.2.10

Jul 1, 2022

This version

0.2.9

Jul 1, 2022

0.2.8

Apr 3, 2022

0.2.7

Apr 3, 2022

0.2.6

Apr 2, 2022

0.2.5

Apr 1, 2022

0.2.3

Mar 31, 2022

0.2.1

Mar 31, 2022

0.2.0

Mar 31, 2022

0.1.0

Mar 31, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

MLclf-0.2.9.tar.gz (6.7 kB view hashes)

Uploaded Jul 1, 2022 Source

Built Distribution

MLclf-0.2.9-py3-none-any.whl (7.2 kB view hashes)

Uploaded Jul 1, 2022 Python 3

Hashes for MLclf-0.2.9.tar.gz

Hashes for MLclf-0.2.9.tar.gz
Algorithm	Hash digest
SHA256	`02b9c60360adf1d3573abc73e0545803b8c50dc28fe970bd33fbf72266cb80be`
MD5	`0492dc9e7d03b19d6862904ada5cdfb9`
BLAKE2b-256	`930e4640a0a5492f8bfbff8c250b590e34ee1431706a9971534a5940f655e0a6`

Hashes for MLclf-0.2.9-py3-none-any.whl

Hashes for MLclf-0.2.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`46857ba08d56bd12b97c29f5b5741f08be60fb5b9fcef65e87e7a3299f3bb66f`
MD5	`7676960a6d6a8b7bfe952c205888540f`
BLAKE2b-256	`c382cd41868e30496ea55157dbb2fce3d0b24553ac82da0c75b7d073429fa0fc`