/plushcap/analysis/activeloop/activeloop-hdf-5-hierarchical-data-format-5-vs-hub-creating-performant-computer-vision-datasets

HDF5 (Hierarchical Data Format 5) vs Hub. Creating performant Computer Vision datasets

What's this blog post about?

This article provides a tutorial on how to create a computer vision dataset for training a Computer Vision (CV) model using the Hierarchical Data Format version 5 (HDF5) file format and comparing it with Hub, an HDF5 alternative. The HDF5 format is popular for managing large datasets but may not be optimized for deep learning tasks. In contrast, Hub offers a deep learning-native dataset format that can integrate with machine learning frameworks like Pytorch or Tensorflow. By using the appropriate format, creating computer vision datasets can significantly impact the success of a deep learning project.

Company
Activeloop

Date published
Sept. 28, 2021

Author(s)
Margaux Masson-...

Word count
1854

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.