Add the Wake Vision dataset

Open Ekhao opened this issue 1 year ago • 1 comments

Add Dataset

Dataset Name: Wake Vision
Issue Reference: No reference

Description

Wake Vision" is a large, high-quality dataset featuring over 6 million images, significantly exceeding the scale and diversity of current tinyML datasets (100x). This dataset includes images with annotations of whether each image contains a person. Additionally, it incorporates a comprehensive fine-grained benchmark to assess fairness and robustness, covering perceived gender, perceived age, subject distance, lighting conditions, and depictions. The Wake Vision labels are derived from Open Image's annotations which are licensed by Google LLC under CC BY 4.0 license. The images are listed as having a CC BY 2.0 license. Note from Open Images: "while we tried to identify images that are licensed under a Creative Commons Attribution license, we make no representations or warranties regarding the license status of each image and you should verify the license for each image yourself.

Checklist

[x] Address all TODO's
[x] Add alphabetized import to subdirectory's __init__.py
[x] Run download_and_prepare successfully
[x] Add checksums file
[x] Properly cite in BibTeX format
[x] Add passing test(s)
[x] Add test data
[x] If using additional dependencies (e.g. scipy), use lazy_imports (if applicable)
[x] Add data generation script (if applicable)
[x] Lint code

May 14 '24 17:05 Ekhao

Hello @fineguy,

Thanks for assigning this pull request to yourself. Do you have an estimate for how long it will take to review it?

Jun 07 '24 12:06 Ekhao