Add the Wake Vision dataset
Add Dataset
- Dataset Name: Wake Vision
- Issue Reference: No reference
Description
Wake Vision" is a large, high-quality dataset featuring over 6 million images, significantly exceeding the scale and diversity of current tinyML datasets (100x). This dataset includes images with annotations of whether each image contains a person. Additionally, it incorporates a comprehensive fine-grained benchmark to assess fairness and robustness, covering perceived gender, perceived age, subject distance, lighting conditions, and depictions. The Wake Vision labels are derived from Open Image's annotations which are licensed by Google LLC under CC BY 4.0 license. The images are listed as having a CC BY 2.0 license. Note from Open Images: "while we tried to identify images that are licensed under a Creative Commons Attribution license, we make no representations or warranties regarding the license status of each image and you should verify the license for each image yourself.
Checklist
- [x] Address all TODO's
- [x] Add alphabetized import to subdirectory's
__init__.py - [x] Run
download_and_preparesuccessfully - [x] Add checksums file
- [x] Properly cite in
BibTeXformat - [x] Add passing test(s)
- [x] Add test data
- [x] If using additional dependencies (e.g.
scipy), use lazy_imports (if applicable) - [x] Add data generation script (if applicable)
- [x] Lint code
Hello @fineguy,
Thanks for assigning this pull request to yourself. Do you have an estimate for how long it will take to review it?