Loading and Preparing Datasets with TensorFlow Datasets (TFDS)

1. Project Title

Efficient Dataset Loading and Preprocessing using TensorFlow Datasets

2. Problem Statement and Goal of Project

Access to standardized, ready-to-use datasets is critical for deep learning research and prototyping. This project demonstrates how to load, inspect, preprocess, and prepare datasets using TensorFlow Datasets (TFDS), enabling quick experimentation without manual dataset handling.

3. Solution Approach

The notebook follows a clear workflow:

Loading datasets from TFDS – Use tfds.load() to fetch datasets with optional train/test splits.
Inspecting dataset metadata – View dataset info, features, and label mappings.
Preprocessing – Apply image resizing, normalization, and type conversion.
Batching, shuffling, and caching – Build optimized input pipelines for training.
Visualization – Display sample images with labels for verification.

4. Technologies & Libraries

From the code:

TensorFlow – Model compatibility, preprocessing, and pipeline integration.
TensorFlow Datasets (TFDS) – Dataset loading and metadata management.
Matplotlib – Visualization of images and labels.
NumPy – Optional numerical handling.

5. Description about Dataset

The notebook uses datasets from TensorFlow Datasets (TFDS) — dataset choice (e.g., MNIST, CIFAR-10) depends on tfds.load() parameters in the code. No manual dataset download or external file handling is required.

6. Installation & Execution Guide

Requirements:

pip install tensorflow tensorflow-datasets matplotlib numpy

Run the notebook:

jupyter notebook tfds.ipynb

or in JupyterLab:

jupyter lab tfds.ipynb

7. Key Results / Performance

Successfully loaded a TFDS dataset with both training and testing splits.
Visualized sample images with correct labels for dataset validation.
Built an optimized input pipeline using batching, shuffling, caching, and prefetching.

Example snippet:

train_ds, test_ds = tfds.load('mnist', split=['train', 'test'], as_supervised=True)
train_ds = train_ds.shuffle(1024).batch(32).prefetch(tf.data.AUTOTUNE)

8. Screenshots / Sample Out

Sample dataset visualization:

Image: <tf.Tensor: shape=(28, 28, 1), dtype=uint8>
Label: 7

(Accompanied by plotted image using Matplotlib in the notebook)

9. Additional Learnings / Reflections

TFDS provides instant access to a wide variety of datasets with minimal code.
Integrating TFDS with tf.data transformations ensures optimal GPU utilization.
Always inspect a dataset visually before training to verify preprocessing correctness.
TFDS is highly useful for benchmarking and educational purposes.

💡 Some interactive outputs (e.g., plots, widgets) may not display correctly on GitHub. If so, please view this notebook via nbviewer.org for full rendering.

👤 Author

Mehran Asgari Email: imehranasgari@gmail.com GitHub: https://github.com/imehranasgari

📄 License

This project is licensed under the Apache 2.0 License – see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
tfds.ipynb		tfds.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Loading and Preparing Datasets with TensorFlow Datasets (TFDS)

1. Project Title

2. Problem Statement and Goal of Project

3. Solution Approach

4. Technologies & Libraries

5. Description about Dataset

6. Installation & Execution Guide

7. Key Results / Performance

8. Screenshots / Sample Out

9. Additional Learnings / Reflections

👤 Author

📄 License

About

Uh oh!

Releases

Packages

Languages

License

imehranasgari/DL_TensorFlow_LowLevelAPI_TFDSLoader

Folders and files

Latest commit

History

Repository files navigation

Loading and Preparing Datasets with TensorFlow Datasets (TFDS)

1. Project Title

2. Problem Statement and Goal of Project

3. Solution Approach

4. Technologies & Libraries

5. Description about Dataset

6. Installation & Execution Guide

7. Key Results / Performance

8. Screenshots / Sample Out

9. Additional Learnings / Reflections

👤 Author

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages