🛍️ T5-Small Product Metadata Generator

This repository contains a fine-tuned T5-small model designed to automate product metadata generation for e-commerce platforms. It simplifies the manual, time-consuming process of product categorization by predicting structured data like brand and hierarchical category levels (L0–L4) from basic product details.

🧠 Model Overview

Base Model: T5-Small
Dataset Link: Dataset
Training: 1 epoch on ~440K examples
Validation: ~60K examples
Frameworks: PyTorch, Hugging Face Transformers, Datasets, Accelerate

For more training details refer to notebooks directory of refer which has two .ipynb notebooks for training and dataset creation (from existent datasource)

🎯 Objective

Manual product categorization is a resource-intensive task on e-commerce platforms.

Inputs:

Product Title
Manufacturer
Store

Outputs:

Brand
L0 to L4 Category Levels (hierarchical classification)

The output is returned in a structured JSON format via a custom post-processing function.

📊 Training Snapshot

🚀 Gradio Demo

An interactive Gradio app is included for testing and demo purposes. It integrates preprocessing and postprocessing pipelines.

🖼️ Gradio UI Preview

🤗 Model Access

You can find and use the trained model on Hugging Face: 🔗 Hugging Face Model Link

🧰 Tools Used

PyTorch
transformers (Hugging Face)
Datasets (Hugging Face)
Accelerate (Hugging Face)

💡 Learning Experience

This project was a great learning journey—from fine-tuning and evaluation to deployment and UI creation. There's still room for improvement, and I welcome all feedback and contributions.

🤝 Contributions

Feedback, issues, and pull requests are highly appreciated! Let’s make product metadata automation better together.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
assets		assets
loss_data		loss_data
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
Training_Loss_Plot.png		Training_Loss_Plot.png
convert_to_json.py		convert_to_json.py
generate_loss_plot.py		generate_loss_plot.py
gradio_app.py		gradio_app.py
inference.py		inference.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🛍️ T5-Small Product Metadata Generator

🧠 Model Overview

🎯 Objective

📊 Training Snapshot

🚀 Gradio Demo

🤗 Model Access

🧰 Tools Used

💡 Learning Experience

🤝 Contributions

About

Uh oh!

Releases

Packages

Uh oh!

Languages

SurAyush/Title2MetaData

Folders and files

Latest commit

History

Repository files navigation

🛍️ T5-Small Product Metadata Generator

🧠 Model Overview

🎯 Objective

📊 Training Snapshot

🚀 Gradio Demo

🤗 Model Access

🧰 Tools Used

💡 Learning Experience

🤝 Contributions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages