Hybrid Image Tagger

A powerful and user-friendly tool that uses a hybrid approach, combining the strengths of the WD Tagger and a Vision Language Model (VLM), to generate detailed and accurate tags for images. The tool is wrapped in a Gradio UI for ease of use.

Features

Hybrid Tagging: Utilizes both WD Tagger and a VLM for comprehensive and high-quality tag generation.
Dual Channel Processing: Choose between different strategies for combining the taggers, including parallel and sequential processing.
Advanced Post-Processing: A rich set of options to refine tags, including custom replacements, trigger words, and more.
User-Friendly UI: A Gradio interface for easy configuration and use.
Batch Processing: Process multiple images concurrently with adjustable concurrency.
Smart Compression: Automatically compresses large images to optimize API usage.
User-friendly Gradio Interface:

Installation

Prerequisites

Python 3.10 or higher
An API key from a compatible AI service (e.g., OpenAI) for the VLM tagger.

Recommended Setup

It is highly recommended to use a Python virtual environment (venv) to avoid conflicts with other projects and system-wide packages.

Create a Virtual Environment

From your project's root directory, run:
```
python -m venv venv
```
Activate the Virtual Environment

The activation command depends on your operating system:
- On Windows (Command Prompt or PowerShell):
```
.\venv\Scripts\activate
```
- On macOS and Linux:
```
source venv/bin/activate
```
Your terminal prompt should now be prefixed with (venv).
Install Dependencies

With the virtual environment active, install the required packages:
```
pip install -r requirements.txt
```

Usage

To launch the application, run the following command:

python tagger.py

This will start the Gradio web UI, which you can access in your browser.

The Interface

The Gradio interface is divided into three main sections:

Upload & Configure: Upload your images and configure the tagging and post-processing settings.
Processing Status: Monitor the progress of the tagging process.
Download Results: Download the generated tags as a zip file.

Tagging Modes

WD Tagger Only: Uses only the WD Tagger.
LLM Only: Uses only the VLM tagger.
Dual Channel: Uses both taggers. You can choose between three strategies:
- Quick: Runs both taggers in parallel for each image.
- Standard: Runs the taggers sequentially for each image.
- Detailed: Runs both taggers in parallel and saves all intermediate files.

Post-Processing

A wide range of post-processing options are available to clean and refine the generated tags:

Text Formatting: Replace underscores, escape brackets, normalize spaces, remove duplicates, and sort alphabetically.
Trigger Words: Add custom prefixes and suffixes to your tags.
Advanced: Set custom text replacements, and limits for the maximum number of tags and minimum tag length.

Output Format

For each image, the tool generates a .txt file with the same name containing comma-separated tags.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
tagger		tagger
.gitignore		.gitignore
README.md		README.md
default_prompt.txt		default_prompt.txt
dual_channel_prompt.txt		dual_channel_prompt.txt
requirements.txt		requirements.txt
tagger.py		tagger.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hybrid Image Tagger

Features

Installation

Prerequisites

Recommended Setup

Usage

The Interface

Tagging Modes

Post-Processing

Output Format

Contributing

About

Uh oh!

Releases

Packages

Languages

CodeBoy2006/hybrid-image-tagger

Folders and files

Latest commit

History

Repository files navigation

Hybrid Image Tagger

Features

Installation

Prerequisites

Recommended Setup

Usage

The Interface

Tagging Modes

Post-Processing

Output Format

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages