mirror of https://github.com/vladmandic/automatic
Comprehensive overhaul of the VQA interrogation system including: - Prefill text support for guiding VLM responses - Thinking mode support with tag cleanup/retention - Dynamic prompt/task selection based on model type - Bounding box visualization for detection results - Debug infrastructure (SD_VQA_DEBUG env var) - New model support: MiMo-VL, Nidum Gemma, Allura Gemma - Model-specific prompt lists (Florence, Moondream) |
||
|---|---|---|
| .. | ||
| deepbooru.py | ||
| deepbooru_model.py | ||
| deepseek.py | ||
| interrogate.py | ||
| joycaption.py | ||
| joytag.py | ||
| moondream3.py | ||
| openclip.py | ||
| vqa.py | ||