automatic

Commit Graph

Author	SHA1	Message	Date
CalamitousFelicitousness	5193285bc7	refactor(vqa): convert to class-based singleton Refactor VQA module from module-level globals to a VQA class singleton pattern with self-contained per-model loading methods. Changes: - Add VQA class with model/processor state and detection data storage - Extract load methods for clean model pre-loading via UI - Interrogate to return string only; store detection data on instance - Add vqa_draw.py for bounding box/point annotation utilities Stub, further transfer of drawing functions to follow - Update moondream3.py to store detection data on VQA singleton - Update endpoints.py and ui_caption.py for new return type	2025-12-05 20:53:18 +00:00
CalamitousFelicitousness	2b6226b62b	feat(vqa): persist thinking mode and improve reasoning output formatting - Add interrogate_vlm_thinking_mode setting to save checkbox state - Update ui_caption to restore Thinking Mode preference on load - Add blank line before 'Answer:' label for visual separation - Remove '\n\n' replacement in clean() that stripped blank lines - Fix Qwen reasoning detection when <think> tag is in prompt, not response - Add reasoning icon to Moondream 2 and 3 model names	2025-12-05 00:00:25 +00:00
CalamitousFelicitousness	506515b018	feat(vqa): add load/unload model buttons to Caption tab - Add load_model() function to pre-load VLM into memory - Add unload_model() function to free VLM from memory - Add Load/Unload buttons to Caption tab UI	2025-12-05 00:00:25 +00:00
CalamitousFelicitousness	a90d85ddfd	feat(ui): add dynamic task selection based on VLM model - Rename "Predefined question" to "Task" - Task dropdown updates choices when model changes - Prompt placeholder updates based on selected task - Model-specific tasks: Florence-2 gets detection tasks, Moondream gets point/detect	2025-12-05 00:00:25 +00:00
CalamitousFelicitousness	4df6aa7944	fix(ui): set prefill text to empty by default	2025-12-05 00:00:25 +00:00
CalamitousFelicitousness	0d88fcd396	feat(ui): add prefill and thinking controls to Caption tab Add minimal UI controls to expose new VQA functionality: - Prefill Text input for guiding VLM responses - Thinking Mode checkbox for reasoning models - Keep Thinking Trace checkbox for output retention - Keep Prefill checkbox for output retention - Annotated Image output panel for detection visualization - Updated button handlers to pass new parameters	2025-12-05 00:00:24 +00:00
CalamitousFelicitousness	78711fb1d4	Merge branch 'dev' into patch-2	2025-10-01 20:58:58 +01:00
CalamitousFelicitousness	78820a14dc	Allow VLM temp setting temperature to 0 Allow VLM temp setting temperature to 0	2025-10-01 20:52:04 +01:00
Vladimir Mandic	cd79f92dff	add opts models_not_to_offload Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-09-19 11:21:54 -04:00
Vladimir Mandic	05dd0096c9	set default vqa model Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-09-04 08:38:29 -04:00
Vladimir Mandic	b2dbef53e5	restyled all toolbuttons to be modernui native Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-08-31 15:01:50 -04:00
Vladimir Mandic	8473bae0fc	1000 papercuts Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-05-13 21:51:33 -04:00
Vladimir Mandic	9bf6838962	update video tab Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-03-20 14:39:38 -04:00
Vladimir Mandic	dbfd59434f	add gemma3 Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-03-15 15:30:57 -04:00
Vladimir Mandic	b6990151c4	caption tab modernui support Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-02-17 10:59:22 -05:00
Vladimir Mandic	a4b3dc269e	modernize clip interrogate Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-02-16 19:37:09 -05:00
Vladimir Mandic	f3dd9b9646	vlm advanced settings and batch processing Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-02-15 14:34:28 -05:00
Vladimir Mandic	e95bd93f67	caption ui redesign Signed-off-by: Vladimir Mandic <mandic00@live.com>	2025-02-15 12:57:19 -05:00

18 Commits (3a65d561a70f60d2c67f607d2b00a944c7c427ed)