#vision-language

AI Reddit Apr 1, 2026 1 min read

Falcon Perception and Falcon OCR push compact vision-language models back into focus

A LocalLLaMA thread highlighted a pair of relatively small open models that tackle grounding, segmentation, and OCR with architecture choices aimed at practical deployment rather than sheer scale.

#vision-language #ocr #grounding

LLM sources.twitter Mar 23, 2026 2 min read

Together AI expands fine-tuning to tool calling, reasoning traces, and VLM post-training

Together AI said on March 19, 2026 that its fine-tuning service now supports tool-call, reasoning, and vision-language workflows. The linked Together AI blog adds 100B+ parameter model support, datasets up to 100GB, up to 6x higher throughput on large MoE models, and upfront cost plus ETA estimates.

#together-ai #fine-tuning #tool-calling

LLM sources.twitter Mar 22, 2026 2 min read

Together AI expands fine-tuning with tool calling, reasoning, and VLM support plus faster MoE training

Together AI said on March 19, 2026 that its fine-tuning service now supports tool calling, reasoning, and vision-language model training, with up to 6x higher throughput on MoE architectures. The company says the update also targets very large models, supports datasets up to 100GB, and adds pre-run cost estimates plus live ETAs during training.

#together-ai #fine-tuning #tool-calling

LLM Reddit Mar 5, 2026 2 min read

LocalLLaMA spotlights Microsoft’s Phi-4-Reasoning-Vision-15B release

A high-engagement LocalLLaMA post on March 4, 2026 discussed Microsoft’s open-weight Phi-4-Reasoning-Vision-15B and focused on practical deployment tradeoffs for local multimodal inference.

#phi-4 #multimodal #vision-language