## Models

LLMs, LVMs, VLMs which can take images as input, can be instruct-tuned etc.

#### Open
- Llava (https://github.com/haotian-liu/LLaVA)
- Qwen (https://huggingface.co/Qwen/Qwen-VL)
- CogVLM (https://github.com/THUDM/CogVLM)
- BLIP Family of models (BLIP, BLIP2, X-InstructBLIP)

#### Tiny ones
- Moondream (https://github.com/vikhyat/moondream)
- Bunny (https://github.com/BAAI-DCAI/Bunny)

#### Closed
- Yasa (reka.ai)
- gpt4 vision, gemini ultra

## Companies
- reka.ai
- OpenAI, Google