Skip to content

Commit

Permalink
Update 2024-04-04-VLM.md
Browse files Browse the repository at this point in the history
  • Loading branch information
rkuo2000 authored Jun 8, 2024
1 parent 25b2a9d commit 072f504
Showing 1 changed file with 13 additions and 1 deletion.
14 changes: 13 additions & 1 deletion _posts/2024-04-04-VLM.md
Original file line number Diff line number Diff line change
Expand Up @@ -173,8 +173,20 @@ Multimodal Language Models](https://publications.reka.ai/reka-core-tech-report.p
**InternLM-XComposer2-4KHD** could further understand 4K Resolution images.<br>
![](https://github.com/InternLM/InternLM-XComposer/raw/main/assets/4khd_radar.png)

---
### [Phi-3](https://azure.microsoft.com/en-us/blog/new-models-added-to-the-phi-3-family-available-on-microsoft-azure/)
**model:** [microsoft/Phi-3-vision-128k-instruct](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct)<br>
* Phi-3-vision is a **4.2B** parameter multimodal model with language and vision capabilities.
* Phi-3-mini is a 3.8B parameter language model, available in two context lengths (128K and 4K).
* Phi-3-small is a 7B parameter language model, available in two context lengths (128K and 8K).
* Phi-3-medium is a 14B parameter language model, available in two context lengths (128K and 4K).

---
### [MiniCPM-V](https://github.com/OpenBMB/MiniCPM-V)
**model:** [openbmb/MiniCPM-Llama3-V-2_5-int4](https://huggingface.co/openbmb/MiniCPM-Llama3-V-2_5-int4)<br>
![](https://github.com/OpenBMB/MiniCPM-V/raw/main/assets/MiniCPM-Llama3-V-2.5-peformance.png)

<br>
<br>

*This site was last updated {{ site.time | date: "%B %d, %Y" }}.*

0 comments on commit 072f504

Please sign in to comment.