I wrote up a few notes about Alibaba Cloud’s impressive Apache 2 licensed Qwen2-VL vision LLM, which seems to handle tasks like handwriting OCR particularly well
-
I wrote up a few notes about Alibaba Cloud’s impressive Apache 2 licensed Qwen2-VL vision LLM, which seems to handle tasks like handwriting OCR particularly well
I had to link to the Internet Archive copies of their blog posts because their GitHub organization (which hosted their blog via GitHub pages) mysteriously vanished without a trace some time in the last 24 hours!
Qwen2-VL: To See the World More Clearly
Qwen is Alibaba Cloud's organization training LLMs. Their latest model is Qwen2-VL - a vision LLM - and it's getting some really positive buzz. Here's [a r/LocalLLaMA thread](https://www.reddit.com/r/LocalLLaMA/comments/1f4q0ag/qwen2_vl_7b_far_more_impressive_than_i_thought/) about the …
(simonwillison.net)
-
Simon Willisonreplied to Simon Willison last edited by
Good news: the disappearance is confirmed to be accidental, hopefully they’ll be back soon once GitHub unflag their account https://twitter.com/justinlin610/status/1831489518467477529