@simon It seems that we are getting massive discount for using these LLMs. Would you be still using these if, for example, the price doubles?
Posts
-
Claude 3.5 Haiku is out, with a couple of surprises -
Claude 3.5 Haiku is out, with a couple of surprises@simon Isn't Gemini Flash free for now at least? I mostly use it for some simple things for this reason.
-
Published some notes on Docling, a rather nice MIT licensed Python PDF document / table extraction library from IBM https://simonwillison.net/2024/Nov/3/docling/@simon Any comments on it's output's quality?
-
I added multi-modal (image, audio, video) support to my LLM command-line tool and Python library, so now you can use it to run all sorts of content through LLMs such as GPT-4o, Claude and Google Gemini@simon I was using the following command
> llm 'please explain what is happening in the video' -a man-in-water.mp4 -m gemini-1.5-flash-latest
Does it look like it should work?
-
I added multi-modal (image, audio, video) support to my LLM command-line tool and Python library, so now you can use it to run all sorts of content through LLMs such as GPT-4o, Claude and Google Gemini@simon I was using an MP4 of 5 mb size. The error just says "internal error" I downloaded the video from here https://www.pexels.com/video/catching-and-releasing-a-big-carp-fish-in-the-lake-5538137/
-
I added multi-modal (image, audio, video) support to my LLM command-line tool and Python library, so now you can use it to run all sorts of content through LLMs such as GPT-4o, Claude and Google Gemini@simon Does video work? I tried both Gemini pro and flash, but I only got some error message. Do I need a paid account to use video scraping? (Image works as expected.)