Gemini CLI is an open-source AI agent that brings Gemini directly to your terminal. It has generous free usage limits, with access to Gemini 2.5 Pro and up to 60 free model requests per minute. It also has Google Search integration and Model Context Protocol support.
To reduce the cost and time of OpenAI transcriptions, speed up your audio using ffmpeg before uploading it. Transcribing 2x or 3x sped-up audio reduces the number of audio tokens charged by OpenAI with barely any impact on transcription quality. In the example shown in this article, there was a 33% cost reduction with 3x speed compared to the original audio duration using the gpt-4o-transcribe model.