From 67119194b5f9f7ba1f83d6b5cf52a1dc4719b286 Mon Sep 17 00:00:00 2001 From: pluja <64632615+pluja@users.noreply.github.com> Date: Sat, 21 Mar 2026 09:38:32 +0200 Subject: [PATCH] Update AI-related sections in README.md --- README.md | 25 +++++++++++++++++-------- 1 file changed, 17 insertions(+), 8 deletions(-) diff --git a/README.md b/README.md index b33b702..8c898dc 100644 --- a/README.md +++ b/README.md @@ -30,9 +30,9 @@ - [Android Launcher](#android-launcher) - [Artificial Intelligence](#artificial-intelligence) - [ChatGPT](#chatgpt) - - [Copilot](#copilot) - - [ElevenLabs Text To Speech](#elevenlabs-text-to-speech) - - [Midjourney (image generation)](#midjourney) + - [AI Coding](#ai-coding) + - [Text To Speech](#text-to-speech) + - [Image Generation](#image-generation) - [Bookmarking](#bookmarking) - [Book and web annotations](#book-and-web-annotationshighlights-management) - [Captchas](#captchas) @@ -258,24 +258,25 @@ When using cloud-based AI services, the data you input is often collected and st #### ChatGPT - [Jan](https://github.com/janhq/jan) - Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. -- [llama.cpp](https://github.com/ggerganov/llama.cpp) - Inference of Facebook's LLaMA model in pure C/C++ so it can run locally on a CPU. +- [llama.cpp](https://github.com/ggml-org/llama.cpp) - Inference of Facebook's LLaMA model in pure C/C++ so it can run locally on a CPU. - [LocalAI](https://github.com/go-skynet/LocalAI) - Self-hosted, community-driven simple local OpenAI-compatible API written in go. Can be used as a drop-in replacement for OpenAI, running on CPU with consumer-grade hardware. - [ollama](https://github.com/jmorganca/ollama) - Get up and running with Llama 2 and other large language models locally. - [PasteGuard](https://github.com/sgasser/pasteguard) - Privacy proxy for LLM APIs that masks PII and secrets before they reach cloud providers. Self-hosted, OpenAI-compatible, and restores original data in responses. - [Shimmy](https://github.com/Michael-A-Kuykendall/shimmy) - Privacy-focused AI inference server with OpenAI API compatibility, zero cloud dependencies, and local model processing. - [Tinfoil](https://tinfoil.sh/) - Verifiably private AI Chat and OpenAI-compatible inference in the cloud. Uses NVIDIA confidential computing and open source code pinned to a transparency log for end-to-end verifiability. -#### Copilot +#### AI Coding - [Continue](https://github.com/continuedev/continue) - Open-source autopilot for VS Code and JetBrains—the easiest way to code with any LLM - [Cline](https://cline.bot/) - Open source AI coding for VSCode. See every decision and use your own models. - [RooCode](https://github.com/RooCodeInc/Roo-Code) - Cline fork with some improvements. +- [OpenCode](https://github.com/anomalyco/opencode/) - The open source coding agent. Connect local models or any providers of your choice. -#### ElevenLabs Text To Speech +#### Text To Speech Go to the [Text To Speech](#text-to-speech) section. -#### Midjourney +#### Image Generation - [ComfyUI](https://github.com/comfyanonymous/ComfyUI) - ComfyUI lets you execute advanced image generation pipelines using an advanced interface. Available on Windows, Linux, and macOS. - [Stable Diffusion Web UI](https://github.com/AUTOMATIC1111/stable-diffusion-webui) - A browser interface for Stable Diffusion and other models. @@ -1438,10 +1439,18 @@ Odysee website contains some trackers and is a heavy site. You can use these alt ## Speech to Text +### Models + +- [Moonshine](https://github.com/moonshine-ai/moonshine) - Fast and accurate automatic speech recognition (ASR) for edge devices. - [OpenAI Whisper](https://github.com/openai/whisper) - Whisper is a general-purpose speech recognition model that can be run locally offline. It can transcribe audio from and to multiple languages. - [whisper.cpp](https://github.com/ggerganov/whisper.cpp) - High-performance inference of OpenAI's Whisper automatic speech recognition (ASR) model. - - [Whishper](https://whishper-docs.pages.dev/) - A whisper frontend with subtitle editing and translation capabilities. +- [ParakeetTDT](https://parakeettdt.com/) - Efficient audio transcription. Convert speech to text with unprecedented speed and accuracy using NVIDIA advanced AI speech recognition model. + +### Apps and services + - [OpenWhispr](https://github.com/OpenWhispr/openwhispr) - Voice-to-text dictation and productivity app with AI agents, meeting transcription, notes, and local/cloud speech recognition. Privacy-first and available cross-platform. Open source alternative to wisprflow. +- [Sasayaki](https://github.com/pluja/sasayaki) - Tiny android dictation app that turns speech into clear writing. +- [Speaches](https://github.com/speaches-ai/speaches) - OpenAI API-compatible server supporting streaming transcription, translation, and speech generation. [Back to top 🔝](#contents)