Nothing officially launched “Essential Voice” on Friday, April 24, 2026. This AI-powered tool is part of the broader Nothing OS 4.1 update and is designed to bridge the gap between spontaneous speech and professional writing.
Unlike traditional dictation, Essential Voice uses Google Gemini 3 Flash for cloud-based processing to deliver “finished writing” rather than a raw transcript.
1. Key Features of Essential Voice
The tool is built system-wide into the Nothing keyboard, meaning it works across all apps (WhatsApp, Gmail, Notes, etc.) and even supports third-party keyboards.
- Smart “De-stuttering”: The AI automatically identifies and removes filler words like “um,” “uh,” and “like,” as well as mid-sentence self-corrections, resulting in polished, structured text.
- Personal Mappings: You can create custom shortcuts where specific spoken phrases trigger pre-defined outputs, such as automatically inserting your home address, email, or a standard email sign-off.
- Translation Agent: Supports real-time translation for over 100 languages. You can speak in one language (e.g., Hindi or Spanish) and have the text appear instantly in another (e.g., English).
- Voice-Based Formatting: Users can dictate commands to organize text into bullet points, numbered lists, or specific templates without touching the screen.
2. Device Availability (April 2026)
Nothing is rolling out the feature in a staggered manner, starting with its most recent hardware:
| Device | Status |
| Nothing Phone (3) | Available Now (via OS 4.1 update) |
| Nothing Phone (4a) Pro | Rolling out now |
| Nothing Phone (4a) | Expected early May 2026 |
| CMF Phone 2 Pro | Available Now (Standard OS 4.1 features) |
- Access: The tool can be triggered by long-pressing the Essential Key (on supported models) or by tapping the new voice icon in the bottom-left corner of the keyboard.
3. Privacy and Technical Specs
- Processing: While it utilizes Gemini 3 Flash in the cloud, Nothing states that audio data is encrypted during transit and deleted immediately after the text is returned to the device. It does not listen in the background.
- Connectivity: Unlike standard offline dictation, Essential Voice requires an active internet connection to perform its advanced AI refining and translation.
- Speed: Because it relies on cloud processing, there is a noticeable latency of 5–20 seconds depending on the length of the message and your network speed.