ESP32-AIVoice-Z01 is an ESP32-S3 AI voice kit with dual microphones, wake word detection, noise reduction and echo cancellation

The ESP32-AIVoice-Z01 is an affordable ESP32-S3-powered AI voice kit designed for creating voice-controlled AI applications. It features Wi-Fi and Bluetooth connectivity through the ESP32-S3 SoC, a dual digital microphone array for accurate voice recognition, and an onboard amplifier. The system also implements audio algorithms for noise reduction and echo cancellation.

The ESP32-AIVoice-Z01 board supports Espressif’s WakeNet voice wake-up framework and integrates with the AiLinker open-source backend service framework to enable the connection to various large AI model services like OpenAI, ZhiPu QingYan, TongYi QianWen, and DouBao. These features make this device suitable for developing AI-powered toys, IoT devices, mobile devices, and smart home applications.

ESP32-AIVoice-Z01 Development Kit

ESP32-AIVoice-Z01 ESP32 AI voice kit specifications

  • Wireless module – ESP32-S3-WROOM-1U
    • SoC – Espressif Systems ESP32-S3 dual-core Xtensa LX7 processor
    • Memory – 8MB PSRAM
    • Storage – 16MB flash
    • Wireless – WiFi 4 and Bluetooth 5.0 connectivity with external antenna
  • Storage – MicroSD card slot
  • Audio
    • Dual digital microphone array (SNR 65dB) with PDM interface
    • 4 Ohm 2.5W power amplifier
    • I2S interface for external DAC or amplifier
    • Support for noise reduction, VAD (Voice Activity Detection), and AEC (Acoustic Echo Cancellation) algorithms
  • USB – USB-C for power and programming
  • Other interfaces
    • SPI screen interface
    • DVP camera interface (not supported in the voice version)
    • GPIO, UART, I2C, I2S, PWM
  • Misc
    • RGB LEDs
    • power toggle switch
    • Speaker connector
    • ESP Reset and BOOT buttons
    • battery connector
    • SPI interface with 1.25mm x 8PIN terminals
  • Power Management
    • 3.3V to 5V input via USB-C, <10mA in deep sleep mode
    • 3.7V lithium battery interface with battery voltage measurement
    • Battery charging current 555mA @5V
  • Dimensions – 77 x 36 x 33 mm
  • Temperature Range  – 10°C to 60°C
  • ESD Protection – Air ±8kV, Contact ±4kV
ESP32 AIVoice Z01 Development Kit GPIO interfaces
ESP32-AIVoice-Z01 development kit GPIO interfaces

The board comes with pre-trained wake word detection for hands-free operation, speech recognition for command processing, and AI-based speech synthesis (TTS) support for generating natural responses. The board can be programmed with Arduino IDE or ESP-IDF framework. Sadly all documentation is only available in Chinese right now including a Quick Start guide and Hardware guide.

ESP32 AIVoice Z01 Development Kit applications

Previously we have written about Espressif’s ESP32-S3-BOX AI development kit which is ideal for online and offline voice applications, we have also covered Banana Pi BPI-AI-Voice dev kit which is a Speech Recognition Development Kit based on MicroSemi ZL38063. M5Stack also released their AX630C-powered offline LLM module which has speech recognition features and can be used for applications like smart homes, voice assistants, and industrial control.

ESP32 AIVoice Z01 Development Kit list

The ESP32-AIVoice-Z01 ESP32 AI voice kit costs $23.02 on the YouYeeTtoo’s store. The demo below shows the solution work with both Chinese and English languages.

Share this:

Support CNX Software! Donate via cryptocurrencies, become a Patron on Patreon, or purchase goods on Amazon or Aliexpress

Radxa Orion O6 Armv9 mini-ITX motherboard
Subscribe
Notify of
guest
The comment form collects your name, email and content to allow us keep track of the comments placed on the website. Please read and accept our website Terms and Privacy Policy to post a comment.
5 Comments
oldest
newest
Hedda
Hedda
3 days ago

Better to use microWakeWord instead as then it is relativly easy to change the wakeword model -> https://github.com/kahrendt/microWakeWord

See proof-of-concept of mWW used in ESPHome firmware for Home Assistant Voice Preview Edition -> https://github.com/esphome/home-assistant-voice-pe

Jean-Luc Aufranc (CNXSoft)
Admin

I’m not sure microWakeWord supports ESP32-S3 as well as the Espressif port.

Edit: OK. I got your point. It looks like wakeword customization is more complicated with WakeNet since people may have to contact Espressif (that post was written in 2019, so I’m not sure it’s still the case)

Stuart Naylor
2 days ago

Yeah its crazy Espressif have locked in there KW model to a on site pay4 custom service.
It breaks the whole framework for most who will start developing alternatives as they blobs they provide seem very locked in to the Espressif system…

Hedda
Hedda
3 days ago

Regardless that is low-end hardware specifications compared to the new reSpeaker Lite from Seeed Studio which features XMOS xCORE MCU as dedicated DSP for best-in-class noise reduction + echo cancellation and more -> https://www.cnx-software.com/2024/08/05/respeaker-lite-voice-assistant-kit-combines-xmos-xu-316-and-esp32-s3-for-advanced-voice-processing-home-assistant-integration/

Hedda
Hedda
2 days ago

FYI,,microWakeWord was originally written for ESP32-S3 but has since been optimized to even run on some lower-end ESP32 variants. ESP32-S3 does however allow you to have several active wake-words at the same time (up to three wake words now I believe).

Boardcon EM3562 Rockchip RK3562 SBC with 8 analog camera inputs