Espressif ESP-ADF Audio Development Framework for ESP32 Supports Baidu DuerOS, and Soon Amazon Alexa, Google Assistant, etc…

Espressif Systems have been working on audio applications like Smart Speakers based on ESP32 WiSoC with hardware development kits like ESP32-LyraTD-MSC Audio Mic HDK, and I could test it with Baidu DuerOS using Mandarin language. However, at the time (February 2018), there was not much else that could be done with the hardware kit, since no corresponding ESP32 audio software development kit had been made available. This has now changes since Espressif has just released ESP-ADF Audio Development Framework on Github. The framework will support the development of audio applications for the Espressif Systems ESP32 chip such as: Music player or recorder handling MP3, AAC, WAV, OGG, AMR, SPEEX … audio formats Play music from network (HTTP), storage (SD card), Bluetooth A2DP/HFP Integration with Media services such as DLNA, Wechat, etc.. Internet Radio Voice recognition and integration with voice services such as Alexa, DuerOS, Google Assistant As we can see […]

Samsung Aurora Smart Speaker Prototype is Powered by a Smartphone, Features an Hologram-like Character

Samsung Electronics has showcased a smart / A.I. speaker prototype at SXSW 2018 trade show last month, that can be manufactured at low cost since it relies on a smartphone to provide a processor, microphone, and other equipment. The smart speaker also uses the smartphone’s screen to show an hologram to greet users. Meet Samsung “Aurora”. The smartphone is placed horizontally (blue above) in a slot in the upper part of the speaker, and a hologram-like character floats above the Bluetooth speaker included of the kit. The part on the top surface is an optical mechanism using the smartphone’s camera for image recognition. The smart speaker also charges the phone, but it’s unclear how you are supposed to answer phone calls once it is inserted in the speaker… The concept looks similar to Vinclu Gatebox which we covered in 2016, except it relies on a smartphone. Samsung is said to […]

ReSpeaker Core v2 is a 6-Mic Array Audio Development Kit Powered by Rockchip RK3229 Processor

Seeed Studio launched ReSpeaker Core board designed for voice interaction in 2016. The board was based on a Mediatek MT7688 MIPS WiSoC running OpenWrt, and came with a single built-in microphone, although a microphone array board with 7 microphones and 12 LEDs was also offered as option. The company is now back with a more powerful update of the board – called ReSpeaker Core v2 – featuring Rockchip RK3229 quad core Cortex A7 processor, running Debian Linux, and with 6 on-board microphones. Beside WiFi connectivity, the board also adds Ethernet and Bluetooth 4.0, as well as a USB hot port compared to the previous version. ReSpeaker Core v2 specifications: SoC – Rockchip RK3229 quad core Cortex A7 processor @ up to 1.5 GHz with Arm Mali-400MP2 GPU System Memory – 1GB DDR3 RAM Storage – 4GB eMMC flash, micro SD card slot Video Output – HDMI 2.0 (but drivers not […]

Rockchip Unveils RK3308 & Gemini Cortex-A35 Processors for Smart Speakers

Rockchip has posted a tweet about smart speakers, specifically about Midea AI speaker based on Rockchip RK3229, but the second image is what caught my eyes, as it reveals two new Arm Cortex A35 processor designed for “Smart AI solutions”, namely RK3308 and “Gemini” Processor. Rockchip RK3308 “mainstream” processor specifications: CPU – Quad core Cortex A35 processor GPU – TBD (maybe none?) Audio Integrated 8-ch ADC and hardware VAD (Voice Activity Detection) module Multi-channel PDM/I2S/TDM audio interface Rockchip Gemini “high-end” processor specifications: CPU – Quad core Cortex A35 processor GPU – Mali Dvalin MP2 GPU Audio – “high-end smart audio solution”; multi-channel PDM/I2S/TDM audio interface Video I/O – LCD screen & camera support Rockchip will provide support for Android Thing and Linux for the processors, as well as integration with MIC arrays boards and popular voice services such as Amazon Alexa, Google Assistant,  Baidu DuerOS, and others. Arm Cortex A35 […]

A First Look at ESP32-LyraTD-MSC Audio Mic HDK with Baidu DuerOS Assistant

Earlier this year, Espressif Systems had unveiled their ESP32-LyraTD-MSC Audio MiC HDK (Hardware Development Kit) which features an ESP32-WROOM module, a 4-mic array DSP, 3 microphones, an audio jack, and various I/Os. I received the board a couple of weeks ago, and while there’s no public information released yet, the company provided me with ESP32-LyraTD-MSC User Guide in English. Eventually, I’d expect Google Assistant and Amazon Alexa to be supported, but in the meantime I had to leverage my (lowly) Chinese language skills to get started since the kit is pre-loaded with firmware connecting to Baidu DuerOS voice assistant. ESP32-LyraTD-MSC Unboxing The kit came in a bland Espressif Systems carton box. Inside the package, I could only find one kit comprised of two boards. The bottom board read ESP32_MicrosemiDSP_Mainboard-V1, and does not show much apart from marking for connectors, headers and the power switch. While the top comes with eight […]

MicroSemi ZL38063 Audio Processor is Designed for Microphone Arrays

I’ve already been experimenting with DIY smart speakers and corresponding services for example using ReSpeaker board with Microsoft Bing Speech API, or Orange Pi Zero with Google Assistant SDK. But so far all the hardware platforms I used only came with one microphone, no microphone array that help with wake word detection in noisy environments. Last week-end, I received Espressif Audio Mic HDK, an ESP32 board with a 3-microphone array which I’ll review a in a few weeks once documentation becomes available and I clear some other items in my review list. In the meantime, I checked out the hardware, and found out the mainboard also comes with Microsemi ZL38063 audio processor specifically designed for microphone array. The chip was released las year, and can be already found in the company’s AcuEdge Development Kit for Amazon AVS, but since I’m going to use a board based on ZL38063 I’d thought […]

Mycroft Mark II Smart Speaker / Voice Assistant Works with Open Source Software (Crowdfunding)

Smart speakers have gain a lot of traction over the last few years, but many of the solutions are based on Google Assistant or Amazon Alexa voice services, with both companies likely tracking your voice searches the same way they track your online searches to provide a “personalized experience” and sell you products or server ads that match your interests. If you don’t like being tracked that way, a solution is to use an open source voice assistant such as Mycroft, and install it on a Linux computer, Raspberry Pi 3 board, or Android device. The company also introduced Mark I reference hardware platform based on Raspberry Pi 2 in 2015, and while all those hardware options should be fine for the technically inclined, but not really suited to the typical end user, and AFAIK they all lack a microphone array for better hot word detection. So Mycroft has come […]

Espressif ESP32 LyraTD MSC HDK is Designed for Smart Speakers, Wireless Audio and other Smart Home Appliances

[Update February 17, 2018:  The kit was previously referred as ESP32 LyRaTD MS1, but the company appears to have changed the name to ESP32-LyRaTD-MSC] So apparently voice command will represent 50% of all searches in the next two years, and everybody is jumping on the smart speaker bandwagon, with announcements from many companies at CES 2018, including Google’s Android Things + Assistant products‘ announcement,  NXP i.MX 8M official launch, Amazon Alexa Voice Service (AVS) development kit from Amlogic and Allwinner, and more. Espressif Systems is about to join the party with their ESP32 LyraTD MS1 HDK (Hardware development kit) that most people will likely remember as “Audio Mic HDK” that was announced on Twitter. Espressif Audio Mic HDK specifications: Wireless Module – ESP32-WROVER module with 802.11 b/g/n WiFi and Bluetooth 4.1 LE connectivity. DSP – 4-mic array chip Storage – micro SD card for audio files Audio Audio driver chip […]

EmbeddedTS embedded systems design