Google Assistant SDK Now Supports Device Actions, More Languages (French, German, Japanese)

Back in May 2017, Google released the Assistant SDK that worked on Raspberry Pi 3, and other ARM boards, essentially transforming low cost development boards into Google Home equivalent. The SDK became more popular once Google’s AIY Voice Kit was launched since it offered an easy and inexpensive way to use it with Raspberry Pi 3 board. Since all you need was a Linux board with an Internet connection, a microphone, and speaker, I tried Google Assistant SDK on one of the cheapest platform available: Orange Pi Zero Set 6 Kit including Orange Pi Zero board, but also an expansion board with built-in microphone and audio output jack, and a cute little case. I added my own pair of speakers, micro SD card, and USB power supply, and after setting up the software, I was able ask question, and get answers with female voice using the demo app. At the […]

VoltaStream AMP1 Linux Audio Board Includes a Stereo Audio Amplifier, Adds WiFi and Bluetooth

Last summer I wrote about VoltaStream ZERO an audio board powered by NXP i.MX6ULL processor, with up to 1GB RAM, a Texas Instruments DAC, and leveraging Raspberry Pi Zero form factor. The board runs a custom Linux distribution called PolyOS built with the Yocto Project, and including shairport-sync, librespot, SqueezeLite, a DLNA renderer, and more. Polyvection, the company behind the project is now back with VoltaStream AMP1 audio development board, with half the board very similar to VoltStream ZERO, and the other half featuring an audio amplifier, and a wireless module for WiFi and Bluetooth. VoltaStream AMP1 board specifications: SoC – NXP i.MX6ULL ARM Cortex-A7 processor @ 996 MHz System Memory – 512 MB DDR3 Storage – micro SD card slot Audio 1x I2S for integrated DAC and AMP, 1x I2S for GPIO access, 1x TOSLINK-IN jack Analog DAC – Texas Instruments PCM1862 (SNR 103 dB) Amplifier – ISSI IS31AP2121 […]

Intel Speech Enabling Developer Kit Works with Alexa Voice Service, Raspberry Pi 3 Board

We’ve known Intel has been working on Quark S1000 “Sue Creek” processor for voice recognition for several months. S1000 SoC is based on two Tensilica LX6 with HiFi3 DSP, some speech recognition accelerators, and up to 8x microphones interfaces which allows it to perform speech recognition locally. The solution can also be hooked to an application processor via SPI, I2S and USB (optional) when cloud based voice recognition is needed. Intel has recently introduced their Speech Enabling Developer Kit working with Amazon Alexa Voice Service (AVS) featuring a “dual DSP with inference engine” – which must be Quark S1000 – and an 8-mic array. The kit also includes a 40-pin cable to connect to the Raspberry Pi 3 board. Intel only provided basic specifications for the kit: Intel’s dual DSP with inference engine Intel 8-mic circular array High-performance algorithms for acoustic echo cancellation, noise reduction, beamforming and custom wake word […]

AMBE+2 Vocoder Promises High Voice Quality at Low (2.0 to 9.6 Kbps) Data Rates

Opus 1.2 open source audio codec was release a few months ago with the ability to deliver low power low high-quality audio bitrate for speech with bitrates as low as  12 Kbps. Digital Voice Systems (DVSI) claims to have gone even lower thanks to their AMBE+2 vocoder (Advanced MultiBand Excitation) providing high-quality speech at data rates from 2.0 to 9.6 kilobytes per second. AMBE+2 vocoder is said to outperform the company’s previous generation AMBE+ Vocoder as well as the G.729 and G.726 vocoders, while operating at only 4.0 Kbps. The vocoder is suitable for mobile radio, secure voice, satellite communication, computer telephony, digital voice and storage applications The solution can be integrated into product either using software licensing, or through Vocoder chips, and the company lists the following key benefits: Maintains speech intelligibility and speaker recognition at rates as low as 2.0 kbps Resistant to background noise and channel bit errors […]

Amlogic A111, A112 & A113 Processors are Designed for Audio Applications, Smart Speakers

Amlogic processors are mostly found in TVs and TV boxes, but the company is now apparently entering a new market with A111, A112, and A113 audio processors. I was first made aware of those new processors through Buildroot OpenLinux Release Notes V20170831.pdf document posted on their Open Linux website, where two boards with Amlogic A113D and A113X are shown. First, S400 board with the following key features/specifications: SoC – Amlogic A113D CPU System Memory – 1GB DDR3 Storage – 512MB SLC NAND flash Display I/F – MIPI interface Connectivity – Gigabit Ethernet SDIO WiFi/BT (AP6356S) Audio SPDIF_IN/SPDIF_OUT LINE_IN/LINE_OUT 2x Audio headers (MIC_Connector & SPK_Connector) USB – 1x USB 2.0 OTG Expansion – 2x PCIe ports Misc – 6x ADC Keys, IR_IN/IR_OUT, UART Interface (RS232) The second S420 board is based on A113X SoC, and comes with less features (no display, no Ethernet, no PCIe…), less memory: SoC – Amlogic A113X […]

Google Assistant News – AIY Voice Kit For Sale, Offline Support, 3rd Party Smart Speakers Announced

There’s been a lot of development related to Google Assistant in the last few days. First, Google provided an update for AIY Projects, with their AIY Projects Voice Kit now available for pre-order on Micro Center for $35 including a Raspberry Pi 3 board, making the kit virtually free, although you may also purchase it. Note that Micro Center blocks traffic originating from some countries, so I had to use Zend2 to access the site. [Update 10/09/2017: You can also get it from Seeed Studio for worldwide shipping] Google also announced the Speech Commands Dataset with 65,000 one-second long utterances of 30 short words, which they are in the process of integrating with the next release of the Voice Kit, and will allow the devices to respond to voice commands without the need for an Internet connection. So if you lose your Internet connection, or want to isolate your Voice […]

Those Charts Show The Benefits of Microphone Arrays for Hot Word Detection

Since I started looking more into smart speakers, including DIY ones such as the I made with Orange Pi Zero board + Google Assistant with a single microphone, I was told about the importance of microphone arrays, but so far, I had not seen any clear study or data about that. That changed today, as I came across a review of mic arrays by the makers of Snips Voice Platform. They tested five arrays connected to a Raspberry Pi 3 with the system, and also added a generic USB microphone to the mix. The results speak for themselves… In that experiment, they measured the rate at which a hot word was successfully detected by incrementally increasing the distance between 0.5 meters to 5 meters (16 ft), and for each distance, repeating the hot word 25 times at 3 second intervals using pre-recording to keep the voice level constant, and the […]

X-Powers AC108 is a Quad-Channel ADC Chip for Microphone Arrays

AC108-Mic-Array-Chip

X-Powers, a company better known to supply PMIC “companion” chip for Allwinner processors, also made some audio chips including AC108 is a chip specifically designed for microphone arrays with support for 4 microphones, and an I2C + I2S output interface to the host processor. Microphone arrays are particularly useful for smart speakers, and especially hot word detection (voice activity detection) as single microphone setups like I use with Orange Pi Zero, may have trouble detecting hot words like “OK Google” in noisy environments (music playing, alarm ringing…). X-Powers AC108 specifications: 108 dB dynamic range (A-weighted) @ 0 dB boost gain -90 dB THD+N (total harmonic distortion plus noise) @ 0 dB boost gain 4x programmable boost amplifiers with 0dB to 45dB in 3dB step ADC sample rates supported – 8kHz,12kHz,16kHz, 22.05kHz, 24kHz, 32kHz, 44.1kHz, 48kHz,96kHz Analog mixer and digital mixer in record data path 4x fully differential microphone inputs: MIC1P/N […]

Exit mobile version
UP 7000 x86 SBC