This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
eg-speech [2018/07/25 13:56] jsmoeller |
eg-speech [2018/07/25 14:27] jsmoeller |
||
---|---|---|---|
Line 20: | Line 20: | ||
LF: <del>Walt</del>, Jan-Simon \\ | LF: <del>Walt</del>, Jan-Simon \\ | ||
- | Voicebox: \\ | + | Nuance: Christian, Paul Purcell, Mike C., <del>Vince</del>, Arijit \\ |
- | Nuance: Christian, Paul Purcell, Mike C., Vince \\ | + | Amazon: <del>Premal<del>, Ankur, Naveen, Kamal\\ |
- | Amazon: Premal, Ankur, Naveen\\ | + | |
NTT Data MSE: Imamura \\ | NTT Data MSE: Imamura \\ | ||
- | Denso Ten: Kusakabe \\ | + | Denso Ten: <del>Kusakabe</del> \\ |
Microchip: Michael, Christian \\ | Microchip: Michael, Christian \\ | ||
- | IoT.bzh: Stephane, Fulup \\ | + | IoT.bzh: <del>Stephane</del>, Fulup \\ |
- | Konsulko: Matt P., Matt R.\\ | + | Konsulko: Matt P., <del>Matt R.</del>\\ |
+ | Myscript: Olivier \\ | ||
Line 35: | Line 35: | ||
* Not wired up to a speech or TTS engine, more of a loop back test. | * Not wired up to a speech or TTS engine, more of a loop back test. | ||
* Will send the link to Konsulko and IoT.bzh to review the API for suggestions. | * Will send the link to Konsulko and IoT.bzh to review the API for suggestions. | ||
+ | * Done. | ||
* List of AGL services available can be seen at https://git.automotivelinux.org/ | * List of AGL services available can be seen at https://git.automotivelinux.org/ | ||
- | * Need to figure out consent and privacy issues with AGL Identity Agent. | + | * Need to figure out consent and privacy issues with AGL Identity Agent. |
+ | * No update. | ||
* How to manage grammar and natural language APIs and split between services and apps? | * How to manage grammar and natural language APIs and split between services and apps? | ||
* How to integrate cloud speech applications? | * How to integrate cloud speech applications? | ||
* Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU. How is this then transmitted to the POI and/or navi app? | * Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU. How is this then transmitted to the POI and/or navi app? | ||
* Sample config from softmixer https://github.com/iotbzh/4a-softmixer/blob/master/conf.d/project/lua.d/smixer-test-simple.lua | * Sample config from softmixer https://github.com/iotbzh/4a-softmixer/blob/master/conf.d/project/lua.d/smixer-test-simple.lua | ||
+ | * Update 7/25: update for FF use 8-channel CSL usb dac, about to land in gerrit. | ||
* Starting a demo project internally led by Paul. End of Aug early Sep they plan to have a design document together internally and will be ready with any questions/issue. Would be a good idea to target the [[agl-distro:sep2018-f2f|Sep F2F in Santa Clara]] to resolve issues with the design. | * Starting a demo project internally led by Paul. End of Aug early Sep they plan to have a design document together internally and will be ready with any questions/issue. Would be a good idea to target the [[agl-distro:sep2018-f2f|Sep F2F in Santa Clara]] to resolve issues with the design. | ||
* Information on IRC, mail list etc is available at [[start:getting-started|Getting Started with AGL]] | * Information on IRC, mail list etc is available at [[start:getting-started|Getting Started with AGL]] | ||
Line 48: | Line 51: | ||
Amazon looking at releasing a possible API in June and starting to work with the AGL App FW. | Amazon looking at releasing a possible API in June and starting to work with the AGL App FW. | ||
* Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together. | * Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together. | ||
+ | * Update 7/25: reviewing above draft, will share ideas/design to run multiple engines in parallel. No timeline, yet. Will review internally and present in next call. | ||
* Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall. | * Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall. | ||
Microchip - AGL USB microphone front-end. Michael is working with MicroSemi on getting hardware available that is already available for Amazon Alexa. Would like to have a prototype available for the AMM in Dresden. Microchip plans to provide the HAL for the microphone. | Microchip - AGL USB microphone front-end. Michael is working with MicroSemi on getting hardware available that is already available for Amazon Alexa. Would like to have a prototype available for the AMM in Dresden. Microchip plans to provide the HAL for the microphone. | ||
+ | * Update 7/25: | ||
+ | * Received hardware from MicroSemi, Alexa stack already working. | ||
+ | * Hardware is mic+dsp connected to rpi running the stack | ||
+ | * Plan: frontend should be connected over USB, integrated with 4a (hal) and interacting with the stack | ||
+ | * Stack needs to pick-up conditioned signal (near/far/noise-cancelling) through alsa device | ||
+ | * Michrochip will provide the hal for 4a | ||
+ | * Michael: Interest to extend the API for beamforming, multiple seats, "1 channel per seat" ? | ||
+ | |||
+ | * Question from Nucance about audio streaming: | ||
+ | * esoundlib - do you need special calls to stream audio | ||
+ | * Fulup: no, reply of 4a role request is the alsa device to write to | ||
+ | * 4a-play /usr/share/4a/media/Happy_MBB_75.ogg (only script) | ||
Action item: | Action item: |