User Tools

Site Tools


eg-speech

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
eg-speech [2018/05/16 14:27]
waltminer
eg-speech [2018/08/08 20:49]
waltminer [Table]
Line 14: Line 14:
  
 ===== Meetings ===== ===== Meetings =====
 +Meetings of the Speech EG are held every other Wednesday. Meeting time is 15:00 UTC. The upcoming schedule can be found below. ​
  
-May 16, 2018+Please join my meeting from your computer, tablet or smartphone.\\  
 +https://​global.gotomeeting.com/​join/​356562157  
 + 
 +You can also dial in using your phone. \\ 
 +United States (Toll Free): 1 877 568 4106 \\  
 +United States: +1 (571) 317-3129  
 + 
 +Access Code: 356-562-157  
 + 
 +More phone numbers  
 +| Australia: +61 2 9087 3604        | Austria: +43 7 2081 5427                           | 
 +| Belgium: +32 28 93 7018           | Canada (Toll Free): 1 888 455 1389                 | 
 +| Canada: +1 (647) 497-9391 ​        | Denmark: +45 43 31 47 82                           | 
 +| Finland: +358 923 17 0568         | France: +33 170 950 594                            | 
 +| Germany: +49 692 5736 7317        | India (Toll Free): 18002669272 ​                    | 
 +| Ireland: +353 16 572 651          | Italy: +39 0 291 29 46 30                          | 
 +| Japan (Toll Free): 0 120 663 800  | Korea, Republic of (Toll Free): 00798 14 207 4914  | 
 +| Netherlands:​ +31 207 941 377      | New Zealand: +64 9 280 6302                        | 
 +| Norway: +47 21 93 37 51           | Spain: +34 932 75 2004                             | 
 +| Sweden: +46 853 527 827           | Switzerland:​ +41 225 4599 78                       | 
 +| United Kingdom: +44 330 221 0088  |                                                    | 
 + 
 + 
 + 
 + 
 +Joining from a video-conferencing room or system? \\ 
 +Dial: 67.217.95.2##​356562157 \\ 
 +Cisco devices: 356562157@67.217.95.2 \\  
 + 
 +First GoToMeeting?​ Let's do a quick system check: https://​link.gotomeeting.com/​system-check  
 + 
 +---------- 
 + 
 +==== September 5, 2018 ==== 
 +Attendees: ​ //Upcoming Meeting//  
 + 
 +==== August 22, 2018 ==== 
 +Attendees: //Upcoming Meeting//  
 + 
 + 
 +==== August 8, 2018 ==== 
 +Attendees:​ 
 + 
 +LF: Walt, <​del>​Jan-Simon</​del>​ \\ 
 +Nuance: <​del>​Christian</​del>,​ Paul Purcell, Mike C., <​del>​Vince</​del>,​ Arijit \\ 
 +Amazon: Premal, Ankur, Naveen, Kamal, Alain\\ 
 +NTT Data MSE: Imamura \\ 
 +Denso Ten: <​del>​Kusakabe</​del>​ \\ 
 +Microchip: <​del>​Michael,​ Christian</​del>​ \\ 
 +IoT.bzh: <​del>​Stephane</​del>,​ Fulup \\ 
 +Konsulko: <​del>​Matt P.</​del>,​ <​del>​Matt R.</​del>​\\ 
 +Myscript: <​del>​Olivier,​ Etienne</​del>​ \\ 
 + 
 + 
 +Notes: 
 + 
 +Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://​github.com/​Nuance-Mobility/​agl-speech-interface) 
 +   * Not wired up to a speech or TTS engine, more of a loop back test.  
 +   * Will send the link to Konsulko and IoT.bzh to review the API for suggestions.  
 +     * Done. 
 +   * List of AGL services available can be seen at https://​git.automotivelinux.org/​ 
 +   * Need to figure out consent and privacy issues with AGL Identity Agent. 
 +     * No update. 
 +   * How to manage grammar and natural language APIs and split between services and apps?  
 +   * How to integrate cloud speech applications?​ 
 +     * Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU.  How is this then transmitted to the POI and/or navi app? 
 +   * Sample config from softmixer https://​github.com/​iotbzh/​4a-softmixer/​blob/​master/​conf.d/​project/​lua.d/​smixer-test-simple.lua 
 +     * Update 7/25: update for FF use 8-channel CSL usb dac, about to land in gerrit. 
 +   * Starting a demo project internally led by Paul. End of Aug early Sep they plan to have a design document together internally and will be ready with any questions/​issue. Would be a good idea to target the [[agl-distro:​sep2018-f2f|Sep F2F in Santa Clara]] to resolve issues with the design.  
 +   * Information on IRC, mail list etc is available at [[start:​getting-started|Getting Started with AGL]] 
 +   * Supported hardware can be found at [[agl-distro#​supported_hardware|AGL Distribution]] 
 +   * 8/8  
 +     * Arijit received the M3 hardware and was able to get it running. Building a "Hello World" sample application using Virtual Box and M3 hardware.  
 + 
 + 
 +Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.  
 +   * Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together.  
 +     * Update 7/25: reviewing above draft, will share ideas/​design to run multiple engines in parallel. No timeline, yet. Will review internally and present in next call. 
 +   * Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall. ​  
 +   * 8/8  
 +     * Naveen presented some use cases and an architecture diagram that Amazon has been working on internally. Received good feedback from the team. Naveen and his team will update their internal wiki and present again at the next meeting. Will look into getting the info onto the AGL wiki after that.  
 + 
 +Microchip - AGL USB microphone front-end. Michael is working with MicroSemi on getting hardware available that is already available for Amazon Alexa. Would like to have a prototype available for the AMM in Dresden. Microchip plans to provide the HAL for the microphone.  
 +    * Update 7/25: 
 +      * Received hardware from MicroSemi, Alexa stack already working. 
 +      * Hardware is mic+dsp connected to rpi running the stack 
 +      * Plan: frontend should be connected over USB, integrated with 4a (hal) and interacting with the stack 
 +        * Stack needs to pick-up conditioned signal (near/​far/​noise-cancelling) through alsa device 
 +        * Michrochip will provide the hal for 4a 
 +        * Michael: Interest to extend the API for beamforming,​ multiple seats, "1 channel per seat" ? 
 + 
 +     * Question from Nucance about audio streaming:​ 
 +       * esoundlib - do you need special calls to stream audio 
 +       * Fulup: no, reply of 4a role request is the alsa device to write to 
 +       * 4a-play /​usr/​share/​4a/​media/​Happy_MBB_75.ogg (only script) 
 + 
 +Action item: 
 +   * Move the github repo into AGL git to foster collaboration - Done 
 +   *  
 + 
 + 
 +==== July 25, 2018 ==== 
 +Attendees:​ 
 + 
 +LF: <​del>​Walt</​del>,​ Jan-Simon \\ 
 +Nuance: Christian, Paul Purcell, Mike C., <​del>​Vince</​del>,​ Arijit \\ 
 +Amazon: <​del>​Premal</​del>,​ Ankur, Naveen, Kamal\\ 
 +NTT Data MSE: Imamura \\ 
 +Denso Ten: <​del>​Kusakabe</​del>​ \\ 
 +Microchip: Michael, Christian \\ 
 +IoT.bzh: <​del>​Stephane</​del>,​ Fulup \\ 
 +Konsulko: Matt P., <​del>​Matt R.</​del>​\\ 
 +Myscript: Olivier, Etienne \\ 
 + 
 + 
 +Notes: 
 + 
 +Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://​github.com/​Nuance-Mobility/​agl-speech-interface) 
 +   * Not wired up to a speech or TTS engine, more of a loop back test.  
 +   * Will send the link to Konsulko and IoT.bzh to review the API for suggestions.  
 +     * Done. 
 +   * List of AGL services available can be seen at https://​git.automotivelinux.org/​ 
 +   * Need to figure out consent and privacy issues with AGL Identity Agent. 
 +     * No update. 
 +   * How to manage grammar and natural language APIs and split between services and apps?  
 +   * How to integrate cloud speech applications?​ 
 +     * Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU.  How is this then transmitted to the POI and/or navi app? 
 +   * Sample config from softmixer https://​github.com/​iotbzh/​4a-softmixer/​blob/​master/​conf.d/​project/​lua.d/​smixer-test-simple.lua 
 +     * Update 7/25: update for FF use 8-channel CSL usb dac, about to land in gerrit. 
 +   * Starting a demo project internally led by Paul. End of Aug early Sep they plan to have a design document together internally and will be ready with any questions/​issue. Would be a good idea to target the [[agl-distro:​sep2018-f2f|Sep F2F in Santa Clara]] to resolve issues with the design.  
 +   * Information on IRC, mail list etc is available at [[start:​getting-started|Getting Started with AGL]] 
 +   * Supported hardware can be found at [[agl-distro#​supported_hardware|AGL Distribution]] 
 + 
 + 
 +Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.  
 +   * Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together.  
 +     * Update 7/25: reviewing above draft, will share ideas/​design to run multiple engines in parallel. No timeline, yet. Will review internally and present in next call. 
 +   * Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall. ​  
 + 
 +Microchip - AGL USB microphone front-end. Michael is working with MicroSemi on getting hardware available that is already available for Amazon Alexa. Would like to have a prototype available for the AMM in Dresden. Microchip plans to provide the HAL for the microphone.  
 +    * Update 7/25: 
 +      * Received hardware from MicroSemi, Alexa stack already working. 
 +      * Hardware is mic+dsp connected to rpi running the stack 
 +      * Plan: frontend should be connected over USB, integrated with 4a (hal) and interacting with the stack 
 +        * Stack needs to pick-up conditioned signal (near/​far/​noise-cancelling) through alsa device 
 +        * Michrochip will provide the hal for 4a 
 +        * Michael: Interest to extend the API for beamforming,​ multiple seats, "1 channel per seat" ? 
 + 
 +     * Question from Nucance about audio streaming:​ 
 +       * esoundlib - do you need special calls to stream audio 
 +       * Fulup: no, reply of 4a role request is the alsa device to write to 
 +       * 4a-play /​usr/​share/​4a/​media/​Happy_MBB_75.ogg (only script) 
 + 
 +Action item: 
 +   * Move the github repo into AGL git to foster collaboration 
 + 
 +==== July 11, 2018 ==== 
 +Attendees:​ 
 + 
 +LF: Walt, <​del>​Jan-Simon</​del>​\\ 
 +Voicebox: \\  
 +Nuance: Christian, Paul Purcell, Mike C., <​del>​Vince</​del>​ \\ 
 +Amazon: <​del>​Premal,​ Ankur,</​del>​ Naveen\\ 
 +NTT Data MSE: Imamura \\ 
 +Denso Ten: <​del>​Kusakabe</​del>​ \\ 
 +Microchip: Michael, Christian \\ 
 +IoT.bzh: <​del>​Stephane</​del>,​ Fulup \\ 
 +Konsulko: Matt P., <​del>​Matt R</​del>​.\\ 
 + 
 + 
 +Notes: 
 + 
 +Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://​github.com/​Nuance-Mobility/​agl-speech-interface) 
 +   * Not wired up to a speech or TTS engine, more of a loop back test.  
 +   * Will send the link to Konsulko and IoT.bzh to review the API for suggestions.  
 +   * List of AGL services available can be seen at https://​git.automotivelinux.org/​ 
 +   * Need to figure out consent and privacy issues with AGL Identity Agent.  
 +   * How to manage grammar and natural language APIs and split between services and apps?  
 +   * How to integrate cloud speech applications?​ 
 +     * Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU.  How is this then transmitted to the POI and/or navi app? 
 +   * Sample config from softmixer https://​github.com/​iotbzh/​4a-softmixer/​blob/​master/​conf.d/​project/​lua.d/​smixer-test-simple.lua 
 +   * Starting a demo project internally led by Paul. End of Aug early Sep they plan to have a design document together internally and will be ready with any questions/​issue. Would be a good idea to target the [[agl-distro:​sep2018-f2f|Sep F2F in Santa Clara]] to resolve issues with the design.  
 +   * Information on IRC, mail list etc is available at [[start:​getting-started|Getting Started with AGL]] 
 +   * Supported hardware can be found at [[agl-distro#​supported_hardware|AGL Distribution]] 
 + 
 + 
 +Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.  
 +   * Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together.  
 +   * Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall. ​  
 + 
 +Microchip - AGL USB microphone front-end. Michael is working with MicroSemi on getting hardware available that is already available for Amazon Alexa. Would like to have a prototype available for the AMM in Dresden. Microchip plans to provide the HAL for the microphone.  
 + 
 +Action item: 
 +   * Move the github repo into AGL git to foster collaboration 
 + 
 + 
 + 
 +==== June 27, 2018 ==== 
 +Attendees:​ 
 + 
 +LF: Walt, <​del>​Jan-Simon</​del>​\\ 
 +Voicebox: \\  
 +Nuance: <​del>​Christian,</​del>​ Mike C., <​del>​Vince</​del>​ \\ 
 +Amazon: Premal, Ankur, Naveen\\ 
 +NTT Data MSE: <​del>​Imamura</​del>​ \\ 
 +Denso Ten: <​del>​Kusakabe</​del>​ \\ 
 +Microchip: <​del>​Michael,​ Christian</​del>​ \\ 
 +IoT.bzh: <​del>​Stephane</​del>,​ Fulup \\ 
 +Konsulko: Matt P., Matt R.\\ 
 + 
 + 
 +Notes: 
 + 
 +Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://​github.com/​Nuance-Mobility/​agl-speech-interface) 
 +   * Not wired up to a speech or TTS engine, more of a loop back test.  
 +   * Will send the link to Konsulko and IoT.bzh to review the API for suggestions.  
 +   * List of AGL services available can be seen at https://​git.automotivelinux.org/​ 
 +   * Need to figure out consent and privacy issues with AGL Identity Agent.  
 +   * How to manage grammar and natural language APIs and split between services and apps?  
 +   * How to integrate cloud speech applications?​ 
 +     * Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU.  How is this then transmitted to the POI and/or navi app? 
 +   * Sample config from softmixer https://​github.com/​iotbzh/​4a-softmixer/​blob/​master/​conf.d/​project/​lua.d/​smixer-test-simple.lua 
 + 
 + 
 +Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.  
 +   * Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together.  
 +   * Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall. ​  
 + 
 +Voicebox was acquired by Nuance so they will probably not be participating as a separate entity.  
 + 
 + 
 +==== June 7, 2018 ==== 
 +Attendees:​ 
 + 
 +LF: Walt, Jan-Simon\\ 
 +Voicebox: \\  
 +Nuance: Christian, <​del>​Mike C., Vince</​del>​ \\ 
 +Amazon: \\ 
 +NTT Data MSE: Imamura \\ 
 +Denso Ten: Kusakabe \\ 
 +Microchip: Michael, Christian \\ 
 +IoT.bzh: Stephane, Fulup \\ 
 +Konsulko: Matt P., Matt R.\\ 
 + 
 + 
 +Notes: 
 + 
 +Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://​github.com/​Nuance-Mobility/​agl-speech-interface) 
 +   * Not wired up to a speech or TTS engine, more of a loop back test.  
 +   * Will send the link to Konsulko and IoT.bzh to review the API for suggestions.  
 +   * List of AGL services available can be seen at https://​git.automotivelinux.org/​ 
 +   * Need to figure out consent and privacy issues with AGL Identity Agent.  
 +   * How to manage grammar and natural language APIs and split between services and apps?  
 +   * How to integrate cloud speech applications?​ 
 +     * Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU.  How is this then transmitted to the POI and/or navi app? 
 +   * Sample config from softmixer https://​github.com/​iotbzh/​4a-softmixer/​blob/​master/​conf.d/​project/​lua.d/​smixer-test-simple.lua 
 + 
 + 
 +Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.  
 +   * No one joined.  
 + 
 + 
 +Face-to-Face meeting planned for June 19 in Tokyo.  
 + 
 + 
 +==== May 30, 2018 ==== 
 +Attendees:​ 
 + 
 +LF: Walt, Jan-Simon\\ 
 +Voicebox: \\  
 +Nuance: Christian, Mike C., Vince \\ 
 +Amazon: \\ 
 +NTT Data MSE:  \\ 
 +Denso Ten: Kusakabe \\ 
 +Microchip: Michael, Christian \\ 
 +IoT.bzh: Stephane \\ 
 +Qt Company: 
 + 
 +Notes: 
 + 
 +Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://​github.com/​Nuance-Mobility/​agl-speech-interface) 
 +   * Not wired up to a speech or TTS engine, more of a loop back test.  
 +   * Will send the link to Konsulko and IoT.bzh to review the API for suggestions.  
 +   * List of AGL services available can be seen at https://​git.automotivelinux.org/​ 
 +   * Need to figure out consent and privacy issues with AGL Identity Agent.  
 +   * How to manage grammar and natural language APIs and split between services and apps?  
 +   * How to integrate cloud speech applications?​ 
 +     * Example: "Find me the closest pizza place" is processed in the cloud and the location and name are returned to the ECU.  How is this then transmitted to the POI and/or navi app? 
 + 
 +Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.  
 +   * No one joined.  
 + 
 +Video conference during the Lorient F2F meeting on June 7  
 + 
 +Face-to-Face meeting planned for June 19 in Tokyo.  
 + 
 + 
 +-------- 
 + 
 +==== May 16, 2018 ====
 Attendees: Attendees:
 LF: Walt\\ LF: Walt\\
 Voicebox: \\  Voicebox: \\ 
 Nuance: Christian \\ Nuance: Christian \\
-Amazon: Premal +Amazon: Premal ​\\ 
-NTT Data MSE: Imamura +NTT Data MSE: Imamura ​\\ 
-Denso Ten: Kusakabe+Denso Ten: Kusakabe ​\\
 Qt Company: Alistair Qt Company: Alistair
  
eg-speech.txt · Last modified: 2021/11/23 15:57 by wminer