User Tools

Site Tools


eg-speech

Speech Expert Group

Goals for Expert Group

  • Create a standardized set of speech recognition APIs that app developers can use regardless of underlying speech engine
  • Natural language or grammar tree based
  • On board or cloud based speech
  • Text to Speech API
  • Signal processing for noise reduction and echo cancellation (future roadmap)
  • Grammar development tools
  • Amazon has an open API that could be used as a starting point
  • Goal is to have a straw man by the AMM in Tokyo, Feb. 20-21 with work completed by ALS in June.
  • Next step is to set up a call with the engineers from all three companies who will be working on this.

Meetings

July 11, 2018

Attendees:

LF: Walt, Jan-Simon
Voicebox:
Nuance: Christian, Paul Purcell, Mike C., Vince
Amazon: Premal, Ankur, Naveen
NTT Data MSE: Imamura
Denso Ten: Kusakabe
Microchip: Michael, Christian
IoT.bzh: Stephane, Fulup
Konsulko: Matt P., Matt R.

Notes:

Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://github.com/Nuance-Mobility/agl-speech-interface)

  • Not wired up to a speech or TTS engine, more of a loop back test.
  • Will send the link to Konsulko and IoT.bzh to review the API for suggestions.
  • List of AGL services available can be seen at https://git.automotivelinux.org/
  • Need to figure out consent and privacy issues with AGL Identity Agent.
  • How to manage grammar and natural language APIs and split between services and apps?
  • How to integrate cloud speech applications?
    • Example: “Find me the closest pizza place” is processed in the cloud and the location and name are returned to the ECU. How is this then transmitted to the POI and/or navi app?
  • Starting a demo project internally led by Paul. End of Aug early Sep they plan to have a design document together internally and will be ready with any questions/issue. Would be a good idea to target the Sep F2F in Santa Clara to resolve issues with the design.
  • Information on IRC, mail list etc is available at Getting Started with AGL
  • Supported hardware can be found at AGL Distribution

Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.

  • Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together.
  • Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall.

Microchip - AGL USB microphone front-end. Michael is working with MicroSemi on getting hardware available that is already available for Amazon Alexa. Would like to have a prototype available for the AMM in Dresden. Microchip plans to provide the HAL for the microphone.

Action item:

  • Move the github repo into AGL git to foster collaboration

June 27, 2018

Attendees:

LF: Walt, Jan-Simon
Voicebox:
Nuance: Christian, Mike C., Vince
Amazon: Premal, Ankur, Naveen
NTT Data MSE: Imamura
Denso Ten: Kusakabe
Microchip: Michael, Christian
IoT.bzh: Stephane, Fulup
Konsulko: Matt P., Matt R.

Notes:

Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://github.com/Nuance-Mobility/agl-speech-interface)

  • Not wired up to a speech or TTS engine, more of a loop back test.
  • Will send the link to Konsulko and IoT.bzh to review the API for suggestions.
  • List of AGL services available can be seen at https://git.automotivelinux.org/
  • Need to figure out consent and privacy issues with AGL Identity Agent.
  • How to manage grammar and natural language APIs and split between services and apps?
  • How to integrate cloud speech applications?
    • Example: “Find me the closest pizza place” is processed in the cloud and the location and name are returned to the ECU. How is this then transmitted to the POI and/or navi app?

Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.

  • Starting to look at AGL App FW binder implementation using audio HAL as a reference. Will work with IoT.bzh on how to put the configuration together.
  • Would like to put together an architecture picture based on the white board drawings from February AMM to see how the API fits into AGL overall.

Voicebox was acquired by Nuance so they will probably not be participating as a separate entity.

June 7, 2018

Attendees:

LF: Walt, Jan-Simon
Voicebox:
Nuance: Christian, Mike C., Vince
Amazon:
NTT Data MSE: Imamura
Denso Ten: Kusakabe
Microchip: Michael, Christian
IoT.bzh: Stephane, Fulup
Konsulko: Matt P., Matt R.

Notes:

Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://github.com/Nuance-Mobility/agl-speech-interface)

  • Not wired up to a speech or TTS engine, more of a loop back test.
  • Will send the link to Konsulko and IoT.bzh to review the API for suggestions.
  • List of AGL services available can be seen at https://git.automotivelinux.org/
  • Need to figure out consent and privacy issues with AGL Identity Agent.
  • How to manage grammar and natural language APIs and split between services and apps?
  • How to integrate cloud speech applications?
    • Example: “Find me the closest pizza place” is processed in the cloud and the location and name are returned to the ECU. How is this then transmitted to the POI and/or navi app?

Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.

  • No one joined.

Face-to-Face meeting planned for June 19 in Tokyo.

May 30, 2018

Attendees:

LF: Walt, Jan-Simon
Voicebox:
Nuance: Christian, Mike C., Vince
Amazon:
NTT Data MSE:
Denso Ten: Kusakabe
Microchip: Michael, Christian
IoT.bzh: Stephane
Qt Company:

Notes:

Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer in github (https://github.com/Nuance-Mobility/agl-speech-interface)

  • Not wired up to a speech or TTS engine, more of a loop back test.
  • Will send the link to Konsulko and IoT.bzh to review the API for suggestions.
  • List of AGL services available can be seen at https://git.automotivelinux.org/
  • Need to figure out consent and privacy issues with AGL Identity Agent.
  • How to manage grammar and natural language APIs and split between services and apps?
  • How to integrate cloud speech applications?
    • Example: “Find me the closest pizza place” is processed in the cloud and the location and name are returned to the ECU. How is this then transmitted to the POI and/or navi app?

Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.

  • No one joined.

Video conference during the Lorient F2F meeting on June 7

Face-to-Face meeting planned for June 19 in Tokyo.


May 16, 2018

Attendees: LF: Walt
Voicebox:
Nuance: Christian
Amazon: Premal
NTT Data MSE: Imamura
Denso Ten: Kusakabe
Qt Company: Alistair

Notes:

Nuance still discussing internally about releasing their API. Christian working with AGL App FW and working with writing an AGL Service layer.

Amazon looking at releasing a possible API in June and starting to work with the AGL App FW.

Video conference during the Lorient F2F meeting on June 7

Face-to-Face meeting planned for June 19 in Tokyo.

Feb 14, 2018

Attendees:
LF: Walt, Dan
Voicebox: Andrew
Nuance: Christian, Mike
Amazon: John

Notes:

  • Amazon still internally discussing making their API available for AGL.
  • Amazon SDKs that are available publicly require an agreement with Amazon to access
    • Alexa Voice Service Device SDK
    • Alexa Skills Kit
  • Nuance interface still discussing internally. Christian made a presentation that he will send around with Nuance's ideas for the API.
  • Walt will use the minutes from these calls to lead an AMM session next week that updates the community on the EG's work.

Feb 6, 2018

Attendees:
LF: Walt, Jan-Simon
Voicebox: Andrew and Adam
Nuance: Christian, Mike
Amazon: None

Notes:

  • No word from Amazon on the availability of their API as well the link to what is publicly.
  • Nuance interface still discussing internally. Still waiting for Amazon proposal.
  • Walt will follow up with Amazon about their API. Schedule a follow up call for Tuesday, Feb 13
  • Question about how AGL handles app creation and installation

Kick off Meeting Jan 29, 2018

Attendees: Walt, Jan-Simon, Mike Chachich, John Scumniotales, Andrew Fairly, Vince Iannotti, Christian Benien (attending AMM)

Agenda for kick off meeting

  • Review expert group goals as captured above
  • Proposal for speech recognition API or TTS API from Amazon?
  • Meeting schedule (biweekly? what time?)
  • Developer commitment from EG members.
  • Reviewed agenda and notes from CES
  • John said there is ongoing internal Amazon about releasing their code. There is already an open source version or public API version. Need to find out definitively what they are talking about as the release.
    • John will send a link to what is publicly available now
    • Should wrap up internal discussions end of this week. (Feb 2)
  • Michael may have something they can release. Will discuss internally once we see what Amazon has.
  • Attending AMM: Nuance: Christen, Amazon: Shitaro and Sanjay. VBT: TBD
  • Reserve time at AMM on Thursday (1 hour)
  • Follow up Feb 6
eg-speech.txt · Last modified: 2018/07/11 15:06 by waltminer