Skip to content

Voice & AI

Phone-side cloud ASR/TTS bootstrap, offline glass speech features, AI streaming chat, plus how pieces stitch together end-to-end.

Pages

Reading order

  1. Phone bootstrap — where hosted voice services negotiate credentials.
  2. Glass lane — contrasts offline wakewords vs streamed ASR lanes.
  3. AI Chat — ties microphones, cloud ASR responses, playback together.

Capability wiring

FeatureResponsibilityTypical host screen
Offline wake-wordLocal hot-word triggersHomeActivity
Offline TTSLocal narrationSendMessageActivity
Cloud ASRDictation-grade dictationSendMessageActivity
Cloud TTSHosted speech synthesisProvisioned phone init; stubs on glass
AI ChatLLM streamed answersSdkMediaActivity

End-to-end mental model

mermaid
sequenceDiagram
    participant U as User
    participant G as Glass client
    participant A as ASR service
    participant AI as AI Chat
    participant T as TTS playback

    U->>G: Speak
    G->>A: Begin speech decode
    A-->>G: Partial/final hypotheses
    G->>AI: Forward user question text
    AI-->>G: Incremental narration tokens
    G->>T: Render reply as audible speech
    T-->>U: Playback