Voice & AI
Phone-side cloud ASR/TTS bootstrap, offline glass speech features, AI streaming chat, plus how pieces stitch together end-to-end.
Pages
Reading order
- Phone bootstrap — where hosted voice services negotiate credentials.
- Glass lane — contrasts offline wakewords vs streamed ASR lanes.
- AI Chat — ties microphones, cloud ASR responses, playback together.
Capability wiring
| Feature | Responsibility | Typical host screen |
|---|---|---|
| Offline wake-word | Local hot-word triggers | HomeActivity |
| Offline TTS | Local narration | SendMessageActivity |
| Cloud ASR | Dictation-grade dictation | SendMessageActivity |
| Cloud TTS | Hosted speech synthesis | Provisioned phone init; stubs on glass |
| AI Chat | LLM streamed answers | SdkMediaActivity |
End-to-end mental model
mermaid
sequenceDiagram
participant U as User
participant G as Glass client
participant A as ASR service
participant AI as AI Chat
participant T as TTS playback
U->>G: Speak
G->>A: Begin speech decode
A-->>G: Partial/final hypotheses
G->>AI: Forward user question text
AI-->>G: Incremental narration tokens
G->>T: Render reply as audible speech
T-->>U: Playback