mirror of https://github.com/GoogleChrome/chrome-extensions-samples.git synced 2026-03-26 13:19:49 +07:00

Files

Sebastian Benz 85f721f5a9 Improve Gemini samples (#1611 )

* Migrate to latest version of the language model

* Update readme to better describe the sample.

* More readme updates

* Consistent API naming and format lists

2026-01-15 13:38:00 +01:00

assets

add audio scribe sample (#1475 )

2025-05-20 08:37:50 +02:00

demo-chat-app

Add linter fixes for new AI samples (#1484 )

2025-06-02 13:46:10 +02:00

background.js

add audio scribe sample (#1475 )

2025-05-20 08:37:50 +02:00

bridge.js

add audio scribe sample (#1475 )

2025-05-20 08:37:50 +02:00

manifest.json

Add Prompt API trial tokens for multimodal extension samples (#1597 )

2026-01-06 13:51:57 +00:00

override-createobject-url.js

add audio scribe sample (#1475 )

2025-05-20 08:37:50 +02:00

README.md

Improve Gemini samples (#1611 )

2026-01-15 13:38:00 +01:00

sidepanel.html

add audio scribe sample (#1475 )

2025-05-20 08:37:50 +02:00

sidepanel.js

Fix prompt format for image description in prompt array (#1516 )

2025-07-29 08:42:39 +02:00

README.md

Audio-Scribe: Transcribe audio messages with Chrome's multimodal Prompt API

This sample demonstrates how to use Chrome's built-in AI APIs to transcribe audio messages directly in the browser. It uses:

Prompt API with multimodal audio input (Gemini Nano) for on-device speech-to-text transcription

Overview

Audio-Scribe adds a side panel that automatically transcribes audio messages from chat applications. When activated, it:

Monitors the page for audio blobs created via URL.createObjectURL.
Detects audio content and sends it to Gemini Nano for transcription.
Streams the transcribed text in real-time to the side panel.
Works with messaging apps like WhatsApp Web that use blob URLs for audio messages.

Running this extension

Clone this repository.
Load this directory in Chrome as an unpacked extension.
Open a chat app in the browser, for example https://web.whatsapp.com/. You can also run the included demo chat app:
```
npx serve demo-chat-app
```
Open the Audio-Scribe side panel by clicking the extension icon or pressing Alt+A.
Play or load audio messages in the chat - they will be automatically transcribed in the side panel.