Skip to content

Conversation

cynthiajoan
Copy link
Collaborator

Description

Replace this paragraph with a description of what this PR is doing. If you're modifying existing behavior, describe the existing behavior, how this PR is changing it, and what motivated the change.

Related Issues

Replace this paragraph with a list of issues related to this PR from the issue database. Indicate, which of these issues are resolved or fixed by this PR. Note that you'll have to prefix the issue numbers with flutter/flutter#.

Checklist

Before you create this PR confirm that it meets all requirements listed below by checking the relevant checkboxes ([x]).
This will ensure a smooth and quick review process. Updating the pubspec.yaml and changelogs is not required.

  • I read the Contributor Guide and followed the process outlined there for submitting PRs.
  • My PR includes unit or integration tests for all changed/updated/fixed behaviors (See Contributor Guide).
  • All existing and new tests are passing.
  • I updated/added relevant documentation (doc comments with ///).
  • The analyzer (melos run analyze) does not report any problems on my PR.
  • I read and followed the Flutter Style Guide.
  • I signed the CLA.
  • I am willing to follow-up on review comments in a timely manner.

Breaking Change

Does your PR require plugin users to manually update their apps to accommodate your change?

  • Yes, this is a breaking change.
  • No, this is not a breaking change.

@cynthiajoan
Copy link
Collaborator Author

/gemini summarize

Copy link

This pull request introduces bidirectional transcription capabilities for Firebase AI. Key changes include:

  • New Configuration: Adds AudioTranscriptionConfig and integrates inputAudioTranscription and outputAudioTranscription into LiveGenerationConfig to enable transcription for both input and output audio streams.
  • Transcription Data Model: Introduces a Transcription class to represent transcription text and its completion status.
  • Live Server Content: Extends LiveServerContent to include inputTranscription and outputTranscription fields, allowing the live server to send transcription updates.
  • Example App Updates: The example application (bidi_page.dart) has been updated to display these new transcription messages in real-time, including logic to append new transcription segments to existing messages. The text field in MessageData was made mutable to facilitate these updates.

This feature enhances the live generation experience by providing real-time text representations of both user input and model output audio.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant