A real-time multimodal communication interface built with React and TypeScript, leveraging Google's Generative AI services for context-aware AI interactions.
- 💬 Real-time Communication: WebSocket-based audio streaming and processing
- 🎧 Advanced Audio: Web Audio API integration with volume metering and visualization
- 🤖 AI Integration: Seamless connection with Google's Generative Language API
- 📹 Media Capture: Support for webcam and screen capture
- 🔄 Real-time Processing: Media stream multiplexing and audio worklet processing
- 🎛️ Modular Design: Flexible component architecture with side panels and control trays
- 🛠️ Tool Integration: Support for tool calls and responses
- React + TypeScript
- Web Audio API
- WebSocket API
- Google Generative AI SDK
src/
├── components/ # React components (Altair, AudioPulse, etc.)
├── contexts/ # React context providers
├── hooks/ # Custom hooks for media and API
├── lib/ # Core utilities and audio processing
│ └── worklets/ # Audio worklet processors
└── [other config files]
- Clone the repository
- Install dependencies:
npm install- Configure your Google API key
- Start the development server:
npm startCopyright 2024 Google LLC
Licensed under the Apache License, Version 2.0