Blog
Notes
Projects
Hire Me
Dark mode
Light mode
Blog
Notes
Projects
Dark mode
Light mode
Home
❯
notes
❯
AI ML
❯
Generative AI
❯
Voice Multimodal
Folder: notes/AI-ML/Generative-AI/Voice-Multimodal
9 items under this folder.
Mar 01, 2026
Audio Processing - How ML Models Understand Sound
Mar 01, 2026
Voice LLM Fundamentals - Audio & Core Concepts
Mar 01, 2026
Inference Optimization - Speed & Cost Reduction
Mar 01, 2026
Voice LLM Resources - Tools, Libraries & References
Mar 01, 2026
Speech Language Models - Voice LLM Architecture
Mar 01, 2026
Speech-to-Speech Models - Real-Time Voice Interaction
Mar 01, 2026
Text-to-Speech (TTS) Models - Architecture & Implementation
Mar 01, 2026
Voice Agent Evaluation & Testing
Mar 01, 2026
Voice Agents Deployment - Infrastructure & Cost