InterviewAI

SpeechDojo 2025 | Project

InterviewAI — overview

Summary

The AI Interview Agent is a real-time voice-based platform designed to simulate professional interview settings. It leverages conversational AI to provide instant feedback, helping users practice and refine their communication skills. Built with a modern web stack, the tool enables smooth, natural interactions through live speech transcription and dynamic response generation. By replicating the pressure and flow of real interviews, it allows candidates to prepare more effectively for high-stakes situations. The ultimate goal is to bridge the gap between practice and performance, making interview preparation more interactive and impactful.

Technical Overview

  • Integrated OpenAI Realtime API with WebRTC to deliver low-latency speech-to-text transcription and conversational LLM-driven feedback during live interview simulations.
  • Designed modular audio processing pipelines (WavRecorder, WavStreamPlayer) for real-time PCM16 audio streaming, dynamic playback interruption, and adaptive input handling.
  • Implemented voice activity detection (VAD) and push-to-talk controls for natural human–AI dialogue flow, reducing latency and improving conversational alignment with LLM responses.
  • Optimized system architecture across React.js front-end and Node.js backend, ensuring scalable LLM inference integration and robust session management for multi-user deployments.

Project in Progress, More Updates Coming Soon!

Skills

Conversational AIHuman-AI InteractionNext.jsReact.jsNode.jsLLMsSpeech-to-TextTypeScriptOpenAI Realtime APIWebRTCRealtime Audio Processing
InterviewAI — Om Upadhyay