PubSpeaker

is a speech analysis and feedback application designed to help users improve their public speaking and pronunciation skills. The app transcribes spoken input into text, analyzes pronunciation, grammar, and fluency, and highlights errors or areas for improvement. Users can review past speech results through a history feature, allowing them to reread transcripts or replay their recorded speech for self-evaluation.

Client Michael Rodriguez

Sector Education

Duration 6 weeks

Project Overview

We delivered a comprehensive speech analysis experience: accurate speech-to-text transcription, phoneme-level pronunciation evaluation, grammar and fluency analysis, clear visual error highlighting, audio playback for self-review, and a structured history system for tracking past results. This is a complete skill-development solution, designed to help users refine their public speaking abilities with practical, data-driven feedback and continuous improvement support.

Our Approach

integrates speech-to-text transcription, phoneme-level pronunciation analysis, and grammar evaluation to deliver clear, actionable feedback that helps users improve their public speaking skills.

How I Did It

Speech Transcription & Alignment

Integrated OpenAI Whisper for speech-to-text transcription and Montreal Forced Aligner (MFA) for phoneme-level alignment to accurately analyze pronunciation.

Accuracy & Error Measuremente

Applied Word Error Rate (WER) and text alignment techniques to evaluate transcription accuracy and identify pronunciation and fluency issues..

Feedback & System Design

Designed a modular system that transforms transcription and alignment results into clear visual feedback, with audio replay and history tracking for continuous improvement.

Let's Work Together

Have a project in mind? I'd love to hear about it and discuss how we can collaborate to create something amazing.