Technical Architecture Deep-Dive for Educators
Spring Boot · React · Azure Cognitive Services · Multi-LLM AI · Enterprise Security
Built for Thousands of Concurrent Learners — The technical foundation powering the platform's core features
Spring Boot 3.3.10 on Java 21 — reactive programming for real-time educational interactions, WebSocket support for live classroom sessions.
99.9% availability through intelligent fallback. Azure OpenAI for production quality, ZhiPuAI GLM-4-Flash for cultural context and backup.
AWS S3-compatible storage (DigitalSpaces). Presigned URLs with 60-minute expiry for learner data isolation and secure access.
Beyond Correct/Incorrect — Understanding how students speak, not just what they say
Azure Cognitive Services drives the pronunciation engine — breaking speech into individual phonemes and comparing against native speaker models.
Detects specific sound issues like /θ/ vs /s/ confusion in "think" vs "sink." Uses IPA for universal sound representation — the same system used in professional PTE, IELTS, and DSE preparation.
For discrete vocabulary testing with HundredMark grading system integration. Quick pass/fail with granular score breakdown.
For longer passages or presentation scripts. Configurable with 3000ms end silence timeout for younger learners. Captures fluency and pacing over entire paragraphs.
Color-coded phoneme feedback — students see exactly which sounds need work
Understanding how students communicate, not just what they know — Psychometric profiling through AI speech analysis
Weighted ipsative scoring across 10 forced-choice ranking dimensions analyzes communication patterns and cognitive styles, creating a personalized learner profile.
Students categorized into five profiles using weighted voting:
Underused tones highlighted for targeted practice assignment
From Live Translation captures to AI-generated flashcards — with video context and Cambridge Dictionary definitions
Binary system implements "desirable difficulties" — separating mastered vocabulary from active learning to prevent cognitive overload
From classroom debates to remote study sessions — AI handles moderation while teachers monitor analytics
Voice biometrics (speaker_id differentiation) analyzes group discussions on shared devices. Each participant receives individual pronunciation scores and logic assessment.
For remote learning, the system processes audio streams to determine optimal AI intervention points (join_at_seconds). AI generates contextual Socratic responses as audio streams.
Real-time analytics reveal which students dominate discussions versus those who need encouragement
Handling real-world classroom conditions — noise, accented speech, and imperfect recordings
Converts browser WebM recordings to Azure-compatible WAV (PCM 16-bit, 16kHz mono). Duration analysis via ffprobe supports HKDSE timing standards.
On-device VAD using RMS (energy) and ZCR (zero-crossing rate) analysis. Adaptive noise floor from first ~1 second, then exponential smoothing (α=0.02).
300ms pre/post padding preserves natural pauses. 400ms minimum segment threshold removes noise artifacts while keeping meaningful content.
Understanding student intent, not just exact words — Multi-algorithm ensemble for flexible, natural language matching
Weights favor semantic meaning and phrase structure over exact character matching — students who rearrange words or use synonyms still receive credit.
Protecting student data and assessment integrity — PCI DSS compliant payment processing and enterprise authentication
Google reCAPTCHA v2/v3 for assessment integrity. Prevents bots from corrupting AI personalization data.
Spring Security with JWT tokens. Role-based permissions: students, teachers, administrators.
3D Secure authentication for international cards. CustomerSession API. PCI DSS compliant.
Microsoft Teams webhook alerts for real-time operational notifications. Theme-colored by severity.
Institutional billing with purchase orders. Bulk subscription management for school-wide deployments. Usage-based pricing for per-minute Live Translation. GDPR and COPPA compliance for educational settings with minors.
Role-based access ensures teachers see assigned students only. Admin views aggregate school data without accessing individual profiles. Parent consent required for under-13 accounts. Data export and deletion capabilities.
Seamless integration with existing educational infrastructure — Thinkific LMS connectivity and flexible subscription management
REST API integration with robust pagination (200 items/page) and automatic retry with exponential backoff for HTTP 429 rate limiting.
Complete lifecycle management with automatic billing, trial-to-paid conversion tracking, and configurable pricing models.
From the Phoneme to the Paragraph
• AI Conversation Partner 24/7
• Phoneme-level pronunciation feedback
• Video flashcard vocabulary system
• Personalized learning pathways
• Live Translation classroom support
• Discussion heatmap analytics
• SBA preparation modules
• Real-time progress dashboards
• Thinkific LMS integration
• Enterprise security (PCI, GDPR)
• Scalable 99.9% SLA infrastructure
• Bulk subscription management
Think & Speak — See Change AI English · SeeChange Education · www.thinkandspeak.com