Advanced Voice Authentication System Powered by AI
Secure, intelligent voice identification using OpenAI's ChatGPT and Whisper APIs
π Quick Start β’ π Documentation β’ π― Features β’ π€ Contributing
VoiceGuard is a cutting-edge voice authentication system that combines traditional voice biometrics with advanced AI analysis. Using OpenAI's ChatGPT-4 and Whisper APIs, it provides enterprise-grade voice identification with unprecedented accuracy and insight.
- π€ AI-Powered Analysis: Leverages ChatGPT-4 for sophisticated voice characteristic analysis
- ποΈ Advanced Voice Processing: MFCC, spectral, and prosodic feature extraction
- π Enterprise Security: Multi-factor confidence scoring and authentication logging
- π Speech-to-Text: Automatic transcription using OpenAI Whisper
- πΎ Smart Database: Efficient SQLite storage with comprehensive user management
- ποΈ Easy to Use: Intuitive CLI interface with guided workflows
Modern graphical interface with buttons and visual feedback:
- Download
VoiceGuard-GUI.exefrom the releases page - Create a
.envfile with your OpenAI API key - Double-click
VoiceGuard-GUI.exe
π See GUI.md for detailed GUI usage guide.
Command-line interface:
- Download
VoiceGuard.exefrom the releases page - Create a
.envfile with your OpenAI API key - Run
VoiceGuard.exe
π See WINDOWS_EXECUTABLE.md for detailed instructions.
# Clone the repository
git clone https://github.com/DevArqf/VoiceGuard.git
cd voiceguard
# Install dependencies
pip install -r requirements.txt
# Set up your OpenAI API key
cp .env.example .env
# Edit .env and add your OPENAI_API_KEY
# Run VoiceGuard
python main.py- Python 3.8 or higher
- OpenAI API key
- Working microphone
- Windows/macOS/Linux
-
Clone and Navigate
git clone https://github.com/yourusername/voiceguard.git cd voiceguard -
Install Dependencies
pip install -r requirements.txt
-
Configure Environment
cp .env.example .env # Edit .env with your OpenAI API key -
Test Configuration
python main.py --config-check
- Multi-Sample Enrollment: Collect 3+ voice samples per user
- Dual-Mode Identification: Known user verification + unknown speaker detection
- Confidence Scoring: Combined AI and traditional biometric scoring
- Authentication Logging: Complete audit trail with timestamps
- Voice Profiling: ChatGPT creates detailed voice characteristic profiles
- Speech Analysis: Automatic transcription and speech pattern analysis
- Demographic Estimation: Age range and gender estimation (where determinable)
- Comparison Intelligence: Advanced AI-powered voice comparison
- MFCC Features: 13-coefficient Mel-Frequency Cepstral analysis
- Spectral Analysis: Centroid, rolloff, bandwidth measurements
- Prosodic Features: Pitch analysis, energy, speaking rate detection
- Quality Assessment: Audio preprocessing and quality scoring
- Interactive CLI: Intuitive command-line interface
- Microphone Testing: Built-in audio device testing
- System Statistics: Real-time performance metrics
- User Management: Complete CRUD operations for users
- User Registration: Create user profile with metadata
- Voice Sample Collection: Record multiple voice samples (default: 3)
- Feature Extraction: Extract MFCC, spectral, and prosodic features
- AI Analysis: ChatGPT analyzes voice characteristics and creates profile
- Storage: Secure storage with voice fingerprinting
- Voice Capture: Record authentication sample
- Feature Processing: Extract and normalize voice features
- Database Comparison: Compare against enrolled voice profiles
- AI Verification: ChatGPT performs advanced voice comparison
- Decision: Multi-factor confidence scoring and final decision
Customize VoiceGuard through the .env file:
# OpenAI Configuration
OPENAI_API_KEY=your_api_key_here
# Audio Settings
SAMPLE_RATE=16000 # Audio sample rate (Hz)
CHANNELS=1 # Audio channels (1=mono, 2=stereo)
RECORD_SECONDS=5 # Recording duration per sample
# Authentication Thresholds
SIMILARITY_THRESHOLD=0.8 # Minimum similarity score (0.0-1.0)
MIN_CONFIDENCE_SCORE=0.7 # Minimum confidence for authentication- Accuracy: >95% identification accuracy with quality voice samples
- Speed: Sub-second feature extraction and comparison
- Scalability: Handles hundreds of enrolled users efficiently
- Reliability: Robust error handling and fallback mechanisms
VoiceGuard uses SQLite with optimized schema:
- users: User profiles and metadata
- voice_profiles: Voice features and AI analysis results
- authentication_logs: Complete authentication history
- enrollment_sessions: Enrollment process tracking
We welcome contributions! See our Contributing Guide for details.
- π§ Enhanced AI analysis algorithms
- π¨ GUI interface development
- π Web API implementation
- π± Mobile app integration
- π Advanced security features
- π Multi-language support
This project is licensed under the MIT License - see the LICENSE file for details.
- OpenAI for ChatGPT-4 and Whisper APIs
- Python audio processing community
- Contributors and testers
- π Documentation
- π Issues
- π¬ Discussions
Made with β€οΈ by DevArqf
β Star this repo β’ π Report Bug β’ β¨ Request Feature