Skip to content

πŸ›‘οΈ Advanced Voice Authentication System using OpenAI ChatGPT & Whisper APIs. Secure voice biometric identification with AI-powered analysis, multi-sample enrollment, and enterprise-grade authentication logging. Python-based with SQLite database.

License

Notifications You must be signed in to change notification settings

DevArqf/VoiceGuard

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

11 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ›‘οΈ VoiceGuard

VoiceGuard Logo

Python OpenAI License Contributions

Advanced Voice Authentication System Powered by AI

Secure, intelligent voice identification using OpenAI's ChatGPT and Whisper APIs

πŸš€ Quick Start β€’ πŸ“– Documentation β€’ 🎯 Features β€’ 🀝 Contributing


🌟 What is VoiceGuard?

VoiceGuard is a cutting-edge voice authentication system that combines traditional voice biometrics with advanced AI analysis. Using OpenAI's ChatGPT-4 and Whisper APIs, it provides enterprise-grade voice identification with unprecedented accuracy and insight.

🎯 Key Highlights

  • πŸ€– AI-Powered Analysis: Leverages ChatGPT-4 for sophisticated voice characteristic analysis
  • πŸŽ™οΈ Advanced Voice Processing: MFCC, spectral, and prosodic feature extraction
  • πŸ”’ Enterprise Security: Multi-factor confidence scoring and authentication logging
  • πŸ“ Speech-to-Text: Automatic transcription using OpenAI Whisper
  • πŸ’Ύ Smart Database: Efficient SQLite storage with comprehensive user management
  • πŸŽ›οΈ Easy to Use: Intuitive CLI interface with guided workflows

πŸš€ Quick Start

Option 1: Windows GUI Application (No Python Required) ⭐ Recommended

Modern graphical interface with buttons and visual feedback:

  1. Download VoiceGuard-GUI.exe from the releases page
  2. Create a .env file with your OpenAI API key
  3. Double-click VoiceGuard-GUI.exe

πŸ“– See GUI.md for detailed GUI usage guide.

Option 2: Windows Console Application (No Python Required)

Command-line interface:

  1. Download VoiceGuard.exe from the releases page
  2. Create a .env file with your OpenAI API key
  3. Run VoiceGuard.exe

πŸ“– See WINDOWS_EXECUTABLE.md for detailed instructions.

Option 3: Run from Source (Python Required)

# Clone the repository
git clone https://github.com/DevArqf/VoiceGuard.git
cd voiceguard

# Install dependencies
pip install -r requirements.txt

# Set up your OpenAI API key
cp .env.example .env
# Edit .env and add your OPENAI_API_KEY

# Run VoiceGuard
python main.py

πŸ› οΈ Installation

Prerequisites

  • Python 3.8 or higher
  • OpenAI API key
  • Working microphone
  • Windows/macOS/Linux

Step-by-Step Setup

  1. Clone and Navigate

    git clone https://github.com/yourusername/voiceguard.git
    cd voiceguard
  2. Install Dependencies

    pip install -r requirements.txt
  3. Configure Environment

    cp .env.example .env
    # Edit .env with your OpenAI API key
  4. Test Configuration

    python main.py --config-check

🎯 Features

πŸ” Authentication Features

  • Multi-Sample Enrollment: Collect 3+ voice samples per user
  • Dual-Mode Identification: Known user verification + unknown speaker detection
  • Confidence Scoring: Combined AI and traditional biometric scoring
  • Authentication Logging: Complete audit trail with timestamps

🧠 AI-Powered Analysis

  • Voice Profiling: ChatGPT creates detailed voice characteristic profiles
  • Speech Analysis: Automatic transcription and speech pattern analysis
  • Demographic Estimation: Age range and gender estimation (where determinable)
  • Comparison Intelligence: Advanced AI-powered voice comparison

🎡 Voice Processing

  • MFCC Features: 13-coefficient Mel-Frequency Cepstral analysis
  • Spectral Analysis: Centroid, rolloff, bandwidth measurements
  • Prosodic Features: Pitch analysis, energy, speaking rate detection
  • Quality Assessment: Audio preprocessing and quality scoring

πŸ’» User Experience

  • Interactive CLI: Intuitive command-line interface
  • Microphone Testing: Built-in audio device testing
  • System Statistics: Real-time performance metrics
  • User Management: Complete CRUD operations for users

πŸ“Š How It Works

Enrollment Process

  1. User Registration: Create user profile with metadata
  2. Voice Sample Collection: Record multiple voice samples (default: 3)
  3. Feature Extraction: Extract MFCC, spectral, and prosodic features
  4. AI Analysis: ChatGPT analyzes voice characteristics and creates profile
  5. Storage: Secure storage with voice fingerprinting

Authentication Process

  1. Voice Capture: Record authentication sample
  2. Feature Processing: Extract and normalize voice features
  3. Database Comparison: Compare against enrolled voice profiles
  4. AI Verification: ChatGPT performs advanced voice comparison
  5. Decision: Multi-factor confidence scoring and final decision

πŸ”§ Configuration

Customize VoiceGuard through the .env file:

# OpenAI Configuration
OPENAI_API_KEY=your_api_key_here

# Audio Settings
SAMPLE_RATE=16000          # Audio sample rate (Hz)
CHANNELS=1                 # Audio channels (1=mono, 2=stereo)
RECORD_SECONDS=5           # Recording duration per sample

# Authentication Thresholds
SIMILARITY_THRESHOLD=0.8   # Minimum similarity score (0.0-1.0)
MIN_CONFIDENCE_SCORE=0.7   # Minimum confidence for authentication

πŸ“ˆ Performance

  • Accuracy: >95% identification accuracy with quality voice samples
  • Speed: Sub-second feature extraction and comparison
  • Scalability: Handles hundreds of enrolled users efficiently
  • Reliability: Robust error handling and fallback mechanisms

πŸ—ƒοΈ Database Schema

VoiceGuard uses SQLite with optimized schema:

  • users: User profiles and metadata
  • voice_profiles: Voice features and AI analysis results
  • authentication_logs: Complete authentication history
  • enrollment_sessions: Enrollment process tracking

🀝 Contributing

We welcome contributions! See our Contributing Guide for details.

Areas for Contribution

  • 🧠 Enhanced AI analysis algorithms
  • 🎨 GUI interface development
  • 🌐 Web API implementation
  • πŸ“± Mobile app integration
  • πŸ”’ Advanced security features
  • 🌍 Multi-language support

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

πŸ™ Acknowledgments

  • OpenAI for ChatGPT-4 and Whisper APIs
  • Python audio processing community
  • Contributors and testers

πŸ“ž Support


Made with ❀️ by DevArqf

⭐ Star this repo β€’ πŸ› Report Bug β€’ ✨ Request Feature

About

πŸ›‘οΈ Advanced Voice Authentication System using OpenAI ChatGPT & Whisper APIs. Secure voice biometric identification with AI-powered analysis, multi-sample enrollment, and enterprise-grade authentication logging. Python-based with SQLite database.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

No packages published