PrivacyGuard : Your On-device ML-powered phishing defense with privacy-first P2P threat sharing and homograph detection.

PrivacyGuard is an intelligent browser extension that protects users from phishing attacks and malicious websites using advanced machine learning, heuristic analysis, and homograph detection. Built with privacy-first principles, all analysis happens locally in your browser—no data ever leaves your device.

Demo Link

Youtube

✨ Key Features

🤖 AI-Powered Detection

Custom TensorFlow.js Model: Trained on 100k+ samples with 88.8% accuracy
16 URL Features: Real-time analysis of lexical patterns and suspicious traits
On-Device Processing: Complete privacy - no data sent to servers
Sub-500ms Analysis: Fast threat assessment

🚦 Smart Alert System

🔴 Red Alert: Full-page warnings for high-risk sites (>75 risk score)
🟡 Yellow Alert: Non-intrusive notifications for suspicious sites (30-75 score)
🟢 Green Status: Silent monitoring for safe sites (<30 score)

🛡️ Multi-Layer Protection

Homograph Detection: Catches Unicode/Punycode spoofing attacks
Heuristic Analysis: Flags suspicious URL patterns and forms
P2P Intelligence: Community-driven threat sharing (mock implementation)
Smart Whitelisting: Learn from your browsing preferences

🎯 Live Demonstrations

Landing Page Interface

)

Alert System in Action

Alert Type	Visual Example	Trigger Conditions
🔴 High Risk		ML Score >75, Homograph attacks, Known phishing
🟡 Caution	[Add yellow alert screenshot]	ML Score 30-75, Suspicious patterns
🟢 Safe		ML Score <30, Whitelisted domains

Console Output & Model Analysis

📊 Performance & Accuracy

Trained and validated using comprehensive datasets:

📈 Model Performance

Dataset Sources:
├── Primary: github.com/ebubekirbbr/dephides (~100k samples)
└── Secondary: IEEE DataPort phishing dataset (validation)

Results:
├── Accuracy: 88.88%

⚡ Runtime Performance

Analysis Speed: 450ms average per URL
Memory Usage: ~15MB additional browser memory
CPU Impact: <2% during analysis
Model Loading: 1.8s (cached after first load)

🛠️ Technical Architecture

Core Components

PrivacyGuard/
├── js/
│   ├── content.js          # Main analysis engine & alert system
│   ├── tf.min.js          # TensorFlow.js runtime
│   └── tfjs_model/        # Trained ML model files
├── popup/
│   ├── popup.html         # Extension interface
│   ├── popup.js           # UI logic & controls
│   └── popup.css          # Styling
├── manifest.json          # Extension configuration
└── icons/                 # Extension icons

Detection Pipeline

┌─────────────────────────────────────────────────────────────┐
│                    Browser Extension                        │
├─────────────────────────────────────────────────────────────┤
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────────────┐  │
│  │   Content   │  │   Popup     │  │    Background       │  │
│  │   Script    │  │   Interface │  │    Service          │  │
│  │             │  │             │  │                     │  │
│  │ • ML Model  │  │ • Risk      │  │ • Storage Mgmt      │  │
│  │ • Heuristics│  │   Display   │  │ • Settings          │  │
│  │ • Homograph │  │ • Controls  │  │ • P2P Simulation    │  │
│  │ • Alerts    │  │ • Analytics │  │                     │  │
│  └─────────────┘  └─────────────┘  └─────────────────────┘  │
└─────────────────────────────────────────────────────────────┘
                              │
┌─────────────────────────────────────────────────────────────┐
│                 Detection Engines                           │
├─────────────────────────────────────────────────────────────┤
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────────────┐  │
│  │    ML       │  │  Heuristic  │  │     Homograph       │  │
│  │   Engine    │  │   Analysis  │  │     Detection       │  │
│  │             │  │             │  │                     │  │
│  │ • 16 URL    │  │ • HTTPS     │  │ • Punycode          │  │
│  │   Features  │  │   Check     │  │ • Mixed Scripts     │  │
│  │ • TF.js     │  │ • Form      │  │ • Confusables       │  │
│  │   Model     │  │   Detection │  │ • Unicode Analysis  │  │
│  └─────────────┘  └─────────────┘  └─────────────────────┘  │
└─────────────────────────────────────────────────────────────┘

Alert Implementation with Shadow DOM

// Isolated CSS to prevent conflicts
const createAlert = (riskData) => {
    const container = document.createElement('div');
    const shadow = container.attachShadow({mode: 'closed'});
    
    shadow.innerHTML = `
        <style>
            .privacy-guard-alert {
                position: fixed; z-index: 2147483647;
                font-family: -apple-system, BlinkMacSystemFont, sans-serif;
                /* Fully isolated styles */
            }
        </style>
        ${getAlertHTML(riskData)}
    `;
    
    document.body.appendChild(container);
};

🚀 Quick Setup & Installation

1. Clone Repository

git clone https://github.com/yourusername/PrivacyGuard.git
cd PrivacyGuard

2. Install in Chrome

Open chrome://extensions/
Enable Developer Mode (top right)
Click "Load unpacked"
Select the PrivacyGuard folder
Pin the extension icon for easy access

3. Test the Extension

# Serve test files locally
python -m http.server 8000

# Test URLs:
http://localhost:8000/college.html     # Yellow alert (HTTP)
http://www.xn--pypal-4ve.com/         # Red alert (Homograph)
https://www.google.com                # Green (Safe)

🧪 Advanced Testing & Console Commands

View Analysis Data

// Check whitelist
chrome.storage.local.get('privacyGuardWhitelist', console.log);

// View P2P data
chrome.storage.local.get(['privacyGuardP2PUserPhishing', 'privacyGuardP2PUserSafe'], console.log);

// Manual analysis
analyzeCurrentURL().then(console.log);

Mock P2P Network Testing

// Add to phishing list
 const p2pSettings = await new Promise(resolve => {
        chrome.storage.local.get([P2P_ENABLED_KEY, P2P_USER_CONFIRMED_SAFE_KEY, P2P_USER_CONFIRMED_PHISHING_KEY], result => resolve(result));
    });

🔬 Dataset & Model Training

Training Data Sources

Primary Dataset: github.com/ebubekirbbr/dephides
- ~100,000 balanced samples (legitimate + phishing URLs)
- Combined from PhishTank, Tranco rankings, academic sources
Validation Dataset: Tranco
- Curated academic dataset for cross-validation
- Used for performance benchmarking

Google Colab Training Results

Feature Engineering

# 16 lexical features extracted from URLs
const featureDescriptions = [
  "1. length",
  "2. hostname_length",
  "3. path_length",
  "4. query_length",
  "5. num_dots",
  "6. num_hyphens",
  "7. num_at",
  "8. num_question_marks",
  "9. num_equals",
  "10. num_underscore",
  "11. num_percent",
  "12. num_slash",
  "13. has_https",
  "14. has_ip",
  "15. num_digits",
  "16. num_let_

🎮 Interactive Demo Scenarios

Scenario 1: Safe Browsing

Visit: https://www.google.com
Result: 🟢 Green status, no alerts
Popup: Shows low risk score, clean analysis

Scenario 2: Suspicious HTTP Site

Visit: http://localhost:8000/college.html
Result: 🟡 Yellow corner alert appears
Action: Choose "Trust", "Block", or "Details"

Scenario 3: Homograph Attack

Visit: http://www.xn--pypal-4ve.com/ (fake PayPal)
Result: 🔴 Full-page red warning blocks access
Reason: Punycode homograph detection triggered

Scenario 4: ML-Detected Phishing

Visit: Your phishing.html test file
Result: 🔴 Red alert based on ML model + heuristics
Details: High-risk features identified and scored

📈 Recent Updates & Improvements

v1.0.0 - Latest Release

✅ Fixed ML Score Inversion: Corrected risk calculation bug
✅ Enhanced Shadow DOM: Complete CSS isolation for alerts
✅ Improved Homograph Detection: Better Unicode analysis
✅ Performance Optimization: Faster model loading and inference
✅ Enhanced P2P Mock: More realistic community simulation

Key Bug Fixes

Alert positioning issues on responsive sites
Memory leaks in model tensor operations
Edge cases in URL feature extraction
Improved error handling for malformed URLs

🛣️ Roadmap & Future Features

🔄 Next Release (v1.1)

Enhanced ML Model: Retrain with larger, more diverse dataset
Dark Mode Support: UI themes for better user experience
Advanced Analytics: Detailed threat statistics and trends
Export/Import Settings: Backup and sync user preferences

⚠️ Known Limitations

Technical Constraints

Model Size: ~2MB addition to extension size
Feature Scope: Currently limited to lexical URL features
Browser Support: Optimized for Chromium-based browsers
P2P System: Currently mock implementation using local storage

Detection Limitations

Sophisticated Attacks: May miss advanced social engineering
Content-based Phishing: Limited analysis of page content beyond forms
Zero-day Threats: Effectiveness depends on training data coverage
Language Support: Homograph detection primarily covers Latin scripts

User Experience

False Positives: ~5.8% rate may require manual whitelisting
Alert Fatigue: Balance between security and usability
Performance: Slight delays possible on resource-constrained devices

🤝 Contributing

We welcome contributions! Here's how to get involved:

🐛 Report Issues

Use GitHub Issues for bugs and feature requests
Include browser version, extension version, and reproduction steps
Screenshots of alerts/console output are helpful

💻 Code Contributions

# Fork the repository
git clone https://github.com/yourusername/PrivacyGuard.git
cd PrivacyGuard

# Create feature branch
git checkout -b feature/your-feature-name

# Make changes and test thoroughly
# Submit pull request with detailed description

📚 Documentation

Improve README sections
Add code comments and examples
Create user guides and tutorials

📄 License & Credits

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Datasets: ebubekirbbr/dephides and Tranco
ML Framework: TensorFlow.js team for browser-based ML capabilities
UI Framework: Minimal custom CSS with Shadow DOM for isolation
Community: Beta testers and security researchers who provided feedback

📞 Support & Contact

🆘 Need Help?

Issues: Report bugs on GitHub Issues
Discussions: Join GitHub Discussions

📧 Contact

Email: [email protected]
Twitter: @AdityaPat_
LinkedIn: Aditya Pattanayak

⭐ Star this repository if PrivacyGuard helps keep you safe online! ⭐

Built with 💙 for safer browsing | Protecting privacy while fighting phishing

[🐛 Report Issues

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
css		css
icons		icons
js		js
ml_training		ml_training
popup		popup
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bfg-1.15.0.jar.REMOVED.git-id		bfg-1.15.0.jar.REMOVED.git-id
college.html		college.html
git-filter-repo.py		git-filter-repo.py
learn-more.css		learn-more.css
learn-more.html		learn-more.html
learn-more.js		learn-more.js
manifest.json		manifest.json
package.json		package.json
phising.html		phising.html

License

AdityaP700/PrivacyGuard

Folders and files

Latest commit

History

Repository files navigation

PrivacyGuard : Your On-device ML-powered phishing defense with privacy-first P2P threat sharing and homograph detection.

Demo Link

✨ Key Features

🤖 AI-Powered Detection

🚦 Smart Alert System

🛡️ Multi-Layer Protection

🎯 Live Demonstrations

Landing Page Interface

Alert System in Action

Console Output & Model Analysis

📊 Performance & Accuracy

📈 Model Performance

⚡ Runtime Performance

🛠️ Technical Architecture

Core Components

Detection Pipeline

Alert Implementation with Shadow DOM

🚀 Quick Setup & Installation

1. Clone Repository

2. Install in Chrome

3. Test the Extension

🧪 Advanced Testing & Console Commands

View Analysis Data

Mock P2P Network Testing

🔬 Dataset & Model Training

Training Data Sources

Google Colab Training Results

Feature Engineering

🎮 Interactive Demo Scenarios

Scenario 1: Safe Browsing

Scenario 2: Suspicious HTTP Site

Scenario 3: Homograph Attack

Scenario 4: ML-Detected Phishing

📈 Recent Updates & Improvements

v1.0.0 - Latest Release

Key Bug Fixes

🛣️ Roadmap & Future Features

🔄 Next Release (v1.1)

⚠️ Known Limitations

Technical Constraints

Detection Limitations

User Experience

🤝 Contributing

🐛 Report Issues

💻 Code Contributions

📚 Documentation

📄 License & Credits

License

Acknowledgments

📞 Support & Contact

🆘 Need Help?

📧 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages