Beschrijving
AlmaWeb AI Visitor Analytics
Beschrijving
Two sides of AI traffic, one powerful plugin.
With the rise of AI platforms like ChatGPT, Claude, Gemini, and Perplexity, your website faces two new realities:
1. AI bots scraping your content for training and indexing
2. Real human visitors discovering your site through AI tools and clicking through to visit
AlmaWeb AI Visitor Analytics is the only WordPress plugin that tracks BOTH:
AI Bot Detection
Identify bots from major AI companies (OpenAI, Google, Meta), SEO crawlers (Ahrefs, Semrush), aggressive scrapers, and even “stealth” bots hiding behind standard User-Agents.
AI Referrer Tracking (NEW in v1.1.0)
Track real human visitors coming FROM AI platforms! When someone asks ChatGPT a question and clicks a link to your site, you will see it. Understand which AI tools are sending you traffic, which pages they land on, and how diverse your AI traffic sources are.
Why AlmaWeb AI Visitor Analytics?
In a world where AIs massively consume AND distribute web content, it is crucial to know:
- Who’s accessing your content – Precisely identify each AI bot (GPTBot, ClaudeBot, Google-Extended, etc.)
- How much data is being extracted – Measure the real impact on your bandwidth and resources
- Which pages are being targeted – Discover what content interests AIs the most
- Hidden bots – Detect even bots that mask their identity with standard User-Agents
Key Features
🔍 Multi-Level Detection
- Level 1 – User-Agent: Detects 190+ AI bot signatures (OpenAI, Anthropic, Google, Meta, Amazon, Apple, etc.)
- Level 2 – IP Range: Verifies official IP ranges (OpenAI, etc.) to identify masked bots
- Level 3 – Stealth Detection: Detects invisible bots like ChatGPT Atlas that use standard browser User-Agents
📊 Comprehensive Dashboard
- Real-time statistics (human visits vs bots)
- Evolution charts with Chart.js
- Distribution by AI family (OpenAI, Google, Anthropic, etc.)
- Top 10 most active bots
- Top 10 most scraped pages
- Comparison with previous period
- Business KPIs: growth rate, peak hours, intent distribution
📋 Detailed Logs
- Advanced filtering by bot type, IP, date
- Real-time search
- Full CSV export
- Complete details: User-Agent, IP, URL, detection method, AI family, bot intent
- Special badge for “invisible” bots
⚙️ Flexible Configuration
- Enable/disable tracking
- Bad bot blocking option (disabled by default)
- Configurable data retention (90 days by default)
- Automatic daily cleanup of old data
- Exclude logged-in users (optional)
🔒 Privacy Respectful
- No data sent to third-party services
- All data stays in your WordPress database
- GDPR compliant
- No cookies or client-side tracking
- Only downloads public data (IP ranges)
Detected Bots
The plugin detects over 190 different bots in the following categories:
AI Bots (AI Training & Inference):
OpenAI (GPTBot, ChatGPT-User, OAI-SearchBot, Operator), Anthropic (ClaudeBot, Claude-Web), Google AI (Google-Extended, Gemini, NotebookLM), Meta AI, Amazon Bedrock, Apple Intelligence, Perplexity, Mistral, xAI (Grok), Cohere, Character.AI, Stability AI, and 100+ others.
Search Engines:
Google, Bing, Yahoo, DuckDuckGo, Yandex, Baidu, etc.
SEO Crawlers:
Ahrefs, Semrush, Moz, Majestic, etc.
Scrapers & Malicious Bots:
Content scrapers, email collectors, attack bots, etc.
Advanced Stealth Bot Detection
Some AI bots like ChatGPT Atlas use standard browser User-Agents (Chrome, Safari) to go unnoticed. AlmaWeb AI Visitor Analytics detects them by verifying their IP addresses against official AI provider IP ranges.
Example:
A visitor with User-Agent “Mozilla/5.0 Chrome/…” from an IP in the OpenAI range (AS401518) will be identified as “ChatGPT Atlas” and marked with an “INVISIBLE” badge in the logs.
Performance
- Minimal impact: Optimized to not slow down your site
- Smart cache: IP ranges are cached for 24h
- Optimized queries: Indexes on all important columns
- Automatic cleanup: Daily deletion of old data
Protect your content and understand your traffic like never before.
Official Plugin Page
Visit the official plugin page for additional resources and support:
https://almaweb.fr/ai-visitor-analytics
Privacy & Data Protection
AlmaWeb AI Visitor Analytics takes your privacy seriously:
Data Storage
- All data is stored locally in your WordPress database
- No external services receive any data from your site
- You have full control over data retention (configurable from 1 to 365 days)
- Automatic daily cleanup removes old data based on your settings
What Data is Collected
The plugin collects the following information for each visit:
* User-Agent string
* IP address
* Requested URL
* HTTP referrer
* Request method (GET, POST, etc.)
* HTTP headers (Accept, Accept-Language, Accept-Encoding)
* Timestamp
This data is used solely for bot detection and traffic analysis on your own site.
GDPR Compliance
- No cookies are set by this plugin
- No client-side tracking or JavaScript-based analytics
- IP addresses are stored for bot identification purposes only
- You can configure data retention to comply with your privacy policy
- Users can request data deletion through standard WordPress data export/erasure tools
What This Plugin Does NOT Do
- Does NOT send your data to external servers
- Does NOT track end-user behavior for advertising
- Does NOT use third-party analytics services
- Does NOT set cookies
- Does NOT require user consent (server-side logging only)
Third-Party Services
This plugin may connect to the following third-party services only to download publicly available bot IP ranges:
OpenAI IP Ranges
- GPTBot: https://openai.com/gptbot.json
- ChatGPT-User: https://openai.com/chatgpt-user.json
- SearchBot: https://openai.com/searchbot.json
Purpose: Verify IP addresses to detect OpenAI bots, including stealth bots like ChatGPT Atlas
Data sent: None – the plugin only downloads public IP range information
Caching: IP ranges are cached for 24 hours to minimize requests
Privacy Policy: https://openai.com/privacy
Important Notes
- No data from your site is ever sent to these services
- The plugin only downloads publicly available IP range lists
- These requests happen server-side (not from visitors’ browsers)
- IP ranges are cached locally for 24 hours
- If the external service is unavailable, the plugin continues to work using User-Agent detection
Bundled Libraries
Chart.js v4.5.1 (bundled locally)
* Used for dashboard visualizations
* No external connections required
* Loaded only on admin pages
* License: MIT
All third-party code is included in the plugin and does not require external CDN connections.
Support & Documentation
Getting Help
If you need help with the plugin, here are your options:
- Documentation: Check the plugin settings page for inline help and tooltips
- WordPress Support Forum: Post your questions on the WordPress.org support forum
- Official Page: Visit https://almaweb.fr/ai-visitor-analytics for contact and support
Reporting Bugs
If you find a bug, please report it on the WordPress.org support forum with:
* WordPress version
* PHP version
* Steps to reproduce the issue
* Any error messages you’re seeing
You can also contact us directly through our official page at https://almaweb.fr/ai-visitor-analytics
Feature Requests
We welcome feature suggestions! Please submit them on the WordPress.org support forum or contact us through https://almaweb.fr/ai-visitor-analytics
Contributing
This plugin is open source and welcomes contributions!
Translation
The plugin is available in French and English. To add a new language:
- Copy
languages/almaweb-ai-visitor-analytics.pot - Translate using Poedit or similar tool
- Save as
almaweb-ai-visitor-analytics-{locale}.poand compile to.mo - Contact us through https://almaweb.fr/ai-visitor-analytics to submit your translation
Credits
Developed by: Yohan Ziri
Company: AlmaWeb (https://almaweb.fr)
License: GPLv3 or later
Special Thanks:
* Chart.js for beautiful visualizations
* The WordPress community for feedback and support
* All users who help improve the bot detection library
Installatie
- Upload the
almaweb-ai-visitor-analyticsfolder to your/wp-content/plugins/directory. - Activate the plugin through the ‘Plugins’ menu in WordPress.
- Go to the new ‘AI Analytics’ menu to see the dashboard and logs.
FAQ
Does this plugin slow down my site?
No, AlmaWeb AI Visitor Analytics is designed to be lightweight. Analysis is done server-side and optimized for minimal impact on page load times. IP ranges are cached for 24 hours and all database queries use indexes for optimal performance.
What types of bots can it detect?
The plugin detects over 190 different bots, including:
* AI Bots: GPTBot, ClaudeBot, Google-Extended, Gemini, Meta-ExternalAgent, Amazonbot, Applebot-Extended, Perplexity, Mistral, Grok, and many others
* Search Engines: Googlebot, Bingbot, Yahoo Slurp, DuckDuckBot, Yandex, Baidu
* SEO Crawlers: AhrefsBot, SemrushBot, MJ12bot (Majestic), DotBot (Moz)
* Malicious scrapers: Content scraping bots, email collectors, etc.
* Stealth bots: Bots that mask their identity (detected by IP verification)How does stealth bot detection work?
Some AI bots like ChatGPT Atlas use standard browser User-Agents to hide. The plugin detects them by checking if the visitor’s IP address belongs to official AI provider IP ranges (OpenAI, etc.). This information is downloaded from official sources and cached locally.
Can I block bad bots?
Yes, an option to block bots identified as malicious is available in settings (disabled by default). You can enable it if you want to prevent access to aggressive scrapers while allowing legitimate AI bots and search engines through.
Is the plugin GDPR compliant?
Yes, the plugin is designed to respect privacy:
* All data remains in your WordPress database
* No data is sent to external services
* No cookies or client-side JavaScript tracking
* IP addresses are stored only for bot analysis
* You control data retention duration (90 days by default)Is data sent to external servers?
No. The plugin only downloads public information (bot IP ranges) from official sources (openai.com). No data from your site is ever sent externally. Everything is stored locally in your WordPress database.
How much database space does the plugin use?
It depends on your traffic. On average, each recorded visit takes about 1-2 KB. For a site with 1000 bot visits per day, that’s about 60-180 MB over 90 days (default retention period). Automatic daily cleanup removes old data.
Can I export the data?
Yes, the logs page includes a CSV export function that allows you to export all filtered data for external analysis (Excel, Google Sheets, BI tools, etc.).
Does the plugin work with multilingual sites?
Yes, the plugin is fully translated and compatible with multilingual sites. It comes with French and English translations, and you can add your own translations.
What’s the difference with classic analytics tools (Google Analytics, Matomo)?
Classic analytics tools focus on human traffic and often ignore bots. AlmaWeb AI Visitor Analytics does the opposite: it specializes in detecting and analyzing AI bots, scrapers, and crawlers. It complements your existing analytics tools by giving you visibility into non-human traffic.
How can I know if my content is being used to train AIs?
The plugin identifies AI training bots (like GPTBot, Google-Extended, etc.) and shows you exactly which pages they visit, how often, and how much data they extract. This allows you to make informed decisions about blocking or allowing these bots.
Does the plugin support high-traffic sites?
Yes, the plugin is optimized for performance with indexes on all important columns, a caching system, and automatic cleanup of old data. It has been tested on sites receiving thousands of visits per day.
Beoordelingen
Er zijn geen beoordelingen voor deze plugin.
Bijdragers & ontwikkelaars
“AlmaWeb AI Visitor Analytics” is open source software. De volgende personen hebben bijgedragen aan deze plugin.
Bijdragers“AlmaWeb AI Visitor Analytics” is vertaald in 1 locale. Dank voor de vertalers voor hun bijdragen.
Vertaal “AlmaWeb AI Visitor Analytics” in je eigen taal.
Interesse in ontwikkeling?
Bekijk de code, haal de SVN repository op, of abonneer je op het ontwikkellog via RSS.
Changelog
1.2.0 (2026-01-30)
- NEW: Firewall – Block bots by intention (training, scraping) or by family (ByteDance, Meta, etc.)
- Blocked bots are logged before being rejected (403)
- New BLOCKED badge in logs view
- Database schema updated with is_blocked column
1.1.2 (2026-01-05)
- Fixed database error in AI Authority Score calculation
- Corrected missing table_name variable in get_intention_counts() function
1.1.1 (2026-01-05)
- Complete English translation of all plugin strings
- Full French translation with updated .po and .mo files
- Fixed missing translations in dashboard, settings, and logs pages
- Plugin now fully bilingual (English/French)
1.1.0 (2025-12-20)
- NEW: AI Referrer Tracking – Track real human visitors coming FROM AI platforms (ChatGPT, Claude, Perplexity, etc.)
- AI Sources dashboard with traffic distribution chart
- Diversity score to measure dependency on AI traffic sources
- Landing pages analysis for AI-referred visits
- Detection of UTM parameters and referrer headers from AI platforms
- Session tracking for AI-referred visitors
- Improved dashboard with Bot Intentions chart
- Updated Chart.js to v4.5.1
- WordPress.org compliance improvements
1.0.0 (2024-12-20)
- Initial release
- Detection of 190+ AI bots (OpenAI, Anthropic, Google, Meta, Amazon, etc.)
- SEO scrapers and malicious bots detection
- User-Agent and IP Range detection
- JSON-based bot library for easy updates
- Real-time dashboard with statistics by AI family
- Stacked evolution chart with Chart.js
- Comparison with previous period
- Top AI bots and most scraped pages
- Configurable tracking and data retention
- Automatic log cleanup with customizable retention period




