| name | haveibeenpwned |
| description | HaveIBeenPwned API Documentation - Check if email accounts or passwords have been compromised in data breaches |
Have I Been Pwned API Skill
Expert assistance for integrating the Have I Been Pwned (HIBP) API v3 to check for compromised accounts, passwords, and data breaches. This skill provides comprehensive guidance for building security tools, breach notification systems, and password validation features.
When to Use This Skill
This skill should be triggered when:
- Checking if emails/accounts appear in data breaches - "check if this email was pwned"
- Validating password security - "check if password is in breach database"
- Building breach notification systems - "notify users about compromised accounts"
- Implementing password validation - "prevent users from choosing pwned passwords"
- Querying stealer logs - "check if credentials were stolen by malware"
- Integrating HIBP into authentication flows - "add breach checking to login"
- Monitoring domains for compromised emails - "track breaches affecting our domain"
- Working with the HIBP API - any questions about authentication, rate limits, or endpoints
Quick Reference
1. Basic Account Breach Check
import requests
def check_account_breaches(email, api_key):
"""Check if an account appears in any breaches"""
headers = {
'hibp-api-key': api_key,
'user-agent': 'MyApp/1.0'
}
url = f'https://haveibeenpwned.com/api/v3/breachedaccount/{email}'
response = requests.get(url, headers=headers)
if response.status_code == 200:
return response.json() # List of breach objects
elif response.status_code == 404:
return [] # Account not found in breaches
else:
response.raise_for_status()
# Usage
breaches = check_account_breaches('user@example.com', 'your-api-key')
print(f"Found in {len(breaches)} breaches")
2. Password Breach Check (k-Anonymity)
import hashlib
import requests
def check_password_pwned(password):
"""Check if password appears in breaches using k-anonymity"""
# Hash password with SHA-1
sha1_hash = hashlib.sha1(password.encode('utf-8')).hexdigest().upper()
prefix = sha1_hash[:5]
suffix = sha1_hash[5:]
# Query API with first 5 characters only
url = f'https://api.pwnedpasswords.com/range/{prefix}'
response = requests.get(url)
# Parse response for matching suffix
hashes = (line.split(':') for line in response.text.splitlines())
for hash_suffix, count in hashes:
if hash_suffix == suffix:
return int(count) # Times password appears in breaches
return 0 # Password not found
# Usage
count = check_password_pwned('password123')
if count > 0:
print(f"⚠️ Password found {count} times in breaches!")
3. Get All Breaches in System
import requests
def get_all_breaches(domain=None):
"""Retrieve all breaches, optionally filtered by domain"""
url = 'https://haveibeenpwned.com/api/v3/breaches'
params = {'domain': domain} if domain else {}
headers = {'user-agent': 'MyApp/1.0'}
response = requests.get(url, headers=headers, params=params)
return response.json()
# Usage - no authentication required
breaches = get_all_breaches()
print(f"Total breaches: {len(breaches)}")
# Filter by domain
adobe_breaches = get_all_breaches(domain='adobe.com')
4. Monitor for New Breaches
import requests
import time
def monitor_latest_breach(check_interval=3600):
"""Poll for new breaches every hour"""
last_breach_name = None
while True:
url = 'https://haveibeenpwned.com/api/v3/latestbreach'
headers = {'user-agent': 'MyApp/1.0'}
response = requests.get(url, headers=headers)
if response.status_code == 200:
breach = response.json()
if breach['Name'] != last_breach_name:
print(f"🆕 New breach: {breach['Title']}")
print(f" Accounts affected: {breach['PwnCount']:,}")
last_breach_name = breach['Name']
time.sleep(check_interval)
5. Domain-Wide Breach Search
import requests
def search_domain_breaches(domain, api_key):
"""Search for all breached emails in a verified domain"""
headers = {
'hibp-api-key': api_key,
'user-agent': 'MyApp/1.0'
}
url = f'https://haveibeenpwned.com/api/v3/breacheddomain/{domain}'
response = requests.get(url, headers=headers)
if response.status_code == 200:
results = response.json()
# Returns: {"alias1": ["Adobe"], "alias2": ["Adobe", "Gawker"]}
total_affected = len(results)
print(f"Found {total_affected} compromised accounts")
return results
else:
response.raise_for_status()
6. Check Pastes for Account
import requests
def check_pastes(email, api_key):
"""Check if email appears in any pastes"""
headers = {
'hibp-api-key': api_key,
'user-agent': 'MyApp/1.0'
}
url = f'https://haveibeenpwned.com/api/v3/pasteaccount/{email}'
response = requests.get(url, headers=headers)
if response.status_code == 200:
pastes = response.json()
for paste in pastes:
print(f"{paste['Source']}: {paste['Title']}")
print(f" Date: {paste['Date']}")
print(f" Emails found: {paste['EmailCount']}")
return pastes
elif response.status_code == 404:
return [] # No pastes found
7. Enhanced Password Check with Padding
import hashlib
import requests
def check_password_secure(password):
"""Check password with padding to prevent inference attacks"""
sha1_hash = hashlib.sha1(password.encode('utf-8')).hexdigest().upper()
prefix = sha1_hash[:5]
suffix = sha1_hash[5:]
headers = {'Add-Padding': 'true'}
url = f'https://api.pwnedpasswords.com/range/{prefix}'
response = requests.get(url, headers=headers)
# Parse response, ignore padded entries (count=0)
for line in response.text.splitlines():
hash_suffix, count = line.split(':')
if hash_suffix == suffix and int(count) > 0:
return int(count)
return 0
8. Handle Rate Limiting
import requests
import time
def api_call_with_retry(url, headers, max_retries=3):
"""Make API call with automatic retry on rate limit"""
for attempt in range(max_retries):
response = requests.get(url, headers=headers)
if response.status_code == 429:
# Rate limited - wait and retry
retry_after = int(response.headers.get('retry-after', 2))
print(f"Rate limited, waiting {retry_after}s...")
time.sleep(retry_after)
continue
return response
raise Exception("Max retries exceeded")
9. Check Subscription Status
import requests
def get_subscription_info(api_key):
"""Retrieve API subscription details and limits"""
headers = {
'hibp-api-key': api_key,
'user-agent': 'MyApp/1.0'
}
url = 'https://haveibeenpwned.com/api/v3/subscription/status'
response = requests.get(url, headers=headers)
if response.status_code == 200:
info = response.json()
print(f"Plan: {info['SubscriptionName']}")
print(f"Rate limit: {info['Rpm']} requests/minute")
print(f"Valid until: {info['SubscribedUntil']}")
return info
10. Stealer Logs Search
import requests
def check_stealer_logs(email, api_key):
"""Check if credentials appear in info stealer malware logs"""
headers = {
'hibp-api-key': api_key,
'user-agent': 'MyApp/1.0'
}
url = f'https://haveibeenpwned.com/api/v3/stealerlogsbyemail/{email}'
response = requests.get(url, headers=headers)
if response.status_code == 200:
domains = response.json() # List of website domains
print(f"Credentials found for {len(domains)} websites")
return domains
elif response.status_code == 404:
return [] # Not found in stealer logs
# Requires Pwned 5+ subscription
Key Concepts
Authentication
- API Key Format: 32-character hexadecimal string
- Header:
hibp-api-key: {your-key} - User-Agent Required: Must set valid user-agent header (returns 403 if missing)
- Test Key:
00000000000000000000000000000000for integration testing
k-Anonymity Model
The Pwned Passwords API uses k-anonymity to protect user privacy:
- Client hashes password locally with SHA-1
- Sends only first 5 characters of hash to API
- API returns ~800 matching hash suffixes
- Client checks locally if full hash matches
This ensures the actual password never leaves your system.
Rate Limiting
- Varies by subscription tier: Pwned 5 = 1,000 requests/minute
- HTTP 429 response when exceeded with
retry-afterheader - Pwned Passwords API: No rate limit
- Best practice: Implement exponential backoff on 429 responses
Breach Model Attributes
Key fields in breach objects:
- Name: Unique identifier (e.g., "Adobe")
- Title: Human-readable name
- BreachDate: When breach occurred (ISO 8601)
- PwnCount: Total compromised accounts
- DataClasses: Types of data exposed (emails, passwords, etc.)
- IsVerified: Breach authenticity confirmed
- IsSensitive: Excluded from public searches
Response Codes
| Code | Meaning |
|---|---|
| 200 | Success - data found |
| 404 | Not found (account not in breaches) |
| 401 | Unauthorized (invalid API key) |
| 403 | Forbidden (missing user-agent) |
| 429 | Rate limit exceeded |
Reference Files
This skill includes comprehensive API documentation in references/:
- other.md - Complete HIBP API v3 reference with all endpoints, authentication, and usage examples
The reference file contains:
- All API endpoints - Breaches, pastes, passwords, stealer logs
- Request/response formats - Headers, parameters, JSON structures
- Authentication details - API key setup and usage
- Rate limiting information - Subscription tiers and retry strategies
- Test accounts - Pre-configured test data for integration
- Code examples - Real-world implementation patterns
Use view to read the reference file when you need detailed information about specific endpoints or advanced features.
Working with This Skill
For Beginners
Start by understanding the core concepts:
- Password checking - Use Pwned Passwords API (no authentication required)
- Account breaches - Requires API key from haveibeenpwned.com
- k-Anonymity - Learn how password hashing protects privacy
Begin with Quick Reference examples #1 (breach check) and #2 (password check).
For Integration Projects
Focus on:
- Authentication setup - Get API key and configure headers
- Rate limiting - Implement retry logic (example #8)
- Error handling - Handle 404, 401, 429 responses properly
- User experience - Provide clear messaging about breach exposure
Review Quick Reference examples #5 (domain search) and #9 (subscription info).
For Production Systems
Consider:
- Caching - Store breach results to reduce API calls
- Background processing - Check breaches asynchronously
- Monitoring - Track new breaches with latest breach endpoint (example #4)
- Privacy - Never log passwords, use k-anonymity model
- Compliance - Follow attribution requirements (CC BY 4.0)
For Security Tools
Advanced patterns:
- Stealer logs - Check malware-stolen credentials (example #10)
- Domain monitoring - Track all compromised accounts in your organization
- Paste monitoring - Alert on email exposure in public pastes (example #6)
- Padding - Use response padding to prevent inference attacks (example #7)
Common Patterns
Pattern 1: Sign-up Password Validation
# Prevent users from choosing compromised passwords
def validate_signup_password(password):
count = check_password_pwned(password)
if count > 0:
return False, f"This password appears in {count} data breaches"
return True, "Password is secure"
Pattern 2: Breach Notification System
# Notify users when their account appears in new breach
def notify_affected_users():
latest = get_latest_breach()
affected_users = query_users_in_breach(latest['Name'])
for user in affected_users:
send_notification(user, latest)
Pattern 3: Compliance Check
# Verify all domain accounts for compliance reporting
def domain_security_audit(domain, api_key):
breached = search_domain_breaches(domain, api_key)
report = {
'total_accounts': len(breached),
'affected_accounts': breached,
'timestamp': datetime.now()
}
return report
API Endpoints Summary
Authenticated Endpoints (Require API Key)
GET /breachedaccount/{account}- Check account breachesGET /pasteaccount/{account}- Check pastesGET /breacheddomain/{domain}- Domain-wide searchGET /subscribeddomains- List verified domainsGET /subscription/status- Check subscriptionGET /stealerlogsbyemail/{email}- Stealer logs by emailGET /stealerlogsbywebsitedomain/{domain}- Stealer logs by siteGET /stealerlogsbyemaildomain/{domain}- Stealer logs by email domain
Public Endpoints (No Authentication)
GET /breaches- All breaches in systemGET /breach/{name}- Single breach detailsGET /latestbreach- Most recent breachGET /dataclasses- List of data typesGET https://api.pwnedpasswords.com/range/{prefix}- Password check
Testing
Test Accounts
Use these on domain hibp-integration-tests.com:
account-exists@- Has breaches and pastesmultiple-breaches@- Three different breachesspam-list-only@- Only spam-flagged breachstealer-log@- In stealer logsopt-out@- No results (opted out)
Test API Key
Use 00000000000000000000000000000000 for integration testing.
Best Practices
- Always set User-Agent - Required header, returns 403 without it
- Use HTTPS only - API requires TLS 1.2+
- Implement retry logic - Handle 429 rate limits gracefully
- Cache breach data - Reduce API calls for frequently checked accounts
- Never log passwords - Use k-anonymity model, hash locally
- Provide attribution - Link to haveibeenpwned.com (CC BY 4.0 license)
- Handle 404 gracefully - "Not found" is good news for users
- Use padding for passwords - Add
Add-Padding: trueheader
Resources
Official Links
- API Documentation: https://haveibeenpwned.com/API/v3
- Get API Key: https://haveibeenpwned.com/API/Key
- Dashboard: https://haveibeenpwned.com/DomainSearch
Community Tools
- PwnedPasswordsDownloader (GitHub) - Download full password database
- Integration libraries available for Python, JavaScript, Go, C#, and more
Acceptable Use
Permitted:
- Security tools and breach notifications
- Password validation in authentication systems
- Compliance and security audits
- Educational and research purposes
Prohibited:
- Targeting or harming breach victims
- Denial-of-service attacks
- Circumventing security measures
- Misrepresenting data source
- Automating undocumented APIs
Violations may result in API key revocation or IP blocking.
Notes
- Breach data licensed under Creative Commons Attribution 4.0
- Pwned Passwords has no licensing requirements
- CORS only supported for unauthenticated endpoints
- Never expose API keys in client-side code
- Service tracks 917+ breaches as of API documentation date