Skip to main content

Extract Emails
from HTML

Paste raw HTML source code and extract every email address — including those hidden in mailto: links, form fields, data attributes, and inline scripts.

Open HTML Extractor

Why HTML Extraction Is Different

Email addresses in HTML source code aren't always visible on the rendered page. They hide in attributes, encoded entities, and script blocks. A simple text scan misses them — but our engine doesn't.

Mailto: Links

Emails embedded in href="mailto:user@example.com" attributes are extracted even when they include query parameters like ?subject= or ?cc=.

URL-Encoded Characters

Some sites encode the @ symbol as %40 in URLs. The extractor decodes these before matching, catching emails that plain regex would miss.

HTML Entity Encoding

Webmasters sometimes encode emails as HTML entities (e.g., @ for @) to deter bots. Our engine handles entity decoding during extraction.

False Positive Filtering

HTML contains many @-like patterns that aren't emails: image filenames (icon@2x.png), CSS selectors, and version strings. Our engine filters these out.

How to Extract Emails from HTML

1

View Page Source

In your browser, right-click on any page and select "View Page Source" (or press Ctrl+U / Cmd+Option+U). This shows the raw HTML markup of the page.

2

Copy the HTML

Select all the HTML source code (Ctrl+A / Cmd+A) and copy it. You can copy the entire page or just specific sections — like the footer, contact page, or team directory markup.

3

Paste into the Extractor

Open the Email Extractor, select the Paste Text tab, and paste the HTML source code. Click Extract.

4

Review & Export

The extractor shows all found emails with validation status. Emails from mailto: links are captured alongside those in visible text. Use domain filters if needed, then export as CSV, TXT, or copy to clipboard.

Where Emails Hide in HTML

These are the most common locations where email addresses appear in HTML source code — many of which are invisible on the rendered page.

Mailto: href attributes — The most common pattern. Links like <a href="mailto:contact@company.com">.
Hidden form field values — Contact forms sometimes pre-fill the recipient email in a hidden input: <input type="hidden" value="info@company.com">.
Data attributes — Custom data attributes like data-email="user@company.com" used by JavaScript widgets.
Schema.org structured data — JSON-LD blocks often contain contact email properties that aren't rendered visually.
HTML comments — Developers sometimes leave email addresses in comments: <!-- Contact: dev@company.com -->.
Meta tags — Author or contact meta tags: <meta name="author" content="user@company.com">.

Use Cases for HTML Email Extraction

Competitor Research

View the source of competitor websites to find team email addresses, partnership contacts, and support addresses that may be hidden from the visible page.

Web Development Audits

Developers can audit their own sites to find accidentally exposed email addresses in source code, form values, or comments before they go live.

Email Newsletter Archives

HTML newsletter archives contain sender and reply-to addresses in the markup. Paste the source to extract all referenced emails from a campaign.

Saved Web Pages

If you've saved web pages as HTML files for offline reading, paste the source code to quickly extract any contact emails from the saved content.

Frequently Asked Questions

What's the difference between pasting HTML and pasting plain text?

When you paste raw HTML source code, the extractor finds emails in mailto: links, href attributes, hidden form fields, data attributes, and inline JavaScript — not just visible text. Plain text mode only scans the visible characters.

Can the extractor handle obfuscated emails in HTML?

Yes. The engine decodes URL-encoded characters (%40 for @), HTML entities (&#64;), and extracts emails from mailto: links even when spread across attributes. It also filters out false positives like image filenames and CSS class names.

Does it execute JavaScript on the page?

No. The paste mode processes raw HTML as text — it does not render or execute scripts. If emails are generated dynamically by JavaScript, you would need to use View Source in your browser first to capture the rendered HTML, or use our URL Fetch mode.

Will it extract emails from embedded iframes?

Only if the iframe source HTML is included in what you paste. The tool processes the text you provide — it does not fetch external resources or render nested documents.

How does it handle mailto: links?

The extractor specifically looks for mailto: patterns in href attributes and extracts the email address, stripping any query parameters like ?subject= or ?cc=. This catches emails that might not appear in the visible page text.

Can I paste the HTML of an entire website?

Yes. You can paste the full HTML source of a page — even large pages with thousands of lines. The extractor processes it entirely in your browser with no size limit imposed by our servers.

Start extracting
in seconds.

No account needed. No credit card. Paste text or enter a URL and get results instantly.

Paste text to extract instantly
Fetch emails from any URL
Export as CSV or TXT