What This Tool Does
Paste any block of HTML — a full page source, a navigation template, a blog article, or an API response — and the extractor scans for every <a href="..."> element. For each link it surfaces the href URL, the visible anchor text (stripped of inner HTML tags), and whether the link is internal or external. You can copy individual URLs, copy the whole list, or export everything as a CSV file for spreadsheet analysis.
How to Use
- Paste your HTML markup into the input panel.
- Click Extract Links to scan the markup and generate the results table.
- Review each row: anchor text, full URL, and internal vs external classification.
- Use Copy URL on any row to grab that individual link.
- Use Copy All URLs for a line-by-line list, or Copy as CSV for anchor + URL pairs.
Frequently Asked Questions
How is this useful for SEO auditing?
Link auditing is a core part of on-page SEO. You need to verify that internal links point to the right pages, that anchor text is descriptive rather than generic ("click here"), and that external links don't point to low-quality or broken destinations. Pasting a page's source code here gives you an instant inventory without needing to crawl the live site or open browser devtools. You can export the results to a spreadsheet and bulk-check status codes or anchor text patterns across an entire site section.
How is this different from browser devtools?
Browser devtools show the rendered DOM after JavaScript has executed, which means dynamically injected links also appear. This tool works on raw HTML source, so it only finds links baked into the markup. That distinction matters for auditing server-rendered templates, CMS exports, email HTML, or static HTML files where you want to inspect what's literally in the code rather than what the browser renders at runtime. It also lets you process HTML you copied from a source viewer or downloaded as a file without needing a live browser session.
What counts as an internal link versus an external link?
The tool classifies a link as internal if the href starts with a forward slash (/about), a relative path (../page), or another non-absolute reference that doesn't begin with http or //. A link is classified asexternal if the href starts with http:// or https://, meaning it points to a fully qualified URL on another domain. This is a syntactic check on the raw href value — it doesn't resolve URLs or detect cross-domain redirects.
What about JavaScript links like href="javascript:void(0)"?
The extractor captures any <a> tag with an href attribute regardless of its value, so javascript:void(0), #, and mailto: links will all appear in the results. They won't count as external links (they don't start with http) and they won't count as standard internal links either. You can spot them easily in the URL column and filter them manually. If you're auditing for broken or non-navigational links, look for these patterns in the table and decide whether they should be replaced with real URLs.