Can it pull links from HTML source code?

Yes. The tool reads the address out of each href and also finds any plain-text links in the same paste, then merges them into one list.

Why are some addresses missing from my results?

Links written without http:// or www. are skipped because they cannot be told apart from filenames. Add a protocol to the ones you want captured, then paste again.

How do I get only the unique domains?

Turn on domains-only mode. The tool strips the protocol, the www, and the path from each URL, then deduplicates the hostnames.

Does it clean up punctuation stuck to a link?

Yes. Trailing commas, periods, and closing brackets that copy along with a URL are removed automatically so the address stays valid.

Is my pasted text private?

Completely. Extraction is pure client-side JavaScript, so nothing you paste is sent to any server.

Can I use this for a link audit on a large export?

Yes. Paste the export, keep deduplicate on, and optionally switch to domains-only to see every destination the links point to in one short list.

URL Extractor

Paste any text or HTML and extract every URL, deduplicated, sorted, ready to copy or download.

Remove duplicates Sort A-Z Domains only

✔ 100% Free✔ No Signup✔ No Watermark✔ Unlimited Use

Updated 2026-07-05 · Built and maintained by the MakeToolz team.

Pull Every Link Out of Any Text or HTML

This free URL extractor (link extractor) finds every web address in whatever you paste, plain text, HTML source, markdown, chat logs, emails or documents. It captures both full https:// URLs and bare www. links, cleans trailing punctuation, and outputs a tidy one-per-line list.

SEOs use it to extract URLs from sitemaps and SERP exports; developers pull links from logs; researchers collect references from articles. A domains-only mode reduces the list to unique hostnames for quick source analysis.

How to Use the URL Extractor

1
Paste your text, HTML source or log content into the box.
2
Toggle options: deduplicate, sort, or reduce to domains only.
3
Click Extract URLs.
4
Copy the list or download it as a .txt file.

Why Use MakeToolz's URL Extractor?

Text, HTML & markdown

Finds links in href attributes, plain prose, markdown and logs alike.

Smart cleanup

Trailing commas, periods and brackets that stick to pasted URLs are stripped automatically.

Domains-only mode

Collapse thousands of URLs into a unique hostname list in one click.

100% in-browser

Your text is processed locally and never uploaded.

Handles big inputs

Extract from very large pastes instantly.

Free & unlimited

No account, no limits.

How a URL Extractor Reads Links Out of Text

A URL extractor finds web addresses by pattern. It scans your text for the shape of a link: a protocol like https:// followed by a domain and path, or a bare www. host. When your paste is raw HTML, links usually sit inside hrefs, the address part of an anchor tag. The extractor pulls the address out of each href and out of any plain-text link in the same paste, so you get one clean list whether the source was code or prose.

After matching, the tool trims the punctuation that tends to stick to a link when you copy it, like a trailing comma, period, or closing bracket. That small cleanup step is what separates a usable list from one full of broken addresses.

Who Needs to Extract Links and When

SEOs run link audits: they paste a page or a crawl export and pull every URL to check for broken paths, mixed protocols, or off-brand destinations. Developers extract links from logs to see what a script requested. Researchers gather references out of an article. Anyone scraping text they already have often needs the links separated from the surrounding words before the data is useful.

The common thread is that links are buried. They live inside sentences, inside markup, inside chat transcripts. Reading them out by hand is slow and error prone, so a pattern-based pull saves real time.

What Gets Captured and What Does Not

Input in your text	Captured?	Why
https://example.com/page	Yes	Full protocol and host
www.example.com	Yes	Bare www host is recognized
<a href="https://example.com">	Yes	Address pulled from the href
example.com (no prefix)	No	Ambiguous with filenames, skipped on purpose
mailto: links	No	Those are addresses, not web URLs

Benefits and Limits

The benefit is a tidy, deduplicated list in one click, processed locally so nothing you paste is uploaded. Domains-only mode collapses thousands of full URLs into a short list of unique hostnames, which is the fastest way to answer "where do all these links point?" during a link audit.

The main limit is that bare addresses without a protocol are skipped. A string like report.pdf or my.file looks the same as example.com to a pattern, so including those would flood your results with false matches. Adding http:// in front of the ones you want fixes this before you paste.

Common Mistakes to Avoid

One mistake is expecting the tool to fetch pages. It reads links out of text you provide; it does not crawl the live web, so you paste the source yourself. Another is pasting HTML where a link wraps across two lines, which can split the address; join those lines first. A third is forgetting to switch on deduplicate, which leaves the same URL repeated many times in a large export.

Tips for Cleaner Link Lists

Use domains-only mode for audits when you care about destinations, not exact paths.
Keep deduplicate on for crawl exports, which repeat the same URLs constantly.
To pull email addresses from the same paste, run it through the email extractor as well.
If you want to size the source before pasting, the word counter tells you how much text you are dealing with.

Frequently Asked Questions

How do I extract all links from a web page?

View the page source (Ctrl+U), copy everything, paste it here and click Extract URLs. Every link in the HTML, including those in href attributes, appears in the output list.

Does it find links without http:// prefix?

Yes, bare www. links are captured too. Links written without either prefix (like example.com) are ambiguous with filenames, so those are intentionally skipped to avoid false matches.

Can I get just the unique domains?

Tick "Domains only" before extracting, the tool strips protocols, www and paths, then deduplicates the hostnames.

Is my data private?

Completely. Extraction is pure client-side JavaScript; nothing is sent to any server.

Related Free Tools

📧 Email ExtractorPull all email addresses (or domains) out of any text. 🔢 Word CounterLive word, character, sentence & reading-time counts. 🔤 Case ConverterConvert text between 7 letter cases instantly.

More Extractors & Validators

Phone Number Extractor Phone Number Validator

Browse all text & utility tools →