URL Extractor
Paste any text or HTML and extract every URL, deduplicated, sorted, ready to copy or download.
โ 100% Freeโ No Signupโ No Watermarkโ Unlimited Use
Pull Every Link Out of Any Text or HTML
This free URL extractor (link extractor) finds every web address in whatever you paste, plain text, HTML source, markdown, chat logs, emails or documents. It captures both full https:// URLs and bare www. links, cleans trailing punctuation, and outputs a tidy one-per-line list.
SEOs use it to extract URLs from sitemaps and SERP exports; developers pull links from logs; researchers collect references from articles. A domains-only mode reduces the list to unique hostnames for quick source analysis.
How to Use the URL Extractor
- 1Paste your text, HTML source or log content into the box.
- 2Toggle options: deduplicate, sort, or reduce to domains only.
- 3Click Extract URLs.
- 4Copy the list or download it as a .txt file.
Why Use MakeToolz's URL Extractor?
Text, HTML & markdown
Finds links in href attributes, plain prose, markdown and logs alike.
Smart cleanup
Trailing commas, periods and brackets that stick to pasted URLs are stripped automatically.
Domains-only mode
Collapse thousands of URLs into a unique hostname list in one click.
100% in-browser
Your text is processed locally and never uploaded.
Handles big inputs
Extract from very large pastes instantly.
Free & unlimited
No account, no limits.
How a URL Extractor Reads Links Out of Text
A URL extractor finds web addresses by pattern. It scans your text for the shape of a link: a protocol like https:// followed by a domain and path, or a bare www. host. When your paste is raw HTML, links usually sit inside hrefs, the address part of an anchor tag. The extractor pulls the address out of each href and out of any plain-text link in the same paste, so you get one clean list whether the source was code or prose.
After matching, the tool trims the punctuation that tends to stick to a link when you copy it, like a trailing comma, period, or closing bracket. That small cleanup step is what separates a usable list from one full of broken addresses.
Who Needs to Extract Links and When
SEOs run link audits: they paste a page or a crawl export and pull every URL to check for broken paths, mixed protocols, or off-brand destinations. Developers extract links from logs to see what a script requested. Researchers gather references out of an article. Anyone scraping text they already have often needs the links separated from the surrounding words before the data is useful.
The common thread is that links are buried. They live inside sentences, inside markup, inside chat transcripts. Reading them out by hand is slow and error prone, so a pattern-based pull saves real time.
What Gets Captured and What Does Not
| Input in your text | Captured? | Why |
|---|---|---|
| https://example.com/page | Yes | Full protocol and host |
| www.example.com | Yes | Bare www host is recognized |
| <a href="https://example.com"> | Yes | Address pulled from the href |
| example.com (no prefix) | No | Ambiguous with filenames, skipped on purpose |
| mailto: links | No | Those are addresses, not web URLs |
Benefits and Limits
The benefit is a tidy, deduplicated list in one click, processed locally so nothing you paste is uploaded. Domains-only mode collapses thousands of full URLs into a short list of unique hostnames, which is the fastest way to answer "where do all these links point?" during a link audit.
The main limit is that bare addresses without a protocol are skipped. A string like report.pdf or my.file looks the same as example.com to a pattern, so including those would flood your results with false matches. Adding http:// in front of the ones you want fixes this before you paste.
Common Mistakes to Avoid
One mistake is expecting the tool to fetch pages. It reads links out of text you provide; it does not crawl the live web, so you paste the source yourself. Another is pasting HTML where a link wraps across two lines, which can split the address; join those lines first. A third is forgetting to switch on deduplicate, which leaves the same URL repeated many times in a large export.
Tips for Cleaner Link Lists
- Use domains-only mode for audits when you care about destinations, not exact paths.
- Keep deduplicate on for crawl exports, which repeat the same URLs constantly.
- To pull email addresses from the same paste, run it through the email extractor as well.
- If you want to size the source before pasting, the word counter tells you how much text you are dealing with.
People Also Ask
How do I extract all links from a web page?
Open the page source with Ctrl+U, copy everything, paste it in, and extract. Every link in the HTML, including those inside href attributes, appears in the output.
Can it pull links from HTML source code?
Yes. The tool reads the address out of each href and also finds any plain-text links in the same paste, then merges them into one list.
Why are some addresses missing from my results?
Links written without http:// or www. are skipped because they cannot be told apart from filenames. Add a protocol to the ones you want captured, then paste again.
How do I get only the unique domains?
Turn on domains-only mode. The tool strips the protocol, the www, and the path from each URL, then deduplicates the hostnames.
Does it clean up punctuation stuck to a link?
Yes. Trailing commas, periods, and closing brackets that copy along with a URL are removed automatically so the address stays valid.
Is my pasted text private?
Completely. Extraction is pure client-side JavaScript, so nothing you paste is sent to any server.
Can I use this for a link audit on a large export?
Yes. Paste the export, keep deduplicate on, and optionally switch to domains-only to see every destination the links point to in one short list.