Last week, one of our sales team members showed me a spreadsheet with 4,000 contacts they'd purchased from a data vendor. Reply rate after two weeks of outreach? 0.3%. Bounce rate? North of 12%. That list cost real money and produced almost nothing.
Most lead lists in 2026 are dead on arrival. , based on 31 million emails sent in 2025, reports an average cold email sequence reply rate of just 4.5% — and that's the average, meaning plenty of campaigns sit well below that. Meanwhile, says the typical seller spends only 40% of their workweek actually selling; the other 60% goes to admin, research, and — you guessed it — prospecting.
So if you're going to spend that time building a list, it had better be one that gets replies. This guide walks through the entire 2026 workflow: defining your ICP, finding leads beyond LinkedIn, building a proper template, verifying data so your bounce rate doesn't tank your sender reputation, scoring leads before you hit send, and keeping everything fresh over time. And I've organized it by budget, so you can start for $0 today.
- Difficulty: Beginner
- Time Required: ~2-3 hours for your first 50-100 leads
- What You'll Need: Chrome browser, , a Google Sheet or spreadsheet, and your ICP written down
What Is a Lead List (and Why Do Most Lists Fail)?

A lead list is a structured dataset of potential buyers — people and companies you want to reach out to. It typically includes person-level fields (name, job title, email, phone, LinkedIn URL) and company-level fields (industry, size, revenue, location). It's the foundation of any outbound sales effort.
Where most teams go wrong: they confuse a lead list with a contact dump. A useful lead list answers why this company and why this person, right now. A purchased list from a random vendor often only answers "here's an email address that may or may not still exist." The difference in outcomes is massive.
Lead lists also fit into a broader lifecycle. A lead is someone who might fit your market. An MQL (marketing qualified lead) has shown some fit or engagement. An SQL (sales qualified lead) is ready for direct follow-up. An opportunity is an active deal. Your lead list is the top of this funnel — and if the top is full of junk, everything downstream suffers.
The most common reasons lead lists fail:
- Stale data: that at least . That means nearly a quarter of your contacts go bad every twelve months.
- Wrong contacts: Role-based emails (info@, sales@) instead of direct contacts. Vague titles like "Staff" that tell you nothing about decision-making authority.
- No targeting criteria: Volume masquerading as strategy. As one forum user put it, "A lot of times, we confuse volume with quality."
- No verification: found that say less than half their CRM data is accurate and complete, and .
- Volume-over-quality mindset: shows that campaigns targeting 21-50 recipients average , while campaigns with 501+ recipients average just . Smaller and tighter beats bigger and sloppier.
The Lead List Template: What Your Spreadsheet Should Actually Look Like
I've reviewed dozens of "how to build a lead list" guides, and here's what bugs me: they all say "include contact info, firmographics, and a lead score" — but none of them actually show you what the spreadsheet looks like. So here's the artifact everyone's missing.

Recommended Starter Columns
Your lead list spreadsheet should have these columns from Day 1:
| Column | What Goes In It | Good Data | Bad Data |
|---|---|---|---|
| Full Name | Person's real name | "Jordan Lee" | "Sales Team" |
| Job Title | Specific current role | "VP of Sales" | "Staff" |
| Company | Legal or trade name | "Acme Logistics" | "Acme?" |
| Industry | Standardized category | "B2B SaaS" | "Tech-ish" |
| Company Size | Employee band | "51-200" | Blank |
| Direct business email | "jordan@acme.com," verified | "info@acme.com" | |
| Phone | Direct or main phone, formatted | E.164 format | Mixed local formats |
| LinkedIn URL | Profile or company page | Complete URL | Search result URL |
| Lead Source | Where record came from | "G2 category page, May 2026" | "Internet" |
| Intent Signal | Why now | "Hiring 3 SDRs," "new funding" | Blank |
| Lead Score | Numeric prioritization | 70/100 with rules | Gut feel |
| Last Contacted | Outreach date | "2026-05-26" | "Recently" |
| Notes | Relevant context | "Uses Shopify Plus" | Long unstructured paste |
Sample Lead List (Anonymized)
Here's what a filled-in list actually looks like — 10 rows covering different personas:
| Full Name | Job Title | Company | Industry | Size | Source | Intent Signal | Score | |
|---|---|---|---|---|---|---|---|---|
| Alex M. | VP Sales | Mid-market SaaS vendor | SaaS | 201-500 | direct verified | G2 category | Hiring AEs | 78 |
| Priya S. | Head of Ops | DTC apparel brand | Ecommerce | 51-200 | direct verified | Shopify showcase | Expanding fulfillment | 72 |
| Marcus T. | Founder | Local agency | Professional services | 11-50 | direct verified | Clutch | New reviews | 66 |
| Elena R. | Revenue Ops Manager | Cybersecurity startup | SaaS | 51-200 | catch-all flagged | Conference speakers | Series A | 61 |
| Ben C. | Owner | HVAC contractor | Local services | 11-50 | main office email | Google Business | High review volume | 48 |
| Mina K. | Director of Partnerships | Marketplace company | Ecommerce | 201-500 | direct verified | Event agenda | Sponsoring event | 74 |
| Diego P. | Real Estate Broker | Regional brokerage | Real estate | 11-50 | direct verified | Association directory | New office page | 58 |
| Sarah N. | Customer Support Lead | B2B software company | SaaS | 51-200 | role-based removed | Capterra | Low support reviews | 44 |
| Omar A. | IT Manager | Manufacturing firm | Manufacturing | 501-1000 | direct verified | Company team page | ERP migration mention | 69 |
| Lena W. | Growth Marketing Lead | Fintech startup | SaaS | 51-200 | direct verified | Product Hunt | New launch | 71 |
What Each Column Means (and What "Good" Data Looks Like)
A few columns deserve extra explanation:
- Job Title: "VP of Sales" tells you this person has budget authority. "Staff" tells you nothing. Always aim for specific titles that indicate decision-making power or influence.
- Email: A personal business email (jordan@acme.com) is gold. A role-based address (sales@acme.com) is almost worthless for cold outreach — it lands in a shared inbox nobody monitors.
- Lead Source: This is the column most people skip, and it's the one that matters most over time. Tracking where each lead came from tells you which channels produce replies, not just rows. "G2 category page, May 2026" is useful. "Internet" is not.
- Intent Signal: This is the "why now" column. A company that just raised a Series A, posted three SDR job listings, or launched a new product is a hotter lead than one sitting quietly. If you can't find an intent signal, the lead might not be worth prioritizing.
How Thunderbit's AI Suggest Fields Builds Your Template for You
One of the things I'm most proud of with : you don't have to guess which columns to create. When you open Thunderbit on any prospect-rich page — a directory listing, a company team page, a conference speaker list — and click "AI Suggest Fields," the AI reads the page and auto-generates the right column names and data types. If the page has names, emails, titles, and company info, Thunderbit suggests exactly those columns.
This is especially useful for beginners who stare at a blank spreadsheet and think, "What fields should I even capture?" Thunderbit answers that question for you, based on the actual data available on the source page. Then you click "Scrape," and export directly to , Excel, Airtable, or Notion.
How to Define Your Ideal Customer Profile (ICP) Before You Build a Lead List

The single biggest mistake I see teams make — and I've made it myself — is building a list before defining who belongs on it. You end up with a spreadsheet full of names and no clarity on why any of them should care about your product.
Your ICP is the description of the company and person most likely to buy from you, get value from your product, and stick around. It's not a persona exercise; it's a targeting filter.
ICP Components
| ICP Item | Prompt | Example |
|---|---|---|
| Industry | Which categories have the problem? | B2B SaaS, ecommerce, professional services |
| Company Size | Which bands can buy now? | 51-500 employees |
| Geography | Where can we sell and support well? | U.S., Canada, UK |
| Revenue Range | What revenue band fits? | $5M-$100M ARR |
| Buyer Titles | Who owns the pain/budget? | VP Sales, RevOps, Head of Ops |
| Trigger | What makes timing urgent? | Hiring, funding, migration, poor reviews |
| Pain Point | What problem do they feel? | Manual list building, stale data, slow enrichment |
| Disqualifier | Who should be excluded? | Students, hobby businesses, competitors |
Practical exercise: Look at your best 5-10 existing customers. What do they have in common? Industry? Size? Title of the person who signed the contract? Write down 3-5 shared traits. That's your ICP draft.
Firmographics vs. Demographics: Which Matter More?
Firmographics are company-level data: industry, size, revenue, location. Demographics (in the B2B context) are person-level data: title, seniority, function, department. For B2B lead lists, firmographics narrow the company and demographics narrow the person. You need both. A perfect company with the wrong contact is a wasted row. A perfect contact at the wrong company is equally useless.
One more thing worth noting: , analyzing , found average decision-making units around . So a good lead list often includes more than one contact per target account — but not so many that your outreach becomes spammy.
Beyond LinkedIn: Where to Find Leads on Websites, Directories, and Social Media
A content gap that surprised me when I researched this topic: 5 out of 6 top-ranking "how to build a lead list" articles funnel readers toward LinkedIn Sales Navigator as the primary lead source. And sure, Sales Navigator is powerful. But it's also expensive ( for Core), and real users frequently complain about export limitations, bloated interfaces, and scraping headaches.

The 2026 reality is that leads live everywhere — not just on LinkedIn. Company websites, industry directories, event pages, review sites, and even social media profiles are rich sources of contact data, often fresher and more specific than what you'll find in a paid database.
Lead Source Comparison
| Lead Source | Best For | Method | Cost |
|---|---|---|---|
| Company websites / About pages | Niche B2B, local services, agencies | Visit team/contact pages, extract names/emails/phones | Free |
| Industry directories (Clutch, G2, Yelp) | Service-based leads, vertical ecosystems | Filter by category/location, scrape listings | Free to low-cost |
| Event attendee/speaker lists | High-intent B2B prospects | Conference agendas, sponsor pages, webinar registrants | Free to paid event access |
| Review sites (G2, Capterra, Google Business) | SaaS and local businesses | Browse categories, extract company contacts | Free |
| Social media (Instagram, X) | B2C, personal brands, local businesses | Public bios, business pages | Free |
| Google site: operators | Long-tail discovery, targeted contact pages | Advanced search queries | Free |
| LinkedIn (basic) | Professional search | Manual search, public profiles | Free |
| LinkedIn Sales Navigator | Mature outbound teams | Advanced filters, saved leads, TeamLink | $99+/mo |
Useful Directories by Vertical
| Vertical | Sources Worth Scraping |
|---|---|
| SaaS | G2, Capterra, Product Hunt, SaaSworthy, AWS Marketplace, Chrome Web Store categories |
| Ecommerce | Shopify stores/showcases, BuiltWith lists, Klaviyo partner directories |
| Real estate | Realtor directories, brokerage office pages, local MLS public pages, chamber directories |
| Professional services | Clutch, DesignRush, UpCity, GoodFirms, local bar/accounting directories |
| Local business | Google Business results, Yelp, Yellow Pages, BBB, local chamber pages |
| Events | Sponsor/exhibitor pages, speaker lists, agenda pages, webinar landing pages |
Google Advanced Search Operators for Lead Finding
These are free and surprisingly powerful. A few examples:
site:clutch.co/agencies "B2B SaaS" "United States"— finds agency listings on Clutch filtered by category and locationsite:company.com ("email" OR "contact") "VP Sales"— finds contact pages on a specific company's site mentioning a VP of Salesintitle:"sponsors" "SaaS" "2026" "conference"— finds conference sponsor pages for SaaS events in 2026site:g2.com/categories "sales engagement" "mid-market"— finds G2 category pages for mid-market sales tools
documents exact-match quotes and operators like site:, so you can verify the syntax there.
How to Scrape Contacts from Any Website with an AI Web Scraper
This is where Thunderbit fits naturally into the workflow. For any of the sources above — a Clutch directory, a company team page, a conference speaker list — the process is the same:
- Open the page in Chrome with the installed.
- Click "AI Suggest Fields." Thunderbit's AI reads the page and suggests columns like Name, Email, Phone, Title, Company.
- Review the suggested fields, add or remove as needed.
- Click "Scrape."
- Export to Google Sheets, Excel, Airtable, or Notion.
The key advantage is that Thunderbit works on messy, non-standardized sites where no pre-built scraper template would cover the layout. The AI reads each page fresh, adapting to whatever structure it encounters. Thunderbit's free add 1-click extraction from any page — unlimited on the free tier.
Using Subpage Scraping to Enrich Your Lead List
A workflow I use often: scrape a directory listing page first (e.g., a list of companies on Clutch), then use Thunderbit's Subpage Scraping to visit each company's individual page and pull additional data — emails, phone numbers, headcount, tech stack, descriptions.
This turns a basic directory list into an enriched, research-ready lead list without any manual clicking. You go from "here are 50 company names" to "here are 50 companies with contact emails, team sizes, and descriptions" in one automated pass. If you want to learn more about , we've covered that in depth.
How to Build a Lead List Step by Step (The 2026 Workflow)
Below is the full workflow, organized for a non-technical sales or ops person who wants to build a real lead list today.
Step 1: Nail Down Your ICP
Before you open any tool, write down your ICP criteria (refer back to the ICP section above). Industry, company size, geography, buyer titles, triggers, and disqualifiers. This takes 15-30 minutes and saves hours of wasted scraping.
Step 2: Pick Your Lead Sources
Based on your ICP and budget, choose 2-3 lead sources from the comparison table. My recommendation: start with the free sources first. If you're targeting SaaS companies, try G2 category pages and company team pages. If you're targeting local businesses, start with Google Business results and Yelp. Layer in paid sources like Sales Navigator only when you've exhausted what's free.
Step 3: Extract Leads Using AI Scraping or Manual Search
For each source, here's the extraction method:
- Websites and directories: Use Thunderbit's AI web scraper. Open the page, click "AI Suggest Fields," review columns, click "Scrape." For popular sites, Thunderbit has that auto-configure fields.
- LinkedIn: Use Sales Navigator search and export, or Thunderbit for .
- Google: Run advanced search operators, then scrape the results pages or visit the individual pages.
Export options: Google Sheets, Excel, Airtable, Notion, CSV, JSON.
Step 4: Verify and Clean Your Data
This step is not optional. I'll cover the full verification workflow in the dedicated section below, but the short version: remove role-based emails, deduplicate, run through a verification tool, flag catch-all domains, and re-verify before each campaign.
Step 5: Score and Prioritize Your Leads
Apply a simple scoring model (detailed below) before you start outreach. This ensures you're contacting the highest-value leads first, not just whoever happens to be at the top of the spreadsheet.
Step 6: Export to Your CRM or Outreach Tool
Move the cleaned, scored list into your CRM (HubSpot, Salesforce, Pipedrive) or outreach platform (lemlist, Mailshake, Apollo). Thunderbit exports directly to Sheets, Airtable, and Notion, which can sync with CRMs via native integrations or Zapier.
Step 7: Launch Outreach and Track Results
Personalize your outreach based on the data you've collected. Mention the prospect's industry, reference their intent signal ("I noticed you're hiring SDRs"), and tailor the value proposition to their pain point. Track reply rates, bounces, and conversions — and feed that data back into your ICP and scoring model for the next round.
Budget-First: How to Build a Lead List from $0 to Enterprise
The number one pain point I hear from early-stage founders and small sales teams: "How do I build a lead list without paying for expensive tools?" It's a fair question. ZoomInfo contracts start in the five figures annually. Sales Navigator is $99+/month. Apollo and Lusha have free tiers but gate the good stuff behind paywalls.
The honest answer: you can get surprisingly far for free. But scaling requires some investment. Here's how to think about it tier by tier.
| Tier | Cost | Methods | Tools |
|---|---|---|---|
| Free ($0) | $0 | Google operators, manual LinkedIn, company websites, Thunderbit free tier (6 pages + free email/phone extractors) | Thunderbit Free, Google, LinkedIn basic |
| Low-cost (<$50/mo) | $0-50 | AI scraping at scale, basic enrichment, email verification | Thunderbit Starter/Pro, Hunter Starter ($34/mo), Bouncer/NeverBounce PAYG |
| Mid-range ($50-200/mo) | $50-200 | Sales Navigator, richer filters, CRM integration | Sales Navigator Core (~$99/mo), Apollo paid, Lusha |
| Enterprise ($200+/mo) | $200+ | Intent data, enrichment suites, compliance workflows | ZoomInfo (quote-based), Cognism (quote-based), Clearbit |
Pricing as of May 2026 — verify current rates before purchasing.
What You Can Accomplish for Free (and Where You'll Hit Limits)
With Thunderbit's free tier (6 pages of AI scraping per month), the free Email Extractor and Phone Number Extractor (unlimited, 1-click), Google search operators, and basic LinkedIn search, a solo founder can realistically build a 50-100 lead list in an afternoon. I've watched people on our team do it.
Where you'll hit limits: volume (pages per month on the free tier), enrichment depth (you won't have intent data or tech stack info without paid tools), and email verification at scale (free verification tools cap at small volumes). When those limits start pinching, that's when it makes sense to move to the low-cost tier — unlock subpage scraping, bulk scraping, pagination, and scheduled scrapers.
The Bounce Rate Fix: A Data Verification Workflow That Actually Works

I've seen forum posts from users reporting from purchased lists. That's not just wasted effort — it's actively dangerous. High bounce rates tank your sender reputation, which means even your good emails start landing in spam.
say a healthy bounce rate is below 2%, and below 1% is great. found that nearly half of senders report bounce rates in the 2-5% range, while . If you're above 5%, your sender reputation is in trouble.
The verification workflow I recommend:
- Remove role-based emails: Delete any info@, sales@, support@, admin@ addresses unless you're intentionally targeting shared inboxes (rare for cold outreach).
- Remove formatting errors: Duplicates, typos, missing domains, dead domains. A quick sort and filter in your spreadsheet catches most of these.
- Run through an email verification tool: Hunter, ZeroBounce, NeverBounce, Bouncer, or Kickbox. These tools ping the mail server to check if the inbox exists without sending an email.
- Flag or remove catch-all domains: that catch-all addresses are a major risky category — they accept mail at the server level without proving a specific inbox exists. If you can't verify the individual inbox, flag the record and treat it as lower confidence.
- Re-verify before each campaign: Data decays fast. If your list is more than 30-90 days old, run verification again before sending.
- Send small batches first: Watch bounce and complaint rates on the first 50-100 sends. Scale only if quality holds.
Why Your Lead Source Affects Data Quality
Not all lead data is created equal. An email pulled from a company's public team page — where the person intentionally listed their contact info — is typically fresher and more accurate than an email from an aggregator database that hasn't been updated in months.
This is one of the reasons I believe in scraping from live public pages rather than relying solely on static databases. Because Thunderbit's AI reads the actual website fresh each time (not a stale database), the extracted emails and phone numbers tend to be current. The Phone Number Extractor also reformats numbers to E.164 standard, which reduces downstream format errors when importing to CRMs.
Scraping from fresh sources is not a substitute for verification — but it does mean you start with cleaner raw material.
Pre-Campaign Checklist
Before you hit "send" on any campaign:
- [ ] All emails verified within the last 30 days
- [ ] No role-based addresses (info@, sales@) in the send list
- [ ] No duplicates
- [ ] Bounce rate from last campaign reviewed
- [ ] Opt-out/unsubscribe mechanism in place
- [ ] Suppression list synced (honor all previous opt-outs)
Build-Then-Score: A Simple Lead Scoring Model for Small Teams
Every guide I've read says "prioritize your leads" — and then moves on without telling you how.

If you're a solo founder or a three-person sales team, you don't need Salesforce Einstein or a predictive scoring engine. You need a spreadsheet column with a transparent formula.
The Scoring Framework
| Signal | Points | Example |
|---|---|---|
| Matches ICP industry | +20 | SaaS, mid-market |
| Company size fit | +10 | 51-500 employees |
| Decision-maker title | +15 | VP Sales, Head of Ops |
| Clear intent signal | +15 | Hiring, funding, tool migration |
| Email verified | +10 | Passed verification |
| Direct source quality | +10 | Company page, event speaker page |
| Engaged with your content | +10 | Downloaded guide, webinar attendee |
| Catch-all / unverified email | -10 | Risky verification status |
| Role-based email | -10 | info@, sales@ |
| Generic title (no clear role) | -5 | "Staff" |
Worked Example
Lead A: VP Sales at a 120-person SaaS company, hiring SDRs, email verified, sourced from a company careers/team page.
Score: 20 (industry) + 10 (size) + 15 (title) + 15 (intent) + 10 (verified) + 10 (source) = 80 → Prioritize for outreach this week.
Lead B: "Staff" at a 5-person hobby business, role-based email, no intent signal.
Score: 0 + 0 + 0 + 0 + 0 - 10 (role-based) - 5 (generic title) = -15 → Skip or remove.
This can live as a simple formula in Google Sheets. Something like:
1=IF(D2="SaaS",20,0)+IF(AND(E2>=51,E2<=500),10,0)+IF(REGEXMATCH(B2,"VP|Head|Director|Founder"),15,0)+IF(J2<>"",15,0)+IF(K2="Verified",10,IF(K2="Catch-all",-10,0))
No Salesforce required.
How to Use AI to Label and Score Leads During Scraping
One of the features my team built into Thunderbit that I find genuinely useful for scoring: Field AI Prompts. When you're configuring your scrape, you can add a prompt to any column — for example, "Classify this lead's seniority as Decision-Maker, Influencer, or Individual Contributor based on the job title and page context."
Thunderbit labels the data during extraction, not after. So when you export to Sheets, the seniority classification, company type, or industry tag is already there — ready to plug into your scoring formula. This cuts out the manual tagging step that makes scoring feel like a chore.
You can also use Subpage Scraping to enrich the original listing data: scrape a directory first, then visit each company's page to pull headcount, funding status, or tech stack — all of which feed into your scoring model.
When to Revisit and Update Your Scores
Lead scores aren't set-and-forget. Re-score monthly, or after any major campaign. If a lead replies positively, their score changes (they're now an active conversation, not a cold lead). If an email bounces, adjust accordingly. If a company that was hiring six months ago has since laid off staff, the intent signal has shifted.
How to Keep Your Lead List Fresh (Automation and Maintenance)
A lead list is not a one-time project.
I've already mentioned that . Contacts change jobs, companies pivot, emails go stale. If you build a great list in May and don't touch it until October, a significant chunk of it is already dead.
Maintenance Cadence
| Task | Frequency | Why |
|---|---|---|
| Verify emails | Before every campaign (or at least monthly) | Prevent hard bounces |
| Deduplicate contacts | Weekly during active prospecting | Avoid duplicate outreach |
| Refresh intent signals | Monthly | Hiring/funding/reviews change quickly |
| Update company firmographics | Quarterly or semi-annually | Size, revenue, and tech stack drift |
| Suppression list sync | Daily or real-time | Honor opt-outs and reduce complaints |
| Source performance review | Monthly | Find which channels create replies, not just rows |
Setting Up a Scheduled Scrape for Ongoing Lead Generation
This is where Thunderbit's Scheduled Scraper comes in. Instead of manually re-visiting directories every month, you can set up a recurring scrape. The setup is simple: describe the time interval in plain language (e.g., "every Monday at 8am"), input the website URLs, and click "Schedule." Thunderbit's AI transforms your words into a schedule and runs the scrape automatically, exporting fresh results to your connected Google Sheet or Airtable base.
Use cases I've seen work well:
- A sales team re-scrapes a Clutch category page monthly to catch new agencies entering the market.
- An ecommerce ops team monitors a competitor directory weekly for new product listings.
- A SaaS founder refreshes a G2 category page before each monthly outbound batch to find newly listed companies.
Thunderbit's cloud mode can , so even large directories get refreshed quickly. For more on setting this up, check out our guide on .
A Quick Note on Compliance and Data Privacy
I'll keep this brief because it's not the focus of this guide, but it's essential.
- CAN-SPAM (U.S.): Applies to all commercial email, including B2B. The each separate violating email can trigger penalties up to . Requirements: accurate headers, non-deceptive subject lines, valid postal address, clear opt-out, and honor opt-outs within 10 business days.
- GDPR (EU/UK): Named business emails can be personal data. The B2B marketing must not conceal identity, must provide a valid opt-out, and must respect objections.
- CCPA/CPRA (California): Emphasizes notice, purpose limitation, data minimization, and consumer rights. The has the latest details.
- Google and Yahoo sender rules: bulk senders to keep spam rates below 0.30%, authenticate with SPF/DKIM/DMARC, and support one-click unsubscribe. .
Bottom line: scrape only publicly available data, avoid login walls without permission, always include an opt-out mechanism, maintain suppression lists, and verify local legal requirements. Thunderbit scrapes publicly available pages — users are responsible for how they use the data.
Conclusion and Key Takeaways
The 2026 lead list workflow isn't about finding more names — it's about building a smaller, fresher, verified, source-aware outreach dataset that actually gets replies.
Here's the full workflow in summary:
- Define your ICP before touching any tool.
- Choose 2-3 lead sources — start free (directories, company pages, Google operators) before paying for databases.
- Extract leads with AI scraping — Thunderbit's 2-click process works on virtually any public page.
- Build a proper template with source tracking, intent signals, and scoring columns.
- Verify and clean — remove role-based emails, deduplicate, run verification, flag catch-alls.
- Score and prioritize — use a transparent spreadsheet model, not gut feel.
- Export to CRM/outreach — personalize based on the data you've collected.
- Track outcomes — bounces, replies, conversions, by lead source.
- Refresh continuously — re-verify before campaigns, re-scrape high-value sources on a schedule.
The data backs this up: by nearly 3x on reply rates. A 200-lead verified list will almost always outperform a 5,000-contact stale database.
Ready to build your first list? gives you 6 pages of AI scraping per month, unlimited free email and phone extraction, and export to Google Sheets or Excel. That's enough to build your first 50-100 leads this afternoon.
FAQs
How many leads should be on my first lead list?
Start with 50-100 well-targeted, verified leads rather than thousands of unqualified contacts. Hunter's data shows that campaigns with smaller, tighter recipient lists (21-50) average 6.2% reply rates — nearly triple the rate of campaigns with 501+ recipients. Quality compounds; volume dilutes.
Should I buy a lead list or build my own?
Building your own is almost always better. Purchased lists carry higher risk: stale data, spam traps, opaque sourcing, and compliance exposure. Self-built lists using AI scraping and manual research yield fresher, more relevant data because you're pulling from live public pages where contacts are current. If you do buy, demand transparency on collection date, verification date, consent basis, and refresh process.
What's the best free way to build a lead list?
Combine Google advanced search operators (site:, intitle:, exact-match queries) with Thunderbit's free tier — 6 pages of AI scraping per month plus unlimited free email and phone extraction — and basic LinkedIn search. This combination covers company pages, directories, event lists, and professional profiles without spending a dollar.
How often should I update my lead list?
Re-verify emails before every campaign, especially if the list is more than 30 days old. Do a full refresh — re-scrape sources, update firmographics, remove dead leads — at least quarterly. ZeroBounce reports that at least 23% of an email list decays within one year, so "set it and forget it" is a recipe for rising bounce rates.
What's a good reply rate for cold outreach from a lead list?
Based on 2025-2026 benchmarks: 3-5% positive reply rate is good, 5-8% is strong, and 8%+ is excellent. The single biggest factor is list quality — targeting, verification, and personalization. A well-built list with verified emails, clear intent signals, and personalized messaging will consistently outperform a larger list with generic contacts and boilerplate copy.
Learn More
