How to Build a Lead List That Gets Replies (2026 Workflow)

Last Updated on May 26, 2026
AI Summary
Build high-converting lead lists in 2026 with a workflow focused on verification, intent signals, and targeted scraping instead of bulk data buys.

Last week, one of our sales team members showed me a spreadsheet with 4,000 contacts they'd purchased from a data vendor. Reply rate after two weeks of outreach? 0.3%. Bounce rate? North of 12%. That list cost real money and produced almost nothing.

Most lead lists in 2026 are dead on arrival. , based on 31 million emails sent in 2025, reports an average cold email sequence reply rate of just 4.5% — and that's the average, meaning plenty of campaigns sit well below that. Meanwhile, says the typical seller spends only 40% of their workweek actually selling; the other 60% goes to admin, research, and — you guessed it — prospecting.

So if you're going to spend that time building a list, it had better be one that gets replies. This guide walks through the entire 2026 workflow: defining your ICP, finding leads beyond LinkedIn, building a proper template, verifying data so your bounce rate doesn't tank your sender reputation, scoring leads before you hit send, and keeping everything fresh over time. And I've organized it by budget, so you can start for $0 today.

  • Difficulty: Beginner
  • Time Required: ~2-3 hours for your first 50-100 leads
  • What You'll Need: Chrome browser, , a Google Sheet or spreadsheet, and your ICP written down

What Is a Lead List (and Why Do Most Lists Fail)?

data-quality-dashboard.webp

A lead list is a structured dataset of potential buyers — people and companies you want to reach out to. It typically includes person-level fields (name, job title, email, phone, LinkedIn URL) and company-level fields (industry, size, revenue, location). It's the foundation of any outbound sales effort.

Where most teams go wrong: they confuse a lead list with a contact dump. A useful lead list answers why this company and why this person, right now. A purchased list from a random vendor often only answers "here's an email address that may or may not still exist." The difference in outcomes is massive.

Lead lists also fit into a broader lifecycle. A lead is someone who might fit your market. An MQL (marketing qualified lead) has shown some fit or engagement. An SQL (sales qualified lead) is ready for direct follow-up. An opportunity is an active deal. Your lead list is the top of this funnel — and if the top is full of junk, everything downstream suffers.

The most common reasons lead lists fail:

  • Stale data: that at least . That means nearly a quarter of your contacts go bad every twelve months.
  • Wrong contacts: Role-based emails (info@, sales@) instead of direct contacts. Vague titles like "Staff" that tell you nothing about decision-making authority.
  • No targeting criteria: Volume masquerading as strategy. As one forum user put it, "A lot of times, we confuse volume with quality."
  • No verification: found that say less than half their CRM data is accurate and complete, and .
  • Volume-over-quality mindset: shows that campaigns targeting 21-50 recipients average , while campaigns with 501+ recipients average just . Smaller and tighter beats bigger and sloppier.

The Lead List Template: What Your Spreadsheet Should Actually Look Like

I've reviewed dozens of "how to build a lead list" guides, and here's what bugs me: they all say "include contact info, firmographics, and a lead score" — but none of them actually show you what the spreadsheet looks like. So here's the artifact everyone's missing.

data-quality-comparison.webp

Your lead list spreadsheet should have these columns from Day 1:

ColumnWhat Goes In ItGood DataBad Data
Full NamePerson's real name"Jordan Lee""Sales Team"
Job TitleSpecific current role"VP of Sales""Staff"
CompanyLegal or trade name"Acme Logistics""Acme?"
IndustryStandardized category"B2B SaaS""Tech-ish"
Company SizeEmployee band"51-200"Blank
EmailDirect business email"jordan@acme.com," verified"info@acme.com"
PhoneDirect or main phone, formattedE.164 formatMixed local formats
LinkedIn URLProfile or company pageComplete URLSearch result URL
Lead SourceWhere record came from"G2 category page, May 2026""Internet"
Intent SignalWhy now"Hiring 3 SDRs," "new funding"Blank
Lead ScoreNumeric prioritization70/100 with rulesGut feel
Last ContactedOutreach date"2026-05-26""Recently"
NotesRelevant context"Uses Shopify Plus"Long unstructured paste

Sample Lead List (Anonymized)

Here's what a filled-in list actually looks like — 10 rows covering different personas:

Full NameJob TitleCompanyIndustrySizeEmailSourceIntent SignalScore
Alex M.VP SalesMid-market SaaS vendorSaaS201-500direct verifiedG2 categoryHiring AEs78
Priya S.Head of OpsDTC apparel brandEcommerce51-200direct verifiedShopify showcaseExpanding fulfillment72
Marcus T.FounderLocal agencyProfessional services11-50direct verifiedClutchNew reviews66
Elena R.Revenue Ops ManagerCybersecurity startupSaaS51-200catch-all flaggedConference speakersSeries A61
Ben C.OwnerHVAC contractorLocal services11-50main office emailGoogle BusinessHigh review volume48
Mina K.Director of PartnershipsMarketplace companyEcommerce201-500direct verifiedEvent agendaSponsoring event74
Diego P.Real Estate BrokerRegional brokerageReal estate11-50direct verifiedAssociation directoryNew office page58
Sarah N.Customer Support LeadB2B software companySaaS51-200role-based removedCapterraLow support reviews44
Omar A.IT ManagerManufacturing firmManufacturing501-1000direct verifiedCompany team pageERP migration mention69
Lena W.Growth Marketing LeadFintech startupSaaS51-200direct verifiedProduct HuntNew launch71

What Each Column Means (and What "Good" Data Looks Like)

A few columns deserve extra explanation:

  • Job Title: "VP of Sales" tells you this person has budget authority. "Staff" tells you nothing. Always aim for specific titles that indicate decision-making power or influence.
  • Email: A personal business email (jordan@acme.com) is gold. A role-based address (sales@acme.com) is almost worthless for cold outreach — it lands in a shared inbox nobody monitors.
  • Lead Source: This is the column most people skip, and it's the one that matters most over time. Tracking where each lead came from tells you which channels produce replies, not just rows. "G2 category page, May 2026" is useful. "Internet" is not.
  • Intent Signal: This is the "why now" column. A company that just raised a Series A, posted three SDR job listings, or launched a new product is a hotter lead than one sitting quietly. If you can't find an intent signal, the lead might not be worth prioritizing.

How Thunderbit's AI Suggest Fields Builds Your Template for You

One of the things I'm most proud of with : you don't have to guess which columns to create. When you open Thunderbit on any prospect-rich page — a directory listing, a company team page, a conference speaker list — and click "AI Suggest Fields," the AI reads the page and auto-generates the right column names and data types. If the page has names, emails, titles, and company info, Thunderbit suggests exactly those columns.

This is especially useful for beginners who stare at a blank spreadsheet and think, "What fields should I even capture?" Thunderbit answers that question for you, based on the actual data available on the source page. Then you click "Scrape," and export directly to , Excel, Airtable, or Notion.

How to Define Your Ideal Customer Profile (ICP) Before You Build a Lead List

icp-filtering-process.webp

The single biggest mistake I see teams make — and I've made it myself — is building a list before defining who belongs on it. You end up with a spreadsheet full of names and no clarity on why any of them should care about your product.

Your ICP is the description of the company and person most likely to buy from you, get value from your product, and stick around. It's not a persona exercise; it's a targeting filter.

ICP Components

ICP ItemPromptExample
IndustryWhich categories have the problem?B2B SaaS, ecommerce, professional services
Company SizeWhich bands can buy now?51-500 employees
GeographyWhere can we sell and support well?U.S., Canada, UK
Revenue RangeWhat revenue band fits?$5M-$100M ARR
Buyer TitlesWho owns the pain/budget?VP Sales, RevOps, Head of Ops
TriggerWhat makes timing urgent?Hiring, funding, migration, poor reviews
Pain PointWhat problem do they feel?Manual list building, stale data, slow enrichment
DisqualifierWho should be excluded?Students, hobby businesses, competitors

Practical exercise: Look at your best 5-10 existing customers. What do they have in common? Industry? Size? Title of the person who signed the contract? Write down 3-5 shared traits. That's your ICP draft.

Firmographics vs. Demographics: Which Matter More?

Firmographics are company-level data: industry, size, revenue, location. Demographics (in the B2B context) are person-level data: title, seniority, function, department. For B2B lead lists, firmographics narrow the company and demographics narrow the person. You need both. A perfect company with the wrong contact is a wasted row. A perfect contact at the wrong company is equally useless.

One more thing worth noting: , analyzing , found average decision-making units around . So a good lead list often includes more than one contact per target account — but not so many that your outreach becomes spammy.

Beyond LinkedIn: Where to Find Leads on Websites, Directories, and Social Media

A content gap that surprised me when I researched this topic: 5 out of 6 top-ranking "how to build a lead list" articles funnel readers toward LinkedIn Sales Navigator as the primary lead source. And sure, Sales Navigator is powerful. But it's also expensive ( for Core), and real users frequently complain about export limitations, bloated interfaces, and scraping headaches.

ai-data-sources-table.webp

The 2026 reality is that leads live everywhere — not just on LinkedIn. Company websites, industry directories, event pages, review sites, and even social media profiles are rich sources of contact data, often fresher and more specific than what you'll find in a paid database.

Lead Source Comparison

Lead SourceBest ForMethodCost
Company websites / About pagesNiche B2B, local services, agenciesVisit team/contact pages, extract names/emails/phonesFree
Industry directories (Clutch, G2, Yelp)Service-based leads, vertical ecosystemsFilter by category/location, scrape listingsFree to low-cost
Event attendee/speaker listsHigh-intent B2B prospectsConference agendas, sponsor pages, webinar registrantsFree to paid event access
Review sites (G2, Capterra, Google Business)SaaS and local businessesBrowse categories, extract company contactsFree
Social media (Instagram, X)B2C, personal brands, local businessesPublic bios, business pagesFree
Google site: operatorsLong-tail discovery, targeted contact pagesAdvanced search queriesFree
LinkedIn (basic)Professional searchManual search, public profilesFree
LinkedIn Sales NavigatorMature outbound teamsAdvanced filters, saved leads, TeamLink$99+/mo

Useful Directories by Vertical

VerticalSources Worth Scraping
SaaSG2, Capterra, Product Hunt, SaaSworthy, AWS Marketplace, Chrome Web Store categories
EcommerceShopify stores/showcases, BuiltWith lists, Klaviyo partner directories
Real estateRealtor directories, brokerage office pages, local MLS public pages, chamber directories
Professional servicesClutch, DesignRush, UpCity, GoodFirms, local bar/accounting directories
Local businessGoogle Business results, Yelp, Yellow Pages, BBB, local chamber pages
EventsSponsor/exhibitor pages, speaker lists, agenda pages, webinar landing pages

Google Advanced Search Operators for Lead Finding

These are free and surprisingly powerful. A few examples:

  • site:clutch.co/agencies "B2B SaaS" "United States" — finds agency listings on Clutch filtered by category and location
  • site:company.com ("email" OR "contact") "VP Sales" — finds contact pages on a specific company's site mentioning a VP of Sales
  • intitle:"sponsors" "SaaS" "2026" "conference" — finds conference sponsor pages for SaaS events in 2026
  • site:g2.com/categories "sales engagement" "mid-market" — finds G2 category pages for mid-market sales tools

documents exact-match quotes and operators like site:, so you can verify the syntax there.

How to Scrape Contacts from Any Website with an AI Web Scraper

This is where Thunderbit fits naturally into the workflow. For any of the sources above — a Clutch directory, a company team page, a conference speaker list — the process is the same:

  1. Open the page in Chrome with the installed.
  2. Click "AI Suggest Fields." Thunderbit's AI reads the page and suggests columns like Name, Email, Phone, Title, Company.
  3. Review the suggested fields, add or remove as needed.
  4. Click "Scrape."
  5. Export to Google Sheets, Excel, Airtable, or Notion.

The key advantage is that Thunderbit works on messy, non-standardized sites where no pre-built scraper template would cover the layout. The AI reads each page fresh, adapting to whatever structure it encounters. Thunderbit's free add 1-click extraction from any page — unlimited on the free tier.

Using Subpage Scraping to Enrich Your Lead List

A workflow I use often: scrape a directory listing page first (e.g., a list of companies on Clutch), then use Thunderbit's Subpage Scraping to visit each company's individual page and pull additional data — emails, phone numbers, headcount, tech stack, descriptions.

This turns a basic directory list into an enriched, research-ready lead list without any manual clicking. You go from "here are 50 company names" to "here are 50 companies with contact emails, team sizes, and descriptions" in one automated pass. If you want to learn more about , we've covered that in depth.

How to Build a Lead List Step by Step (The 2026 Workflow)

Below is the full workflow, organized for a non-technical sales or ops person who wants to build a real lead list today.

Step 1: Nail Down Your ICP

Before you open any tool, write down your ICP criteria (refer back to the ICP section above). Industry, company size, geography, buyer titles, triggers, and disqualifiers. This takes 15-30 minutes and saves hours of wasted scraping.

Step 2: Pick Your Lead Sources

Based on your ICP and budget, choose 2-3 lead sources from the comparison table. My recommendation: start with the free sources first. If you're targeting SaaS companies, try G2 category pages and company team pages. If you're targeting local businesses, start with Google Business results and Yelp. Layer in paid sources like Sales Navigator only when you've exhausted what's free.

For each source, here's the extraction method:

  • Websites and directories: Use Thunderbit's AI web scraper. Open the page, click "AI Suggest Fields," review columns, click "Scrape." For popular sites, Thunderbit has that auto-configure fields.
  • LinkedIn: Use Sales Navigator search and export, or Thunderbit for .
  • Google: Run advanced search operators, then scrape the results pages or visit the individual pages.

Export options: Google Sheets, Excel, Airtable, Notion, CSV, JSON.

Step 4: Verify and Clean Your Data

This step is not optional. I'll cover the full verification workflow in the dedicated section below, but the short version: remove role-based emails, deduplicate, run through a verification tool, flag catch-all domains, and re-verify before each campaign.

Step 5: Score and Prioritize Your Leads

Apply a simple scoring model (detailed below) before you start outreach. This ensures you're contacting the highest-value leads first, not just whoever happens to be at the top of the spreadsheet.

Step 6: Export to Your CRM or Outreach Tool

Move the cleaned, scored list into your CRM (HubSpot, Salesforce, Pipedrive) or outreach platform (lemlist, Mailshake, Apollo). Thunderbit exports directly to Sheets, Airtable, and Notion, which can sync with CRMs via native integrations or Zapier.

Step 7: Launch Outreach and Track Results

Personalize your outreach based on the data you've collected. Mention the prospect's industry, reference their intent signal ("I noticed you're hiring SDRs"), and tailor the value proposition to their pain point. Track reply rates, bounces, and conversions — and feed that data back into your ICP and scoring model for the next round.

Budget-First: How to Build a Lead List from $0 to Enterprise

The number one pain point I hear from early-stage founders and small sales teams: "How do I build a lead list without paying for expensive tools?" It's a fair question. ZoomInfo contracts start in the five figures annually. Sales Navigator is $99+/month. Apollo and Lusha have free tiers but gate the good stuff behind paywalls.

The honest answer: you can get surprisingly far for free. But scaling requires some investment. Here's how to think about it tier by tier.

TierCostMethodsTools
Free ($0)$0Google operators, manual LinkedIn, company websites, Thunderbit free tier (6 pages + free email/phone extractors)Thunderbit Free, Google, LinkedIn basic
Low-cost (<$50/mo)$0-50AI scraping at scale, basic enrichment, email verificationThunderbit Starter/Pro, Hunter Starter ($34/mo), Bouncer/NeverBounce PAYG
Mid-range ($50-200/mo)$50-200Sales Navigator, richer filters, CRM integrationSales Navigator Core (~$99/mo), Apollo paid, Lusha
Enterprise ($200+/mo)$200+Intent data, enrichment suites, compliance workflowsZoomInfo (quote-based), Cognism (quote-based), Clearbit

Pricing as of May 2026 — verify current rates before purchasing.

What You Can Accomplish for Free (and Where You'll Hit Limits)

With Thunderbit's free tier (6 pages of AI scraping per month), the free Email Extractor and Phone Number Extractor (unlimited, 1-click), Google search operators, and basic LinkedIn search, a solo founder can realistically build a 50-100 lead list in an afternoon. I've watched people on our team do it.

Where you'll hit limits: volume (pages per month on the free tier), enrichment depth (you won't have intent data or tech stack info without paid tools), and email verification at scale (free verification tools cap at small volumes). When those limits start pinching, that's when it makes sense to move to the low-cost tier — unlock subpage scraping, bulk scraping, pagination, and scheduled scrapers.

The Bounce Rate Fix: A Data Verification Workflow That Actually Works

email-validation-workflow.webp

I've seen forum posts from users reporting from purchased lists. That's not just wasted effort — it's actively dangerous. High bounce rates tank your sender reputation, which means even your good emails start landing in spam.

say a healthy bounce rate is below 2%, and below 1% is great. found that nearly half of senders report bounce rates in the 2-5% range, while . If you're above 5%, your sender reputation is in trouble.

The verification workflow I recommend:

  1. Remove role-based emails: Delete any info@, sales@, support@, admin@ addresses unless you're intentionally targeting shared inboxes (rare for cold outreach).
  2. Remove formatting errors: Duplicates, typos, missing domains, dead domains. A quick sort and filter in your spreadsheet catches most of these.
  3. Run through an email verification tool: Hunter, ZeroBounce, NeverBounce, Bouncer, or Kickbox. These tools ping the mail server to check if the inbox exists without sending an email.
  4. Flag or remove catch-all domains: that catch-all addresses are a major risky category — they accept mail at the server level without proving a specific inbox exists. If you can't verify the individual inbox, flag the record and treat it as lower confidence.
  5. Re-verify before each campaign: Data decays fast. If your list is more than 30-90 days old, run verification again before sending.
  6. Send small batches first: Watch bounce and complaint rates on the first 50-100 sends. Scale only if quality holds.

Why Your Lead Source Affects Data Quality

Not all lead data is created equal. An email pulled from a company's public team page — where the person intentionally listed their contact info — is typically fresher and more accurate than an email from an aggregator database that hasn't been updated in months.

This is one of the reasons I believe in scraping from live public pages rather than relying solely on static databases. Because Thunderbit's AI reads the actual website fresh each time (not a stale database), the extracted emails and phone numbers tend to be current. The Phone Number Extractor also reformats numbers to E.164 standard, which reduces downstream format errors when importing to CRMs.

Scraping from fresh sources is not a substitute for verification — but it does mean you start with cleaner raw material.

Pre-Campaign Checklist

Before you hit "send" on any campaign:

  • [ ] All emails verified within the last 30 days
  • [ ] No role-based addresses (info@, sales@) in the send list
  • [ ] No duplicates
  • [ ] Bounce rate from last campaign reviewed
  • [ ] Opt-out/unsubscribe mechanism in place
  • [ ] Suppression list synced (honor all previous opt-outs)

Build-Then-Score: A Simple Lead Scoring Model for Small Teams

Every guide I've read says "prioritize your leads" — and then moves on without telling you how.

lead-scoring-list-maintenance.webp

If you're a solo founder or a three-person sales team, you don't need Salesforce Einstein or a predictive scoring engine. You need a spreadsheet column with a transparent formula.

The Scoring Framework

SignalPointsExample
Matches ICP industry+20SaaS, mid-market
Company size fit+1051-500 employees
Decision-maker title+15VP Sales, Head of Ops
Clear intent signal+15Hiring, funding, tool migration
Email verified+10Passed verification
Direct source quality+10Company page, event speaker page
Engaged with your content+10Downloaded guide, webinar attendee
Catch-all / unverified email-10Risky verification status
Role-based email-10info@, sales@
Generic title (no clear role)-5"Staff"

Worked Example

Lead A: VP Sales at a 120-person SaaS company, hiring SDRs, email verified, sourced from a company careers/team page.
Score: 20 (industry) + 10 (size) + 15 (title) + 15 (intent) + 10 (verified) + 10 (source) = 80 → Prioritize for outreach this week.

Lead B: "Staff" at a 5-person hobby business, role-based email, no intent signal.
Score: 0 + 0 + 0 + 0 + 0 - 10 (role-based) - 5 (generic title) = -15 → Skip or remove.

This can live as a simple formula in Google Sheets. Something like:

1=IF(D2="SaaS",20,0)+IF(AND(E2>=51,E2&lt;=500),10,0)+IF(REGEXMATCH(B2,"VP|Head|Director|Founder"),15,0)+IF(J2&lt;>"",15,0)+IF(K2="Verified",10,IF(K2="Catch-all",-10,0))

No Salesforce required.

How to Use AI to Label and Score Leads During Scraping

One of the features my team built into Thunderbit that I find genuinely useful for scoring: Field AI Prompts. When you're configuring your scrape, you can add a prompt to any column — for example, "Classify this lead's seniority as Decision-Maker, Influencer, or Individual Contributor based on the job title and page context."

Thunderbit labels the data during extraction, not after. So when you export to Sheets, the seniority classification, company type, or industry tag is already there — ready to plug into your scoring formula. This cuts out the manual tagging step that makes scoring feel like a chore.

You can also use Subpage Scraping to enrich the original listing data: scrape a directory first, then visit each company's page to pull headcount, funding status, or tech stack — all of which feed into your scoring model.

When to Revisit and Update Your Scores

Lead scores aren't set-and-forget. Re-score monthly, or after any major campaign. If a lead replies positively, their score changes (they're now an active conversation, not a cold lead). If an email bounces, adjust accordingly. If a company that was hiring six months ago has since laid off staff, the intent signal has shifted.

How to Keep Your Lead List Fresh (Automation and Maintenance)

A lead list is not a one-time project.

I've already mentioned that . Contacts change jobs, companies pivot, emails go stale. If you build a great list in May and don't touch it until October, a significant chunk of it is already dead.

Maintenance Cadence

TaskFrequencyWhy
Verify emailsBefore every campaign (or at least monthly)Prevent hard bounces
Deduplicate contactsWeekly during active prospectingAvoid duplicate outreach
Refresh intent signalsMonthlyHiring/funding/reviews change quickly
Update company firmographicsQuarterly or semi-annuallySize, revenue, and tech stack drift
Suppression list syncDaily or real-timeHonor opt-outs and reduce complaints
Source performance reviewMonthlyFind which channels create replies, not just rows

Setting Up a Scheduled Scrape for Ongoing Lead Generation

This is where Thunderbit's Scheduled Scraper comes in. Instead of manually re-visiting directories every month, you can set up a recurring scrape. The setup is simple: describe the time interval in plain language (e.g., "every Monday at 8am"), input the website URLs, and click "Schedule." Thunderbit's AI transforms your words into a schedule and runs the scrape automatically, exporting fresh results to your connected Google Sheet or Airtable base.

Use cases I've seen work well:

  • A sales team re-scrapes a Clutch category page monthly to catch new agencies entering the market.
  • An ecommerce ops team monitors a competitor directory weekly for new product listings.
  • A SaaS founder refreshes a G2 category page before each monthly outbound batch to find newly listed companies.

Thunderbit's cloud mode can , so even large directories get refreshed quickly. For more on setting this up, check out our guide on .

A Quick Note on Compliance and Data Privacy

I'll keep this brief because it's not the focus of this guide, but it's essential.

  • CAN-SPAM (U.S.): Applies to all commercial email, including B2B. The each separate violating email can trigger penalties up to . Requirements: accurate headers, non-deceptive subject lines, valid postal address, clear opt-out, and honor opt-outs within 10 business days.
  • GDPR (EU/UK): Named business emails can be personal data. The B2B marketing must not conceal identity, must provide a valid opt-out, and must respect objections.
  • CCPA/CPRA (California): Emphasizes notice, purpose limitation, data minimization, and consumer rights. The has the latest details.
  • Google and Yahoo sender rules: bulk senders to keep spam rates below 0.30%, authenticate with SPF/DKIM/DMARC, and support one-click unsubscribe. .

Bottom line: scrape only publicly available data, avoid login walls without permission, always include an opt-out mechanism, maintain suppression lists, and verify local legal requirements. Thunderbit scrapes publicly available pages — users are responsible for how they use the data.

Conclusion and Key Takeaways

The 2026 lead list workflow isn't about finding more names — it's about building a smaller, fresher, verified, source-aware outreach dataset that actually gets replies.

Here's the full workflow in summary:

  1. Define your ICP before touching any tool.
  2. Choose 2-3 lead sources — start free (directories, company pages, Google operators) before paying for databases.
  3. Extract leads with AI scraping — Thunderbit's 2-click process works on virtually any public page.
  4. Build a proper template with source tracking, intent signals, and scoring columns.
  5. Verify and clean — remove role-based emails, deduplicate, run verification, flag catch-alls.
  6. Score and prioritize — use a transparent spreadsheet model, not gut feel.
  7. Export to CRM/outreach — personalize based on the data you've collected.
  8. Track outcomes — bounces, replies, conversions, by lead source.
  9. Refresh continuously — re-verify before campaigns, re-scrape high-value sources on a schedule.

The data backs this up: by nearly 3x on reply rates. A 200-lead verified list will almost always outperform a 5,000-contact stale database.

Ready to build your first list? gives you 6 pages of AI scraping per month, unlimited free email and phone extraction, and export to Google Sheets or Excel. That's enough to build your first 50-100 leads this afternoon.

Try AI Web Scraper for Lead Lists

FAQs

How many leads should be on my first lead list?

Start with 50-100 well-targeted, verified leads rather than thousands of unqualified contacts. Hunter's data shows that campaigns with smaller, tighter recipient lists (21-50) average 6.2% reply rates — nearly triple the rate of campaigns with 501+ recipients. Quality compounds; volume dilutes.

Should I buy a lead list or build my own?

Building your own is almost always better. Purchased lists carry higher risk: stale data, spam traps, opaque sourcing, and compliance exposure. Self-built lists using AI scraping and manual research yield fresher, more relevant data because you're pulling from live public pages where contacts are current. If you do buy, demand transparency on collection date, verification date, consent basis, and refresh process.

What's the best free way to build a lead list?

Combine Google advanced search operators (site:, intitle:, exact-match queries) with Thunderbit's free tier — 6 pages of AI scraping per month plus unlimited free email and phone extraction — and basic LinkedIn search. This combination covers company pages, directories, event lists, and professional profiles without spending a dollar.

How often should I update my lead list?

Re-verify emails before every campaign, especially if the list is more than 30 days old. Do a full refresh — re-scrape sources, update firmographics, remove dead leads — at least quarterly. ZeroBounce reports that at least 23% of an email list decays within one year, so "set it and forget it" is a recipe for rising bounce rates.

What's a good reply rate for cold outreach from a lead list?

Based on 2025-2026 benchmarks: 3-5% positive reply rate is good, 5-8% is strong, and 8%+ is excellent. The single biggest factor is list quality — targeting, verification, and personalization. A well-built list with verified emails, clear intent signals, and personalized messaging will consistently outperform a larger list with generic contacts and boilerplate copy.

Learn More

Shuai Guan
Shuai Guan
CEO at Thunderbit | AI Data Automation Expert Shuai Guan is the CEO of Thunderbit and a University of Michigan Engineering alumnus. Drawing on nearly a decade of experience in tech and SaaS architecture, he specializes in turning complex AI models into practical, no-code data extraction tools. On this blog, he shares unfiltered, battle-tested insights on web scraping and automation strategies to help you build smarter, data-driven workflows.When he's not optimizing data workflows, he applies the same eye for detail to his passion for photography.

Try Thunderbit

Scrape leads & other data in just 2-clicks. Powered by AI.

Get Thunderbit It's free
Extract Data using AI
Easily transfer data to Google Sheets, Airtable, or Notion
PRODUCT HUNT#1 Product of the Week