Back to Guides

    Building a Target List with Data Tools

    How to use data tools to find acquisition targets without drowning in junk

    6 min read

    Key Takeaways

    • Data tools do not find deals. They give you a list. Your process turns the list into conversations
    • Start with your target company profile, then build filters that match it. Tight filters beat massive exports
    • Use North American Industry Classification System ("NAICS") codes as the base because NAICS is more detailed than the older Standard Industrial Classification ("SIC") system, especially in services
    • Private company estimates are noisy. Treat revenue and employee counts as filters, not facts
    • Contact data decays fast. Plan for meaningful cleanup and validation each time you run a list

    There are two ways buyers waste time with databases. The first is pulling a list of 5,000 companies and never calling anyone. The second is treating database fields like audited financials.

    The right mindset is simple: a data tool is a prospecting engine. It helps you create a clean, prioritized list of companies that might fit your target company profile. Then you do the real work.

    What Data Tools Provide (and What They Don't)

    Most tools in this category (D&B Hoovers, ZoomInfo, Apollo, PitchBook for larger deals, and similar) try to provide company name, address, website, industry classification in NAICS and sometimes SIC, employee count (estimated or self-reported), revenue (estimated), and key contacts including names, titles, emails, and phone numbers.

    That is enough to start direct outreach. It is not enough to underwrite a deal.

    These tools have different strengths. A simple way to think about it: some tools are stronger on firmographics (company-level data), while others are stronger on contacts (people-level data). For acquisition sourcing, you need both. You need firmographic filters to match your target company profile, and you need an actual owner or decision-maker contact path. If your tool is weak on owner contacts, your output list will look great and perform poorly.

    Building Filters That Match Your Target Company Profile

    Start with industry. Use NAICS codes, then layer keywords. NAICS is a modern classification system that recognizes more businesses than SIC did, especially in services. That extra specificity helps when you are hunting small operators that are easy to miss in broad categories. A practical note: owner-operated businesses are often miscoded, so do not rely on one NAICS code. Use a small cluster.

    Filter geography by state or metro area, radius from a zip code if available, and distinguish between "headquarters" versus "location" entries (important for multi-location businesses).

    For size, self-funded search filters usually include employee count as a proxy for scale, revenue as a proxy for deal size, and sometimes years in business as a proxy for stability. Treat size estimates as noisy. If your filter says "$3M to $10M revenue," do not assume the company is actually in that band. Use it to avoid obviously wrong targets.

    When deciding what fields to capture, do not export the entire universe of columns. Capture what you will actually use. Minimum fields include company name, website, location (city, state), NAICS code and a plain-English industry tag, employee count estimate, revenue estimate, owner or primary contact name, title, email and phone if available, and data source (which tool, which date). Add fields that help outreach: a "personalization hook" (one line you can reference), notes from quick research, and last touch date with next touch date.

    Cleaning and Enriching the List

    Your list is not real until it is clean. You will see duplicates across parent and subsidiary entries, slight naming differences, and multi-location listings. Pick a single canonical record per target. Remove non-targets: franchises or corporate chains (if you are not targeting them), businesses outside your geography even if the HQ is in it, and businesses with no realistic contact path.

    Segment by priority. Do not treat every target equally. Create tiers: Tier 1 for perfect fit (outreach first), Tier 2 for decent fit (outreach after Tier 1), and Tier 3 for long shots. This keeps you from wasting your best outreach energy on your worst targets.

    B2B contact data decays quickly. A common benchmark is that B2B data can decay at roughly 20% to 30% per year. That means a meaningful portion of emails and phone numbers you pull today will be wrong next year, and some will be wrong today. Do not fight this. Design around it: validate emails where possible, use multiple contact paths (email, phone, LinkedIn, letter), and track bounces and update the list as you go.

    Data tools will miss key context. An enrichment pass is where you add the things that make outreach work: who actually owns the business, what they actually do (in plain English), any succession signals (founder, long-tenured owner, leadership gap), and a real personalization hook. You can do this manually for Tier 1, and lightly for Tier 2. This is also where AI can help, but only if you feed it the right sources.

    Common Mistakes

    The patterns that derail list-building are predictable: pulling a massive list and never touching it, over-filtering so hard you end up with 20 targets total, under-filtering so everything looks like a target, trusting revenue estimates, not capturing enough context to personalize outreach, and not tracking touches and follow-ups.

    This guide does not cover cold outreach scripts (see cold outreach guide), what to ask once an owner responds (see seller call guide), or pricing and tool selection (changes often and depends on your workflow).

    What to Do Next

    The list is the raw material. The process you run on top of it determines whether you book calls or burn leads.

    A well-built list should give you enough signal to prioritize and enough context to personalize. If you find yourself staring at 500 names with no idea where to start, go back and tighten your filters or add an enrichment pass.

    Once the list exists, you have two jobs: work it efficiently, and reach out in a way that does not sound like spam.

    Read next:


    Sources

    • U.S. Bureau of Labor Statistics: Industry Classification Overview (NAICS, differences vs SIC)
    • U.S. Bureau of Labor Statistics: What Is NAICS?
    • U.S. Census Bureau: NAICS reference files and tools
    • Leadspace: The Cost of Data Decay to Your Business (B2B data decay benchmark discussion)
    • HubSpot: Database Decay Simulation (email database decay estimate)
    • Marquette University Library guide: Private company information (why estimates are often all you can find)