Introduction
In the fast-moving logistics and construction ecosystem, having access to updated business directories can accelerate sales, marketing, and vendor outreach efforts. When a client approached Actowiz Solutions with the goal of creating a comprehensive dataset of asphalt and dirt moving trucking companies in the Southeastern United States, the project required precision, geolocation targeting, and efficient scraping from structured directories such as Google Yellow Pages.
This case study outlines how Actowiz Solutions helped the client collect accurate business information across Florida, Georgia, Alabama, South Carolina, North Carolina, and Tennessee, and organized the data into a clean, spreadsheet-friendly format.
Client Objective
The client was searching for a reliable and experienced web scraping provider to:
- Extract trucking company information from Google Yellow Pages.
- Focus exclusively on asphalt and dirt movers.
- Collect the following fields:
- Business Name
- Address
- Phone Number
- Other relevant business details (website, ratings, hours)
- Email (if available)
- Organize all data into separate columns in a spreadsheet.
- Ensure geo-targeting across 6 U.S. states: FL, GA, AL, SC, NC, and TN.
The client wanted to use this data for B2B outreach, sales prospecting, and regional supplier network expansion.
Challenges in Extracting Regional Business Listings
Despite the structured layout of Google Yellow Pages, scraping this type of business data came with challenges:
- Category-specific filtering: There was no single label for “asphalt and dirt movers,” so custom keyword-based search filters had to be implemented.
- Location scoping: Multiple cities and ZIP codes across six states had to be covered systematically.
- Duplicate entries: Yellow Pages often shows the same business across nearby locations.
- Captcha and anti-bot systems: Throttling mechanisms had to be bypassed legally and respectfully.
- Consistency: All extracted data had to be standardized (e.g., phone numbers, street addresses).
Actowiz Solutions’ Approach
Actowiz initiated a multi-phase scraping strategy to ensure precision, speed, and compliance:
Phase 1: Keyword Query Testing - Customized search strings like “asphalt hauling company,” “dirt movers,” and “trucking for construction” were used to identify Yellow Page listings.
Phase 2: Geo-Segmented Data Collection - Each target state was split into metropolitan regions (e.g., Miami, Orlando, Atlanta, Nashville). - Scraping was done state-wise and city-wise using geo-coordinates.
Phase 3: Structured Parsing and Cleaning - Business names, phone numbers, and addresses were parsed using XPath/CSS selectors. - All unstructured fields like business descriptions were cleaned using NLP techniques to filter relevant companies.
Phase 4: De-duplication and Formatting - Duplicate records were removed using fuzzy match algorithms. - Final datasets were arranged into clean spreadsheets with column headers.
Technology Stack & Tools Used
- Scrapy (Python framework) for structured crawling
- Selenium for handling dynamic JavaScript pages
- Pandas for data cleaning and structuring
- Proxy Rotation APIs to avoid IP bans
- Google Maps API for verifying address geolocation
- Regular Expressions + Named Entity Recognition (NER) for filtering job-specific services
Data Fields & Formatting
The final dataset was organized in an Excel-compatible .CSV file with the following fields:
Column Name |
Description |
Business Name |
Full name of the trucking company |
Address |
Full street address with ZIP code |
City |
Extracted from full address |
State |
One of the six states (FL, GA, etc.) |
Phone Number |
In standard format (e.g., (555) 123-4567) |
Website |
Official URL if available |
Email Address |
If publicly listed |
Business Description |
Extracted keywords like ‘asphalt’, ‘dirt hauling’ |
Ratings |
If listed on Yellow Pages |
Working Hours |
Business hours (if listed) |
Sample Dataset Preview
Business Name |
Address |
City |
State |
Phone |
Website |
Description |
Southern Asphalt Haulers |
432 Industrial Blvd |
Orlando |
FL |
(407) 555-7832 |
www.southernasphalt.com |
Asphalt & dirt hauling |
RedClay Dirt Movers |
1221 Highway 92 E |
Fayetteville |
GA |
(770) 555-2211 |
www.redclaymovers.com |
Dirt, rock, and sand hauling |
Tennessee Haul & Dump |
987 Route 19 |
Nashville |
TN |
(615) 555-4433 |
N/A |
Excavation and trucking |
Delivery Timeline & Quality Assurance
The entire project was completed in under 10 business days, broken into:
- Day 1–2: Keyword and location strategy finalized
- Day 3–6: Data scraping and storage
- Day 7–8: Data cleaning, de-duplication, formatting
- Day 9: Manual QA + random sample checks
- Day 10: Final delivery and client walkthrough
QA protocols included: - Checking 50 random records per state - Spot verification of addresses on Google Maps - Formatting verification for phone and emails
Client Impact & Outcomes
950+ trucking businesses were captured across the 6 states.
92% of listings included valid phone numbers.
78% had associated business descriptions.
Over 300 companies had websites, 90+ had public emails.
Outcomes achieved: - Client launched a B2B outreach campaign targeting construction trucking partners. - Internal CRM enriched with clean regional data. - Enabled faster expansion into the Southeastern U.S. market.
The client remarked on the: - High accuracy and consistency of the data - Well-labeled spreadsheet that could be imported into Salesforce - Quick turnaround time without compromising on quality
Conclusion
This case study demonstrates the power of regional, niche-focused scraping services delivered by Actowiz Solutions. By blending smart keyword filters, precise geo-targeting, robust scraping infrastructure, and manual QA, Actowiz enabled the client to unlock a critical dataset for asphalt and dirt moving companies across the Southeast U.S.
Whether you’re in construction, logistics, local business analytics, or B2B lead generation, our ability to scrape structured business listings from directories like Google Yellow Pages provides a competitive advantage.
Need help building your own targeted business datasets? Reach out to Actowiz Solutions today!