See how Actowiz Solutions scraped and organized current Indian government schemes across healthcare, education, agriculture, and business sectors.
Location: Mumbai, India
Industry: Public Data Analytics & Policy Research
Objective: To collect, validate, and structure current Central and State Government schemes across key sectors — Healthcare, Education, Agriculture, and Business Development — from verified government sources and portals.
The client required an up-to-date, accurate, and comprehensive dataset for both Central and State-specific welfare programs. The focus was on ongoing schemes only, excluding outdated or discontinued ones.
Public sector data is often scattered across multiple portals, PDFs, and departmental websites. For India, this fragmentation occurs between:
Actowiz Solutions' mandate was clear:
| Source | Description | Type |
|---|---|---|
| india.gov.in | Official government directory of Central schemes | Central |
| mygov.in | Citizen engagement and scheme announcements | Central |
| pib.gov.in | Official Press Information Bureau releases | Central |
| State Government Portals | Schemes per state (maharashtra.gov.in, tamilnadugov.in, up.gov.in, etc.) | State |
| Ministry Sites | Agriculture, MSME, Education, Health, Finance, Women & Child Development | Sectoral |
| News & Gazette Updates | Scheme launches and updates | Cross-source validation |
| Component | Tools Used |
|---|---|
| Web Scraping | Python (Scrapy + BeautifulSoup + Requests-HTML) |
| Dynamic Rendering | Playwright (for JavaScript-heavy sites) |
| Data Parsing | Regex, Pandas |
| Data Storage | MySQL + CSV + JSON |
| Validation Engine | Rule-based filters for "active" schemes |
| Dashboard Visualization | Power BI / Tableau |
| Automation | Cron jobs for weekly updates |
[ Government Websites (Central & State) ]
↓
[ Scrapy Spider + Playwright Automation ]
↓
[ HTML & PDF Parsing (Titles, Descriptions, URLs) ]
↓
[ NLP-based Keyword Categorization (Healthcare / Education / etc.) ]
↓
[ Validation & Deduplication ]
↓
[ Structured Export (CSV, JSON, MySQL) ]
↓
[ Power BI Dashboard Visualization ]
| Field | Description |
|---|---|
| Scheme Name | Official scheme title |
| Type | Central / State |
| State / Ministry | Applicable entity |
| Category | Healthcare, Education, Agriculture, Business |
| Description | Summary of benefits |
| Target Group | Farmers, Students, Entrepreneurs, Women, MSMEs, etc. |
| Launch Year | Year of introduction |
| Current Status | Active / Merged / Suspended |
| Official Link | Source URL for validation |
| Scheme Name | Type | Sector | Target Group | Description | Source |
|---|---|---|---|---|---|
| Ayushman Bharat Pradhan Mantri Jan Arogya Yojana | Central | Healthcare | Low-income families | Health insurance coverage up to ₹5 lakh per family per year. | https://pmjay.gov.in |
| PM-KISAN Samman Nidhi | Central | Agriculture | Small & marginal farmers | Direct income support of ₹6,000 annually in three installments. | https://pmkisan.gov.in |
| Startup India Seed Fund Scheme | Central | Business | Startups / Entrepreneurs | Early-stage funding support for startups across sectors. | https://startupindia.gov.in |
| Samagra Shiksha Abhiyan | Central | Education | School students | Integrated education scheme for holistic school development. | https://education.gov.in |
| Mahatma Jyotirao Phule Jan Arogya Yojana | State (Maharashtra) | Healthcare | Residents of Maharashtra | Free healthcare for families below income threshold. | https://jeevandayee.gov.in |
| Rythu Bandhu Scheme | State (Telangana) | Agriculture | Farmers | Investment support for each crop season at ₹10,000/acre. | https://rythubandhu.telangana.gov.in |
| Sector | Total Schemes (Approx.) |
|---|---|
| Healthcare | 42 |
| Education | 38 |
| Agriculture | 55 |
| Business & Industry | 33 |
| Women & Child Development | 20 |
| Skill & Employment | 27 |
Insight: Agriculture and Healthcare remain the most active sectors with the highest number of ongoing initiatives in 2024–2025.
Actowiz Solutions built custom data validation modules to ensure reliability:
| Metric | Achieved |
|---|---|
| Total Schemes Extracted | 215+ (across 24 states and 1 UT) |
| Verified Active Schemes | 180+ |
| Central Schemes | 95 |
| State Schemes | 85 |
| Average Update Cycle | Weekly (Automated) |
| Data Accuracy | 98.6% validated |
| Phase | Duration | Activities |
|---|---|---|
| Discovery & Source Mapping | 2 Days | Identified verified central & state sources |
| Scraper Development | 5 Days | Built Scrapy + Playwright hybrid crawler |
| Data Extraction & Cleaning | 3 Days | Parsed, validated, and normalized data |
| QA & Output Formatting | 2 Days | Validated schema and removed duplicates |
| Dashboard Setup | 2 Days | Visualization in Power BI |
| Total Duration | ~12 Days | End-to-end delivery |
“Actowiz Solutions turned a difficult, fragmented research task into a structured data system. Their scraping and validation accuracy were exceptional, and we now have an updated dashboard tracking all active schemes weekly.”
— Head of Policy Analytics, Mumbai-based Consultancy
Scraping limited to publicly accessible .gov.in and .nic.in domains.
No personal or confidential data collected.
Compliant with Indian IT Act and data usage policies.
Data used strictly for public research and analytics.
Actowiz Solutions follows ethical web scraping practices and ensures data accuracy and compliance at every stage.
This case study demonstrates how Actowiz Solutions transformed the complex, decentralized landscape of Indian government schemes into an organized, real-time dataset for decision-makers.
Through cutting-edge scraping technology, data validation, and classification, Actowiz Solutions helped the client gain:
With automation and NLP-driven classification, the client now maintains a live, weekly-updated dashboard of verified schemes — ensuring informed decisions and transparent analytics.
Our web scraping expertise is relied on by 4,000+ global enterprises including Zomato, Tata Consumer, Subway, and Expedia — helping them turn web data into growth.
Watch how businesses like yours are using Actowiz data to drive growth.
From Zomato to Expedia — see why global leaders trust us with their data.
Backed by automation, data volume, and enterprise-grade scale — we help businesses from startups to Fortune 500s extract competitive insights across the USA, UK, UAE, and beyond.
We partner with agencies, system integrators, and technology platforms to deliver end-to-end solutions across the retail and digital shelf ecosystem.
Tivanon Tyre Data Extraction enables real-time pricing transparency and competitive benchmarking, helping automotive businesses optimize strategy and profits.
How a $50M+ consumer electronics brand used Actowiz MAP monitoring to detect 800+ violations in 30 days, achieving 92% resolution rate and improving retailer satisfaction by 40%.

Track UK Grocery Products Daily Using Automated Data Scraping across Morrisons, Asda, Tesco, Sainsbury’s, Iceland, Co-op, Waitrose, and Ocado for insights.
Whether you're a startup or a Fortune 500 — we have the right plan for your data needs.