KK-DATA avatar KK-DATA

Building a WhatsApp Number Filtering Pipeline: From Number Generation to Active Detection Complete Guide

筛号系统 Whatsapp筛号 kkdata 出海营销

How to Build a WhatsApp Number Screening Pipeline? A Complete Guide from Number Generation to Active User Detection

Overseas marketing and community operation teams face large volumes of WhatsApp numbers every day. How can you quickly determine which numbers are valid, which are recently active, and which can be used for precise retargeting? A number screening system with a WhatsApp pipeline can seamlessly connect number preparation, detection, deduplication, and export, completely eliminating the inefficiency and waste of manual verification. This article, based on the KK-DATA platform, will guide you step by step in building a complete WA screening pipeline.

What is a WhatsApp Screening Pipeline?

A WhatsApp screening pipeline refers to the complete automated process that starts from number data preparation, goes through batch detection (valid, active, wsid, etc.), and ends with deduplication and export. Unlike a single random screening task, the pipeline emphasizes three characteristics:

  • Repeatability: The same batch of numbers can be subjected to different detection dimensions (e.g., first test validity, then filter for activity).
  • Modularity: Number generation, screening detection, deduplication, and export are independent and can be flexibly combined.
  • Traceability: Each number comes with detection labels, facilitating subsequent marketing segmentation.

Simply put, the pipeline lets you complete the entire process of “generate numbers → detect registration status → identify active users → export wsid” at one go, without having to manually download multiple files and stitch them together. The KK-DATA console is designed around this pipeline, from global number generation to multi-platform screening, and then to a data deduplication warehouse, forming a closed loop.

Why Do Overseas Marketing Teams Need a Dedicated WA Screening Pipeline?

Teams without a pipeline often encounter three pain points:

  1. Inconsistent number quality: Numbers collected from public sources may have over 30% not registered on WhatsApp. Sending messages directly wastes a lot of time and resources.
  2. Low-efficiency manual verification: Adding contacts one by one on your phone or using third-party small tools to verify can process at most a few hundred numbers per day, completely unable to support the need for thousands or tens of thousands of leads.
  3. Wasted balance from repeated detections: Submitting the same batch of numbers for screening multiple times leads to repeated charges. Without a deduplication mechanism, every penny spent could be wasted.

A screening system with a WhatsApp pipeline directly solves these problems: batch verification of validity and activity, automatic deduplication to avoid repeated charges, and one task that exports valid numbers, activity tags, and wsid. Notably, KK-DATA uses a pay-per-number model (no subscription plans), so you only pay for what you use, allowing even small teams to run a pipeline at low cost.

Three Core Stages in Designing a WhatsApp Screening Pipeline

The pipeline can be simplified into three stages, each with corresponding operation interfaces and key points.

Number Data Preparation (Generate or Import)

The starting point is the list of numbers. You can obtain them in two ways:

  • Random generation: Use a random number generation tool to create numbers by country, prefix, or custom rules. KK-DATA supports random generation for numbers in 240+ countries and regions, and generation is free.
  • Import CSV: If you already have a target number database (e.g., obtained via crawling or purchase), upload a CSV file directly. Pay attention to format requirements (generally a single column of numbers, with or without international dialing codes).

Tip: When running the pipeline for the first time, test with a small batch (e.g., 1000 numbers) to confirm detection types and formatting are correct before submitting the full volume.

WhatsApp Screening Detection

This is the core of the pipeline. After selecting WhatsApp as the platform, you need to check the detection types:

  • Valid number detection: Determines whether the number is registered on WhatsApp (i.e., “WA activated”).
  • Activity detection: Determines whether the number has been online within a recent period (e.g., 7 days, 15 days, 30 days).
  • wsid export: Retrieves the WhatsApp user’s unique identifier for subsequent message delivery and retargeting.

KK-DATA supports up to approximately 1 million numbers per task. You can freely combine detection types. For example, you can check only valid + active, or valid + wsid, or all three.

Result Export and Data Management

After detection, results are exported in CSV or TXT format. Each number comes with detection status (valid/invalid, active/inactive), wsid field, etc. Additionally, the platform’s built-in data deduplication warehouse automatically records the numbers already detected in this batch, so that when you submit the same numbers again, you won’t be charged again.

These three stages form a complete WhatsApp screening pipeline. Now let’s dive into the practical steps.

How to Build a WhatsApp Screening Pipeline Step by Step (Practical Guide)

The following steps are based on the KK-DATA console. You can log in at https://app.kkdata.cc/ and follow along.

Step 1: Prepare the Number List in the Console

After logging in, go to the “Global Number Generation” module. If you want to test WhatsApp users in Indonesia, select country “Indonesia”, set the quantity to generate (e.g., 5000 numbers), and click Generate. The generated numbers will be saved in your number list.

If you already have an existing CSV of numbers, go directly to the “Number Management” upload feature and upload the file. Note: The file should be in plain text format, one number per line, preferably with international dialing codes (e.g., 628123456789).

Step 2: Create a WhatsApp Screening Task

On the “Screening Task” page, click “New Task”, select the platform as WhatsApp. Then check the required detection types.

Detection Type Description

  • Valid: Only checks if the number is registered on WhatsApp. Hit rate is about 30%–70% (depending on data source quality).
  • Active: On top of valid, further judges whether there has been recent online behavior. Recommended to choose a 7-day or 15-day window, suitable for marketing scenarios that need quick reach.
  • wsid: Exports the user’s unique identifier for precise message routing. Note: wsid export requires additional authorization and has a different price; see real-time prices in the console.

It is recommended to check “Enable number deduplication”, so the system will automatically compare with the data deduplication warehouse to avoid charging for already-detected numbers.

Step 3: View Estimated Cost and Submit the Task

Before submission, the interface will display the estimated cost. This is because KK-DATA charges per number for each detection type, with different unit prices for each type. If the balance is insufficient, the task cannot be submitted. You need to recharge first via USDT (TRC20), with a minimum of about 50 USDT. The balance is credited instantly, no waiting.

After confirming the cost and number list are correct, click Submit. The task starts executing, and you can view its progress in the task list.

Step 4: View Results and Export Data

When the task is complete, you will receive a Telegram notification (you need to bind notifications in advance). In the console, go to the task details to see the summary statistics: number of valid numbers, number of active numbers, wsid export count, etc. Click “Export Results”, choose CSV or TXT format, and download.

In the exported file, each number will contain fields like the following (example):

NumberValidActive (7 days)wsid
628123456789YesYes3A…
628987654321YesNoEmpty

This gives you a set of high-quality data ready for WhatsApp marketing, without needing to organize manually.

What Is the Difference Between “Valid”, “Active”, and “wsid” in WhatsApp Screening?

These three concepts correspond to different business goals:

  • Valid number: The number is registered on WhatsApp. This is the most basic filter to avoid sending messages to non-existent numbers. Suitable for first-round cleaning.
  • Active number: Users who have been online within a specified window. Higher activity usually leads to higher response and conversion rates. Suitable for promotional scenarios that need immediate feedback.
  • wsid: The internal identifier of a WhatsApp user system. After exporting wsid, even if the number changes or the app is uninstalled, you can still target the user precisely for retargeting. However, wsid export typically involves a more advanced detection method and has a slightly higher cost.

You can choose combinations based on your marketing rhythm: For cold-start acquisition, just detecting whether the number is valid is sufficient; for second-touch or precise targeting, it’s advisable to add active detection and wsid export.

How to Use the Data Deduplication Warehouse to Avoid Wasting Balance on Repeated Detections?

The data deduplication warehouse is an often overlooked but extremely cost-saving feature of the pipeline. Its principle is: when you submit a batch of numbers for screening, the system automatically stores this batch’s numbers and detection results in the warehouse. When any subsequent task encounters the same numbers again, regardless of which detection types you check, the system will prioritize reading existing results from the warehouse and will not charge you again.

For example: you first detect 5,000 numbers for “valid + active”, paying for 5,000 numbers. A week later, you want to detect the same batch for “wsid”. You only need to submit the same number list and check “wsid”. The system first matches the warehouse records; it only charges for the wsid part that hasn’t been detected yet (if the numbers already had valid and active detected previously, this time you are only charged for the wsid detection of those 5,000 numbers). If you enable deduplication, the system will even inform you which numbers and results already exist in the warehouse, avoiding double submission.

Operation path: When creating a new task, check “Enable data deduplication”. The warehouse data is retained by default, no manual management needed.

WhatsApp Screening Pipeline Checklist and Best Practices

To ensure smooth operation of the pipeline, we recommend checking the following items before each execution:

  • Have you topped up enough balance? Minimum about 50 USDT, pay as you go.
  • Are the number formats correct? Preferably with international dialing codes, no spaces or parentheses.
  • Have you selected the correct platform (WhatsApp)?
  • Do the detection types match your current marketing goals? Refer to the “valid/active/wsid selection matrix” above.
  • Have you enabled data deduplication? To avoid repeated charges.
  • Have you tested with a small batch first? Run 100–500 numbers to confirm the results are correct before submitting the full volume.
  • Have you bound Telegram notifications? So you can be notified in real time when tasks are completed.

Efficiency Tip

A single WhatsApp screening task can simultaneously export valid numbers, activity tags, and wsid, without needing multiple operations. Combined with the data deduplication warehouse, repeated numbers incur no additional costs. With a pay-per-number model, even small batches are cost-effective, suitable for testing and iteration.

Frequently Asked Questions

Q: How far back can WhatsApp screening detect activity?

A: Currently, it supports custom activity windows, typically offering options for 7 days, 15 days, or 30 days of recent activity data. Specific options can be seen when submitting a task (based on the actual KK-DATA console interface). We recommend choosing a window based on your marketing rhythm: 7 days for quick acquisition, 30 days for brand exposure.

Q: What is the use of exporting wsid?

A: wsid is a WhatsApp user identifier, which can be used for precise marketing (e.g., message delivery, secondary verification, preventing duplicate additions). You can bind wsid in subsequent messaging tools or CRMs to ensure targeting the same user account, even if the other party changes their phone number.

Q: What happens to a submitted task if my balance is insufficient?

A: On KK-DATA, a task can only be created if the balance is sufficient before submission. If the balance is insufficient, you cannot submit a new task. Tasks running will not be interrupted due to balance changes; fees are deducted from the balance after the task is completed. So ensure your balance covers the estimated cost before execution.

Q: What is the maximum number of numbers per task?

A: Currently, a single WhatsApp screening task supports up to about 1 million numbers. If you have more than 1 million numbers, you can submit them in batches, each batch within 1 million. The data deduplication warehouse will automatically avoid duplicate charges.

Q: Can randomly generated numbers be used directly for screening? Are there additional charges?

A: Number generation is completely free and does not consume your balance. Generated numbers will not automatically trigger screening; you need to manually create a screening task to be charged per number. You can generate numbers unlimitedly and submit them for detection at any time.


That wraps up the complete guide for building a WhatsApp screening pipeline. From number preparation to detection, deduplication, and export, each step has clear operation points. If you are looking for a mature, transparent, subscription-free screening system, why not log into the console and run a pipeline yourself: 👉 Log in to the console to start screening; if you encounter issues, you can contact customer service at https://t.me/kkdata_robot; detailed operation instructions can be found in the documentation.

Related Articles

Comprehensive Analysis of thshxt Number Screening System: How Overseas Teams Choose Telegram/WhatsApp Number Filtering Platforms

What is the thshxt number screening system? This article provides a comprehensive comparison of mainstream general-purpose screening tools, covering number filtering, active detection, gender identification, and global number generation for Telegram, WhatsApp, iMessage, and more. Overseas marketing teams can refer to the selection criteria in this article to evaluate the capability differences of platforms like kkdata.cc and make better decisions. FAQ included.

thdata alternatives comprehensive guide: core function comparison and number screening system selection (2025)

Looking for thdata alternatives? This article deeply evaluates the core dimensions of number screening system selection, comparing Telegram/WhatsApp number screening, number generation, data deduplication, and billing models. Includes KK-DATA hands-on experience and migration recommendations to help overseas teams make wise choices.

2026 Best Practices for Number Screening Systems: A Complete Guide from Selection to Pipeline Operation

In 2026, competition for overseas customer acquisition intensifies. How to efficiently filter valid numbers using a number screening system? This article summarizes best practices for number screening systems, covering multi-platform screening for Telegram/WhatsApp/iMessage, global number generation, automatic data deduplication, and SOP pipeline construction, helping you reduce invalid costs and improve ROI. It is an essential customer acquisition guide for overseas teams.