Ensuring Accuracy in Data Records Reconciliation between CGSN and IN for 2G/3G and 4G Networks

Ensuring Accuracy in Data Records Reconciliation between CGSN and IN for 2G/3G and 4G Networks

Introduction

In the telecom industry, data usage records are logged by various network nodes, such as SGSN/GGSN in 2G/3G networks and SGW/PGW in 4G networks. These records, known as data session EDRs (Event Detail Records), capture critical information about data sessions, including the volume of data used, session duration, and charging details. Meanwhile, the Intelligent Network (IN) records the billing details associated with these data sessions. Reconciliation between these two sources is essential to ensure accuracy in billing, revenue assurance, and network management. In this blog post, we will explore the importance of reconciling these data records, the challenges involved, and how Big Data tools like Apache Spark can streamline this process.

Why Data Records Reconciliation is Important

  1. Accuracy in Data Billing: Each data session, whether in a 2G/3G or 4G network, must be accurately billed to the customer. Discrepancies between the volume of data recorded by the CGSN and the charges recorded by the IN can lead to billing errors, causing revenue loss and customer dissatisfaction.
  2. Revenue Assurance: Ensuring that all data usage is correctly captured and billed is crucial for preventing revenue leakage. Reconciliation helps identify missing, duplicated, or incorrect records, allowing operators to correct discrepancies proactively.
  3. Network Performance Monitoring: Reconciliation can also provide insights into network performance by comparing the expected usage (as recorded by SGSN/GGSN or SGW/PGW) with the actual charges. This helps operators in network planning and optimization.

How to Achieve Data Records Reconciliation

  1. Matching Using MSISDN, IMSI, and Timestamp:
  • MSISDN and IMSI are unique subscriber identifiers that link data sessions across network and billing systems.
  • The timestamp is a crucial attribute that captures the start and end times of a session. Matching records based on MSISDN, IMSI, and timestamp helps in accurately linking data usage records from CGSN and IN.
  1. Using a Unique Correlation ID:
  • Some systems generate a unique correlation ID for each data session, linking records between the network and billing nodes seamlessly. This ID makes reconciliation straightforward by directly associating each data session with its corresponding billing record.
  • However, in many instances, this unique ID is not available, complicating the reconciliation process.

Challenges in Data Records Reconciliation

  1. Absence of a Unique Correlation ID:
  • When there is no unique ID linking records between CGSN and IN, operators must rely on MSISDN, IMSI, and timestamp for matching. This approach is prone to errors, especially when dealing with sessions that start and stop frequently or overlap.
  1. Time Synchronization Issues:
  • Even a minor time discrepancy between SGSN/GGSN (or SGW/PGW) and IN can lead to unmatched records. These discrepancies can arise due to differences in system clocks, network delays, or processing times.
  • To address this, operators often use a time window to match records, where sessions are considered correlated if their timestamps fall within a predefined range, such as ±10 seconds.
  1. Handling High Volumes of Intermediate Records:
  • Data sessions often generate multiple intermediate records, especially during long or fragmented sessions. These records need to be consolidated into a single session record before reconciliation.
  • For 4G networks, SGW/PGW may generate separate records for different parts of a session, further complicating the consolidation process.

Leveraging Big Data Tools for Efficient Reconciliation

  1. Using Apache Spark:
  • Apache Spark’s distributed processing capabilities are ideal for handling large volumes of data records from both CGSN and IN. It allows for efficient matching of records based on multiple keys like MSISDN, IMSI, and timestamp.
  • Spark’s in-memory processing reduces latency, enabling near real-time reconciliation, which is critical for maintaining billing accuracy and revenue assurance.
  1. Consolidating Intermediate Records:
  • Spark can aggregate multiple intermediate records into a single session based on MSISDN and IMSI, while applying business rules to filter out duplicates and handle overlaps.
  • For example, all records with the same MSISDN and IMSI within a session can be grouped together, and their data volume and duration summed to create a consolidated record.
  1. Handling Time Differences:
  • Spark’s window functions allow for flexible time-based grouping and aggregation. A time window can be defined to match records with slight timestamp differences, accounting for system clock discrepancies between CGSN and IN.
  • This helps in accurately correlating records, even when exact timestamps do not match.
  1. Scaling with Data Growth:
  • As data usage continues to grow, the volume of EDRs from SGSN/GGSN and SGW/PGW increases exponentially. Spark’s ability to scale horizontally by adding more nodes to the cluster ensures that reconciliation processes can keep pace with the growing data volumes without compromising performance.

Conclusion

Reconciliation of data records between SGSN/GGSN (or SGW/PGW) and IN is crucial for accurate billing, revenue assurance, and network management. Despite the challenges such as the absence of a unique correlation ID, time synchronization issues, and high volumes of intermediate records, big data tools like Apache Spark provide a robust solution. Spark’s distributed processing, in-memory computation, and advanced aggregation capabilities enable efficient and scalable reconciliation, ensuring data integrity and billing accuracy.

In the next blog post, we will provide a step-by-step guide on implementing a Spark-based data records reconciliation pipeline, complete with code examples and best practices. Stay tuned.

Core and Intelligent Network Elements: Why Reconciliation is Essential for Telecom Operators

In modern telecom networks, core network elements and intelligent network (IN) elements play critical roles in managing services like voice, SMS, and data. Understanding how these elements interact is vital for ensuring accurate revenue assurance, which is a key challenge for telecom operators. This post explains the functions of key network components and highlights the importance of reconciliation for managing revenues effectively.

Core Network Elements

  1. Mobile Switching Center (MSC):
    • Role: The MSC is responsible for managing voice calls and SMS in both 2G and 3G networks. It handles the switching and routing of these services, ensuring smooth communication between mobile users and other network types such as the PSTN (Public Switched Telephone Network).
    • Key Functions:
      • Call setup and teardown.
      • Mobility management, including location updates and handovers.
      • Generation of Call Detail Records (CDRs) for billing purposes.
      • SMS handling between mobile subscribers and external messaging systems.
  2. Serving GPRS Support Node (SGSN):
    • Role: The SGSN is part of the packet-switched (PS) domain in 2G and 3G networks. It is responsible for the delivery of data services, including internet access, over GPRS and UMTS. The SGSN manages the mobility of users and keeps track of their data usage.
    • Key Functions:
      • Session management for data services.
      • Authentication and mobility management within the PS domain.
      • Collecting data usage statistics for billing.
      • Forwarding data sessions to the Gateway GPRS Support Node (GGSN).
  3. Gateway GPRS Support Node (GGSN):
    • Role: The GGSN is the gateway between the mobile network’s PS domain (managed by the SGSN) and external packet data networks, such as the internet. It assigns IP addresses to mobile users and routes data between the mobile network and the external network.
    • Key Functions:
      • Management of IP addresses for mobile devices.
      • Billing data generation for data services.
      • Acting as a liaison between mobile data networks and external packet networks.
  4. Serving Gateway (SGW) and Packet Data Network Gateway (PGW):
    • Role: In 4G LTE networks, the SGW and PGW replace the SGSN and GGSN, respectively, to handle data traffic. The SGW routes and forwards packets between the eNodeB (base station) and external networks. The PGW connects the LTE network to external packet data networks, such as the internet, managing IP address assignment and handling services like billing and policy enforcement.
    • Key Functions:
      • SGW: Forwarding data traffic within the LTE network and between the LTE network and external networks.
      • PGW: Managing mobile user IP traffic, including routing, IP address assignment, and applying policies for billing.

Intelligent Network (IN) Elements

  1. Intelligent Network (IN):
    • Role: The IN manages value-added services like prepaid billing, call forwarding, and other real-time services. The IN interacts with the MSC and other core network elements to provide these services while keeping track of real-time user balances and other account information.
    • Key Functions:
      • Real-time prepaid charging.
      • Execution of customized services like VPNs and number portability.
      • Supplementary service control (e.g., balance inquiries, top-ups).

Why Reconciliation is Crucial for Revenue Assurance

Telecom operators must ensure that the records generated by core network elements like the MSC, SGSN, GGSN, SGW, and PGW match the real-time billing information recorded by IN systems. Any discrepancies between these systems can lead to significant revenue leakage. Here’s why reconciliation is essential:

  1. Accurate Billing: Core elements such as the MSC and SGSN/GGSN generate CDRs for voice, SMS, and data usage. These records must be reconciled with billing information from IN systems, especially for real-time services like prepaid charging. If a mismatch occurs, operators risk revenue loss, either by overcharging or undercharging customers.
  2. Revenue Assurance: Discrepancies between the traffic data captured by core network elements and the charging data in the IN can lead to missed revenue opportunities. For instance, if the IN fails to register a data session or voice call correctly, the operator may not capture the associated revenue.
  3. Fraud Prevention: Reconciling data from the core and IN networks also helps detect and prevent fraud, such as bypassing prepaid billing systems or unauthorized use of services. By correlating voice, SMS, and data usage records across systems, operators can identify unusual patterns indicative of fraud, like SIM box fraud or call bypass.
  4. Ensuring Service Quality: Reconciliation not only ensures proper billing but also enables operators to monitor and maintain the quality of services. By comparing the data from different systems, operators can ensure that services are delivered according to the promised quality and detect any service anomalies.

Conclusion

For telecom operators focused on revenue assurance, reconciling data between core network elements (MSC, SGSN, GGSN, SGW, PGW) and intelligent network elements (IN) is crucial. This process ensures accurate billing, prevents revenue loss, and helps detect fraud, all while maintaining high service quality. Implementing robust reconciliation processes using Big Data Analytics can help operators stay ahead of potential revenue leakage and improve operational efficiency.