The Importance of CDR Reconciliation between MSC and IN for Voice and SMS

The Importance of CDR Reconciliation between MSC and IN for Voice and SMS

Call Detail Records (CDRs) serve as a fundamental building block for tracking, billing, and analyzing various communication services like voice calls and SMS. These records are generated by different network elements such as the Mobile Switching Center (MSC) and the Intelligent Network (IN). However, discrepancies between these records can lead to serious issues, including revenue leakage, fraud, and data inconsistencies. This blog post explores why CDR reconciliation between MSC and IN is crucial, how the process can be implemented for both voice (MO) and SMS (MO) services, the challenges encountered, and how Big Data tools like Apache Spark can significantly enhance the efficiency and accuracy of this process.

Why CDR Reconciliation is Important?

CDR reconciliation is not just a technical exercise; it is a crucial component of an effective revenue assurance process. Ensuring that all CDRs from different network nodes match perfectly is essential for several reasons. From maximizing revenue capture to ensuring compliance, a reliable reconciliation process can prevent numerous issues that telecom operators face on a daily basis. Let’s delve into the key reasons why this process is indispensable:

  • Revenue Assurance: Reconciliation ensures that all billable events are captured and billed correctly. Discrepancies between MSC and IN CDRs can lead to revenue leakage, where services are provided but not billed accurately.
  • Financial Integrity: Reconciliation validates the consistency of call records across different network elements, ensuring that the data used for billing, analysis, and reporting is accurate, thereby safeguarding financial integrity.
  • Operational Efficiency: Identifying missing or duplicate records during reconciliation helps in detecting errors in the network, such as incomplete calls or unbilled events, which could lead to potential revenue loss and operational inefficiencies.
  • Regulatory Compliance: Telecom operators must often report accurate traffic data to regulatory authorities. CDR reconciliation helps in ensuring the accuracy of these reports, thereby meeting regulatory requirements and avoiding penalties.

Achieving CDR Reconciliation for Voice (MO) and SMS (MO)

Implementing an effective CDR reconciliation process requires a systematic approach to match records from MSC and IN. While the goal is to ensure that both sets of records align perfectly, the actual process involves several steps and considerations, especially when dealing with multiple fields like msisdn, imsi, caller, called, timestamp, and sometimes a unique correlation ID. Here’s how you can achieve a robust reconciliation process for voice and SMS services:

Reconciliation Process for Voice (MO) between MSC and IN:

  • Identify Key Fields: The main fields to consider for reconciliation are msisdn, imsi, caller, called, timestamp, and a unique correlation ID.
  • Intermediate CDRs Aggregation: MSC often generates intermediate CDRs for the same call, especially for long-duration calls that are split into smaller segments. These intermediate CDRs must be aggregated based on caller, called, and timestamp to create a complete call record before reconciling with IN CDRs.
  • Unique Correlation ID: Ideally, there should be a unique identifier that links the CDRs from MSC and IN for the same call. This ID ensures that the call records can be matched accurately across the two systems.
  • Timestamp Matching: In cases where a unique correlation ID is not available, we can use a combination of fields like caller, called, and timestamp to identify matching records. However, time differences between MSC and IN systems can pose challenges.

Reconciliation Process for SMS (MO) between MSC and IN:

  • Identify Key Fields: Similar to voice, important fields for SMS reconciliation include msisdn, imsi, sender, receiver, timestamp, and a unique correlation ID (if available).
  • Aggregation of Long SMS Messages: For SMS, particularly long messages that are split into multiple segments (e.g., concatenated SMS), it is important to aggregate these segments into a single CDR based on sender, receiver, and timestamp before proceeding with reconciliation. This ensures that the entire message is accounted for correctly, avoiding partial billing or data discrepancies.
  • Matching Criteria: After aggregation, it is crucial to match the sender and receiver numbers along with the timestamp to accurately reconcile the SMS records between MSC and IN.

Challenges in CDR Reconciliation

Despite the critical need for CDR reconciliation, the process is often challenging. From the absence of a unique correlation ID to time discrepancies between different network elements, telecom operators often face significant obstacles in achieving accurate and consistent CDR reconciliation. Understanding these challenges is essential to developing strategies to mitigate them. Here are some of the most common issues faced during the reconciliation process:

  • Lack of Unique Correlation ID: Many times, there is no unique correlation ID between MSC and IN, making it difficult to directly match the records. In such cases, using a combination of msisdn, caller, called, and timestamp becomes necessary.
  • Intermediate CDRs for Voice Calls: For long-duration calls, MSC generates multiple intermediate CDRs. These need to be aggregated to form a single complete call record before any reconciliation can be performed with IN CDRs. Failure to do so can result in mismatches and inaccuracies.
  • Segmented SMS Messages: Long SMS messages that are split into multiple segments must be aggregated before reconciliation. Missing or incomplete segments can lead to discrepancies between MSC and IN records.
  • Time Difference Between MSC and IN: Even a slight time difference between the CDR generation in MSC and IN can cause mismatches. For instance, if the MSC and IN systems are not synchronized to the same time source, the same event may appear at slightly different times, complicating the reconciliation process.
  • Handling Duplicate Records: Sometimes, multiple CDRs are generated for the same event, especially in cases of call setup failures or retries. These duplicates must be filtered out to avoid false discrepancies.
  • High Data Volume: The large volume of CDRs generated daily in a telecom network can make reconciliation a resource-intensive and time-consuming process.

How Big Data Tools Like Apache Spark Can Help

With the rise of Big Data technologies, the telecom industry now has powerful tools to handle large-scale data reconciliation more efficiently. Apache Spark, in particular, stands out for its ability to process massive datasets in parallel, making it an excellent choice for CDR reconciliation. Leveraging Spark can not only speed up the reconciliation process but also provide additional capabilities like anomaly detection and real-time analytics. Here’s how Spark can transform the reconciliation process:

  • Scalability and Speed: Spark can handle large volumes of data efficiently with its in-memory processing capabilities, making it ideal for processing millions of CDRs quickly.
  • Distributed Processing: Spark’s distributed architecture allows it to parallelize the reconciliation process across multiple nodes, significantly speeding up the comparison of records from MSC and IN.
  • Data Transformation: Spark provides powerful APIs for data manipulation, making it easier to preprocess the CDR data (e.g., filtering out duplicates, handling missing values, aligning timestamps, and aggregating intermediate CDRs or segmented SMS messages) before reconciliation.
  • Advanced Analytics: Spark’s integration with MLlib allows for the application of machine learning techniques to detect anomalies or patterns in CDR data that could indicate issues such as revenue leakage or fraud.
  • Automated Matching: Spark can be used to automate the matching process based on multiple criteria, such as caller, called, timestamp, and imsi, and flagging records that do not match for further analysis.

Conclusion

CDR reconciliation between MSC and IN is a vital process for telecom operators, enabling them to maintain accurate billing, detect fraud, and ensure compliance. Although challenges like the absence of a unique correlation ID and time differences exist, these can be mitigated with proper data processing and advanced tools like Apache Spark. By adopting such big data solutions, telecom operators can not only streamline the reconciliation process but also gain valuable insights to drive business decisions.

 

Ensuring Accuracy in Data Records Reconciliation between CGSN and IN for 2G/3G and 4G Networks

Ensuring Accuracy in Data Records Reconciliation between CGSN and IN for 2G/3G and 4G Networks

Introduction

In the telecom industry, data usage records are logged by various network nodes, such as SGSN/GGSN in 2G/3G networks and SGW/PGW in 4G networks. These records, known as data session EDRs (Event Detail Records), capture critical information about data sessions, including the volume of data used, session duration, and charging details. Meanwhile, the Intelligent Network (IN) records the billing details associated with these data sessions. Reconciliation between these two sources is essential to ensure accuracy in billing, revenue assurance, and network management. In this blog post, we will explore the importance of reconciling these data records, the challenges involved, and how Big Data tools like Apache Spark can streamline this process.

Why Data Records Reconciliation is Important

  1. Accuracy in Data Billing: Each data session, whether in a 2G/3G or 4G network, must be accurately billed to the customer. Discrepancies between the volume of data recorded by the CGSN and the charges recorded by the IN can lead to billing errors, causing revenue loss and customer dissatisfaction.
  2. Revenue Assurance: Ensuring that all data usage is correctly captured and billed is crucial for preventing revenue leakage. Reconciliation helps identify missing, duplicated, or incorrect records, allowing operators to correct discrepancies proactively.
  3. Network Performance Monitoring: Reconciliation can also provide insights into network performance by comparing the expected usage (as recorded by SGSN/GGSN or SGW/PGW) with the actual charges. This helps operators in network planning and optimization.

How to Achieve Data Records Reconciliation

  1. Matching Using MSISDN, IMSI, and Timestamp:
  • MSISDN and IMSI are unique subscriber identifiers that link data sessions across network and billing systems.
  • The timestamp is a crucial attribute that captures the start and end times of a session. Matching records based on MSISDN, IMSI, and timestamp helps in accurately linking data usage records from CGSN and IN.
  1. Using a Unique Correlation ID:
  • Some systems generate a unique correlation ID for each data session, linking records between the network and billing nodes seamlessly. This ID makes reconciliation straightforward by directly associating each data session with its corresponding billing record.
  • However, in many instances, this unique ID is not available, complicating the reconciliation process.

Challenges in Data Records Reconciliation

  1. Absence of a Unique Correlation ID:
  • When there is no unique ID linking records between CGSN and IN, operators must rely on MSISDN, IMSI, and timestamp for matching. This approach is prone to errors, especially when dealing with sessions that start and stop frequently or overlap.
  1. Time Synchronization Issues:
  • Even a minor time discrepancy between SGSN/GGSN (or SGW/PGW) and IN can lead to unmatched records. These discrepancies can arise due to differences in system clocks, network delays, or processing times.
  • To address this, operators often use a time window to match records, where sessions are considered correlated if their timestamps fall within a predefined range, such as ±10 seconds.
  1. Handling High Volumes of Intermediate Records:
  • Data sessions often generate multiple intermediate records, especially during long or fragmented sessions. These records need to be consolidated into a single session record before reconciliation.
  • For 4G networks, SGW/PGW may generate separate records for different parts of a session, further complicating the consolidation process.

Leveraging Big Data Tools for Efficient Reconciliation

  1. Using Apache Spark:
  • Apache Spark’s distributed processing capabilities are ideal for handling large volumes of data records from both CGSN and IN. It allows for efficient matching of records based on multiple keys like MSISDN, IMSI, and timestamp.
  • Spark’s in-memory processing reduces latency, enabling near real-time reconciliation, which is critical for maintaining billing accuracy and revenue assurance.
  1. Consolidating Intermediate Records:
  • Spark can aggregate multiple intermediate records into a single session based on MSISDN and IMSI, while applying business rules to filter out duplicates and handle overlaps.
  • For example, all records with the same MSISDN and IMSI within a session can be grouped together, and their data volume and duration summed to create a consolidated record.
  1. Handling Time Differences:
  • Spark’s window functions allow for flexible time-based grouping and aggregation. A time window can be defined to match records with slight timestamp differences, accounting for system clock discrepancies between CGSN and IN.
  • This helps in accurately correlating records, even when exact timestamps do not match.
  1. Scaling with Data Growth:
  • As data usage continues to grow, the volume of EDRs from SGSN/GGSN and SGW/PGW increases exponentially. Spark’s ability to scale horizontally by adding more nodes to the cluster ensures that reconciliation processes can keep pace with the growing data volumes without compromising performance.

Conclusion

Reconciliation of data records between SGSN/GGSN (or SGW/PGW) and IN is crucial for accurate billing, revenue assurance, and network management. Despite the challenges such as the absence of a unique correlation ID, time synchronization issues, and high volumes of intermediate records, big data tools like Apache Spark provide a robust solution. Spark’s distributed processing, in-memory computation, and advanced aggregation capabilities enable efficient and scalable reconciliation, ensuring data integrity and billing accuracy.

In the next blog post, we will provide a step-by-step guide on implementing a Spark-based data records reconciliation pipeline, complete with code examples and best practices. Stay tuned.

Core and Intelligent Network Elements: Why Reconciliation is Essential for Telecom Operators

In modern telecom networks, core network elements and intelligent network (IN) elements play critical roles in managing services like voice, SMS, and data. Understanding how these elements interact is vital for ensuring accurate revenue assurance, which is a key challenge for telecom operators. This post explains the functions of key network components and highlights the importance of reconciliation for managing revenues effectively.

Core Network Elements

  1. Mobile Switching Center (MSC):
    • Role: The MSC is responsible for managing voice calls and SMS in both 2G and 3G networks. It handles the switching and routing of these services, ensuring smooth communication between mobile users and other network types such as the PSTN (Public Switched Telephone Network).
    • Key Functions:
      • Call setup and teardown.
      • Mobility management, including location updates and handovers.
      • Generation of Call Detail Records (CDRs) for billing purposes.
      • SMS handling between mobile subscribers and external messaging systems.
  2. Serving GPRS Support Node (SGSN):
    • Role: The SGSN is part of the packet-switched (PS) domain in 2G and 3G networks. It is responsible for the delivery of data services, including internet access, over GPRS and UMTS. The SGSN manages the mobility of users and keeps track of their data usage.
    • Key Functions:
      • Session management for data services.
      • Authentication and mobility management within the PS domain.
      • Collecting data usage statistics for billing.
      • Forwarding data sessions to the Gateway GPRS Support Node (GGSN).
  3. Gateway GPRS Support Node (GGSN):
    • Role: The GGSN is the gateway between the mobile network’s PS domain (managed by the SGSN) and external packet data networks, such as the internet. It assigns IP addresses to mobile users and routes data between the mobile network and the external network.
    • Key Functions:
      • Management of IP addresses for mobile devices.
      • Billing data generation for data services.
      • Acting as a liaison between mobile data networks and external packet networks.
  4. Serving Gateway (SGW) and Packet Data Network Gateway (PGW):
    • Role: In 4G LTE networks, the SGW and PGW replace the SGSN and GGSN, respectively, to handle data traffic. The SGW routes and forwards packets between the eNodeB (base station) and external networks. The PGW connects the LTE network to external packet data networks, such as the internet, managing IP address assignment and handling services like billing and policy enforcement.
    • Key Functions:
      • SGW: Forwarding data traffic within the LTE network and between the LTE network and external networks.
      • PGW: Managing mobile user IP traffic, including routing, IP address assignment, and applying policies for billing.

Intelligent Network (IN) Elements

  1. Intelligent Network (IN):
    • Role: The IN manages value-added services like prepaid billing, call forwarding, and other real-time services. The IN interacts with the MSC and other core network elements to provide these services while keeping track of real-time user balances and other account information.
    • Key Functions:
      • Real-time prepaid charging.
      • Execution of customized services like VPNs and number portability.
      • Supplementary service control (e.g., balance inquiries, top-ups).

Why Reconciliation is Crucial for Revenue Assurance

Telecom operators must ensure that the records generated by core network elements like the MSC, SGSN, GGSN, SGW, and PGW match the real-time billing information recorded by IN systems. Any discrepancies between these systems can lead to significant revenue leakage. Here’s why reconciliation is essential:

  1. Accurate Billing: Core elements such as the MSC and SGSN/GGSN generate CDRs for voice, SMS, and data usage. These records must be reconciled with billing information from IN systems, especially for real-time services like prepaid charging. If a mismatch occurs, operators risk revenue loss, either by overcharging or undercharging customers.
  2. Revenue Assurance: Discrepancies between the traffic data captured by core network elements and the charging data in the IN can lead to missed revenue opportunities. For instance, if the IN fails to register a data session or voice call correctly, the operator may not capture the associated revenue.
  3. Fraud Prevention: Reconciling data from the core and IN networks also helps detect and prevent fraud, such as bypassing prepaid billing systems or unauthorized use of services. By correlating voice, SMS, and data usage records across systems, operators can identify unusual patterns indicative of fraud, like SIM box fraud or call bypass.
  4. Ensuring Service Quality: Reconciliation not only ensures proper billing but also enables operators to monitor and maintain the quality of services. By comparing the data from different systems, operators can ensure that services are delivered according to the promised quality and detect any service anomalies.

Conclusion

For telecom operators focused on revenue assurance, reconciling data between core network elements (MSC, SGSN, GGSN, SGW, PGW) and intelligent network elements (IN) is crucial. This process ensures accurate billing, prevents revenue loss, and helps detect fraud, all while maintaining high service quality. Implementing robust reconciliation processes using Big Data Analytics can help operators stay ahead of potential revenue leakage and improve operational efficiency.