Open access peer-reviewed chapter - ONLINE FIRST

Data Compression Strategies for Use in Advanced Metering Infrastructure Networks

By Chih-Wei Hsu and Sun-Yuan Hsieh

Submitted: June 16th 2020Reviewed: November 10th 2020Published: December 19th 2020

DOI: 10.5772/intechopen.94974

Downloaded: 95


Internet of Things technology has advanced rapidly. For example, numerous sensors can be deployed in a city to collect a variety of data, and such data can be used to monitor the city’s situation. A possible application of such data is smart metering implemented by power suppliers for their consumers; smart metering involves installing a multiplicity of smart meters that, in conjunction with data centers, form a smart grid. Because a smart gird must collect and send data automatically, the establishment of advanced metering infrastructure (AMI) constitutes the primary step to establishing a smart grid. However, problems remain in smart metering: data traffic from smart meters flows rapidly at a huge volume, resulting in bandwidth bottlenecks. Thus, this chapter proposes some data compression technologies as well as a novel scheme for reducing the communication data load in AMI architectures.


  • smart grid
  • smart meter
  • concentrator
  • compression
  • advanced metering infrastructure
  • data traffic

1. Introduction

The Internet of Things (IoT) connects a multiplicity of devices to make life convenient. Smart grids constitute one implementation of IoT, and many countries have launched smart grids to develop integrated energy supply systems. A smart grid incorporates automation, bidirectional communication, and advanced sensor measurement systems to streamline interactions between the client and power supplier [1, 2, 3, 4]. In contrast to traditional grids, smart grids enable power suppliers to distribute power efficiently and to control the use of power during a given period [5]. Furthermore, the data collected from a smart grid enable automatic billing.

The analysis of energy consumption data has many advantages for both consumers and suppliers: Consumers can track their energy usage, particularly as it varies with the seasons. Power suppliers can monitor how power is utilized across the distribution grid, which can help them formulate power management and energy-saving measures [6, 7, 8]. The establishment of advanced metering infrastructure (AMI) constitutes the first step to constructing such an intelligent power gird.

AMI is central to a smart grid system and enables the system to automatically monitor usage. Figure 1 illustrates the architecture of a typical AMI, indicating that it comprises several concentrators and smart meters that are connected to a meter data management system (MDMS) [9]. The smart meters send data to the concentrators, and the concentrators send the data to the MDMS. The AMI architecture requires high-quality and high-speed networks for providing stable service and efficient monitoring, and it comprises two such networks, namely a local area network (LAN) and wide area network (WAN). The LAN connects the concentrators and smart meters and leverages cutting-edge communication technologies such as power-line communication and the Zigbee specification. The WAN connects the concentrators and MDMS and constitutes the most important part of the AMI system; this network supports a host of high-speed technologies such as broadband-over-power-line technology, 3G, and long-term evolution [10].

Figure 1.

AMI structure.

Nonetheless, smart meters have a very large data frequency—with a small packet being generated every 15–60 min—that will exceed the capacity of existing communication technologies [11, 12]; this makes data difficult to transmit. Furthermore, smart grids face challenges in storing these large volumes of data. To solve these two problems, data size in both communication and storage should be reduced. This chapter introduces some novel approaches to doing so.

2. Preliminaries

Smart grids have expanded considerably, and many organizations have noticed the advantages engendered by smart meters. Thus, research on smart grids has similarly expanded. Specifically, smart grids enable the automatic distribution of power on the part of suppliers and provide usage data, which can be used to develop various applications.

Smart meters constantly upload data to the MDMS. Such data pertain to, for example, interval data readings, meter remote disconnections, meter remote reconnections, and meter firmware patches. The volume of such data is considerably large, potentially causing network collapse. The reduction of packet size in a smart grid addresses this problem, and such a reduction has become a major topic of research. Thus, this section introduces some basic strategies that are currently adopted to solve this problem.

2.1 Basic method for data reduction in an AMI system

Among existing methods, the conventional approach is combining the messages sent by multiple meters to reduce either protocol overhead or the frequency of transmission. In this approach of message concatenation, energy consumption data, control signal messages, and firmware update packages are combined. This scheme entails a lower transmission volume because the packet header is reduced, where multiple messages can be sent in a header. However, one issue in this scheme is where messages ought to be concatenated.

2.1.1 Meter side

Message concatenation at the meter side results in a greater reduction in data size than does that at the concentrator side. Data size reduction is greater at the meter side primarily because a meter generates data within a certain period; thus, a longer interval between instances of meter-to-concentrator data transmission yields disproportionately large savings in data size. However, most meters contain low-performance hardware, hampering this method. Consequently, meters are either overloaded or unable to compress the data.

2.1.2 Concentrator side

Messages can also be concatenated at the concentrator side prior to transmission. This is the appropriate site for message concatenation [13, 14] because concentrators have more powerful hardware and process data more efficiently than meters do. In concentrator-side concatenation, meters send packages to the concentrator, the concentrator subsequently aggregates the data into one package, and the compressed message is finally sent to the MDMS. Concentrator-side concatenation not only reduces the volume of messages but also stabilizes the system.

However, despite the advantages of concentrator-side concatenation, existing devices on the market are incapable of supporting such concatenation. Currently, concentrators on the market are capable of executing only simple integrations, and these products all follow the PRIME standard of WAN communication [15].

3. Novel approaches

A message generated by a smart meter contains a 40–60-B header within the packet. Concentrators can collect messages sent from the meter side according to a Poisson process [16] before combining these messages to minimize possible traffic. The aforementioned procedure comprises two steps, namely combination and compaction, which are detailed as follows [17].

3.1 Combination

Combination entails combining messages into a larger packet to solve the problem of protocol overhead during communication. Figure 2 illustrates how such combination reduces traffic. Combining meter messages on smart grids is efficient because each piece of data generated by a meter have a size of only a few bytes. Concentrators combine messages that have been sent by meters in a specified period, after which they compact the messages.

Figure 2.

Concept underlying the combination procedure.

Figure 3 illustrates the operating procedures of a typical combination algorithm. Specifically, the concentrators receive meter messages, place the messages into a combination queue, combine the messages in the queue once some predetermined quantity of messages have been accumulated, and finally compact the messages into a package [17]. This method enables solving the protocol overhead problem. The data compaction scheme is detailed as follows.

Figure 3.

Combination procedure.

3.2 Compaction

Compaction reduces the size of headers in large-scale AMI architecture, which is very helpful for reducing data volume. Messages can be compacted through either full compaction (FC) or loose compaction (LC) [17].

3.2.1 Full compaction

Figure 4 illustrates the operating procedures of an FC algorithm. Specifically, any piece of data received from the meter is stored in Qc, and the usage in the current period is calculated by determining the difference between the message in the current period and the message in the last period; this process is performed until Qcis empty. Subsequently, before the usage data are stored in Qt, the calculated value should be normalized; this can specifically be achieved by multiplying the value by 103to convert any floating-point number into an integer. Once all messages are in Qt, the concentrators can start calculating the sum of all values in Qt. Once the FC procedure is complete, the concentrators obtain the total usage from the meter data, which helps power suppliers monitor the usage of an area to optimize power delivery.

Figure 4.

FC procedure.

To illustrate the FC procedure, Table 1 lists some energy usage values in an area. First, all values are input into Qc. Second, the difference between the previous usage and current usage is calculated for each period. Third, the values obtained from these calculations (212.880, 323.769, 388.112, 46.115, and 610.762 in this example) are input into Qt. Finally, these values in Qtare summed to indicate the net difference in energy usage (1581.638 in this example).

Meter NumberPrevious UsageCurrent UsageDifference

Table 1.

Example meter data from an area.

An Example of meter data.

3.2.2 Loose compaction

LC differs from FC in that LC is recoverable, whereas FC is not. Figure 5 illustrates operating procedures of an LC algorithm. First, the concentrators obtain meter data by dequeuing Qcand subtracting the meter value for the current period from the meter value for the previous period. Second, similar to second step of the FC algorithm, the value obtained in the first step is multiplied by 103for normalization and subsequently input into Qt. Third, LC sorts values in Qtin descending order. Fourth, the positive difference between a value and its lower-valued neighbor is computed and then input into Ql; the initial, lowest value in Qtthus remains unchanged. For example, if abcdis input into Qt, then abacbdcis input into Ql. Finally, the values returned by the LC algorithm constitute the compacted data.

Figure 5.

LC flow.

To illustrate the LC procedure, consider the values in Table 1. First, the LC algorithm takes the difference between past and present usages and inputs these differences into Qc. In this example, these differences can be listed as follows: [212.880, 323.769, 388.112, 46.115, 610.762]. This list is then sorted in descending order as [46.112, 212.880, 323.769, 388.112, 610.762] and input into Qt. The LC algorithm then dequeues each piece of data from Qtand calculates the positive difference between a value and its lower-valued neighbor. The list of such differences is as follows: [46.112, 166.768, 110.889, 64.343, 222.650]. This list is input into Qland returned.

After compacting the data, the concentrators send the data to the MDMS. When the MDMS receives the data, it executes the LC recovery (LCR) algorithm to extract the raw data. Figure 6 illustrates the LCR algorithm, which is the reverse of the LC algorithm. First, the LCR algorithm inputs the received data into Ql; the algorithm then recovers those values prior to subtraction from a lower-valued neighbor. These recovered values are then input into Qt. Second, the LCR algorithm denormalizes each value in Qtand adds to it the value of the usage in the previous period. Finally, the MDMS outputs all values into Qcand returns them.

Figure 6.

LC recovery procedure.

Consider the values for the example presented in Table 1. First, the LCR algorithm recovers the compacted data; specifically, the MDMS inputs [46.112, 166.768, 110.889, 64.343, 222.650] into Ql. Subsequently, the LCR algorithm recovers the values prior to subtraction from a lower-valued neighbor; specifically, the MDMS obtains [46.112, 212.880, 323.769, 388.112, 610.762] and inputs this list into Qt. Finally, the LCR algorithm denormalizes the data by multiplying each value by 103; specifically, the MDMS obtains the list of raw data [46.112, 212.880, 323.769, 388.112, 610.762].

3.2.3 Result analysis

First of all, all of messages sent from meters were encoded in ASCII. FC and LC compaction performance are provided in this section. The Figure 7 shows the compression ratio of the FC and LC method. The vertical scale label present the total bytes used before compaction and the horizontal scale label is the number of meter messages compacted when compaction. As the Figure 7 shows, FC has better performance than LC. However, FC does not support recovery, so it can achieve the highly compaction ratio. On the contrary, LC is the trade-off algorithm when recovery function is required and have demand of compression. If the system only need the sum of the energy usage, then FC would be the proper way for compressing the data.

Figure 7.

FC and LC analysis.

4. Compression technologies

The choice of compression algorithm has become a critical issue because most IoT devices have hardware limitations, particularly in their low-power-consumption microprocessors. The best compression algorithm provides the best compression ratio given the hardware specifications of an IoT device [18, 19]. This section discusses two well-known approaches to data compression [20].

4.1 Lempel–Ziv–Markov chain algorithm

The Lempel–Ziv–Markov chain algorithm (LZMA) is a member of the LZ-family, and it is based on the LZ77 algorithm, which uses a dictionary-based scheme. The LZMA yields excellent compression ratios without being demanding on hardware, making it suited to IoT environments by virtue of its exclusive dictionary structure [19].

4.2 Prediction by partial matching

Prediction by partial matching (PPM) can predict a subsequent pattern using present or previous symbols. Moreover, PPM can be used with a Markov model to construct a compression algorithm. For example, the RAR algorithm, developed by Eugene Roshal and Alexander Roshal, uses PPM and the Lempel–Ziv–Storer–Szymanski algorithm to achieve impressive compression [20].

5. Conclusions

Smart grids will be key infrastructure considering the rapid developments in IoT technology. This chapter presents data compression techniques, such as combination and compaction, developed for reducing the communication data load in a smart grid. These techniques not only reduce the frequency between instances of transmission but also considerably reduce data volume. Moreover, FC and LC can be used in different situation. FC provides extremely compact ratio for the user who have less bandwidth for transport the meter data. LC can be used when the system requires the raw data of the energy usage and having sufficient network bandwidth. In addition, some well known data compression techniques are also introduce in the chapter. Proposed algorithm can be implement with the compression technique provided above to decrease the volume of the meter data. Also, this chapter explains recent advances in smart grid technology. Readers can build on the aforementioned algorithms to formulate novel contributions of their own.

Download for free

chapter PDF

© 2020 The Author(s). Licensee IntechOpen. This chapter is distributed under the terms of the Creative Commons Attribution 3.0 License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite and reference

Link to this chapter Copy to clipboard

Cite this chapter Copy to clipboard

Chih-Wei Hsu and Sun-Yuan Hsieh (December 19th 2020). Data Compression Strategies for Use in Advanced Metering Infrastructure Networks [Online First], IntechOpen, DOI: 10.5772/intechopen.94974. Available from:

chapter statistics

95total chapter downloads

More statistics for editors and authors

Login to your personal dashboard for more detailed statistics on your publications.

Access personal reporting

We are IntechOpen, the world's leading publisher of Open Access books. Built by scientists, for scientists. Our readership spans scientists, professors, researchers, librarians, and students, as well as business professionals. We share our knowledge and peer-reveiwed research papers with libraries, scientific and engineering societies, and also work with corporate R&D departments and government entities.

More About Us