Abstract
Introduction
With technological advances of on-board scientific instruments, there has been a boom in the volume of data collected by satellites 1 . In particular, hyperspectral imagers or synthetic aperture radars on earth observation satellites generate data at tens of Gbps 2 . Traditionally, a source satellite has had to carry imagery or sensor data until the availability of a direct ink to a ground station, which takes tens of minutes. In order to make this volume of data available for use on the ground in near real time, a satellite network is required to forward the data. Typically, the network comprises a number of Low Earth Orbit (LEO) satellites, which include the source nodes, and Geostationary Earth Orbit (GEO) satellites (see Figure 1). With the existence of Inter-Satellite Links (ISLs), it is possible to find at least one route for any source to a ground sink.

A network for data collection consists of a Geostationary Earth Orbit (GEO) satellite and Low Earth Orbit (LEO) satellites connected via ISLs. The GEO satellite can relay data from LEO satellites to the ground station.
This satellite network is expected to serve a variety of data collection missions (e.g. disaster surveillance, weather forecast) that demand high throughput and reasonable end-to-end delays. However, while this network increases connectivity, it does not necessarily yield desirable delivery performance. There are several factors that affect the performance. First, multiple sources are generating traffic simultaneously, which imposes a great demand on the network. Second, transmission paths have to be built under high link variability with limited nodal power. Consequently, a path’s throughput is easily deteriorated by emerging bottlenecks during transmission, since link capacities vary as nodes move. Third, due to the incompatible input and output rates, some nodes’ buffers occasionally suffer overload, which might induce packet loss. In particular, the traffic pattern of many-to-one or many-to-several data collection makes nodes near to the sink liable to congestion. In light of these factors, one can design a routing algorithm that dynamically adjusts paths based on link capacities and enhances it with a load-balanced scheme3,4. But according to our empirical results 5 , this type of solution breaks the internal connection between path selection and node resources (e.g power, buffer), which deflates the network throughput region. Therefore, optimization of routing together with resource allocation is critical for improving delivery performance.
In this paper, we are interested in the design of routing algorithms on satellite networks that achieve long-term throughput optimality as well as desirable end-to-end delays. We formulate the throughput objective in terms of the mean flow rates provided to the data collection. For end-to-end delays, we introduce a cost function that describes the total amount of resources used by all flows in the network. Specifically, we add up weighed traffic loads on all links, of which the weight measures the distance of the next hop to the destination. In this way, end-to-end delays can be tightly regulated. Our goal is to optimize the objectives under resources constraints. Basically, we consider three types of resources: links, power, and queues. During transmission, the existence of a link is constrained by the network connectivity, and its capacity is decided by the transmitting power, which obeys a nodal power limit. Meanwhile, all queues’ lengths should be bounded for network stability. Then, we solve this optimization problem by exploring three arguments: exogenous arrival load, link transmitting load, and link transmitting power, and derive our algorithms that connect path selection and resource allocation. Compared with the well-known backpressure-type technique6,7, our algorithms manage to achieve a balance that preserves throughput optimality with reasonable end-to-end delay characteristics. Our contributions are summed as follows.
We develop a utility maximization framework that seamlessly embeds a distance factor, which supports general metrics (e.g. hop, geographic distance, link weight). This framework decouples routing decision from link variability by allocating power dynamically for activated links.
We propose a geographic-location-aware backpressure algorithm derived from the framework for satellite networks. This solution not only maintains the long-term optimality of backpressure policy, but also brings down loops and unnecessary exploring of routes to a great extent.
We further define and exploit transmission opportunities missed by the backpressure-type algorithm. In order to seize these opportunities, we adaptively modify backpressure (backlog differential) according to the trend of a neighbor’s backlog change. This enables us to pump packets faster to the sink direction. Also, a related loop reduction scheme is introduced considering local minima.
Finally, we develop a simulation platform and demonstrate that traditional backpressure algorithms fail to meet the requirement of data collection on satellite networks. Our algorithms are able to deliver packets in time, and scale well in all scenarios tested. We also show that there is a tradeoff between power consumption and end-to-end delays on the selection of the GEO relaying.
The rest of the paper is organized as follows. In Section ‘Related work’, we briefly review related works. Section ‘System model’ introduces our system model for the hybrid satellite network. We propose our framework and derive the geographic (geographic-location-aware) backpressure algorithm in Section ‘Throughput-optimal and delay-aware cross-layer algorithm’. Then in Section ‘Opportunistic transmission beyond backpressure’, we upgrade our solution by reducing transmission opportunities missed. The evaluations in different scenarios are conducted on our simulation platform in Section ‘Evaluation’. The conclusion is given in the final section.
Related work
Maximize throughput for satellite networks
There has been a long-standing interest8–10 in maximizing data transfer for satellite networks. Most work focuses on the design of constellation parameters11,12 and advocates the utilization of ISLs13,14. For the remainder, there are efforts on developing routing and scheduling algorithms8,9,15 for maximizing throughput. These proposals are either heuristic attempts or evaluated only in simplified simulation scenarios. None of them backtrack to a concrete theoretical framework. Also, we notice that a group of work3,4,16 enhances routing algorithms for satellite networks with load balancing schemes. Their throughput performance relies on passive route adjustment to congestion or traffic prediction. Moreover, the idealized traffic distribution and topology assumptions limit the algorithm scalability. Besides, there are routing algorithms17,18 designed for the above hybrid satellite network. Regardless of throughput performance, few consider to decouple link variability from routing decisions for better responsiveness to traffic load. Therefore, these drawbacks motivate us to seek an alternative approach grounded in network throughput optimization theory.
Backpressure algorithm
The backpressure algorithm, also known as ‘MaxWeight’, has brought theoretical prosperity6,7 for backpressure-based stochastic optimization. The improvement of its framework is still continuing 19 . Basically, it uses the maximum backlog differentials as link weights. With a maximum-weight matching, it can schedule and route any input traffic within the network capacity region stably 20 . It provably maximizes throughput, which can be derived by either Lyapunov drift 6 or Lagrange duality 21 .
However, poor performance in delivery rate and end-to-end delay is the Achilles’ heel of the backpressure-type algorithms, due to loops or extensive exploring of routes. To restrict detours, researchers6,21,22 incorporate the shortest path concept into the traditional backpressure algorithm. Enhanced backpressure routing is proposed by Neely 6 via a shortest path bias, which is used in a heuristic manner. In Ying, Shakkottai, Reddy et al. 22 , the routing problem is formulated by minimizing the average number of hops between sources and destinations. Accordingly, the queue structure on each node is expanded for hop information. Though its delay improvement is significant, in a dynamic topology it has to frequently reconstruct queues based on the updates of the employed shortest path algorithm. Similarly, in Bui, Srikant and Stolyar 21 , a shorter path is preferred by minimizing total link rate. Its routing decision is affected by backlog differential minus a threshold. But the selection of the threshold is non-trivial especially in a dynamic network. To address the loop problem, in Xiong, Li, Eryilmaz et al. 23 , the authors define the running average net rate of commodity traffic traversing a link, and restrict network topology by eliminating links with zero net rate. In this way, route construction is loop-free. But unnecessary detours still exist.
Essentially, the backpressure algorithm requires gradients to pump flows. When input traffic is low or packets spread out in the network, flows easily get stuck due to low gradients. Thus, several algorithms are proposed to maintain the gradients for flows, in order to reduce delays. In Moeller, Sridharan, Krishnamachari et al. 20 , Backpressure Collection Protocol (BCP) uses finite queues and drops packets when a queue is full. Then the discards are placed into the corresponding underlying virtual queue so as to keep the gradient. Cooperating with the Last-In-First-Out (LIFO) queue policy, the later packets are sent over the existing gradient. The utility-delay tradeoff analysis of LIFO backpressure is given in Huang, Moeller, Neely et al. 24 . In Athanasopoulou, Bui, Ji et al. 25 , shadow queues are proposed to activate links in decoupled routing and scheduling. To maintain gradients, the arrival rate of a shadow queue is enlarged by a certain coefficient of the actual rate. However, from our experimental results, we find that those transforms of gradients are not flexible enough in a dynamic network.
Besides, under the Network Utility Maximum (NUM) framework, authors try to directly incorporate end-to-end delays into either the constraints or the objective26,27, which are approximately calculated based on queue theory. However, it cannot effectively capture delay characteristics when routes overlap. In Li and Eryilmaz 28 , the deadline constraint per packet is considered, which again changes the queuing discipline. We notice that there is recent work29–31 closer to ours. In Hu, Yuanan, Dongming et al. 29 , for satellite networks, utility optimal scheduling is proposed. In Jiao, Tian, Zhang et al. 30 , to improve delay and energy efficiency, GRAdient-assisted energy-efficient backPrEssure (GRAPE) uses the hop distance to the sink as the gradient, and combines it with nodes’ residual energy. Also, the combination of geographic and backpressure routing is explicitly proposed in Núñez-Martínez, Baranda and Mangues-Bafalluy 31 . However, the differences are that we focus on the dynamics of satellite networks and provide a general theoretical framework based on NUM. The location-aware-backpressure routing is a natural outcome from the framework. Moreover, none of above work exploits the transmission chances missed by the backpressure-type algorithm.
System model
Consider a multihop satellite network comprising one GEO, multiple LEO nodes, and one ground station (sink). A GEO node can connect to the sink anytime and concurrently support
Assume that time is slotted and a satellite’s position is fixed within a slot. Define the network topology at slot
Connectivity constraint and link length
Let
where
where
where
For ISLs between LEO nodes, assume that a LEO node only contacts its immediate neighbors if any: intra-orbit and inter-orbit neighbors. The lengths of the two types of ISLs are
where
Power constraint and link capacity
The total power of node
Let
where
where
where
Traffic model and flow constraint
Source traffic
At the beginning of slot
The vector of exogenous arrival numbers is denoted as
Traffic over a link
At slot
The vector of
Then, the flow conservation constraint is given as
Objective
Given above constraints, we aim to develop a cross-layer solution that controls
where
For the other two metrics, we assign undelivered packets (e.g. trapped in loop) with high end-to-end delay penalties, and focus on the minimization of end-to-end delays. Note that it is not feasible to formulate the end-to-end delay as a function of the arrival and service processes for a large-scale multi-hop network
23
. Thus, we utilize link rate
where
Throughput-optimal and delay-aware cross-layer algorithm
In this section, we first propose our throughput-optimal and delay-aware framework, combing utility maximization and cost minimization. Then, we decompose our original problem into rate control, routing, and power allocation. Accordingly, we develop a cross-layer solution for satellite networks, which maintains the throughput optimality and has desirable delay performance.
Throughput-optimal and delay-aware framework
Cost function
In order to regulate end-to-end delays in a tight fashion, we define
where
The intuition behind the cost function is that, while the link rate is a resource cost of the transmission, the distance to the sink is deemed as a delay cost after the transmission. Because, if defined properly, the distance usually indicates how long packets will linger in the network. For example, in a static network, the minimum hop count estimates the duration monotonously. But, in a dynamic network, the hop count easily becomes invalid as the topology changes. Thus, we will discuss the distance definition for satellite networks in the algorithm.
4.1.2 Problem formulation
Under the constraints given in the system model, we control the triple
Let
Cross-layer decomposition
To solve problem (11), we assign a Lagrange multiplier
We can take
Because
where
and a routing problem
At each iteration of equation (14), by solving equations (15) and (16), we can obtain the joint corresponding rate control and routing policy.
Given the link capacity in equation (3),
where
Cross-layer control policy
The cross-layer control policy is the solution to the problems decomposed above, and provides a guideline of designing the corresponding algorithm. Assume that each node maintains a queue for each destination
From equations (14) and (18), we know that
The policy needs to guarantee the network stability, which means the queue length for any destination inside a node must be finite
4.3.1 Rate control
For the problem in equation (15), based on the queue length for destination
The rate for a destination is decided by the queue length of the destination at source nodes, which controls input load within the network capacity.
Routing
For the problem in equation (16), its solution resembles the traditional backpressure algorithm and
Then, for multiple destinations and contactable neighbors, we repeat the above selection from
and the transmitting rate of link
Power allocation
We first transform the problem in equation (17) into
and assign Lagrange multipliers
Because the problem in equation (23) satisfies Slater’s condition, Karush–Kuhn–Tucker conditions give
From the above gradient equation, we can derive
By eliminating the slack variable
Then, we solve
From equation (24), we can see that the power of each link is assigned according to link weight and channel state.
Geographic backpressure algorithm
We present our algorithm based on above cross-layer control policy.
Basic procedure
Briefly, each node controls its exogenous input at the transport layer according to equation (19), in which we choose
Geographic backpressure
Differently from the traditional backpressure algorithm, we introduce a distance factor as a cost in equation (9) into the backlog differential calculation. In Athanasopoulou, Bui, Ji et al.
25
, an
Nevertheless,
In fact, to select the next hop, a node only compares values within its current neighbor set. Therefore, instead of absolute distances, we use the relative rank of neighbors’ distances
where
Opportunistic transmission beyond backpressure
In this section, based on the geographic backpressure algorithm, we further exploit transmission opportunities missed by backpressure-type algorithms. First, we employ three examples to motivate the utilization of missed transmission chances. Then, we design a scheme to recognize these opportunities and accordingly enhance our proposed algorithm. In addition, a related loop reduction scheme is introduced considering local minima.
Motivation
First, consider the last hop in the network. Traditional backpressure algorithms, even at the direct neighbor of a sink, do not guarantee forwarding packets straight to the destination. In our scenario, the sink node is a ground station. Recall that the last hop in our scenario could be a downlink with limited duration. Hence, it is expensive to have packets lingering around a sink. In order to push packets faster, neighbor
However, taking a closer check on other hops, we find that the distance bias is not always effective to push packets. We demonstrate this problem in Figure 2. At the beginning slot

An example of the geographic backpressure algorithm (
Before answering the question, we give another example to show the necessity of seizing the opportunity. As shown in Figure 3, node 2 and node 3 are sources that are relayed by node 1 (e.g. GEO node) to the sink with a high-capacity link, which is common in our scenario. Because the aggregated rate of link 2-1 and link 3-1 does not exceed that of link 1-S. Ideally, sources can transmit at their full rates. But from the resulting table on the right, it not only confirms the existence of missed chances, but also reveals unfairness to the low-rate source (i.e. node 3). That means a low-rate source is more likely to miss opportunities, and its delay would grow significantly as the number of flows (partly) sharing its path increases. However, if node 3 takes missed chances, its backlog can stabilize sooner than node 2, say from slot 4. Therefore, it is important to devise a scheme to capture the underutilized capacity beyond positive backpressure.

An example of transmission opportunities missed by backpressure-type algorithms. Node 2 and node 3 are injected at 2 packets/slot and 1 packets/slot, respectively. The capacities of link 2-1, link 3-1 and link 1-S are 2 packets/slot, 1 packets/slot, and 3 packets/slot, respectively. The table on the right gives the backlog changes of three nodes in the first eight slots.
Opportunity-aware scheme
To answer the question, we first define a slot set with missed transmission opportunities. For node
Basically, it describes that at slot
In above examples, we can see that they all have a fluent downstream in common. The fluency results from either nearby sink nodes or high-capacity links. As shown in the first example, a negative backlog can be seen as a credit to a sink node, which eliminates lingering packages. Similarly, we should give credits to those nodes that can forward packets quickly. Let
Consequently, even if the backlog difference is zero, a negative
where

First eight slots by using assigned credit for link activation. The total throughput is increased by 25% from 16 packets to 20 packets.
Loop reduction scheme
Distance bias can effectively avoid unnecessary detours, thus potentially cutting down loops. However, in a dynamic network, there could be special cases such as isolation and local minima. Also, concerning multiple random sources, it is possible that a neighbor node could send back packets just transferred by the current node, due to their backpressure variation. Therefore, loops still exist and hinder further improvement on delays. Note that loops are also waste of resource in our scenario since links between satellites are limited and expensive (e.g. pointing and adjusting).
However, we are not going to design a sophisticated method to eliminate loops. Rather, we prefer to use a simple mechanism to lower the impact of loops to an acceptable extent. Through experiments we find most loops are between two neighbors. To reduce such one-hop loops, we record the node ID of the last hop, say node
These two schemes only need incremental change based on Algorithm 1. We refer to enhanced Algorithm 1 as ‘Geographic Backpressure Plus’.
Evaluation
To validate the feasibility of the proposed algorithms for data collection on satellite networks, we conduct three groups of simulations on the ns-2 network simulator. We mainly examine the delivery rate, end-to-end delay, and energy consumption of the algorithms. We use a hybrid constellation mentioned and corresponding parameters are listed in Table 1. The LEO constellation uses polar orbits, in which there are no ISLs across the seam. A LEO node can communicate with the GEO node with an elevation larger than 15°. The GEO satellite can relay at most 20 LEO nodes simultaneously through ISLs. We adjust the locations of the GEO node and the ground sink node to ensure the existence of a direct link between them. Nodes periodically send probes to detect and maintain neighbors. All links are orthogonalized. Link capacity
Hybrid constellation parameters
We select 10 LEO satellites as source nodes. Half source nodes need to send packets across the seam. Similarly to Chen, Liu and Hu
32
, source traffic follows an
Without ISLs to the GEO node
We first consider a scenario without GEO relaying, which increases the difficulty of routing. We compare our algorithms with other algorithms: traditional backpressure (Traditional BP), float-queue-based backpressure (i.e. BCP), and pure geographic (Geographic). Specifically, the threshold of buffer size for BCP is 10 KB. We use the terms
Delivery rate is an important test to apply a backpressure-type algorithm on satellite networks. A low delivery rate could lead to a failure of recovering a sensor image. It is considered a waste for such resource-limited networks. From Figure 5a we observe that distance bias is able to maintain 100% delivery rate, though Geographic BP is a little bit lower. This is because Geographic BP does not prevent one-hop loops, and a small number of packets is trapped due to non-positive backpressure. Nevertheless, Geographic BP’s delivery rates are steadily higher than those of Traditional BP and BCP. Without distance bias, neither of them can completely deliver all packets. Their delivery rates drop as traffic load gets lighter.

Delivery rate, average number of hops, average end-to-end delay, Cumulative Density Function (CDF) of end-to-end delays of traditional backpressure, Backpressure Collection Protocol (BCP), pure geographic, geographic backpressure, and geographic backpressure plus, as traffic load varies.
Then, we examine the average number of hops and average end-to-end delay of delivered packets. The former metric reflects how diversely a backpressure-type algorithm chooses a path, compared with the path formed by Geographic routing, which has the lowest average number of hops, as expected. In Figure 5b, the values of Geographic BP and Geographic BP Plus both raise marginally from Geographic routing, while those of Traditional BP and BCP are much higher and increase as traffic load decreases.
However, in view of queuing delay, fewer hops do not necessarily mean a better end-to-end performance. Therefore, the latter metric depicts whether the combination of backpressure and geographic routing helps to reduce the delay. In Figure 5c, the trends of Traditional BP and BCP keep consistent with those in Figure 5b, which indicates that the number of hops dominates end-to-end delays. Meanwhile, the significant growth of Geographic routing’s delays, from 12 slots to over 150 slots, proves that the over-utilization of a shorter path does not scale well under high traffic load. In contrast, Geographic BP and Geographic BP Plus both grow slowly but no more than 85 slots. In order to show their difference on delay performance, we give the CDF of end-to-end delays when
Last, we measure total power consumption as the overhead of each algorithm. From Figure 6, Traditional BP and BCP similarly have the highest power consumption, while Geographic BP and Geographic routing keep the lowest value. Because each slot Geographic BP Plus seizes more transmission opportunities beyond positive-backpressure-activation policy, it has slightly higher power consumption than Geographic BP. But, with the better end-to-end performance, we consider the extra consumption a worthwhile cost.

Power consumption of traditional backpressure, BCP, pure geographic, geographic backpressure, and geographic backpressure plus as traffic load varies.
To sum up, the traditional backpressure-type routing algorithm and its modified version without distance bias are not suitable for satellite networks. At the same time, the complement approach is necessary to refine the geographic-location-aware backpressure algorithm, in order to improve the end-to-end performance. Next, we further evaluate the proposed algorithms in another two scenarios, which are important for data collection missions.
With ISLs to the GEO node
Generally, the relay of a GEO node can reduce hops and benefit end-to-end delays. But the long ISL distance would increase link cost. Thus, there exists a tradeoff between delay and power consumption for a path to go through GEO or not. The traditional backpressure algorithm only takes a GEO neighbor as a normal candidate, and is unable to dynamically change its rank. Meanwhile, the pure geographic algorithm prefers a GEO node and easily leads to congestion. Still they are not qualified in this scenario. Hence, we only give the comparison of Geographic BP and Geographic BP Plus.
In Figure 7a, it illustrates the end-to-end delays distributions of two algorithms with or without the GEO node when

The minimum value, maximum value, 25th, 50th, and 75th percentile of end-to-end delays, and power consumption of geographic backpressure and geographic backpressure plus with or without GEO as traffic load equals 0.4 and 1.0.
Without ISLs among LEO nodes
Finally, we consider a scenario that is the opposite of the first one. ISLs among LEO nodes are removed and four LEO nodes with persistent ISLs to the GEO are selected as source nodes. Hence, packets have to be sent through the GEO node. We plot the end-to-end delay distribution and power consumption in Figure 8a and Figure 8b, respectively. The two algorithms are also compared with or without ISLs among LEO nodes as

The minimum value, maximum value, 25th, 50th, and 75th percentile of end-to-end delays, and power consumption of geographic backpressure and geographic backpressure plus with or without ISLs among LEO nodes as traffic load equals 0.4 and 1.0.
Conclusion
Data collection on satellite networks is a challenging task that requires compatible throughput and delay performance. In this paper, we model the designing of the data collection algorithm as a cross-layer control optimization problem. It essentially tackles the tradeoff between the two metrics and achieves a balance that maintains throughput optimality with reasonable delay characteristics.
To achieve this, we unify throughput utility and delay cost into the objective. By decomposition, we solve the problem and derive the rate-control, routing, and power allocation policies, which provide provable throughput optimality. Then, we specify the delay cost function and develop our geographic-location-aware backpressure algorithm. Moreover, considering scarcity of link resource, we exploit opportunities missed by backpressure-type algorithm to accelerate transmissions. As a result, the end-to-end delay is regulated in a tight fashion without sacrificing throughput optimality. In future work, we will optimize the energy consumption of special nodes in the network, such as GEO satellites, to extend our power allocation policy.
