Abstract
Keywords
1. Introduction
Path planning algorithms aim to find a sequence of valid states, called a path, that connects a start to a goal. Sampling-based planners, such as Probabilistic Roadmaps (PRM; Kavraki et al., 1996), find paths by randomly sampling valid states and connecting nearby states when these local connections are valid. The resulting structure can be viewed as a graph embedded in a state space, where each vertex represents a valid state and each edge a sequence of valid states connecting two vertices. Multiple planning problems can be solved by adding starts and goals to this embedded graph and then finding a path between them with a graph-search algorithm.
A single planning problem is often solved more efficiently with incremental sampling-based planners, such as Rapidly-exploring Random Trees (RRT; LaValle and Kuffner Jr 1999, 2001) and its asymptotically optimal variant, RRT* (Karaman and Frazzoli 2010, 2011). These planners build a search tree of valid paths rooted at the start by incrementally sampling and connecting states when these local connections are valid. This avoids having to specify the sampling resolution
Best-first graph-search algorithms, such as Dijkstra’s algorithm (Dijkstra 1959), can search problems more efficiently by ordering their search on
The efficient search order of A* is combined with the incremental sampling of RRT* in informed sampling-based planners, such as Batch Informed Trees (BIT*; Gammell et al., 2015, 2020). This improves planning performance, but only when effective cost heuristics are available. A heuristic is most effective when it is both accurate and computationally inexpensive to evaluate relative to other search operations. Such heuristics may not exist for some problems, because they are inaccurate for a given obstacle configuration or computationally expensive due to complex optimization objectives, or may not be admissible, which is often required for theoretical performance guarantees of informed planners.
Problem-specific information not expressible as admissible cost heuristics can be exploited by more advanced informed graph-search algorithms, such as Anytime Explicit Estimation Search (AEES; Thayer et al., 2012) and Anytime Multi-Heuristic A* (A-MHA*; Natarajan et al., 2019). These algorithms decouple search order from solution quality guarantees, which allows them to balance search efficiency with anytime performance. This is especially important for robotic systems that operate under hard time constraints.
This paper presents techniques to inexpensively calculate accurate, admissible, and problem-specific heuristics and exploit them with sampling-based planning algorithms. This is achieved with an asymmetric bidirectional search that considers different information in the forward and reverse searches. These two searches continuously inform each other by sharing complementary information in both directions. Algorithm 1 provides a conceptual overview of this approach.
The reverse search calculates heuristics for the current sampling-based approximation of a planning problem. It exploits problem-specific information implicit in the observed distribution of valid states by combining
The forward search finds valid paths in the current sampling-based approximation of a planning problem. It does this effectively by exploiting the accurate, problem-specific heuristics calculated by the reverse search. The forward search informs the reverse search when invalid edges were used to calculate the heuristic, causing the reverse search to update the heuristic. The forward search is computationally expensive because it performs full collision detection and edge cost evaluation, but focused on connections likely to yield a solution by the calculated heuristics (Figure 1). An illustration of the search trees constructed by RRT*, FMT*, BIT*, AIT*, and EIT* to find an initial solution when optimizing path length (a–e) and obstacle clearance (f–j). The start and goal are represented by a black dot and circle, respectively. Sampled states are represented by small black dots. State space obstacles are indicated with gray rectangles. The initial solutions are shown in yellow and the search trees constructed to find them are shown in black. Any edge in these search trees that is not part of the initial solution delayed finding it. RRT* randomly explores the state space and fully evaluates many edges that are not part of the initial solution for both objectives (a, f). FMT* orders its search with increasing cost-to-come of the vertices and not only fully evaluates many edges that are not part of the initial solution for both objectives but also fails to find a solution when optimizing obstacle clearance because its connection strategy depends on edge cost (b, g). BIT* orders its search on the total potential solution cost according to an admissible cost heuristic and evaluates fewer edges that are not part of the initial solution (c) but only if an informative admissible cost heuristic is available (h). AIT* calculates and exploits a problem-specific heuristic and evaluates still fewer edges that are not part of the initial solution (d) but only when an informative admissible cost heuristic can be calculated for the optimization objective (i). EIT* calculates and exploits cost and effort heuristics and evaluates few edges even when an admissible cost heuristic cannot be calculated for the objective (e, j).
This paper presents two almost-surely asymptotically optimal sampling-based planning algorithms informed by an asymmetric bidirectional search, Adaptively Informed Trees (AIT*) and Effort Informed Trees (EIT*). AIT* calculates an increasingly accurate, admissible cost heuristic with its reverse search and exploits this heuristic with its forward search. This results in fast initial solution times even when the admissible cost heuristic available
EIT* builds on AIT* by calculating an additional cost and effort heuristic with its reverse search and exploiting all three heuristics with its forward search. This results in fast initial solution times even when no admissible cost heuristic is available. The full details of EIT* are presented in Section 3.3.
The benefits of simultaneously calculating and exploiting adaptive heuristics are demonstrated on 12 problems in abstract, robotic, and biomedical domains in Section 4. All domains are tested with the path-length objective, where informative admissible heuristics are available
1.1. Statement of Contributions
This paper expands on ideas first published as Strub and Gammell (2020a). It makes the following specific contributions: • Presents EIT* as an extension of AIT* to optimization objectives that are computationally expensive or difficult to approximate with an admissible • Proves the almost-sure asymptotic optimality of these algorithms by building on results from the path-planning and graph-search literature (Section 3.3). • Demonstrates the effectiveness of these algorithms across multiple domains and optimization objectives, including problems of robotic manipulation and problems with continuous goal regions (Section 4).
2. Background
This section first formally defines the optimal path planning problem (Section 2.1) and then reviews related techniques from the literature on using heuristics in sampling-based planners and graph-search algorithms (Section 2.2).
2.1. Problem Definition
There are two widely studied versions of the path planning problem. The
The Optimal Path Planning Problem (Karaman and Frazzoli 2011).
Asymmetric bidirectional search 1 2 3 4 5 6 7 8 Sampling-based planners are often evaluated probabilistically as a function of the number of samples over all possible realizations of a distribution. Algorithms whose probability of solving the feasible path planning problem approaches one as the number of samples approaches infinity are called
Almost-sure asymptotic optimality (Karaman and Frazzoli 2011). Sampling-based planners can also use deterministic sampling strategies (e.g., Branicky et al., 2001; LaValle et al., 2004), which can result in deterministic optimality guarantees (Janson et al., 2018). The finite-time properties of asymptotically optimal planners are analyzed by Dobson and Bekris (2013), Janson et al. (2018), and Tsao et al. (2020). Formally analyzing sampling-based planners requires assumptions about the path planning problem (e.g., Gammell and Strub 2021). The analysis of AIT* and EIT* builds on the probabilistic results of Karaman and Frazzoli (2011) and makes the same assumptions (Sections 2.1.1–2.1.4).
2.1.1. State space assumption
The state space of the planning problem is assumed to be an open,
2.1.2. Cost function assumptions
Let
The cost of any path,
It is also assumed that only trivial paths consisting of a single state can have zero cost,
2.1.3. Obstacle Assumption
The obstacle configuration of the optimal path planning problem is assumed to allow for a valid path from the start to the goal that remains a fixed distance,
2.1.4. Optimal Solution Assumption
At least one solution of the optimal path planning problem,
2.2. Literature Review
Almost-surely asymptotically optimal planning is a popular area of research (Gammell and Strub 2021). This section focuses on sampling-based and graph-search techniques to calculate and/or exploit heuristics to improve performance. Sampling-based planners that use heuristics to guide the sampling and/or order the search are reviewed in Section 2.2.1. Approaches that calculate and exploit accurate cost heuristics for graph-search algorithms are reviewed in Section 2.2.2 and algorithms that use effort heuristics in Section 2.2.3. Using heuristics in sampling-based planning has parallels with lazy collision detection, which is reviewed in Section 2.2.4.
2.2.1. Heuristics in Sampling-Based Planning
Sampling-based planning algorithms can improve their performance by using heuristics to bias their sampling and guide their search.
RRT-Connect (Kuffner Jr and LaValle 2000) builds on RRT by growing two trees, one rooted in the start and one in the goal state. These trees each explore the state space around them, but are also guided towards each other with a
Heuristically-Guided RRT (hRRT; Urmson and Simmons 2003) and Generalized Bidirectional RRT (GBRRT; Nayak and Otte 2021) bias the growth of their trees with cost heuristics. hRRT uses
Informed RRT* (Gammell et al., 2014, 2018) builds on RRT* by using an admissible cost heuristic to ensure that only states that can improve the current solution are processed. This improves RRT*’s convergence rate and retains its almost-sure asymptotic optimality, but does not guide the search with the heuristic, does not improve the accuracy of the heuristic as the search progresses, and does not provide any benefits until an initial solution is found. Kunz et al. (2016) and Yi et al. (2018) extend informed sampling to kinodynamic systems. Joshi and Tsiotras (2019) present a variant of Informed RRT* that uses previous collision detection results and available information in the graph structure to guide the search.
RRT# (Arslan and Tsiotras 2013, 2015) builds on RRT* by ensuring that all samples are optimally connected to the search tree after each iteration. It does this efficiently by using an admissible cost heuristic to update the connections of suboptimally connected samples in order of their total potential solution cost. This again improves RRT*’s convergence rate and retains its almost-sure asymptotic optimality, but can also not improve the accuracy of the heuristic as the search progresses and does not provide any benefits until an initial solution is found.
Sakcak et al. (2020) present a method that incorporates a heuristic into a version of RRT* that is based on motion primitives (Sakcak et al., 2019). This can improve the performance on kinodynamic problems but uses a discretization of the state space that suffers from the
Informed Subdivision Tree (IST; Bekris and Kavraki 2008), A*-RRT (Li and Bekris 2011), the
Bayesian Effort-Aided Search Trees (BEAST; Kiesel et al., 2017) is similar to these methods in that it runs PRM on a simplified abstraction of the problem and uses the resulting graph to calculate an effort heuristic for the samples in the simplified space. If searching the original space reveals that regions in the abstract space cannot easily be connected, then this effort heuristic is updated in a Bayesian manner. BEAST tends to find initial solutions faster than other planners but does not provide any guarantees on the quality of its solutions.
Quotient-space RoadMap Planner (QMP; Orthey et al., 2018), Quotient-space RRT (QRRT; Orthey and Toussaint 2019), and their asymptotically optimal variants QMP* and QRRT* (Orthey and Toussaint 2019) solve planning problems with sequences of admissible lower-dimensional simplifications of increasing dimensionality. Paths in lower-dimensional simplifications can guide the sampling of states in higher dimensional simplifications and can be seen as admissible heuristics. This can improve performance by orders of magnitude, especially for high-dimensional problems, but requires the user to manually specify the sequence of lower-dimensional simplifications for each problem.
Motion Planning using Lower Bounds (MPLB; Salzman and Halperin 2015) is an anytime adaption of Fast Marching Trees (FMT*) (Janson and Pavone 2013; Janson et al., 2015) that incorporates admissible cost heuristics. MPLB uses two passes of Dijkstra’s algorithm to restrict the set of samples to be searched and another pass of Dijkstra’s to calculate an admissible cost heuristic for these samples, all without detecting collisions. It then uses the resulting cost heuristic in a forward search with collision detection to find a path. This approach can result in accurate, admissible cost heuristics and requires few collision detections but does not update the heuristic when the forward search detects collisions on edges that were used to compute the heuristic.
BIT* samples batches of states and views these states as an increasingly dense edge-implicit Random Geometric Graph (RGG; Penrose 2003). It uses an admissible cost heuristic to search this graph in order of potential solution quality with techniques similar to Lifelong Planning A* (LPA
Similar to AIT* and EIT*, the methods in this section use heuristics to improve the performance of sampling-based planning algorithms. In contrast to AIT* and EIT*, these methods either do not apply heuristics to all aspects of the search, do not provide any bounds on the quality of their solution, require a preprocessing step, do not calculate problem-specific heuristics, or do not improve the accuracy of the heuristic as the search progresses.
2.2.2. Improved heuristics in graph-search
Developing and exploiting accurate heuristics is an important area of research in informed graph-search algorithms. Techniques to calculate more accurate heuristics have improved performance in various problem domains, e.g., the 15-Puzzle (Culberson and Schaeffer 1996), Rubik’s Cube (Korf 1997), and robot vacuum on a grid (Thayer et al., 2011).
Pattern databases (Culberson and Schaeffer 1996, 1998; Korf 1997) are precomputed tables of exact solution costs to potentially simplified subproblems of a problem domain. The highest solution cost of any remaining subproblem in an ongoing search can be used as an accurate heuristic for an informed search. Additive pattern databases (Felner et al., 2004) are constructed such that the heuristic remains admissible when the solution costs of all remaining subproblems are combined, which can result in more accurate heuristics. This increased accuracy can significantly improve performance, but is confined to problem domains for which pattern databases can be generated.
Hierarchical A* (HA
Adaptive A* (AA
The method in Thayer et al. (2011) also generates more accurate heuristics for any domain. It uses a relationship between the cost-to-go of a state and the cost-to-go of its best child to define a
The
Similar to AIT* and EIT*, the methods in this section generate increasingly accurate heuristics that result in increasingly efficient searches. In contrast to AIT* and EIT*, these methods either require preprocessing, cannot be used for the initial search of a graph, result in inadmissible heuristics, or increase the heuristic value for all unexpanded states uniformly and only order by estimated solution cost.
2.2.3. Effort heuristics in graph-search
Ordering the search based on (inflated) cost heuristics improves anytime performance the most when the cost of a path correlates well with the computational effort required to find it (Wilt and Ruml 2012). Directly ordering the search on the computational effort of a path can instead improve performance even when this is not the case. The graph-search literature often uses the number of states that must be expanded to find a solution as a proxy for the total search effort.
Dynamically Weighted A* (DWA
Explicit Estimation Search (EES; Thayer and Ruml 2011) aims to always expand the node which most quickly leads to a solution whose cost is within a user-specified bound of the optimal cost. It uses an admissible cost heuristic to guarantee the bound on the suboptimality and inadmissible cost and effort heuristics to guide its search. This significantly improves search performance in domains where solution cost and depth can differ, but introduces computational overhead and algorithmic complexity because EES must maintain three queues ordered on three different quantities. AEES is an anytime version of EES and provides the foundation of the forward search of EIT*, which is discussed in Section 3.3.2.
Similar to AIT* and EIT*, the methods in this section use estimates of the computational effort required to discover a solution to guide the search and improve performance. In contrast to AIT* and EIT*, these methods do not increase the accuracy of their heuristics as the search progresses.
2.2.4. Lazy collision detection
A byproduct of using heuristics in sampling-based planning to bias the sampling and guide the search is often that fewer edges have to be fully evaluated (Figure 1). This relates informed path planning algorithms to algorithms with lazy collision detection that explicitly aim to minimize the number of collision detections.
Lazy PRM (Bohlin and Kavraki 2000) and Fuzzy PRM (Nielsen and Kavraki 2000) take similar approaches to minimizing computational effort through lazy collision detection. Both algorithms initially connect samples without performing any collision detection on the edges. The resulting graph is processed with an informed graph-search algorithm to find a path that connects the start and goal states, and only checked for collision once a path is found. If collisions are detected, then the corresponding vertices and edges are removed from the graph and the updated graph is processed again to find a new path between the start and goal states. This results in few fully evaluated edges and improved planning performance, but Lazy PRM and Fuzzy PRM do not provide any guarantee on the quality of its solution and do not improve the accuracy of the heuristic used in their graph-search algorithm. Almost-surely asymptotically optimal variants of similar approaches exist (Hauser 2015; Kim et al., 2018) but these algorithms also do not improve the accuracy of their heuristics as the search progresses.
The Single-Query, Bi-Directional, Lazy Collision Detection (SBL) planner (Sánchez and Latombe 2003) combines lazy collision detection with ideas from RRT-Connect. It grows two trees, similar to RRT-Connect, but only checks collisions on edges that it believes to be on a path connecting the start and goal states, like Lazy PRM and Fuzzy PRM. SBL achieves fast solution times, but does also not provide any guarantees on the quality of its solution and does not improve its solution given more computational time.
Lower Bound Tree-RRT (LBT-RRT; Salzman and Halperin 2014, 2016) extends RRT* with a graph whose edges are not fully evaluated and uses this graph to determine which edges to evaluate next. LBT-RRT is almost-surely asymptotically near-optimal and allows for continuous interpolation between RRT and Rapidly-exploring Random Graphs (RRG; Karaman and Frazzoli 2011) by only rewiring states that are
The Lazy Shortest Path (LazySP) Framework (Dellin and Srinivasa 2016) explicitly aims to minimize the number of edges that are checked for collision. It first finds a path from the start to the goal using a heuristic for the edge cost and then uses an
Similar to AIT* and EIT*, the methods in this section improve performance by reducing the number of full edge evaluations. In contrast to AIT* and EIT*, these methods either do not use heuristics, do not improve the accuracy of their heuristics, do not guarantee any bounds on the quality of their solution, do not improve their solution given more computation time, or can only optimize path length.
3. AIT* and EIT*
AIT* and EIT* are almost-surely asymptotically optimal path planning algorithms that build on BIT*. BIT* approximates the state space with a batch of samples, which it views as an edge-implicit RGG. BIT* searches this RGG in order of the total potential solution quality of its edges until it can guarantee that it has found the
To find solutions, BIT* processes the implicit edges of its current RGG approximation in order of their total potential solution cost, using incremental search techniques similar to an edge-queue version of TLPA*. It estimates the total potential solution cost of an edge as the sum of the current cost to come to the source of the edge, a heuristic estimate of the edge cost, and a heuristic estimate of the cost to go from the target of the edge. The formal guarantees of BIT* require that these cost heuristics are admissible.
AIT* and EIT* extend BIT* with an asymmetric bidirectional search that unifies many of the benefits reviewed in Section 2.2 by leveraging information implicit in the observed distribution of valid states (Figure 2). The reverse searches of AIT* and EIT* calculate accurate heuristics which are exploited by their forward searches. The forward searches in turn inform the reverse searches if they used invalid edges to compute the heuristic. In this way, both searches continuously inform each other with complementary information. An illustration of how AIT* and EIT* leverage the observed distribution of valid states to calculate accurate, problem-specific heuristics. The start and goal are represented by a black dot and circle, respectively. Sampled states are represented by small black dots. The state space obstacle is indicated with a gray rectangle. Euclidean distance is often used as a cost heuristic when optimizing path length (a, b). It suggests the green and red edges are equally promising, even though the red edge leads to a dead end (a). Calculating a problem-specific cost heuristic with a reverse search reveals that the green edge is more promising and can lead the forward search around the obstacle without evaluating many unnecessary edges (b). Some optimization objectives may not easily allow for informative admissible heuristics, such as obstacle clearance (c, d). Most informed search algorithms are ordered by cost-to-come in the absence of an informative heuristic, which again suggests that the green and red edges are equally promising (c). Calculating a problem-specific effort heuristic with a reverse search again reveals that the green edge can lead to a solution faster than the red edge, even in the absence of an informative admissible cost heuristic (d).
The reverse searches of AIT* and EIT* evaluate edges approximately and are therefore computationally inexpensive. The forward searches of AIT* and EIT* evaluate edges fully and are therefore computationally expensive, but focused by the calculated problem-specific heuristics. This computational asymmetry avoids the inefficiency of naive symmetric bidirectional informed search, where frontiers of expensive searches pass each other (Section 10.2, Pohl 1969).
An overview of the components in BIT*, AIT*, and EIT*. All three algorithms approximate the state space with the same series of increasingly dense RGGs. BIT* does not have a reverse search and finds valid paths with an edge-queue version of TLPA* ordered by the total potential solution cost according to an
3.1. Notation
The state space is denoted by
The forward and reverse search trees are denoted by
The true connection cost between two states is denoted by the function
The cost to come to a specific state from the start through the forward tree is denoted by the function
Admissible estimates of the cost of a path from the start to a goal constrained to go through a specific state is denoted by the function
Square brackets denote a label, e.g.,
The compounding operations,
3.1.1. EIT*-specific Notation
Potentially inadmissible effort heuristics between two states are denoted by
3.2. Adaptively Informed Trees (AIT*)
AIT* improves on BIT* by using the same increasingly dense RGG approximation but searching it with an asymmetric bidirectional search which calculates and exploits a more accurate cost heuristic that is specific to each RGG approximation (Figure 3). This results in a more efficient search with fewer evaluated edges when an admissible cost heuristic is available Five snapshots of AIT*’s search when minimizing path length. The start and goal states are represented by a black dot and circle, respectively. Sampled states are represented by small black dots. State space obstacles are indicated with gray obstacles. The forward search tree is shown with black lines and the reverse search tree with gray lines. The current best solution is highlighted in yellow. AIT* starts by initializing the RGG approximation and calculating an approximation-specific admissible cost heuristic with a reverse search without collision detection (a). AIT* exploits the calculated heuristic with its forward search and repairs the reverse search tree whenever the forward search reveals that it contains an invalid edge (b). When the forward search finds the resolution-optimal solution, the RGG approximation is improved by sampling and pruning and the heuristic is updated on this improved approximation (c). This updated heuristic is again exploited with the next forward search and repaired when found to use invalid edges (d). AIT* repeats these steps until stopped and almost-surely asymptotically converges towards the optimal solution (e).
AIT* consists of three high-level steps: (i) improving the RGG approximation (sampling; Section 3.2.1) (ii) updating the heuristic (reverse search; Section 3.2.2) (iii) finding valid paths in the current RGG approximation (forward search; Section 3.2.3), as shown by Algorithm 1. The full technical details of AIT* are given in Algorithms 2–7.
The reverse search of AIT* is a version of LPA* that calculates accurate cost heuristics by combining an admissible cost heuristic between multiple states into a more accurate cost heuristic between each state and the goal. The calculated cost heuristic is admissible for the current RGG and leverages information implicit in the observed distribution of valid states. This reverse search is computationally inexpensive because it does not perform collision detection on the edges.
If the reverse search finishes without reaching the start, then the start and goal are not in the same connected component of the current RGG approximation. AIT* skips the forward search in this case and directly improves the RGG approximation. This ensures that AIT* does not spend computational effort searching a graph that it knows cannot contain a solution.
The forward search of AIT* is an edge-queue version of A* which efficiently exploits the calculated heuristic and evaluates few edges that do not contribute to a solution when admissible cost heuristics are available
3.2.1. Approximation
AIT* incrementally approximates the state space by sampling batches of
The
AIT* considers the combination of both this RGG definition and any existing connections in the forward search tree and ignores edges known to be invalid (Alg. 3, lines 2 and 3). RGG complexity is reduced by pruning samples that are not in the informed set (Alg. 2, line 38 and Alg. 7).
3.2.2. Reverse Search
AIT* calculates an accurate cost heuristic between each processed state and the goal that is admissible for the current RGG approximation. It does this by calculating the
This is achieved by processing the RGG approximation with a version of LPA* that is rooted at the goal and uses the admissible
The queue of the reverse LPA* search in AIT* is denoted by
An uninitialized LPA* search is used to calculate the heuristic on the first batch of samples and after each new batch is added. This is more efficient than incrementally updating the heuristic with LPA* for the large changes in the graph that result from increasing its resolution (Aine and Likhachev 2016; Koenig et al., 2004; Likhachev and Koenig 2005). An uninitialized LPA* search is started by clearing the reverse search tree (except for the goals), setting the
The heuristic is updated whenever an edge in the reverse search tree is found to be invalid by removing this edge from the RGG approximation and repairing the reverse search tree with LPA*. This is accomplished by updating the cost-to-go of the source state of the invalid edge and then running LPA* to update the cost of all affected states as necessary (Alg. 2, lines 8–16 and 34–36 and Alg. 6).
The reverse search is suspended when the total potential solution cost of the best state in the reverse queue is greater than or equal to that of the best edge in the forward queue and the target of the best edge in the forward queue is consistent (Alg. 2 lines 9 and 10). This guarantees that no other edge in the forward queue would be better if the reverse search was continued (Strub 2021). The reverse search is also suspended when the reverse or forward queue is empty or when all edges in the forward queue have consistent targets with a reverse-key value less than or equal to the minimum reverse-key in the reverse queue, but these conditions are omitted from Algorithm 2 for clearer structure.
Adaptively Informed Trees (AIT*) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43
3.2.3. Forward Search
AIT* finds solutions to a planning problem by building a search tree rooted at the start with an edge-queue version of A* that uses the heuristic calculated by the reverse search. The edge-queue of the forward search is denoted by
A forward search iteration begins by testing if the forward queue contains an edge that can possibly improve the current solution (Alg. 2, line 17). If it does, then the edge with the lowest
If the edge is not in the forward tree but can possibly improve it, then it is checked for validity which is computationally expensive (Alg. 2, lines 22 and 23). If the edge is invalid, then it is added to the set of invalid edges (Alg. 2, line 34), and if it is also part of the reverse tree, then the heuristic is updated with the reverse search (Alg. 2, lines 8–16, 35, 36, and Alg. 6). If the edge is valid, then its true cost is evaluated which may also be computationally expensive and it is checked whether the edge can actually improve the current solution and forward tree (Alg. 2, lines 24 and 25).
If the edge can improve the current solution and forward search tree, then its target is added to this tree if it is not already in it (Alg. 2, lines 26 and 27). If it is already in the forward search tree, then the new edge constitutes a rewiring and the old edge is removed from the tree (Alg. 2, line 29). The new edge is added to the forward tree and its target is expanded regardless of whether the target was already in the tree or not (Alg. 2, lines 30 and 31).
AIT*: 1 2 3 4
AIT*: 1 2 3 4
AIT*: 1 2 3 4 5 6 7 8 9 10 11 12 13
AIT*: 1 2 3 4 5 6 7 8
AIT*: 1 2 3 A forward search iteration finishes by updating the current solution cost (Alg. 2, line 32). In practice, this is done efficiently by only checking the goals in the forward tree. The entire forward search terminates when it is guaranteed that the optimal solution in the current RGG approximation is found. This occurs when no edge in the forward queue can possibly improve the current solution (Alg. 2, line 17). The forward search also terminates when the start and goal are not in the same connected component of the RGG approximation. This occurs when the reverse search tree does not reach any edge in the forward queue, but this condition is omitted from Algorithm 2 for clearer structure. The three steps of AIT*, i.e., improving the RGG approximation, updating the heuristic with the reverse search, and finding valid paths with the forward search, are repeated for as long as computational time allows or until a suitable solution is found. This results in increasingly accurate cost heuristics for increasingly efficient searches of increasingly accurate RGG approximations and will almost-surely asymptotically converge to the optimal solution in the limit of infinite samples (Section 3.4.1).
3.3. Effort Informed Trees (EIT*)
Informed planning algorithms guided by admissible cost heuristics, such as BIT*, ABIT*, and AIT*, need effective
EIT* consists of the same three high-level steps as AIT*: (i) improving the RGG approximation (sampling; Section 3.2.1) (ii) updating the heuristics (reverse search; Section 3.3.1) (iii) finding valid paths in the current RGG approximation (forward search; Section 3.3.2), as shown in Algorithm 1. Identically to AIT*, EIT* also approximates the state space with a series of increasingly dense RGGs and skips the forward search if the reverse search terminates without reaching the start. In contrast to AIT*, the reverse search of EIT* includes adaptive sparse collision detection on the edges and calculates both problem-specific path-cost and search-effort heuristics which are exploited in an anytime manner by the forward search of EIT*. The full technical details of EIT* are given in Algorithms 8 and 9 using the same Five snapshots of EIT*’s search when optimizing obstacle clearance. The start and goal specifications are represented by a black dot and circle, respectively. Sampled states are represented by small black dots. State space obstacles are indicated with gray rectangles. The forward search tree is shown with black lines and the reverse search tree with gray lines. The current best solution is highlighted in yellow. EIT* starts by initializing the RGG approximation and calculating approximation-specific cost and effort heuristics with a reverse search without collision detection (a). EIT* exploits the calculated heuristics to guide its forward search and repairs the reverse search tree whenever the forward search reveals that it contains an invalid edge. The forward search is initially ordered on the least calculated effort-to-go from a state to the goal, which results in fast initial solution times (b). Once the initial solution is found, the forward search uses the calculated cost heuristics to find the resolution-optimal solution on the current RGG approximation (c). Having found the resolution optimal solution on an RGG approximation, EIT* improves this approximation, updates the heuristic, and aims to find the next best resolution-optimal solution with minimal computational effort (d). This process is repeated until the algorithm is stopped and will almost-surely asymptotically converge towards the optimal solution (e).
3.3.1. Reverse Search
EIT* calculates an admissible cost heuristic, an inadmissible cost heuristic, and an inadmissible effort heuristic for each RGG approximation. The calculated admissible cost heuristic is a lower bound on the optimal cost of a path from a state to the goal and is denoted by the label
These heuristics are computed as in AIT* with a reverse search that combines
The reverse search of EIT* is an edge-queue version of A* with adaptive sparse collision detection. Collision detection is traditionally considered a computationally expensive operation in sampling-based planning (Hauser 2015; Kleinbort et al., 2020b), but this is due to the computational cost of validating valid edges (Sánchez and Latombe 2003). Detecting invalid edges with sparse collision detection is computationally cheaper and was found to be of similar computational cost to other operations in the reverse search when solving the problems presented in Section 4.
The queue of the reverse A* search in EIT* is denoted by
New heuristics are calculated when the RGG approximation is initialized or improved and updated when the forward search detects that the heuristics were calculated with an invalid edge. If the heuristics are calculated because of an initialized or improved approximation, then the resolution of the adaptive sparse collision detection is reset to a user-specified parameter (Alg. 8, lines 6 and 56). If they are updated because of an invalid edge, then the resolution of the sparse collision detection in the reverse search is increased (Alg. 8, line 48).
Each iteration of the reverse search extracts the edge with the lowest
The edge is then checked if it can improve the admissible cost heuristic of its target (Alg. 8, line 17). If it can, then the heuristic is updated and the target is either rewired or added to the reverse search tree (Alg. 8, lines 18–22). The reverse search iteration is completed by expanding the outgoing edges of the target into the reverse queue.
Similar to AIT*, the reverse search is suspended when the total potential solution cost of the best edge in the reverse queue is greater than or equal to that of the best edge in the forward queue and the target of the best edge in the forward queue is closed (Alg. 8 lines 6 and 7). This guarantees that no other edge in the forward queue would be better if the reverse search was continued (Strub 2021). The reverse search is also suspended when the reverse or forward queue is empty, when all edges in the forward queue have closed targets, or when the inflation factor is infinity and any edge in the forward queue has a target in the reverse tree, but these conditions are omitted from Algorithm 8 for clearer structure.
3.3.2. Forward Search
The forward search of EIT* is an edge-queue version of AEES which exploits the cost and effort heuristics calculated by the reverse search of EIT* in an anytime manner. It leverages problem-specific cost and effort information and results in effective searches with fast initial solution times even when no admissible cost heuristics are available
AEES searches the same RGG approximation multiple times with successively tighter suboptimality bounds. It initially prioritizes quickly finding any solution over efficiently finding the resolution-optimum, which improves anytime performance. AEES is especially useful when no informative admissible cost heuristic is available
The forward search of EIT* orders its queue by considering (i) a lower bound on the optimal solution cost (ii) an estimate of the optimal solution cost (iii) and an estimate of the minimum remaining effort to validate a solution within the current suboptimality bound.
Optimal cost bound
A lower bound on the optimal solution cost in the current RGG approximation is computed as
Optimal cost estimate
A more accurate, but possibly inadmissible, estimate of the optimal solution cost in the current RGG approximation is computed as
This estimate is often more accurate than the admissible lower bound, because it can use information that may overestimate the true cost. The edge that leads to this possibly inadmissible estimate of optimal solution cost is denoted as
Effort Informed Trees (EIT*) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 ( 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57
EIT*: 1 2 3 4 5 6 7 8 9
Minimum effort estimate
An estimate of the minimum remaining effort to validate a solution within the suboptimality bound is computed as
The edge that results in this estimate of the minimum required effort remaining to find a solution within the current suboptimality bound is denoted as
ETI* first checks if the queue contains an edge that could improve the current solution (Alg. 8, line 27). If none of the edges in the forward queue can, then the RGG approximation is improved by pruning states that are not in the informed set and sampling more states (Alg. 7 and Alg. 8, lines 52 and 53). The reverse search tree and set of closed vertices are then reset (Alg. 8, line 54) and the forward and reverse search queues are reinitialized by inserting the outgoing edges of the start and goals, respectively (Alg. 8, line 55).
If at least one edge in the forward queue could improve the current solution, then the edge that is estimated to lead to the fastest improvement of the current solution is extracted from the queue (Alg. 8, lines 28 and 29, and Alg. 9). This edge is determined with the following steps: 1. If the edge with the minimum remaining effort required to validate a solution, 2. If the edge that is estimated to be on the optimal solution path, 3. Otherwise the edge that provides the lower bound on the optimal solution cost in the current RGG approximation,
The forward search in EIT* then proceeds similarly to AIT*. If the selected edge is in the forward search tree, then its target state is expanded and the forward search iteration is complete (Alg. 8, lines 30 and 31). If the selected edge is not part of the forward search tree but can possibly improve it, then it is checked for collisions (Alg. 8, lines 32 and 33).
If collisions are detected, then the edge is added to the invalid edges (Alg. 8, line 46) and if it is in the reverse search tree, then the reverse search tree and queue are reset and the sparse collision detection resolution is updated, which will improve the accuracy of the heuristic computed by restarting the reverse search (Alg. 8, lines 47–50). If no collisions are detected, then the true cost of the edge is evaluated to check whether it actually improves the current solution and forward search tree (Alg. 8, lines 34 and 35).
If the edge improves the current solution and forward search tree, then its target state is added to the tree if it is not already in it (Alg. 8, lines 36 and 37). If the target state is already in the tree, then the edge causes a rewiring and the edge from the old parent is removed from the tree (Alg. 8, lines 38 and 39). The new edge is then added to the tree, its target state is expanded into the forward queue, and the solution cost is updated (Alg. 8, lines 40–43). If the edge results in an improved solution, then the suboptimality factor is changed according to a user-specified update policy (Alg. 8, lines 44). Section 4 presents the updated policy used in the experimental evaluation of EIT*.
The entire forward search terminates when it is guaranteed that the optimal solution in the current RGG approximation is found. This occurs when no edge in the forward queue can possibly improve the current solution (Alg. 8, line 27). The forward search also terminates when the start and goal are not in the same connected component of the RGG approximation. This occurs when the reverse search tree does not reach any edge in the forward queue, but this condition is omitted from Algorithm 8 for clearer structure.
Similar to AIT*, the three steps of EIT*, i.e., improving the RGG approximation, updating the heuristic with the reverse search, and finding valid paths with the forward search, are repeated for as long as computational time allows or until a suitable solution is found. This results in increasingly accurate cost and effort heuristics for increasingly efficient and effective searches of increasingly accurate approximations and will also almost-surely asymptotically converge to the optimal solution in the limit of infinite samples (Section 3.4.2).
3.4. Analysis
Any path planning algorithm that processes a sampling-based approximation with a graph-search algorithm is almost-surely asymptotically optimal if the approximation almost-surely contains an asymptotically optimal solution and the graph-search algorithm is resolution-optimal. This is a sufficient but not necessary condition. The almost-sure asymptotic optimality of AIT* and EIT* follows from proven properties of their RGG approximations and graph-search algorithms.
3.4.1. AIT*
The RGG approximation constructed by AIT* almost-surely contains an asymptotically optimal solution because it contains all the edges in PRM* for any set of samples and PRM* is almost-surely asymptotically optimal (Karaman and Frazzoli 2011). AIT*’s forward search is resolution-optimal because A* is a resolution-optimal algorithm if it is provided with an admissible cost heuristic (Hart et al., 1968). AIT*’s reverse search without collision detection results in an admissible cost-heuristic because LPA* is also a resolution-optimal algorithm (Aine and Likhachev 2016) and because adding collision detection cannot decrease path cost. AIT* is therefore almost-surely asymptotically optimal.
3.4.2. EIT*
The RGG approximation constructed by EIT* almost-surely contains an asymptotically optimal solution because like AIT* it also contains all the edges in PRM* and PRM* is almost-surely asymptotically optimal (Karaman and Frazzoli 2011). EIT*’s forward search is resolution-optimal because AEES is a resolution-optimal algorithm when the cost heuristic is admissible and the suboptimality factor is one (Thayer and Ruml 2011). EIT*’s reverse search with sparse collision detection results in an admissible cost heuristic because denser collision detection cannot decrease path cost and A* is a resolution-optimal graph-search algorithm when provided with an admissible cost heuristic (Hart et al., 1968). EIT* is therefore also almost-surely asymptotically optimal.
4. Experimental results
The benefits of an asymmetric bidirectional search are shown on abstract, robotic manipulator, and knee replacement dislocation problems (Sections 4.1–4.3). AIT* and EIT* were compared against the Open Motion Planning Library (OMPL; Şucan et al., 2012) implementations of RRT, RRT-Connect, RRT*, LBT-RRT, LazyPRM*, FMT*, BIT*, and ABIT* 1 .
The planners were tested when optimizing path length and obstacle clearance. Path length was optimized by minimizing the arc length of the path in state space. Obstacle clearance was optimized by minimizing the reciprocal of clearance integrated over the arc length of the path,
The lower limit on
The admissible cost heuristic,
The effort heuristic,
The inflation factor update policy in EIT* was configured to have an infinite inflation factor until the initial solution is found and then switch to a unity inflation factor. This results in fast initial solutions and efficient subsequent searches to improve them. The sparse collision detection resolution update policy used in EIT* was configured to initially search each batch with a single collision check and then double the resolution if the forward search detects a collision on an edge used in the reverse search tree.
RRT-based planners used maximum edge lengths of 0.3, 0.9, 1.25, 1.25, 2.4, and 3.0 in
FMT* is not an anytime algorithm and requires the user to specify the number of samples in advance. All experiments presented in this section tested configurations of FMT* with 10, 50, 100, 500, 1000, and 5000 samples. There are multiple lines for FMT* in the presented plots because median solution times and costs were computed separately for each configuration and the results of all configurations that were able to solve a specific problem were plotted.
4.1. Abstract Problem
State space obstacles have complex shapes even for relatively simple problems (e.g., Figure 1, Das and Yip 2020). This complexity often makes it difficult to gain insight about the underlying reason for a planner’s performance on a given problem. Directly designing abstract state space obstacles from basic geometries provides intuition on the performance of a planner for a given obstacle configuration and helps the algorithmic design process.
The basic geometries of these simple obstacle configurations make collision detection computationally much less expensive than in real-world problems. A simple way to simulate the more expensive collision detection of real-world problems in this abstract setting is to increase the collision detection resolution. The collision detection resolution was set to 5 ⋅ 10−6, which on the tested hardware makes evaluating a valid edge in this abstract setting as computationally expensive as evaluating a valid edge on a dual-arm manipulation problem (Section 4.2). While admissible cost heuristics exist for these abstract problems with clearance in state space (Strub and Gammell 2021), such heuristics often do not exist for real-world problems with clearance in workspace and therefore no heuristics were used for the clearance objective in these abstract problems either.
The abstract obstacle configuration on which the planners were tested consists of a wall with a narrow gap between the start and goal states (Figure 5). This obstacle configuration illustrates the speed with which planners find a hard-to-find optimal homotopy class when optimizing path length. When optimizing obstacle clearance, this configuration illustrates the challenges of searching in the absence of informative heuristics and ordering the search according to the total potential solution cost. A two-dimensional illustration of the wall gap experiment. The start and goal states are represented by a black dot and circle, respectively. State space obstacles are indicated with gray rectangles. Each state space dimension was bounded to the interval [0, 1].
Three versions of the wall gap obstacle configuration in dimensions
Figure 6 shows the performance of all algorithms on all six instances of the problem when optimizing path length and obstacle clearance. When optimizing path length, AIT* and EIT* perform similarly to Lazy PRM*, BIT*, and ABIT* and find initial solutions at least as fast as RRT-Connect and significantly faster than RRT* and LBT-RRT (Figures 6(a), (c), and (e)). When optimizing obstacle clearance, EIT* outperforms all other tested asymptotically optimal planners, including AIT*, by again finding initial solutions as fast as RRT-Connect, which has a computational advantage because it does not calculate path cost (e.g., obstacle clearance, Figures 6(b), (d), and (f)). The planner performances on the wall gap experiments described in Section 4.1 (Figure 5). The success plots show the percentages of successful runs over time. The cost plots show the median initial solution times and costs as squares and the median solution costs over time as thick lines, both with nonparametric 99% confidence intervals shown as error bars and shaded areas, respectively. Unsuccessful runs were taken as infinite costs. The results show that EIT* outperforms all other tested asymptotically optimal planners for both objectives in terms of success rates, median initial solution times, and median solution quality over time.
4.2. Manipulator problems
The algorithms were also tested on path planning problems for Barrett Whole-Arm Manipulator (WAM) arms in the Open Robotics Automation Virtual Environment (OpenRAVE; Diankov 2010). OpenRAVE was configured to use the Flexible Collision Library (FCL; Pan et al., 2012) for collision detection and clearance computation, using Oriented Bounding Box (OBB; Gottschalk et al., 1996) tree and Rectangle Swept Sphere (RSS; Larsen et al., 2000) volume representations, respectively. The collision detection and clearance computation resolution was set to 3.6 ⋅ 10−3, which resulted in a 1% false-negative collision detection rate for invalid edges on representative problems.
4.2.1. Single-Arm manipulator problem
Robotic manipulator arms are commonly used in pick-and-place tasks. In the single-arm experiment, the algorithms were instructed to find paths for a Barret WAM arm with seven degrees of freedom to place a small cube into a box (Figure 7). Illustrations of the single-arm manipulator problem. The top row shows the start configuration of the arm in position to pick up the red cube from the table (a–d). The bottom row shows the goal configuration of the arm in position to place a cube in the box (e–h).
Figure 8 shows the performance of all algorithms when optimizing path length and obstacle clearance. When optimizing path length, AIT* and EIT* perform similarly to Lazy PRM*, BIT*, and ABIT*, which all find initial solutions nearly as fast as RRT-Connect and significantly faster than RRT* and LBT-RRT (Figure 8(a)). When optimizing obstacle clearance, AIT* and EIT* outperform all other tested asymptotically optimal planners but do not find initial solutions as fast as RRT-Connect, which again has a computational advantage because it does not calculate path cost (Figure 8(b)). This figure shows the planner performances on the single-arm manipulator experiments described in Section 4.2.1 (Figure 7). The success plots show the percentages of successful runs over time. The cost plots show the median initial solution times and costs as squares and the median solution costs over time as thick lines, both with nonparametric 99% confidence intervals shown as error bars and shaded areas, respectively. Unsuccessful runs were taken as infinite costs. The results show that for the path length version of this simple problem, EIT* and AIT* have near identical performances to Lazy PRM*, FMT*, BIT*, and ABIT*, which outperform LBT-RRT and RRT* (a). In the obstacle clearance version of the problem, EIT* and AIT* outperform all other almost-surely asymptotically optimal planners (b).
4.2.2. Dual-Arm manipulator problem
In the dual-arm planning experiment, the algorithms were instructed to find paths for two Barret WAM arms with a total of 14 degrees of freedom from a start configuration at the bottom shelf to a goal configuration at the top shelf (Figure 9). Illustrations of the dual-arm manipulator problem. The top row shows the start configuration of the arms in position to pick up two objects on the bottom shelf (a–d). The bottom row shows the goal configuration of the arms in position to place these objects on the top shelf (e–h).
Figure 10 shows the performance of all algorithms when optimizing path length and obstacle clearance. When optimizing path length, AIT* performs similarly to BIT* and ABIT* which perform better than Lazy PRM* and EIT*, and find initial solutions nearly as fast as RRT-Connect and significantly faster than RRT* and LBT-RRT (Figure 10(a)). When optimizing obstacle clearance, AIT* and EIT* again outperform all tested asymptotically optimal planners but do not find initial solutions as fast as RRT-Connect, which still has a computational advantage because it does not calculate path cost (Figure 10(b)). This figure shows the planner performances on the dual-arm manipulator experiments described in Section 4.2.2 (Figure 9). The success plots show the percentages of successful runs over time. The cost plots show the median initial solution times and costs as squares and the median solution costs over time as thick lines, both with nonparametric 99% confidence intervals shown as error bars and shaded areas, respectively. Unsuccessful runs were taken as infinite costs. The results show when optimizing path length, AIT* performs nearly identically to BIT* and ABIT*, which all outperform Lazy PRM*, FMT*, and EIT*, and clearly outperform LBT-RRT and RRT* (a). When optimizing obstacle clearance, EIT* and AIT* outperform all other almost-surely asymptotically optimal planners (b).
4.3. Knee replacement dislocation problem
Calculating heuristics with an asymmetric bidirectional search can also improve performance on the feasible planning problem by guiding the search towards the goal. The knee replacement dislocation problem evaluates the potential of medial dislocation for the Oxford Domed Lateral Unicompartmental Knee Replacement (UKR; Pandit et al., 2010, Figure 11)
2
by searching for a path to free the mobile bearing. A multiview illustration of the knee replacement dislocation experiment. The 3D model of the Oxford Domed Lateral UKR is reproduced from Figure 1 in Pandit et al. (2010) (a). The experiments presented in this paper used a simplified approximation of the Oxford Domed Lateral UKR (b). The start state of the mobile bearing is between the two fixed parts (b) with the search space and goal region shown as red and green regions, respectively (c). The goal region is the set of bearing positions in medial dislocation between the tibial and femoral components (d).
The Oxford Domed Lateral UKR consists of metal femoral and tibial components which are fixed to the bone and a mobile polyethylene bearing which separates the metal components (Gunther et al., 1996). Medial dislocation occurs when there is enough space between the tibial and femoral components for the mobile bearing to move onto the tibial-component wall where it may be trapped by the femoral component. The dislocation risk for different relative poses of the femoral and tibial components has been analyzed by using planning algorithms to search for paths that allow the bearing to reach a region representative of dislocation (Yang et al., 2020, 2021). Figures 11(b) and (c) respectively illustrate the start state and goal region of the mobile bearing and the fixed poses of the tibial and femoral components used in this experiment. The state space of this problem is SE (3), and the mobile bearing is free to move and rotate in any direction not in collision with the fixed parts.
Figure 12 shows the performance of all planners when optimizing path length and obstacle clearance. When optimizing path length, EIT* outperforms all other tested planners. AIT* is the second best performing algorithm followed by FMT*, BIT*, and ABIT*. The OMPL implementations of LBT-RRT and RRT-Connect did not allow goals to be defined as a region and were replaced with RRT for this experiment. When optimizing obstacle clearance, EIT* again outperforms all other tested planners and is the only planner that achieved a success rate of 100%. The planner performances on the knee replacement dislocation problem described in Section 4.3 (Figure 11). The success plots show the percentages of successful runs over time. The cost plots show the median initial solution times and costs as squares and the median solution costs over time as thick lines, both with nonparametric 99% confidence intervals shown as error bars and shaded areas, respectively. Unsuccessful runs were taken as infinite costs. The results show that when optimizing path length, EIT* and AIT* outperform all other planners in terms of success rates, median initial solution times, and median solution quality over time. Lazy PRM* finds solutions as fast as EIT* for some random sequences of samples, but reaches the time limit before finding any solution for other random sequences. When optimizing obstacle clearance, EIT* again outperforms all other tested planners in terms of the above measures. RRT-Connect and LBT-RRT were not tested in this experiment, as their available OMPL implementations do not support goal regions.
5. Discussion
The experiments presented in Section 4 demonstrate the benefits of sampling-based path planning with an asymmetric bidirectional search. This section discusses the results of these experiments, elaborates on the algorithmic differences between AIT* and EIT*, and presents possible extensions to asymmetric bidirectional algorithms in sampling-based path planning and beyond.
Section 4 shows the performance of AIT*, EIT*, and eight other sampling-based algorithms on a diverse set of 12 problems optimizing two objectives. When optimizing path length, the experiments show that AIT* and EIT* are competitive to the other tested planners in terms of initial solution times, success rates, and solution quality over time.
When optimizing obstacle clearance, the experiments show that EIT* outperforms all other tested asymptotically optimal planners by finding initial solutions faster, reaching 100% success rates sooner, and providing the highest quality solution for most of the time. AIT* is often the second-best performing planner on this objective and even competitive to EIT* on the robotic arm experiments.
The batch size of AIT* and EIT* was kept constant for all presented experiments, while the performance of RRT-based planners was tuned to the problem dimension by adjusting the maximum edge length. This shows that AIT* and EIT* can perform well without problem-specific batch sizes but can also motivate future research to investigate advanced batch-size calculations, including variable and adaptive batch sizes.
The experiments presented in Section 4 keep the collision detection resolution constant within each problem. This resolution determines the false-negative collision rate, i.e., the percentage of edges that are considered valid but in reality are not. What is considered an acceptable false negative collision rate depends on the application of the planning algorithm. Experiments not presented in this paper showed that the
Edge evaluation is also computationally expensive for systems with kinodynamic constraints, when full edge evaluation requires solving a two-point Boundary Value Problem (BVP). The accurate heuristics calculated by AIT* and EIT* can reduce the number of BVP solved, but AIT* and EIT* require exact solutions to the BVPs. Other almost-surely asymptotically optimal algorithms do not require exact BVP solutions even for problems with kinodynamic constraints (Hauser and Zhou 2016; Li et al., 2016; Kleinbort et al., 2020a; Shome and Kavraki 2021).
AIT* and EIT* use different algorithms for the reverse and forward searches of their asymmetric bidirectional search (Table 1). The change in forward search algorithms from A* in AIT* to AEES in EIT* is motivated by the benefits of effort heuristics, which cannot be exploited with A*, and justify the computational overhead induced by the increased complexity of AEES. The change in reverse search algorithms from LPA* in AIT* to A* in EIT* is motivated by the observations that repairing the reverse search tree with LPA* is only more efficient than restarting A* for small changes in the search tree (between 1% and 2%; Table 1, Aine and Likhachev 2016), and that detecting invalid edges with the forward search often results in larger changes in the reverse search tree. Implementing increasingly dense collision detection in an LPA* reverse search would additionally require either further bookkeeping to keep track of which portion of the tree was checked with which resolution or result in duplicated collision detection effort on some of the edges.
The presented asymmetric bidirectional approach in which two searches inform each other with complementary information can potentially be beneficial in all problem domains where full edge evaluation is computationally expensive, e.g., because of computationally expensive true edge cost computation or complex collision detection. If this edge cost computation can inexpensively be approximated by a heuristic, then the reverse search can combine such heuristics between multiple states into more accurate heuristics between each state and the goal, similar to AIT* and EIT*.
6. Conclusion
Informed path planning algorithms can use problem-specific knowledge in the form of heuristics to improve their performance, but selecting appropriate heuristics is difficult. This is because heuristics are most effective when they are both accurate and computationally inexpensive to evaluate, which are often conflicting characteristics. Many informed planners additionally can only use problem-specific knowledge if it is expressible as admissible heuristics, which is not always possible for all optimization objectives.
This paper presents two almost-surely asymptotically optimal path planning algorithms that address these challenges by using asymmetric bidirectional searches that simultaneously calculate and exploit accurate, problem-specific heuristics. AIT* uses an inexpensive reverse search to combine admissible
EIT* builds on AIT* by additionally calculating inadmissible cost and effort heuristics with its reverse search. This additional knowledge about the computational effort to validate a path can be calculated for any optimization objective and exploited by EIT*’s forward search to order its search in a solution-oriented manner. This improves anytime performance when the cost of a path does not correlate well with the computational effort required to validate it and allows EIT* to find initial solutions quickly, even if an admissible cost heuristic is not available for an optimization objective.
The benefits of simultaneously calculating and exploiting ever more accurate heuristics through an asymmetric bidirectional search are demonstrated on 12 diverse problems in abstract, robotic, and biomedical domains. EIT* outperforms all other tested asymptotically optimal planners when optimizing obstacle clearance and performs competitively when optimizing path length. AIT* is often the second best performing asymptotically optimal planner when optimizing obstacle clearance and also performs competitively when optimizing path length.
Information on the OMPL implementations of AIT* and EIT* as well as the software framework for running the experiments and creating the corresponding plots will be available at https://robotic-esp.com/code/.
Supplemental Material
Supplemental Material
Supplemental Material
Footnotes
Acknowledgements
Declaration of conflicting interests
Funding
Supplementary Material
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
