Abstract
Introduction
Generally, a plan for the rail transit network in a city is formed through several stages, which mainly include the network size calculation, the morphology determination, the rail paths building, and then the initial network formation. Afterward, a revision and optimization, based on passenger flow test results, will be repeatedly implemented on the initial network until the ultimate rail transit network plan is formed. Throughout the whole process, the initial network is key to the determination of the geographical position and path direction of urban rails in all periods and scopes.
Existing studies on city rail transit network planning could be mainly grouped into two streams. One is theoretically focused on the generation and optimization of city rail transit network employing analytic network approach. This stream went through three individual stages. Studies in the first stage from 1970s to 1980s are single-goaled and achieved by the optimization of one path on another. For example, DiCesare 1 proposed a single-path model to minimize construction fee. Church and Clifford 2 proposed a simple network grid model to balance the path network configuration, by which a path rendering the lowest fees between the origin and destination (O-D) is generated. As the model fails to serve the majority of population in a region, it cannot be implemented. In the second stage from 1990s to the early 21st century, multiple goals were proposed for network design on certain premises. For example, based on Church’s studies, Current et al. 3 established the dual goal-based generation model to design a rail network serving the maximized passenger flow with the shortest line length. Utilizing the Tabu Search Heuristic algorithm, Dufourd et al. 4 developed a dual goal-based rail transit optimization model for rail network design. Afterward, Liu 5 proposed a virtual network model for urban rail transit. Based on current research, Bruno et al. 6 tried the K-shortest-paths algorithm to analyze options for rail transit networks with fixed demands and found that the resulted network was better performed in the optimization of minimizing construction and traveling cost. Wang et al. 7 used network graph theory in the investigation of the network size, morphology, and initial network layout. In the third stage from early 21st century till now, numerous planning methods for rail transit network layout come out, such as bi-level planning and designing models, joint studies on passenger flow forecast and network optimization, network planning based on elastic demands, and new path expansion method. Based on the four-phase model framework, Laporte et al. 8 defined a new model using the maximum coverage of the public demand and the path as an objective function and it could be used for a kernel rail transit network design and the ultimate urban rail transit network formation, Marín and García-Ródenasbs 9 established a bi-level path network optimization model.
The other stream is practically involved with methodologies applied in the generation and optimization of rail transit network including demand test, where both quantitative and qualitative analyses are employed. For instance, Beijing Urban Construction Institute advocated the plane-, hub-, and path-based factor analysis. 10 China Academy of Urban Planning and Design proposed a method for weaving transit network, which takes the planning for the goal and principle, the functional hierarchy for the orientation, and the traffic hub for the outline. 11 Escudero and Munoz 12 proposed a two-phase planning method that incorporates loop rail paths in network expansion. The first phase uses the integer planning and design model to arrange rail transit stations. The second phase connects specified stations under certain constraints. 12 Considering that most Chinese cities follow the multi-center axis development, Peng 13 proposed the station-based networking method. Kong 14 also proposed a technique for generating the initial rail transit network, which is separated into three steps: generating the initial network—designing alternative plans—determining the final plan. At the same time, the assessment system for rail transit network is also improving. Guo and Lu 15 examined the relationship among city space structure, land usage, and rail transit network layout and assessed the network layout of typical cities. Wang et al. 16 raised three qualitative indicators that are the fitness between rail transit network center and city center of gravity, the consistency of shape division dimensions between the city and rail transit network, and the similarity of the direction between population and rail transit network.
As mentioned above, numerous theories and methodologies on the rail transit network planning have been brought out, but still some gaps exist there. Current theoretical studies aimed at optimizing the whole rail network and building their corresponding network models lack the due practicability. Most mature methods on rail network planning still stay in a qualitative level, especially the determination of network morphology and passenger hubs. The process for line network generation lacks data for quantitative study. Therefore, the objective of this article is to generate a rail transit network using the reference drawn by the importance theory and employing the quantitative analysis from plane, hub, and path factors.
Determining the initial rail network morphology based on the plane factor
Constructing the adjacent matrix and border matrix
Adjacent matrix reflects the adjacency between zones, which is the basis for path searches. Suppose the adjacent matrix is represented by
The adjacent matrix of a zone is as follows
In practice, additional attention should be given to the city morphology and its future development that these eccentric or concentric zones should be considered to be adjacent. According adjustments should be made on the adjacent matrix and build a revised adjacent matrix
Border matrix is a set of zones that are possible to be destinations of a path and usually refer to zones in the bordering area (bordering zones for short). Suppose the border matrix is represented by
The border matrix between zones is as follows
Since the O-D of a rail transit may not always fall in the bordering zones of a city but sometimes fall in zones inside the city, adjustments should also be made to meet the actual situation and build a revised border matrix
Importance of traffic zones
The importance of a traffic zone is related not only to its population and number of jobs, but also to its geographical position in a city. Therefore, the calculation of a traffic zone importance involves two parts: position importance and transportation importance.
Position importance of a traffic zone
Assume that both origin and destination of a generated trip locate in the geographical center of the zones. As to traffic zones in the periphery with larger size, the main passenger hubs are selected as their centers. Suppose a research area is divided into
where
Transportation importance of a traffic zone
From the transportation perspective, the importance of a traffic zone typically depends on the trips generated in this zone (production and attraction). Therefore, the generated trip amount is the key determinant of the transportation importance. The transportation importance of a traffic zone
where
Overall importance of a traffic zone
The overall importance of
In equation (9),
Path initialization to searches
Based on adjacent and border matrixes and importance indicators, the direction weight in path searches could be defined by
where
As to the rail path search, the steps could be summarized as follows:
Step 1: Determine the origin, calculate the importance indicator of traffic zones, and rank these zones according to the importance. Select centers that are of higher importance and meanwhile distributed to different zones as the origin. The number of origins depends on the actual situation, mostly, 4 to 6.
Step 2: Path search starts from the origin to its adjacent zones with higher importance. The search could have four options in direction. During the search, if the weight difference between two directions is within 20%, search both directions. The search must comply with the smooth requirement that the turn angle should be between 90° and 180°.
Step 3: Once the centers between adjacent zones are connected, the searching would come to the next direction with the smaller weight to avoid repeated paths.
Step 4: If the searched point is a center of a border matrix, the search ends. The collection set of zones throughout the path is
Step 5: Conduct preliminary analyses of generated path morphologies. If some zones with obvious high importance are not searched or the rail network size is apparently small, repeat steps 1, 2, 3, and 4 until the general paths on the rail transit network show up.
Generating the virtual rail transit network based on the hub factor
Determining the importance of a hub
Identifying passenger hubs
A passenger hub is the place where the passenger flow produces and attracts, and it usually refers to a regional center (I) or transportation hub (II). Typically, a regional center system consists of an administrative center, a business center, a large-sized company, residence, and a resort center. On the other hand, a transportation hub is usually presented in the form of a bus station, passenger station, rail station, or an airport. Usually, (I) typed passenger hubs are classified according to the land use and development strength surrounding the hub and its detailed classification is shown in Table 1.
Classification of (I) typed passenger hubs.
According to the transportation functions and service scope, (II) typed passenger hubs are divided into regional, community, and urban passenger hubs, as shown in Table 2.
Classification of (II) typed passenger hubs.
Constructing the indicator system
Based on the factor analysis of city passenger hubs, a hierarchical analysis method that incorporates geographical condition, production and attraction of passenger flows, and accessibility of hubs is applied to build the passenger hub indicator system, as shown in Figure 1.

The indicator system of passenger hubs.
Determining and standardizing indicators
Both qualitative and quantitative indicators are included in this section. Quantitative indicators can be mainly obtained through corresponding surveys or statistics, while qualitative indicators can be identified by experts grading method and processed by the following expression
where
Since the units, dimensions, and magnitude of indicators are different, we cannot measure the importance of passenger hubs directly. So, there is a need to standardize these indicators.
Assume that
The standardization process is as follows
After standardization, the range of indicator
Determining the weights of indicators
Weight indicates the importance, value, and proportion of an indicator in the whole indicator system. In general, the weights determined at the goal layer are largely the results of qualitative experience, leading to high subjectivity. This thesis uses entropy method to determine indicators’ weights.17,18 Entropy, a measure of the disorder in a system, is commonly employed to draw the effective information or weights behind the given data. Therefore, its application for the weight calculation in the goal layer is reasonable and objective.
The decision matrix driven from data standardization is used for the weight calculation of the
Through the calculation based on the above expression, we can get the matrix
Then, the entropy of the
If
where
Calculating the importance
After the weights of indicators are determined, the importance indicator
where
Determining the path origin and destination
To connect rails with inter-city transportation effectively without any transfers, we need to select certain pairs of O-D in the border matrix according to the importance of passenger hubs. Sometimes, O-D in non-border matrixes (mainly refers to fringe regions with frequent traveling activities) should be considered with the vehicle land use as constraints. Afterward, we need to match origins with destinations, based on the search results in the plane-based initial rail transit network, trip distribution, the maximum rail length constraint, and the possibility of building a rail in between.
Determining the path search area
The path search area is determined by the origin and destination, as well as the distance in between. After the O-D is determined, it comes to the analysis of reasonable distance between the O-D. As mentioned above, the diameter of a central town is 30–40 km. Given that origins and destinations are generally not located in the city border, the straight line distance between them is around 35 km. Suppose the nonlinear coefficient of rail transit is 1.35 (the average nonlinear coefficient of rail transit in Beijing is 1.15 while that of Shanghai is 1.33), then the maximum O-D distance is 47 km. On this basis, the search area can be determined.
Determining the matching set of effective passenger hubs between O-D
Building a directed graph between O-D and passenger hubs
Take any point in the city plane for the origin and build a rectangular plane coordinates, as shown in Figure 2. Select the

The directed graph of O-D and hubs.
Algorithm for determining the matching set of effective passenger hubs
In the directed graph, the collection set of points in the
According to previous analysis, we need to calculate the multiple paths between O-D. This article uses the double-sweep algorithm 19 combined with the genetic algorithm (GA) to search the matching set of effective passenger hubs with meeting the requirements on smoothness (the turn angle is more than 90°) and maximum line length of the search area.
The genetic algorithm was developed by Holland 20 for solving complicated optimization and search problems. With the growing studies on heuristics, the application of GA in complex adaptive systems has been given more and more attention.
The genetic algorithm is inspired by the mechanism of natural evolution including inheritance, mutation, selection, and crossover. In the algorithm, a candidate solution to an optimization problem is denoted by the notion of “chromosome,” which is encoded in certain manner. Usually, the coding system is binary represented, but other encodings (e.g. floating coding) are also possible. The process of genetic algorithm for the optimal solution is analogous to that of natural evolution which obeys the principle of “survival of the fittest.” In the genetic algorithm, the evolution is an iteration process where a population of chromosomes in each iteration is called a generation. The initial generation is produced at random. In a generation, the quality of each chromosome is evaluated with a fitness function which is usually an adaptation of the objective function of the model. Chromosomes with higher fitness are more likely to be picked out from the current generation to produce a new generation by means of duplication, mutation, and crossover. Then the new generation can be used in the next iteration. Dominated by the principle of “survival of the fittest,” the iteration process will converge and the global optimal can be obtained. In practice, the algorithm stops when either the threshold of iterations or a satisfactory fitness level is reached for the population.
The iteration number, the population size, the crossover rate, and the mutation rate are several parameters that have great impacts of the performance of GA. Large iterations or population number hold more potential to obtain global optimal solution but will also increase searching time. High mutation rate can confirm the diversity of solutions to avoid trapping in local optimal but at the cost of huge time consumption.
The detailed steps are shown as follows:
Step 1. Select an encoding system; set population size
Step 2.
Do while
For
/*calculating the fitness of each chromosome of Generation
End for
If (a satisfactory fitness level obtained) then
End do
print solution
Else
For
/*calculating the selection odds of each chromosome of Generation
End for
End if
Do crossover:
Do mutation:
End do
Thus, the search process of genetic search can be seen in Figure 3.

Flowchart of the genetic search.
The obtained matching set of effective passenger hubs is
Determining the matching set of optimal passenger hubs
Next, we need to calculate the overall importance
where
Among all paths, select the path of the highest overall importance
Adjusting entity paths at the path layer
The passenger hub matching set
Since passenger hubs are relatively close and usually locate at the network nodes, the searched virtual rail paths highly match the actual transportation network, as shown in Figure 4. To keep a high consistency between actual rail paths and passenger flow corridors, the study selects the optimal rail paths and sets up the rail path layout accordingly.

Graph of rail path layout.
Case study of Hefei
Determining the morphology of the initial rail transit network
To elaborately show the morphology of the initial network, this article combined modeling zones according to their structure. First, build the distance matrix between zones according to the rail transit network plan, which is used to calculate the position importance of zones. Second, forecast the generated traveling needs of a zone and divide the value by the zone size, by which the transportation importance of the zone could be calculated. Third, combine the position importance and the transportation importance and get the overall importance of a zone, as shown in Figure 5.

Importance distribution and initial network morphology map.
What to follow is to build the adjacent matrix and border matrix and to calculate the direction weights of searching adjacent zones based on the importance indicator, distance matrix, and forecasted O-D distribution results. As suggested above, traffic zones that have higher importance and locate in different zones should be chosen as the origin. In this case study, traffic zones 1, 37, and 63 were selected as origins and then we searched their revised adjacent zones. The search network covering 59 zones was produced and shown in Figure 5, by which all possible rail paths in the central city are listed in Table 3.
Possible paths in the initial rail network.
Generating the entity rail path
To form a rail transit network, two steps are involved: (1) build the network structure and morphology and (2) set up and optimize each rail path one by one. As an example, this paper selected line A to follow the steps based on the importance indicator.
Determining passenger hubs, path origin, and path destination
The path search area is the main traffic corridor at the east-west direction in the major town of Hefei. This study selected Changing Avenue and Dazhong Road as the origin and destination considering the urbanization trend of Hefei, the match with the rail transit network, connection modes between rail paths, and land use constraints in the neighboring area. Within the covering area of the main corridor, 50 passenger hubs were selected as alternative options.
Determining the optimal passenger hubs
Based on the optimization of passenger hubs, a directed graph including origin, destination, and hubs was built and then using the double-sweep algorithm, we obtained a set of alternative paths between the O-D. According to the calculated importance for each passenger hub, and each alternative path a final set including 28 optimal passenger hubs is obtained: {Changing Avenue of Nan gang Town,…, Large East Gate,…, Dazhong Road of Long gang Town}.
In the next procedure, a path distribution was made with the consideration of the real road situation, an implementation was conducted for virtual path in the hub layer, and the optimal path for line A was determined as shown in Figure 6.

Passenger hubs matching and path optimization of line A.
Generating the initial network based on path-by-path searches
Following the path optimization of line A, other optimal paths could be searched. Moreover, key elements that are transfer stations and connection between multiple paths should be considered in the initial path search and analyzed in the formation of initial network plan for the major town of Hefei. The resulted initial network is presented in Figure 7.

The initial network search diagram of Hefei.
Conclusion
As the initial network is the basic form of the final network plan especially in the respects of network size and morphology, it is key to the determination of ultimate rail transit network. Moreover, the important role of the initial network in modeling test on passenger flow and path optimization confirms its significance in city rail transit planning. To ensure the reliability of the initial rail transit network generated in the network planning, this article proposed an importance-based technique. It conducts the analysis from the plane, hub, and path perspectives and makes the network optimization path-by-path.
The case study of Hefei has proved its proficient advantages in three aspects. First, its analysis on the initial network is considered to be more supportive for its quantitative feature in the calculation on the overall importance and search direction weight. Second, all procedures in initial virtual network generation using the importance calculation in the indicator system, and the random search method reflect the station-anchoring theory and its aim at maximizing network coverage. Third, the application of the importance theory in the generation of the initial rail transit network makes it possible that the rail path setup and optimization are conducted quantitatively. In summary, this article contributes to both theories and practices in the rail transit research area.
