Abstract
Keywords
Introduction
Path optimization for logistics distribution is comparatively complex, and it mainly involves path selection in logistics distribution. The transport fleet, time limit, transport cost, and other conditions should be considered in the logistics distribution path optimization process. In actual logistics distribution activities, the proportion of these factors changes; therefore, many aspects of logistics distribution path optimization are researched and various algorithms are also adopted. The problems encountered in the actual distribution process should be fully considered when selecting an algorithm by incorporating the distribution capability, time, and other cost restrictions; then, an excellent distribution path solution could be obtained by the algorithm.
With the thriving development of China’s logistics industry, the logistics industry urgently needs the support of the relevant theory and technology. In recent years, China’s cities have experienced very severe traffic congestion, and the logistic distribution path optimization has not only involved simple combinatorial optimization. Since the complexity and changes of road conditions have a great impact on the logistic distribution path optimization and the path travel time would have larger fluctuations in different time ranges, the road condition changes should be considered in the path optimization research for logistics distribution. Accordingly, the research on path optimization for logistics distribution has important application value and research significance for the development of the logistics industry.
Relevant research
In the 1960s, Dantzig and Ramser proposed path optimization 1 for the first time, and its essence is embodied in the development and change of traveling salesman. In the early, the road traffic had small pressure, the impact factor was less, the calculated quantity of road net was not huge, and the relevant theory just started its development. At such period, the optimization algorithm of logistics distribution path mainly refers to the accurate algorithm in combination with the mathematical theory, including the cutting plane algorithm, branch-and-bound method and Dijkstra algorithm. 2 These algorithms mainly apply the combinatorial optimization ideas in mathematics to find out the best solution as far as possible. Later, with the development of traffic scale and category, the road net becomes huge, and the traditional path optimization algorithm has not solved the complex road net optimization. At this time, with the prosperous development of natural calculation theory, many scholars raise the heuristic algorithm by combining the natural calculation theory and path optimization. Although such kind of algorithm guarantees the best solution, it approximates the best solution to the maximum extent, and its computation efficiency boosts greatly upon the large-scale solution. Common heuristic algorithm includes ant algorithm, genetic algorithm, simulated annealing algorithm, and particle swarm optimization.3–6 These heuristic algorithms are the natural calculation of simulating the phenomenon principle in the natural world, which seeks for the best solution or similar best solution rapidly based on the priori knowledge, and the best solution is sought within the acceptable time. Recently, there are more algorithms about the neural network, especially, the rising of deep learning provides a kind of new support in theory for solving the logistics distribution path optimization.
Most algorithms adopted by the scholars of China’s colleges and universities for path optimization in logistics distribution apply the ant algorithm. Zhang Wenguang 7 applies the ant algorithm to path optimization, and the slope and congestion degree are converted into energy to consume the equivalent flat journal so that the route solution better improves the efficiency of logistics distribution. Chen Yuquan 8 proposes an ant algorithm based on an improved Pareto approach, and Li Jing 9 proposes an ant transfer strategy and a pheromone updating method to address the slow convergence rate of the traditional ant algorithm and the best partial problems to improve the convergence speed and computational time. Chen Jianjun 10 applies the ant algorithm based on the optimized mathematical model of the logistics distribution path. In the simulation experiment, such an algorithm model has a strong optimizing ability overall, with a rapid search speed. Xu Xing 11 analyses the advantages and disadvantages of the ant algorithm, introduces a genetic manipulation, and amends the pheromone updating method to solve the logistics distribution path problem; this approach effectively determines the optimal route or a near optimal route. Nie Jingjing 12 introduces the partial best strategy and the nearest-neighbor algorithm to the ant algorithm, and addresses the low optimization speed and partial convergence.
Road condition prediction model based on deep belief network
Traffic big data and their characteristics
Traffic big data mainly include the driving position, license plate, traffic flow, road conditions, surrounding environment and other information collected by road traffic vehicle cameras, global positioning systems, road cameras, speedometers, radio-frequency identification technology, and other sensors; it may also include information from transportation and express delivery services regarding distribution vehicles, routes, and delivery times of logistics companies; weather information, such as sun, rain, temperature, humidity, and wind may be provided by the Meteorological Bureau; and road construction and closure information may be sourced from road construction companies. The characteristics of traffic big data are as follows. Traffic big data consist of many types of data from a wide number of sources, and the amount of data is massive. The amount of data for a medium-sized city is approximately 500 PB per year and there is a lack of data in some areas, and there may also be errors, redundancy, and other data issues. The data have multidimensional features, such as time, space, which contains great value, and the data stream is rapidly time-varying. Therefore, the accuracy of predicting the road traffic conditions depends on the data from motor vehicles, vehicle owners, roadside facilities, expressway toll stations, traffic police, meteorological bureaus, road construction companies, etc., and more data will enable a more accurate prediction.13–17
Deep belief network
Deep learning is used in popular technical models such as the deep belief network (DBN) and convolutional neural network.
13
The DBN model is characterized by a rapid training speed and strong learning ability. This paper applies a DBN to achieve the learning of traffic data, and the traffic forecast model is constructed based on the DBN. Since the DBN model is a directed acyclic graph overlaid by many restricted Boltzmann machines (RBMs), its input data are located at the bottom unit and are visible, and its parameter values are set according to a priori knowledge, so it could also be called a probability model. The DBN could also be understood as a network model or multilayer RBM based on Bayesian probability with the following training processes:
The unsupervised training is conducted in the RBM by a self-encoded network method layer by layer to ensure that the feature vector maps to different feature spaces, the feature information is reserved as much as possible, and the pre-training of the model is completed in this process. On the last layer of the DBN, the RBM input is regarded as the input of the classifier, and the supervised training classifier conducts the back propagation (BP) of the error from top to down. The entire DBN network is slightly adjusted to complete the training process.18-20
Road condition prediction model
Combined with the characteristics of DBN technology, to solve the accuracy problem of road condition prediction, this article proposes a road condition prediction model based on a DBN, as shown in Figure 1.

Flowchart of traffic forecast.
The data pre-processing module is mainly the combination of traffic data pre-processing and road condition information (weather, time, etc.). The data pre-processing uses data repair, data replacement, data denoising, and other techniques to pre-process the abnormal data, recording errors, data null values, data distortion, etc. to improve the quality of the data, which will facilitate the model learning and improve the training speed and learning effect.
DBN training module:
The traffic condition prediction of DBN needs to consider the weather, road construction, holidays, and commuting peaks, and the traffic flow data include the traffic volume
Model construction
Considering the scale of urban traffic data, the number of layers for DBN model is set as five layers, and the structure diagram is shown in Figure 2.

DBN model structure.
The five-layer DBN structure has met the requirements for traffic data learning very well, in which the first layer is the input layer, the last layer is the output layer, and the middle layer is the hidden layer. Based on the experience of setting the number of hidden layer neurons artificial neural networks, 14 the number of neurons on the second layer is two-thirds of that on the first layer; the number of neurons on the third layer is two-thirds of that on the second layer; and the fourth and fifth layers are the inverse of the first and second layers, respectively.
The training model
This paper adopts the greedy unsupervised learning algorithm layer by layer based on the self-encode model as the pre-training process of BP algorithm. Each layer of deep learning model applies the stacked self-coded method, and the input on the rear layer is from the output of the front layer. The weight parameter on the first layer of training depth network model corresponding to the first automatic encoder is

Training of prediction model.
At the classification stage, the learning sample data with manually set labels are provided to the classifier for learning. The classifier conducts the learning classification according to the characteristic quantity set after the reconstruction according to the labeled data and conducts the classification operation according to the characteristic quantity set after the learning process.
15
First, build the training set. The vector V for the traffic information characteristics and traffic impact factor will be the new vector X after the learning of the DNN model. It is predicted that the model outputs 10 traffic rank values (0–9), in which level 0 indicates the failure of a passage, and level 9 indicates the rare vehicle at the traffic road section, which passes unimpeded. In accordance with the rank value set manually, set the class label,
Then, the classifier predictions are conducted with the solution calculation method. The SoftMax model is promoted by and changed from the logistic regression model in the multi-classification research. The training set is input with the label vector set, L, and a method with more repetitions could be selected. The setting method of the hypothesis function is used to evaluate the probability value,
In the above formula, the
Road condition prediction algorithm
The unsupervised learning process of the model is the core process of the road condition prediction deep belief network traffic forecast algorithm (DBNTFA), and its specific steps are as follows:
Input: Training sample data queue ((
Output: Weight coefficient [
Step 1: Select an initial value of the weight coefficient matrix, W, and a learning rate value, μ;
Step 2: Use the unlabeled training data {
Step 3: Determine the labeled training data queue (
Step 4: Use the output layer
Step 5: Globally optimize the weights [
Step 6: Output the final weight matrix [
Logistics distribution path optimization algorithm based on road condition prediction
Path optimization concept
The DBNTFA algorithm and logistics distribution path optimization are combined to realize the DBNTFA-based logistics distribution path optimization. Its basic concept is to solve the time-share traffic class value,

Flowchart of logistics distribution path optimization based on DBNTFA.
Time-share weighted traffic network
The time-share weighted traffic network can be obtained by introducing the time-share weight into the topology of the traffic network, and the time-share weight could be solved on the basis of the traffic rank values and road section length. The weight could be understood as the combination of the passing time and risk. The expression of the time-share weight,
In formula (5),
A road section is the basic unit in the time-share weighted traffic network and is defined as follows in the traffic network: a road section is the minimum unit of a road, and there is no intersection leading to other road sections in the middle of a road section. The time period is determined by the updating cycle of the time-share weighted traffic network, and it is set according to the collection interval for the traffic data. The traffic data are collected every 5 min. The updating cycle of the rank value for the corresponding traffic network is 5 min. The weight in formula (5) is substituted into the traffic network to solve the time-share weighted traffic network.
Improvement of ant algorithm
Logistics distribution path optimization is solved by comparing the time-share weighted traffic network and the traditional ant algorithm with the following distinction aspects: in the traditional ant algorithm, the ant could directly visit other cities when starting from one city, and could only visit the conjoint road section in the traffic network; the path length of traditional ant algorithm is fixed, and the weight of the time-share traffic network is updated; the starting point in traditional ant algorithm includes all urban nodes, and the starting point in the time-share traffic network is the distribution point. The city and path length in ant algorithm are corresponding to the intersection node and road section weight in the time-share traffic network, respectively.
Formula (6) for the degree of expectation is defined as follows, and formula (7) is logistics distribution path formula. Substitute formulas (6) and (7) into formula (8) of the moving probability,
16
to obtain formula (9) that ant moves from one intersection node to another intersection node
The variable quantity of pheromone is
The pheromone on the line should be updated after the ant algorithm accesses the delivery point once (with time n)
In the above formula,
Through the above improvements, in the time-share traffic network, the following improved ant colony optimization algorithm is obtained as follows:
Input: Accessible node set {Rn}, delivery point [p]
Output: Optimal path S
Step1: Initialization time T = 0, cycle number
Step2: Obtain the current intersection accessible node set {Rn} from the traffic network, and move it to the next intersection node according to Formulas (11) and {Rn};
Step3: Update the taboo table, and place the intersection node moved in Step (2) into the taboo table;
Step4: Step 2 and step 3 are executed cyclically, and this iteration ends when the ant stops moving;
Step5: Calculate the weight and Wa of walking route of each ant that successfully accesses all delivery points, and update the road segment pheromone and the optimal path S according to Formula (7).
Logistics distribution path optimization algorithm
The road condition prediction and logistics distribution path optimization algorithm based on DBN may be referred to as the Deep belief network traffic forecast path optimization (DBNTFPO) algorithm. This algorithm is a hybrid new algorithm, which combines the advantages of both time-share weighted traffic network and improved ant colony algorithm to make the systematic predictions of urban road conditions, so as to give the optimal solution for the logistics distribution path optimization problem under complex road conditions. The DBNTFPO algorithm can be described in two parts: the construction of time-share weighted traffic network [
Inputs: traffic feature data [
Output: time-share weighted traffic network [ Construction of time-share traffic network
Use the traffic characteristic data [ Substitute the time-share traffic rank value The weight of logical structure [ Output the time-share weighted traffic network [ Solution of distribution route
(v) Initialize the time-share weighted traffic network [ (vi) Put m ants into (vii) Output the optimal route S.
The DBNTFPO algorithm combines the DBNTF algorithm and the improved ant algorithm. Although the process is more complex, after one-time solution, the route solution could be obtained for the future solution after the completion of Algorithm 1.
Algorithm analysis
The DBNTFPO algorithm consists of two parts: the construction of time-sharing weighted traffic network [
Test results and analyses
Application examples
For an intelligent distribution platform, if a large-scale logistics company stored approximately 3.9 PB of big data such as the information for the vehicle, cargo, customer, finance, road traffic, weather, and road condition, we could extract the road condition prediction-related parameters and distribution-related information from approximately 1 PB of data over the past 5 years, including the road segment number, time period, average speed, traffic volume, weather level, wind speed, holidays, weekends, vehicle number, delivery point, intersection node, and other information. Using Hadoop’s big data processing platform and Map Reduce, we extracted the traffic information, weather information, and road condition information of 1012 roads from traffic big data. Among them, the information of 712 road lines is used as the training set, with 128 road lines used as the evaluation set and 172 road lines as the test set.
Experimental environment and parameter settings
The experimental hardware consists of 6 Blade servers (256 CPUs), 2 T K40 GPU graphics accelerator cards, and the cloud platform with 2 PB capacity. The software includes Linux (CentOS 7_x64), hadoop-2.7.2, jdk1.8.45, Eclipse 4.3.2 and MATLAB 2012 R.
One training period is one-half year (180 days), and the traffic data of 5–22 points per day is selected as the sample time range with a data sample collection interval of 5 min. The conversion formula for time (Time) and the time period,
Some training sample data.
Road condition prediction results and analysis
In this paper, the DBNTFPO algorithm is compared with the historical average method and support vector machine (SVM) algorithm for the road condition prediction experiments. Figure 5 shows the comparison chart of the three algorithms in terms of the traffic class value prediction accuracy.

Prediction accuracy comparison chart.

Accuracy under different road conditions.

Different traffic class value prediction accuracy.

Distribution time figure in three methods within 1 week.
As shown in Figure 5, when the learning period is short, the road condition prediction accuracy of the proposed algorithm in this paper is relatively poor, and the road condition prediction based on the SVM is more accurate. However, with the increase of the learning period, the prediction accuracy of this algorithm is greatly improved and finally stabilizes at approximately 86%. The accuracy of the SVM decreases with the increasing training scale, which is approximately 71%. The calculation of the historical average method is simple and convenient, and the accuracy does not change much with the increase of the data, and the accuracy is low. The reason is that the scale of the study set is small at the beginning stage, and the road condition characteristic factor effect learning is not enough. In addition, the depth model is under-fitting, and the prediction effect is not ideal. When the training set is increased, the deep learning model has a high degree of training, and the prediction accuracy is correspondingly improved.
As shown in Figure 6, the accuracy for the normal working day is relatively stable, but its accuracy is lower than the other two cases, and the accuracy is the highest in rainy weather. The reason is that, under normal circumstances, the road traffic conditions have fewer influencing factors and the road conditions are more random, so the prediction accuracy is relatively low. In rainy weather, the road traffic is obviously affected, and various factors have a greater impact on the road conditions. Nanning is in a rainy area, so the data training is relatively adequate, and the prediction accuracy is relatively high.
As seen from Figure 7, the prediction model can realize accurate results in the faulty road section and can achieve accurate predictions in the congested road section and the smooth traffic road. However, when the road conditions are irregular or the features are not obvious, the prediction accuracy is correspondingly reduced. Table 2 shows a comparison of the performance for three different algorithms. The prediction accuracy values of the algorithm proposed in this paper are the highest in the various road conditions and are all above 90%, and the historical average method has the worst prediction effect.
Performance comparison of several road condition processing methods.
DBNTFPO: deep belief network traffic forecast path optimization; SVM: support vector machine.
Distribution path optimization test results and analysis
Now, take 1 week (7 days) for the distribution optimization test. There are 30 delivery vehicles every day to deliver the goods to 27,000 delivery points; the delivery time is from 7:00 a.m. to 19:00 p.m., and the distribution logistics path optimization algorithm proposed in this paper, ant algorithm, and driver experience methods are used to determine the distribution time, as shown in Table 3.
Number of solutions for each method in a week.
DBNTFPO: deep belief network traffic forecast path optimization.
Average schedule.
DBNTFPO: deep belief network traffic forecast path optimization.
The path length of traditional ant algorithm is fixed and is not updated after one-time solution, and the driver could select the path depending on the driving experience. The DBNTFPO algorithm generates plenty of route resolutions by depending on the abundant traffic data advantage. The average quantity value of road section for such three algorithms has little difference, because different road sections have a very large difference in road condition, the road section contains many road sections, and the time may be short uncertainly. These route solutions are evaluated on the basis of the distribution time.
The distribution time data are recorded and collected by the distribution driver. Based on Figure 8, the algorithm in such paper has a relatively obvious time advantage, and the distribution time is relatively stable. Traditional ant algorithm includes the single route, and has larger change and fluctuation in time, with poor anti-risk capability. The distribution time is also unstable in the driver experience method, because the drivers mainly select the path according to individual experience, with the relatively stable distribution, but the path is not the optimal.
As seen from Table 4, the traditional ant algorithm has become not suitable for the path optimization problem under the complex road condition, and the algorithm in this paper could effectively reduce the distribution time compared with the distribution experience of drivers. In the actual logistics distribution, the algorithm in this paper has excellent applicability and effectiveness, and the time complexity is consistent with the theoretical analysis.
Conclusion
The innovation point of this paper is to introduce the big data and deep learning method into the path optimization problem. All traditional path optimization algorithms are mostly based on the simulation theories, which disjoint with the actual application. This paper uses the deep learning method to study the traffic big data, extract the feature factors of road condition, and accurately predict the road condition in combination with the classifier. This paper solves the poor practical application problem of traditional algorithm, and provides a new thought for solving the logistics distribution path optimization problem. The development of deep learning brings new solution to many fields. Thus, in future, as the technical problems of deep learning is solved, more efficient method will be brought to the logistics distribution path optimization, and it will change the model of logistic distribution path optimization.
