Abstract
Keywords
1 Introduction
Brain–computer interface (BCI) system based on steady-state visual evoked potential (SSVEP) has attracted extensive attention in the field of BCI because of its high information transmission rate (ITR) and low training cost [1]. The widely used SSVEP paradigm evokes electroencephalography (EEG) signals in luminance modulation. However, this paradigm is easy to cause visual fatigue [2] and contains the risk of evoking photosensitive epilepsy. Additionally, the harmonic of the EEG signal evoked by the luminance modulation paradigm is prominent, which leads researchers to avoid the overlap between the fundamental frequency of a target and the harmonic of other targets when designing stimulation frequency, thus limiting the selection of stimulation frequency [3]. Han et al. studied a paradigm with radial contraction-expansion motion as the stimulus-evoked form and developed a high-interaction BCI based on flicker-free SSVEP [4]. This paradigm uses the feature that the human visual system is susceptible to motion perception to evoke EEG signals. The luminance of the stimuli in the paradigm remains constant, reducing the subjects’ discomfort caused by target flickering [5, 6]. In addition, the EEG evoked by this paradigm has low harmonic content and more frequencies in the range of sensitive frequencies [7] that can be used to stimulate the target than the luminance modulation paradigm. Therefore, the radial contraction-expansion motion paradigm is an excellent alternative to the traditional luminance modulation paradigm.
Majority of existing research on the radial contraction-expansion motion paradigm includes comparative studies of EEG characteristics evoked by different paradigms and practical research on the radial contraction-expansion motion paradigm. The identification algorithms used are mostly canonical correlation analysis (CCA) [8]. As a statistical method for measuring the potential correlation between two multidimensional variables, CCA is widely used in SSVEP-BCI [9]. However, in standard CCA-based methods, the canonical correlation value between SSVEP and the sin-cos reference signal tends to decrease as the flicker frequency increases, leading to a decrease in the accuracy of SSVEP detection at higher frequencies [10]. In addition, due to the different characteristics of evoked signals and the EEG signal intensity evoked by the radial contraction-expansion motion paradigm being weaker than that of the luminance modulation paradigm [11], the recognition effect of trained algorithms with excellent recognition performance under the luminance modulation paradigm is not satisfactory under the radial contraction-expansion motion paradigm, such as task-related component analysis (TRCA) [12], extended canonical correlation analysis (eCCA) [13].
The power spectral density of EEG signals is usually a decreasing function, and the power is low at higher frequencies [14]. Binary subband canonical correlation analysis (BsCCA) filters EEG data into two subbands. The first subband covers all target frequencies, and the second subband contains high-frequency components that need to be enhanced. It adjusts high- and low-frequency components through subband weights to improve the recognition accuracy of high-frequency stimuli [15]. Therefore, this study proposes an extended binary subband canonical correlation analysis (eBSCCA) detection algorithm oriented to the radial contraction-expansion motion paradigm in response to the above problems. The algorithm uses the BsCCA algorithm to improve the recognition accuracy of high-frequency stimuli and combines the dynamic window method to improve the ITR of the system. The experimental results show that, compared with the TRCA and eCCA algorithms, the eBSCCA algorithm proposed in this paper can effectively improve the recognition accuracy and ITR of EEG signals.
The structure of this paper is as follows: the second section introduces the characteristics of EEG signals evoked by radial contraction-expansion motion paradigm and the proposed algorithm; the third section introduces the experimental process, parameter selection, comparison algorithm, and algorithm performance evaluation criteria; the fourth section shows the parameter calculation results, offline experiment results, and online experiment results of the dataset; and in Section 5, the experimental results and paradigm performance are discussed; the sixth section is the conclusion of this paper.
2 Methods
2.1 Signal characteristics
In the radial contraction-expansion motion paradigm, the contraction-expansion motion of the target was realized by periodically reducing and increasing the radius of concentric circles with different radii. The cosine function adjusts the change process of the circle radius. The radial contraction-expansion process of a circle can be expressed by Eq. (1):
where
As shown in Fig. 1, when the phase
The frequency at which the movement direction changes is called the flip frequency denoted as

Schematic diagram of the contraction-expansion process of the stimulus target.
where the constraint

Signal characterization analysis of offline data. (A) The average unilateral amplitude spectrogram (11, 17, and 19 Hz), the abscissa represents the response frequency, and the ordinate represents the unilateral amplitude. (B) The average SNR of each stimulation frequency. The abscissa represents the stimulation frequency, and the ordinate represents the average SNR of the EEG response at this frequency.
2.2 Proposed algorithm
This paper proposes an extended BsCCA detection algorithm for the radial contraction-expansion motion paradigm. The CCA algorithm is improved based on the characteristics of the EEG evoked by this paradigm, which is used in training scenarios. The algorithm uses the BsCCA algorithm to improve the recognition accuracy of high-frequency stimuli [19] and uses the dynamic window method to improve the ITR further.
2.2.1 BsCCA
During the experiment, the SNR of the EEG signal in the high-frequency part was lower than that in the low-frequency part, which decreased the recognition accuracy of high-frequency components of some subjects. The BsCCA algorithm proposed by Islam et al. can improve the recognition accuracy of high-frequency stimuli [19]. As shown in Fig. 3(A), the BsCCA algorithm proposed in Ref. [19] was applied to the scene without training. However, this study further applied it to the scene with training. Figs. 3(B) and (C) show the flow of the BsCCA algorithm with training.

Schematic diagram of BsCCA algorithm. (A) Schematic diagram of the flowchart of the untrained BsCCA algorithm. (B) Schematic diagram of the training process of the trained BsCCA algorithm. (C) Schematic diagram of the detection process of the trained BsCCA algorithm.
In Fig. 3,
where
Fig. 3(A) shows the flow of the untrained BsCCA algorithm: after the test data
Fig. 3(B) shows the training process of the trained BsCCA algorithm: after the EEG template
where
Fig. 3(C) shows the detection process of the trained BsCCA algorithm: the test data
where cov(·) represents the covariance between two variables; and
2.2.2 eBSCCA
Fig. 4 shows the flowchart of the eBSCCA algorithm. The decision feature consists of three parts:
2.2.3 Dynamic window
Traditional SSVEP recognition algorithms only analyze fixed-length data, such as CCA and filter bank canonical correlation analysis (FBCCA) [21]. These algorithms usually estimate the optimal data window length in offline datasets based on the performance of the BCI system. The fixed window length is then applied to online experiments [18]. However, due to the complexity and non-stationarity of spontaneous EEG signals, the optimal window length is different among different subjects [22]. In addition, the psychological status and environmental changes of subjects are also the influencing factors of the optimal window length. The dynamic window method can effectively reduce the above adverse effects and dynamically adjust the data length required by the algorithm while maintaining the high precision of the decision to achieve a higher ITR, thereby improving the performance of the BCI [23].

Flowchart of eBSCCA algorithm.
In this study, the dynamic window method sets a minimum calculation window length
where

Flow chart of dynamic window method.
Algorithm 1 shows the pseudo-code of the eBSCCA algorithm.
eBSCCA algorithm.
3 Experimental design
3.1 SSVEP dataset
In this study, the subband parameters in the BsCCA algorithm are calculated using the SSVEP dataset of the finals of the 2020 World Robotics Competition – BCI Brain Control Robot Competition Technical Competition (Dataset I). The dataset contains the EEG data of 10 subjects, and each subject participated in 6 block experiments. The first three blocks were offline experiments, and the last three blocks were online experiments. The stimulation paradigm contains 35 targets, the stimulation frequency range was 3–20 Hz, the interval was 0.5 Hz, and all the initial phases were π / 2. The experimental data was in the block, and each block was the EEG data collected continuously. A single trial lasted 5 s, including 3 s stimulation time and 2 s rest time. In the stimulation process, 35 targets were presented simultaneously, and the motion amplitude of each target changed cosine according to its predetermined frequency. Subjects looked at the prompted target during stimulation to produce the steady-state visual evoked response in their EEG signal. The trigger was recorded at the beginning of the stimulation phase in each trial. The experimental data were 65 channels of EEG data with a sampling frequency of 1000 Hz without other filtering processing.
3.2 Paradigm design
As shown in Fig. 6(A), the paradigm consists of 35 radial contraction-expansion targets with a gray background outside the stimulus targets. The target stimulation frequency was from 3 to 20 Hz, the interval was 0.5 Hz, and the initial phase was π / 2. The static image of the stimulus pattern was composed of 8 concentric rings. Each concentric ring was divided into 24 equal parts, composed of white and black lattice alternately, and the area of the white area was equal to that of the black area. The radius of the outermost circle of the stimulus target was 100 pixels. The radii of the 8 concentric rings ranged from 12.5 pixels to 100 pixels and were evenly distributed according to the difference value of 12.5 pixels. A gray area with a radius of 6.25 pixels was set in the center of the circle, and a black dot with a radius of 1.25 pixels was set in the center of the gray area to focus on the subjects’ attention. Fig. 6(B) shows the distribution of the target stimulation frequency.

Paradigm interface design and single target flip frequency. (A) Paradigm interface. (B) Flip frequency of each target.
In the contraction-expansion motion paradigm, the display’s high refresh rate is an essential factor in reducing flicker perception [4]. This study sets the screen refresh rate to 240 Hz, consistent with the 2020 World Robotics Competition—BCI Brain Control Robot Competition SSVEP Training Competition.
3.3 Data collection
Neusen W series wireless EEG acquisition system produced by NEURACLE company was used in this study. In the experiment, 64 channels EEG cap arranged according to the international 10–20 system was used. The display model used was Dell AW2518H (1920 × 1080 resolution). The Tsinghua Institutional Review Board approved the experiment, and all subjects signed an informed consent form before the experiment and received a certain reward after the experiment.
Thirteen subjects participated in the experiment, including five women and eight men. All subjects had normal or corrected to normal vision. The subjects sat on a chair and relaxed during the experiment, keeping the distance between the eyes and the screen about 60 cm. After the experiment began, the subjects were required to minimize blinking and physical activity. The experiment was divided into two parts: offline and online. The experiment took the block as the unit for data collection. Four block data were collected for offline and online experiments, and the sampling rate was 1000 Hz. Each block contained 35 trials in the offline experiment, and each target was prompted once in random order. Each trial was stimulated for 4 s, and there was 1 s to find the target between the two trials and 2 min of rest between the two blocks. The offline data of each subject was used as the training data of the subject’s online experiment. Each block of the online experiment also contained 35 trials, and each trial was stimulated for 4 s. There was a 1 s search time between the two trials and a 2 min rest time between the two blocks.
3.4 Parameter selection
In data processing, the algorithm starts to intercept data with a delay of 140 ms from the start of each trial to avoid using the low SNR data at the start of the trial [26]. In the preprocessing stage, the algorithm used a comb filter to suppress the interference of 50 Hz power frequency components [27]. In brain topography, responses evoked by the radial contraction-expansion motion paradigm were concentrated in the occipital region [4]. Therefore, the algorithm used nine channels in the occipital region for calculation (PO3、PO4、PO5、PO6、PO7、PO8、Oz、O1、O2). The all-pass subband covers all target frequencies in the range of [3, 20] Hz, and the weight was set to 1; the high-frequency optimization subband covers the high-frequency components that need to be enhanced, in the range of [13, 20] Hz, and the weight was set to 0.25.
The offline algorithm comparison used the leave-one-out cross-validation method. Each group retained one block as the test sample, and the remaining blocks were used as the training sample. The maximum calculation time of the dynamic window of the eBSCCA algorithm was set to 2 s, the dynamic window threshold value ranged from 1 to 10, the value interval was 1, and there were 10 data points in total.
The online experiment used the four blocks of the offline experiment as training data, and the leave-one-out cross-validation method was used in the training process. One block was reserved for each group as a verification sample, and the rest of the blocks were used as training samples. During the training process, the maximum calculation time of the dynamic window of the eBSCCA algorithm was set to 2 s, the dynamic window threshold value ranged from 1 to 10, and the value interval was 1, with a total of 10 data points. Finally, the dynamic window threshold with the highest average ITR was used as the threshold used by the online algorithm.
3.5 Contrast algorithm
This study used the eCCA and TRCA algorithms as comparison algorithms. The eCCA algorithm is composed of four correlation coefficients [13]. The TRCA algorithm used filter banks to decompose the SSVEP into subband components to efficiently extract the independent information embedded in the harmonic components [12]. In the literature [12], the subband setting method of the TRCA algorithm is consistent with the FBCCA algorithm, and the upper and lower cutoff frequencies of the
where
In BCI research, average ITR and accuracy are common performance evaluation indicators of algorithms. The block accuracy rate was obtained by dividing the number of correct predictions for each block by the total number of pre-tests, and the accuracy rate for each subject was the average of the block accuracy rates. The formula for calculating the ITR is [28]:
where
4 Results
4.1 Subband and weight parameter calculations
Subbands and weights were obtained according to Dataset I. The all-pass subband was set to [3, 20] Hz, covering all target frequencies and the weight
Fig. 7 shows the grid search results. The results show that the optimal ITR (77.84 bits/min) can be obtained when the frequency range of the high-frequency optimized subband and weight is set to [13, 20] Hz and 0.25, respectively.
4.2 Offline experiment results
The offline analysis was derived from the offline experimental dataset in this study. Fig. 2(B) shows the SNR of SSVEP responses evoked by different stimulus targets in the offline dataset. From Fig. 2(B), it can be observed that the SNR of the SSVEP component increases with the frequency increase at the stimulation frequency of 3–13 Hz. The SNR of SSVEP components decreased with frequency increase under stimulation at 13–20 Hz. The high-frequency optimized subband calculated according to Dataset I in this study was [13, 20] Hz, which can completely cover the frequency range where the SSVEP SNR decreases.

Grid search result. The abscissa is the weight of the high-frequency optimized subband (subband2), the ordinate is the lower cutoff frequency of the high-frequency optimized subband (subband2), and the color blocks represent the average ITR value of 10 subjects.
In Fig. 8, each brain topography map corresponds to a stimulation frequency, and the energy of each channel was calculated from the offline data corresponding to the stimulation frequency. This study normalized each subject using its maximum channel energy as a standard to eliminate the effect of energy differences between subjects. It averaged the normalized values of 13 subjects. Fig. 8 shows that, except for the response of the lowest stimulation frequency of 3 Hz, which was relatively scattered, the rest of the target responses were concentrated in the occipital area.

Normalized EEG energy distribution topographic map.
As shown in Fig. 9, under the radial contraction-expansion motion paradigm, the eCCA algorithm can obtain the best recognition effect by using the reference signal with a harmonic number of 1. This optimization parameter will be used for algorithm comparison.

The influence of harmonic number of reference signal on the performance of eCCA algorithm. (A) Average recognition accuracy. The abscissa is the time window length, and the ordinate is the average recognition accuracy. (B) Average ITR. The abscissa is the time window length, and the ordinate is the average ITR.
As shown in Fig. 10, under the radial contraction-expansion motion paradigm, the TRCA algorithm can get the best recognition effect by setting the number of subbands to 2. This optimization parameter will be used for algorithm comparison.
Fig. 11 shows the comparison results of average accuracy and ITR under different algorithms. As shown in Fig. 11(A), the recognition accuracy of each algorithm increases with the increase in decision time. Fig. 11(B) shows that the TRCA algorithm reached the highest average ITR of 75.67 ± 49.85 bits/min in the 1.5 s time window. The eCCA algorithm reached the highest average ITR of 107.66 ± 61.79 bits/min in the 1.4 s time window. Furthermore, the eBSCCA algorithm reached the highest average ITR of 151.34 ± 42.45 bits/min in the average decision time of 1.59 s.
Each algorithm’s average ITR and accuracy under the highest average ITR were taken for a paired

The effect of the number of subbands on the performance of the TRCA algorithm. (A) Average recognition accuracy. The abscissa is the time window length, and the ordinate is the average recognition accuracy. (B) Average ITR. The abscissa is the time window length, and the ordinate is the average ITR.

Comparison of average accuracy and ITR under different algorithms. (A) Average accuracy comparison. The errorbar represents the standard deviation of the average accuracy of each subject. (B) Average ITR comparison. The errorbar represents the standard deviation of the average ITR of each subject. The TRCA and teCCA algorithms contain 16 data points corresponding to the decision window length in the range of 1–2.5 s, and the interval is 0.1. The eBSCCA algorithm contains 10 data points, which correspond to dynamic window thresholds in the range of 1–10, with an interval of 0.1. Since the decision window length when the dynamic window threshold is similar to that when the dynamic window threshold is 2, the first and second data points of the eBSCCA algorithm in the figure almost coincide.
represents TRCA algorithm,
represents eCCA algorithm,
represents eBSCCA algorithm. The hollow point corresponds to the highest point of ITR of the algorithm.
Paired t-test (with Bonferroni correction) for the mean ITR and mean accuracy of the optimal performance of each algorithm.
The offline experiment included 1820 trials. As shown in Fig. 12(A), in the eBSCCA algorithm, the calculation workload of the computer increases with the increase in the calculation window length of the trial, and the average calculation workload under the maximum calculation window length was less than 40 ms. In Fig. 12(B), the calculation window length distribution of the trials changes with the subjects’ signal quality. If the signal quality of the subject is good, the length of the calculation window is small. If the signal quality of the subject is poor, the calculation window is large. As shown in Fig. 12(C), the eBSCCA algorithm uses the dynamic window method and the computer calculation workload changes with the signal quality. The better the quality of the tested signal, the less the computer calculation workload. In Figs. 12(D) and (E), the TRCA and ECCA algorithms used fixed windows for the calculation, so the computer calculation workload of each subject was less different.
4.3 Online experiment results
Table 2 shows the specific values of the online experiment results of each subject.
As shown in Table. 2, the average discrimination accuracy of the subjects reached 88.68%, and the discrimination accuracy of all subjects was greater than 80%. The average ITR of the subjects reached 158.77 bits/min. Except for subject S3, the ITRs of the other subjects were all greater than 100 bits/min. Among them, the ITR of 4 subjects reached 200 bits/min, and the average ITR of subject S8 was the highest, reaching 237.72 bits/min. Except that the average decision time of subject S3 was longer than 2 s, which was 2.22 s. The decision times of the other subjects were all within the range of 1–2 s.

Different algorithms to calculate time-consuming statistics. (A) Computer calculation workload of different calculation window lengths in the eBSCCA algorithm. The abscissa represents the calculation window length of the trial (including 0.5 s of sight transfer time), and the ordinate represents the average calculation workload of the computer corresponding to each calculation window length range. (B) Statistics on the length of calculated data used in trials in the eBSCCA algorithm. The abscissa represents the calculation window length of the trial (including 0.5 s of sight transfer time), and the ordinate represents the number of trials corresponding to each calculation window length range. (C) The time-consuming statistics of the eBSCCA algorithm with optimal parameters under the dynamic stop window length condition. The abscissa represents the computer calculation workload of trials, and the ordinate represents the number of trials corresponding to the computer calculation workload range. (D) Time-consuming statistics of eCCA algorithm with optimal parameters. (E) Time-consuming statistics of TRCA algorithm with optimal parameters.
Online experiment results of each subject (including 0.5 s eye shift time).
5 Discussion
In the current work we propose an extended BsCCA detection algorithm for the radial contraction-expansion motion paradigm. This algorithm improves the CCA algorithm according to the characteristics of the high fundamental frequency component, less doubling frequency component, and the solid low-frequency response of EEG signals evoked by the radial contraction-expansion motion paradigm. The BsCCA algorithm was used to improve the recognition accuracy of high-frequency stimuli. The dynamic window method was used to minimize the required data length to achieve higher ITR and improve the performance of BCI. The offline experimental results showed that the proposed algorithm significantly outperforms the eCCA and TRCA algorithms in the signals evoked by the radial contraction-expansion motion paradigm (
As shown in Fig. 13, the recognition performance of the three algorithms was improved with the increase in the amount of training data. Among them, the TRCA algorithm was most affected by the amount of training data. In the case of less training data, the advantages of the proposed ebscca algorithm were more obvious. Research [12] has shown that both the eCCA and TRCA algorithms are greatly affected by training data. When the number of training blocks was 3, the prediction accuracy of the algorithm was less than 60%. In the offline algorithm comparison in this paper, the number of training blocks was 3. Therefore, the lack of training data may be one of the reasons for the poor performance of the eCCA and TRCA algorithms.
For subjects S3, S9, S11, and S13, the recognition accuracy of 30 target frequencies in the range of 5.5–20 Hz has been improved to varying degrees without 3–5 Hz target frequency. In the offline data set, the time window length of the CCA algorithm was set to 3 s. After removing the target frequency of 3–5 Hz, the average recognition accuracy of subject S3, S9, S11, and S13 increased from 61.67%–68.33%, 90.83%–99.17%, 60%–70.83%, and 55.83%–84.17%, respectively. The recognition accuracy was dramatically improved, especially in subject S13, by nearly 30%. Therefore, to enhance the universality of this paradigm, the stimulation frequency range of the paradigm should be improved. Although, the flip frequency of 3 Hz is too low, and some subjects have a poor recognition effect.
6 Conclusion
The radial contraction-expansion motion paradigm is different from the traditional luminance modulation paradigm. This paradigm has a weak sense of flicker. Additionally, the subjects are not prone to visual fatigue during use, reducing the risk of photosensitive epilepsy. Therefore, this paradigm is an excellent choice to replace the luminance modulation paradigm. However, the current research on this paradigm mainly focuses on the comparative study of the EEG performance evoked by the paradigm and the practical study of the paradigm. There is a lack of research on applicable algorithms. Thus, the extended BsCCA algorithm proposed in this paper for the radial contraction-expansion motion paradigm can effectively improve EEG signal recognition performance and provide a good algorithm choice for the practical application of this paradigm in the future.

The influence of the number of training data blocks on the performance of each algorithm. (A) The abscissa represents the number of training data blocks, and the ordinate represents the average recognition accuracy. (B) The abscissa represents the number of training data blocks, and the ordinate represents the average ITR.
The radial contraction-expansion motion paradigm has been developed for spellers at the application level, and it works well. In addition, this paradigm has also been used to develop attention-driven games, and researchers believe that this paradigm can be used to train participants’ attention, which can be used in special education schools or centers dealing with attention disorders in the future [29].
