Abstract
Keywords
Introduction
Over the last decades, composite laminates have become the predominant structural material in various engineering applications. Nowadays, the quest to develop safer and lighter structures still fosters the scientific community to investigate different damage mechanisms in composite materials and their reciprocal interaction. However, despite the impressive amount of research, open questions are still present, and the understanding of the physics behind failure modes in composites is limited. 1 Moreover, composite structures are particularly susceptible to flaws arising from the manufacturing process and service and exhibit complex failure modes as opposed to metals. Among them, delamination constitutes one of the most common damage mechanisms and can also occur in adhesive bonds. 2 Using large safety factors mitigates the risks of catastrophic structural failure but leads to heavier designs and might not be deemed sufficient to guarantee safety.
Consequently, delamination growth represents a severe threat to structural integrity in carbon fiber reinforced polymer (CFRP) structures, and it becomes necessary to implement Structural Health Monitoring (SHM) strategies. SHM can provide essential information about delamination existence, location, and size. Moreover, it can also deepen the understanding of other correlated damage mechanisms and thus promote the introduction of innovative composite materials and structures. 3 SHM offers a wide range of techniques, each with its strengths and weaknesses depending on the application.
Among them, Optical Fiber Sensors (OFS) provide numerous advantages over traditional strain sensing techniques. For example, they are intrinsically immune to electromagnetic interference; they have large bandwidth, which enables multiplexing solutions; and can survive harsh environmental conditions if protected with appropriate coatings and cable sheaths. Alj et al. provide further details about the durability of OFS. 4 Moreover, their lightweight and small size allow them to be embedded in composites 5 as well as 3D printed structures, 6 and they have recently been shown to be a viable alternative to accelerometers for modal analysis. 7 Recent advances in optical fiber technology fostered the use of Distributed Optical Fiber Sensors (DOFS) based on Raman, Brillouin, 8 and Rayleigh backscattering. DOFS based on Rayleigh backscattering are particularly promising for monitoring damage, such as delamination growth, in composites since they provide millimeter resolution along the fiber length within several meters of range. 9
However, assessing the damage detection performance of optical fibers is not a straightforward process. The optical fiber datasheet specifies the geometrical, mechanical, and optical properties. On the other hand, the interrogator datasheet provides the resolution, wavelength range, wavelength stability, maximum sensor length, measurement uncertainty, and sampling rate. Nevertheless, these metrics do not directly assess strain-based damage detection performance since damage is not a physical quantity that can be directly measured. Indeed, Axiom IVa of SHM states that sensors cannot measure damage and that a feature extraction process is needed to obtain damage-related information. 10 For example, considering delamination monitoring, the detection performance is expected to change depending on the loading conditions because they affect the damage-induced strain in the structure. In addition, depending on the strain transfer occurring from the structure to the fiber core, 11 DOFS may exhibit different detection performances. The current literature lacks well-established methodologies for certification and performance evaluation for damage detection, preventing the adoption of this technology in many applications.
The performance of Non-Destructive Evaluation (NDE) methods, widely accepted in many industries (aerospace, automotive, oil and gas, medical, and marine, to name a few), is evaluated following the guidelines provided in the MIL-HKBK-1823A. 12 First, damage detection performance is quantified using Probability of Detection (POD) curves and Probability of False Alarm (PFA). Furthermore, varying the threshold value makes it possible to evaluate the POD against the PFA and obtain the so-called Receiver Operating Characteristic curve.13,14
It is legitimate to ask whether these NDE reliability metrics can be applied to SHM. The naive application of POD curves in SHM would lead to inconsistent results. One of the most critical differences between NDE and SHM is their variability sources. The human factor represents the highest variability contribution in NDE, whereas SHM is affected by both temporal and spatial sources of variability. Moreover, SHM is typically characterized by repeated measurements over time, implying that the independent measurement assumption used in NDE does not hold. 15
Meeker et al. 12 reviewed and proposed statistical methods for SHM, 16 extending the theory described in the MIL-HKBK-1823A. The authors demonstrate that the Length at Detection (LaD), 17 and the Repeated Measures Random Effects Model (REM2), 18 are valid statistical methods to handle SHM data. However, in both cases, the lack of data often hinders their applications since it is challenging to manufacture and test many structures equipped with identical sensing systems. Model-Assisted POD (MAPOD) curves can reduce the amount of requested experimental data. They allow the modeling of many types of variability sources, but the computational cost can be prohibitive due to the curse of dimensionality. Surrogate modeling can mitigate this problem and is already available in software such as CIVA.19,20
A systematic literature review by the authors 21 shows that in SHM, POD curves were mainly applied to Guided Waves14,15,22–29 and occasionally to other techniques such as Comparative Vacuum Monitoring,17,30 Acoustic Emissions,31,32 and Carbon Nanotubes. 18 Supplemental Table A1, in the Appendix of Falcetelli et al., 21 highlights that only a few POD studies on OFS are present, and no POD studies on DOFS are available. Grooteman developed a numerical model of a three-stringer thermoplastic composite panel installed with Fiber Bragg Gratings (FBGs) and computed the frequency shift in the eigenmodes. Then, using the modal strain energy as a damage indicator, they generated a POD curve using the hit/miss approach. 33 Sbarufatti and Giglio 34 developed POD curves to quantify the performance of FBGs bonded onto an aluminum stiffened panel in terms of minimum detectable crack length. In this work, the authors compared the confidence interval for a population proportion method 35 with the one-sided tolerance interval (OSTI) for a normal distribution. 36
To the best of the authors’ knowledge, no study presents a rigorous methodology for qualifying DOFS in different scenarios using POD curves. A first attempt at generating POD curves for DOFS was explored by Falcetelli et al. 37 This study profoundly extends the preliminary research activity by establishing a systematic methodology based on the LaD method to qualify the damage detection performance of DOFS. The method is validated considering the use of DOFS for monitoring delamination in composite structures as a case study. Specifically, Single-Mode (SM) DOFS with ORMOCER® coating and Graded-Index Multimode (GIM) DOFS with a dual acrylate coating are surface mounted onto CFRP double cantilever beam (DCB) specimens under Mode I quasi-static and fatigue loading.
POD curves developed with the LaD method are used to evaluate the performance of the monitoring system of the two DOFS types in the two loading conditions. The results confirm that both strain transfer and loading conditions affect POD curves and prove that the proposed methodology can quantify the damage detection performance of DOFS in different scenarios.
Moreover, the authors introduce a practical approach to evaluating the required number of specimens based on the expected level of uncertainty. This method is two-fold since it can also serve for comparing POD curves generated from different sample sizes introducing the concept of virtual specimens. This dual functionality might be of great help in real applications where it is rare having homogeneous datasets.
The final aim of this article is not to promote the use of specific DOFS for delamination detection, rather is to develop a comprehensive methodology to assess the performance of DOSF and show the implications of making a POD study in SHM using DOFS. The proposed methodology aims to provide the SHM community with a reference procedure required to deploy DOFS in composite aircraft structures.
The article is organized as follows: Section “Materials and methods” presents the methodologies of this work, including the basic notions about POD in SHM, the method to estimate the required number of specimens and to compare POD curves produced from various sample sizes, the working principle of DOFS with Rayleigh backscattering, and the experimental methodology employed in the study; Section “Results” presents the results of the research; Section “Discussion” discusses the results and highlights their implications; Section “Conclusions” retraces the main stages of the article and suggests potential future research activities.
Materials and methods
POD for SHM: LaD method
The MIL-HKBK-1823A 12 describes how to derive POD curves using the famous â versus a method. As an example, Figure 1 shows the output of this methodology using synthetic data.

Example of the
The critical
Different from NDE, where all the observations can be considered independent, in SHM, engineers may have to deal with repeated measurements. In this case, the hypothesis of statistical independence of the observations does not hold anymore, making it impossible to apply the standard linear regression model used for POD in NDE studies. 21
The LaD method avoids the issue of dealing with repeated measurements by considering only one observation among the time series of data. Similar to the standard method proposed in the MIL-HKBK-1823A, 12 detection is positive if a certain measurement is above a certain threshold. However, only the measurements at which damage (delamination in this study) is detected for the first time are considered. The crack lengths at detection lay on the threshold, potentially following different statistical distributions. Assuming that the population is normally distributed, the POD curve can be determined as a function of the crack length using Equation (1):
The symbol
The
It should be noted that the LaD is not the only method to derive POD curves. For instance, the REM2 method16,18 is a valid alternative that allows more efficient data use. Nevertheless, when less than ten observations are available, issues arise to fit a five-parameters model such as the REM2.21,39 In these kinds of studies, where testing many specimens or even real structures becomes costly and time-consuming, the LaD approach seems more appropriate.
Required number of specimens
Here the authors propose a practical scheme to assess the required number of samples,
Pseudo-code for estimated sample size.
Where
Simulating the effect of virtual specimens on the lower bound
As the
The first step is to use the LaD method to compute the value of
Specimens manufacturing
The DCB coupons were produced following the guidelines described in the ASTM D5528 standard. 41 The AS4 HexPly 8552® unidirectional carbon prepreg 42 was employed to fabricate a 300 mm square panel with [024] stacking sequence by hand layup. A 12 µm TeflonTM film was placed during lamination at the panel mid-plane. This non-adhesive insert served as an initiation site for the delamination, providing an initial crack length of 50 mm. The specimens were cut from the panel utilizing an automated Proth® cutting machine such that 25 mm strips were obtained. Ad hoc loading bocks were machined, matching the specimens width of 25 mm. Before bonding, the loading block surface was sandblasted, whereas the bonding surface of the specimens was slightly scrubbed with traditional sandpaper. Impurities were removed with an alcoholic solution, and the 3M™ Scotch-Weld™ EC-9323 structural epoxy adhesive 43 was used for bonding.
Optical fiber sensors
Two types of DOFS were used in this study: SM OFSs with ORMOCER®
44
coating, produced by FBGS Technologies GmbH (Jena, Germany), and GIM DOFS, produced by Plasma Optical Fibre (Eindhoven, The Netherlands). These were connected via LC/APC connectors to an ODiSI-B,
45
developed by LUNA Innovations Inc. (Roanoke, VA, USA). The interrogator uses swept-wavelength coherent interferometry to measure Rayleigh backscattering,9,46,47 which originates as a result of non-propagating material-density fluctuations.
48
The scattered light exhibits a repeatable profile that is sensitive to longitudinal strain,
Where
Experimental setup
There are a large number of studies proposing analytical solutions for DCB specimens. The simplest analytical solution considers the DCB arms as cantilever beams clamped at the crack tip. 49 Both the Euler–Bernoulli beam theory and the Timoshenko beam theory can be used, with the latter providing more accurate results (Euler–Bernoulli-based solutions are a special case of Timoshenko-based solutions if the shear stiffness becomes infinite). 50
The plot in Figure 2(a) shows a qualitative representation of the theoretical (Euler–Bernoulli solution) and expected measured experimental strain profiles along the longitudinal direction (

DCB specimen geometry and the optical fiber layout.
Figure 3 shows an example of a DCB specimen used in the fatigue test and the DOFS positioned above its top surface. The bonding of the DOFS was achieved using ThreeBond 1742® cyanoacrylate adhesive. 51

Specimen example used in the fatigue test.
Before testing, one side of the DCB coupons was coated with a thin layer of white spray paint. After drying, 1 mm spaced vertical lines were used as a reference for visually estimating the crack length from the camera. An extra vertical mark is placed at the crack tip after the pre-cracking procedure explained in the D5528 standard. 41 Figure 4 shows a picture captured from a 9-Megapixel camera positioned in front of the specimen.

Crack length estimation in the DCB specimen.
The true crack length is estimated by exploiting its relationship with the compliance
Where

Compliance calibration.
Data structure
Acquired strain data during static and fatigue tests of the
Where
Similarly, the crack length is organized in a vector
Static tests
A Zwick—20 kN tensile test machine was used for static testing, as shown in Figure 6.

DCB specimen installed in the Zwick—20 kN tensile test machine.
The Zwick software was set up to synchronize the LUNA system and the camera. The tensile load is applied at a displacement rate of 1 mm/min. A sampling frequency of 0.5 Hz was used to collect data. The first experimental campaign used a total of five DCB specimens equipped with SM OFSs with ORMOCER® coating. Since three optical fiber segments are bonded onto each specimen, the number of linear regressions used to build POD curves can be multiplied by three.
The same methodology was applied in a second experimental campaign, where six specimens equipped with GIM DOFS were tested. Preliminary results revealed that GIM optical fibers are more sensitive to small bending radii. As a result of the repeated bending of the optical fiber, the configuration shown in Figure 2 would have resulted in an unsatisfactory signal-to-noise ratio. Therefore, in this case, only one central optical fiber segment was bonded in the specimen.
Fatigue test
An experimental fatigue test campaign was carried out on three specimens, where SM DOFS with ORMOCER® coating were surface bonded using the scheme previously shown in Figure 2. The DCB specimens were mounted in an MTS—10 kN Elastomer hydraulic test machine equipped with a 10 kN load cell. The whole experimental setup is shown in Figure 7.

Fatigue test setup.
The fatigue tests were performed in load control. Figure 8 shows a schematic overview of how the cycling loading was applied to the DCB specimens. Preliminary fatigue tests using DCB specimens manufactured from the same CFRP laminate were performed to assess the optimal load level to be used during fatigue testing. This preliminary study found that 80% of the pre-cracking load was the optimum load level for delamination growth. Lower loads would have led to very slow delamination growth, whereas higher loads would have resulted in an unstable delamination growth, which is not suitable for developing POD curves.

Fatigue test loading and measurement scheme.
The MTS software was programmed to reach 80% of the pre-cracking load,
Damage index definition
The first step in developing POD curves is to identify a proper damage-sensitive feature. From theory, it is possible to predict that the stress field reaches its maximum compressive value at the crack tip. Therefore, the strain value at the crack tip is a potential damage-sensitive feature. For a generic delamination value, and thus a generic time value
Figure 9 shows an example of the strain profiles obtained using DOFS at different times in the static test profile. The black stars, placed in correspondence with the lower peak of each strain profile, highlight the crack tip position and its relative propagation as delamination grows. Due to the non-linear strain transfer occurring between the specimen and the optical fiber,
11
and the distortion in the measured strain due to the interrogator resolution, the strain does not decrease linearly with the delamination length. The linearity is restored by applying a logarithmic transformation to the damage index. The new definition of the damage index at generic time value,
Then, it is possible to define a damage index vector

Strain profiles generated from DOFS analysis.
Results
Static test
SM optical fibers
Figure 10 shows the application of the LaD method to SM DOFS with ORMOCER® coating. The abovementioned damage index behaves linearly with respect to the crack length, and linear regression is performed for every damage index vector

LaD method applied to SM DOFS data for crack detection.
In Figure 10 the abscissa assumes zero value at the onset of the bonding length of each DOFS segment. The threshold was chosen by quantifying the noise level in preliminary experiments. Precisely, three standard deviations related to noise data were summed to the highest intercept of the regression lines. This procedure avoids negative lengths at detection, which would be the equivalent of saying that the crack was detected before it reached the bonded region of the DOFS, which should not be possible in principle.
The normality assumption of the lengths at detection can be verified using the Anderson Darling test (Figure 11). The null hypothesis, H0, states that the data follow a normal distribution. The null hypothesis can be rejected if, for a certain significance level,

Darling–Anderson test.
Under the assumption that the crack lengths at detection, denoted as black squares in Figure 10, follow a normal distribution, it is possible to build a POD and its relative lower bound by applying Equations (1) and (2), respectively. Figure 12 shows the POD that was obtained using this methodology.

POD and its lower 95% confidence bound of SM DOFS.
The identified values for
GIM fibers
The same methodology used for SM DOFS in Section “SM Optical Fibers” is now applied to the static test data obtained with GIM DOFS. The LaD results are shown in Figure 13. Although it is difficult to verify the normality assumption using the Anderson–Darling due to the low number of samples, the collected data are enough to show how a different strain transfer performance affects the resulting POD curve. The GIM DOFS has a dual acrylate coating whose stiffness is lower than the ORMOCER® coating of SM DOFS. This results in a lower strain transfer performance and a higher discrepancy between the real strain (the one present on the specimen surface) and the measured strain (strain present in the fiber core).

LaD method applied to GIM DOFS data for crack detection.
Figure 14 displays the corresponding POD curve with

POD and its lower 95% confidence bound of GIM DOFS.
Applying the method proposed in Section “Required number of specimens,” it is possible to show the convergence of the lower bound as the number of specimens increases (Figure 15). For example, when the number of specimens is equal to 97, the

Lower bound convergence as the number of specimens increases.
Fatigue test
Figure 16 shows that the LaD method was applied to fatigue test data. The variability within segments of the same specimens and between different specimens is more pronounced than for the static case, even if the same type of SM DOFS was used (with ORMOCER® coating).

LaD method applied to SM DOFS data in fatigue loading conditions for crack detection.
The corresponding POD curve is shown in Figure 17, with

POD and its lower 95% confidence bound of SM DOFS in fatigue loading conditions.
The data highlight that both variability sources due to between-specimens and within-specimen heterogeneity are present. The first two specimens (black and blue color in Figure 16) have more data points with respect to the static case because of the large number of samples acquired every 500 cycles. On the other hand, the third specimen (magenta color in Figure 16) has few data points because the crack propagated beyond the bonded region of the DOFS right after the application of the pre-cracking load and propagated faster than in the previous two cases.
Compared to the static case, the measured strain is lower because the specimens were fatigue loaded at 80% of the pre-cracking load,
Discussion
Comparison of POD curves
Table 2 summarizes the results in terms of
Summary of
SM: Single-Mode.
Different optical fibers in the same loading configuration exhibit different
Comparison of POD lower bounds with virtual samples
The difference between
In this study, the number of DOFS segments (samples) in each case is different. This situation is likely to occur in real applications due to the availability of different DOFS or, for example, a limited amount of time to perform fatigue tests compared to static tests.
As described in Section “Required number of specimens,” since the tolerance factor
Nevertheless, it would be interesting to compare the results obtained in this research by having the same number of samples for each case study. Referring to the procedure outlined in Section “Simulating the effect of virtual specimens on the lower bound,” the authors virtually augmented the number of samples of the different case studies to 30 units. Under the assumption that the experimental data correctly captured the variability sources involved in the experimental setup, this methodology allows a fair comparison between the different cases, eliminating a potential bias error due to the different sample sizes. Applying this procedure to Table 2, one obtains the results in Table 3.
Comparison of
SM: Single-Mode.
The results confirm what is already seen in Table 2, even if the differences within the case studies are less accentuated.
Interpretation and implications of the results
The DOFS type proved to be a determinant factor in the POD analysis, which can be directly correlated to different strain transfer properties. On the other hand, the loading type is also shown to be a key variable. This is not surprising since DOSFs are sensitive to strain which depends on the applied load. The higher scattering in the fatigue data can be attributed to a lower signal-to-noise ratio. First, the test itself involves a higher amount of noise due to vibrations. Second, the crack propagates at a lower load, thus further reducing the signal-to-noise ratio. Moreover, the mechanisms involved in delamination growth are different in fatigue loading compared to quasi-static loading. 2 For example, a different amount of fiber bridging can affect the strain field in the process zone, 53 thus affecting the damage index and the POD parameters.
This result suggests that also the loading mode could potentially lead to different POD curves. Indeed, different mode mixites of Mode I and Mode II would affect the process zone and the strain profile, thus affecting the damage index. In such a case, a novel and more appropriate damage index should be developed because the strain at the crack tip may no longer be the best damage-sensitive feature.
Temperature variations are not considered in this study but are expected to be determinant in the POD analysis due to the relation between Δ
Upscaling POD curves
In real applications, it could be inconceivable to test a sufficiently high number of structures to perform a statistically consistent POD study for DOFS. Indeed, one should be able to produce and replicate a large number of identical complex structures, each equipped with an identical DOFS setup. Even though the proposed methodology was developed considering DOFS in laboratory case studies, it offers a framework for assessing POD curves in real applications in two different ways.
First, it is possible to use the same methodology as a basis to derive MAPOD for DOFS. This could be achieved by simulating the outcome of the LaD method given the noise level, the loading conditions, and the strain transfer properties of the DOFS-structure mechanical system. The variability sources can be modeled assigning a certain probability distribution to the most critical parameters.
Second, POD curves obtained at a coupon level could be transferred at a structure level to monitor a specific damage type. The objective is to use the proposed methodology and build an experimental setup that mimics the local perturbation caused by damage in the strain field of a real structure. For example, in a hot spot monitoring scenario, where the structure is expected to fail due to mode-I delamination, the POD curves obtained from equivalent DCB specimens can provide an acceptable estimate of the damage detection performance of the system in the real application.
Conclusions
To the best of the authors’ knowledge, this is the first time an experimental POD study has been performed for DOFS based on the Rayleigh backscattering. The study proposed a methodology to develop POD curves using the LaD method focusing on delamination, which is one of the major causes of failure for composites. Mode I static and fatigue loading experiments were performed on DCB specimens with two types of DOFS (SM fibers with ORMOCER® coating and GIM fibers with dual acrylate coating).
Probably, better POD curves could be obtained by using stiffer adhesives, redesigning the experimental setup to have lower noise, or using DOFS with higher strain transfer properties. However, the case studies that have been shown only serve as examples to show the implications of performing a POD study in SHM using DOFS. The goal is to develop an easily reproducible methodology to assess the performance of DOSF and to bring the attention of the SHM community to this topic which is often underestimated.
The following bullet points summarize the main finding of this research:
Both loading conditions and DOFS type affect the performance in delamination detection
POD curves for DOFS can also be sensitive to different loading modes, damage types, and laminate stacking sequences, dramatically increasing the problem complexity compared to classical NDE applications.
The LaD model proved effective in producing POD curves for DOFS, but the normality assumption is difficult to verify as the sample size decreases.
Other POD models, such as the REM, do not require any normality assumption but are difficult to fit with small sample sizes.
In many cases, the only feasible solution is to derive a MAPOD. The proposed framework, combined with preliminary knowledge regarding the most frequent damage modes in the structure, could be used to develop MAPOD for DOFS.
The study provides a practical approach to estimating the required number of samples for the POD study.
The same approach can be used to simulate the lower bound convergence, imposing a certain number of virtual samples to compare POD curves obtained from different sample sizes. Caution must be taken in interpreting the results since the underlying assumption is that the available samples properly captured the variability.
The presence of unexpected variability sources, which are not captured in the experiments, such as varying EOCs, leads to unconservative results.
Based on the finding of this work, further research is needed and should be devoted to the following aspects:
development of multi-dimensional POD curves varying the mode mixites between Mode I and Mode II for delamination;
development of a MAPOD framework for DOFS;
link the concepts of strain transfer and POD curves;
development of compensation strategies for varying EOCs, sensor drift, and other variability sources potentially affecting POD curves.
analysis of upscaling potentialities and limitations of such methodology, from both structural complexity and loading complexity aspects;
The final aim of this work is to spark a constructive debate in the SHM community about developing the most appropriate methodologies to certify DOFS for damage detection using POD curves.
