Abstract
Introduction
Cumin (

Plant diagram A and medicinal materials diagram (seed) B of
As a TCM, the medicinal properties of cumin are warm and the flavor is pungent. It has been used to treat stomach pain, dyspepsia, flatulence, hoarseness, toothache, hypertension, scorpion bites, weight loss, jaundice and diarrhoea.2,4 It can be administered alone or in combination with other Chinese herbs. 2 Moreover, modern research has demonstrated various pharmacological effects of cumin, including antioxidant, antibacterial, insect-killing, anti-inflammatory, pain-relieving, antitumor, platelet disaggregating, and antiallergic activities3,5 as well as, tension-preventive, hyperglycemic, hypoglycemic, antihyperlipidemic, bronchodilatory, immunomodulatory, contraceptive, anti-amyloidogenic, 6 anti-osteoporotic, antifungal, aldose reductase, alpha-glucosidase, 7 tyrosinase inhibitory, and neurovs protective properties. 8 Relevant studies also indicate that cumin lowers blood glucose in diabetic rats and reduces plasma and tissue levels of cholesterol, phospholipids, free fatty acids, and triglycerides.6,7,9
The primary reason cumin may help treat diseases is its close link between chemical composition and pharmacological effects. Cumin contains about 10% fatty oil, volatile oil, protein, cellulose, sugar, minerals, and elements.10–12 Its volatile oils have been extensively studied, which cuminaldehyde identified as the major bioactive component.
13
Additional constituents include cymene, cuminic alcohol (cuminol), γ-terpinene, safranal, limonene, eugenol,
Materials and Methods
Instrument
Q Exactive Plus Orbitrap High Resolution LMS (Thermo Fisher), U3000 Ultra High Performance Liquid Chromatography and Automatic Sampler (Thermo Fisher), ACQUITY UPLC HSS T3, 2.1 × 100 mm, 1.8 μm column (Waters Company), Agilent 7890A gas chromatograph combined with a 7000B triple-quadrupole MS detector (Agilent, Palo Alto, CA, USA), HP-5MS capillary column (30 m × 0.25 mm ×0.25 μm), Vortex 2 Genie Vortex Mixer (Scientific Industries), 5810R Low temperature centrifuge (Eppendorf), WD-9415C ultrasonic cleaner (Beijing Liuyi Instrument Factory), AL 204 Electronic Analytical Balance (Mettler-Toledo, USA), KQ-250 Ultrasonic Cleaner (Gong Yu hua Instrument Co., Ltd); Genie G10 (C111) Ultrapure water preparation device (Inner Mongolia Jingding Yuanye Technology Co., LTD, China).
Drugs and Reagents
Sample Solution Preparation
UPLC-Q-Orbitrap-MS Analysis Conditions
Instrumentation and Conditions
The chromatographic separations were performed using a Waters ACQUITY UPLC HSS T3 C18 column (2.1 mm × 100 mm, 1.8 μm). The flow rate was set at 0.2 mL/min. The injection volume of the samples was 10 µL. The mobile phase was composed of solvent A (ultrapure water with 0.1% formic acid) and solvent B (acetonitrile). The gradient program was as follows: 0 min, 100% A; 0∼10 min, 100→70% A; 10∼25 min, 70→60% A; 25∼30 min, 60→50%A; 30∼40 min, 50→30% A; 40∼45 min, 30→0%A; 45∼60 min, 0→100% A; 60∼70 min, 100% A.
Mass spectrometric detection was conducted on the Q Exactive PlusTM Orbitrap MS system. The detection mode is Full MS-ddMS2, positive and negative ion modes are scanned simultaneously, and the scanning range is m/z 100–1200. The other parameters were set as follows: MS1 resolution set to 70,000, MS2 resolution set to 17500, capillary voltage was 3.2 kV, capillary temperature was set to 320°C, Aux gas heater temperature was set to 350°C, Sheath gas flow rate was 40L/min, Aux gas flow rate was 15L/min, AGC Target set to 1e6, TopN was set to 5, and the collision energy triggering the MS2 scan adopts the stepped fragmentation voltage NCE, which is set to 30, 40, 50.
Data Analysis and Identification of Compounds
The chromatographic peak identification was implemented, comparing the chromatographic retention time, excimer ion peak and fragment-ion of the test sample with the standard sample, compounds with scores greater than 80 were screened by mzCloud (www.mz-cloud.com) and mzVoult (self-made database) software. Combined with the literature reports on the relevant chemical components of cumin, in the light of the exact molecular weight provided by mass spectrometry, calculated the accurate molecular formula of components, determined and assigned to each component peak. The maximum mass errors between the measured and calculated values were < 5 ppm, and the matching compounds would be generated from predicted fragments from the structure. Further analysis of the compounds was conducted by considering relevant literature and utilizing online databases such as PubChem, CNKI and PubMed.
GC-MS Analysis Conditions
Instrumentation and Conditions
The GC-MS analysis volatile oil were measured on HP-5MS capillary column (30 m × 0.25 mm ×0.25 μm) was adopted for the chromatographic separation. Helium gas was used as carrier gas at a constant flow rate of 1 ml/min and an injection volume of 1 μl in size ratio of 10:1. The ion source temperature was 270°C. The temperature program employed was as follows: starting at 40°C for 1 min, then ramping to 60°C at a rate of 5°C/min, followed by an increase to 100°C at a rate of 8°C/min, and finally reaching 280°C at a rate of 10°C/min, with a hold time of 3 min. The electron ionization (EI) source was utilized for ionization, with the analysis being ionized at 70 eV and 230°C in the ion source and solvent delay was 5 min. The scanning mass range was set from 40 to 650.
Data Analysis and Identification of Compounds
The raw data files were imported into Qualitative Analysis 10.0 software for further analysis. Peak integration and mass spectra extraction were performed using this software. The extracted mass spectra were compared to the NIST standard library for identification. Additionally, the raw data was subjected to peak extraction and peak alignment. The compounds were identified by combining relevant literature and utilizing online databases. Then, find the Kovats retention Index (RI) of the obtained volatile components from NIST (National Institute of Standards and Technology) to further identify and verify their volatile components.
Statistic Analysis
All Data was processed and summarized using Microsoft Office Excel 2019. The Pie Chart was made using Microsoft Office Word 2019. Bar chart was drawn with Wei Sheng Xin (https://www.bioinformatics.com.cn/).
Results
UPLC-Q-Orbitrap MS Analysis of Chemical Components from Cumin Aqueous Extraction
The high resolution MS data of cumin aqueous extract were rapidly acquired using the UPLC-Q-Orbitrap-MS method. 62 compounds were systematically identified from the aqueous extract of cumin using UPLC-Q-TOF-MS in positive and negative ion modes. The total ion chromatogram was analyzed by the standards, fragmentation patterns, literature, mzCloud and mzVault database (Figure 2, Table 1). These including 10 flavonoids, 7 phenylpropanoids, 5 terpenoids, 4 alkaloids, 7 organic acids, 8 fatty acids, 8 amino acids, 2 glycosides and 10 other components. The proportion of different types of compounds is shown in Figure 3. Furthermore, representative compositions of these different chemical types were used as examples to analyze the possible fragmentation pathways and characteristic products, in conjunction with reported literature.

Total ion current chromatograms of cumin aqueous extract under positive (A) and negative (B) ion modes.

Types of compounds contained in cumin aqueous extract.
Identification of Chemical Constituents of Cumin Aqueous Extract by UPLC-Q-TOF-MS.
Flavonoids
There were two types of flavonoids including flavonoid aglycone compounds and flavonoid glycoside compounds. It is well known that the main MS behavior of flavonoid aglycones was RDA fragmentation pathway and losses of small molecules or radicals39,40 like -CH3, -CO, -CO2, -CH2 and -H2O. For flavonoid glycosides, the cleavage at glycosidic linkages could happen in both positive and negative ion modes, that is mainly forms high-abundance flavonoid aglycones by continuously losing sugar groups.41,42 We identified the 10 flavonoids, here took apigenin and luteolin as examples to describe the fragment patterns and possible fragmentation pathways of these components, respectively. Using negative ion mode, compound 48 exhibited a [M-H]− signal at m/z 269.0455. The possible elemental composition of the compound is C15H10O5, and the main fragment ions include m/z 225.0555, 201.0552, 181.0659, 159.0405, 117.0345, etc, in which m/z 225.0555 is derived from the loss of a neutral CO2 molecule at the excimer ion peak and then the loss of a CO2 to m/z 181.0659 fragment ion. The m/z 201.0552 ions are derived by hydrogen rearrangement after the loss of excimer ion peaking C3O2, but its structure is unstable and easy to lose C2H2O to form a stable conjugated ion m/z 159.0405, and m/z 159.0405 can lose C2H2O to form fragmentation ion m/z 117.0345. The fragment information is basically consistent with the literature report 50. So it was identified as the compound is apigenin. The secondary mass spectrum and possible cleavage pathway was shown in Figure 4A. Compound 47 exhibited a [M-H]− signal at m/z 285.0404, the cleavage fragments of RDA from the flavonoid mother nucleus were also founded at m/z 151.0035 and m/z 133.0294 fragments. Its C the loss of CO and O in the ring results in a characteristic ion fragment 33 m/z 241.0500. Through database search, it is speculated that the compound is luteolin. The secondary mass spectrum and possible cleavage pathway was shown in Figure 4B.

The secondary mass Spectrum and possible cleavage pathway of apigenin (A) and luteolin (B).
Phenylpropanoids
A total of 7 phenylpropanoids were identified. The cleavage characteristics of secondary mass spectrometry were mainly the loss of neutral molecules such as H2O, CH2O2 and CO2. Here took chlorogenic acid was taken as example to describe the fragment patterns and possible fragmentation pathways of these components. 42 Using negative ion mode, compound 32 exhibited a [M-H]− signal at m/z 353.0877, the possible elemental composition of the compound is C16H18O9. In the secondary mass spectra, the fragment ion m/z 191.0554 [M-H-C9H6O3]− was formed after losing the quinic acid component, or the fragment ion m/z 135.0452 [M-H-C7H10O5-CO2]− was formed after losing the caffeic acid component and then further losing CO2. The cleavage pathway of the compound was consistent with that reported in the literature. 37 Compound 54 was identified as chlorogenic acid and its possible cleavage pathway is shown in Figure 5.

The secondary mass Spectrum and possible cleavage pathway of chlorogenic acid.
Amino Acids
Amino acids are a class of compounds containing amino and carboxyl groups. Generally, the characteristic fragments of the parent ions H2O and COOH2 corresponding to 18 and 46 are lost,43,44 and some amino acids will lose NH3 corresponding to 17. In this study, we have identified the 8 amino acids, here took L-Tyrosine as an example, to describe the fragment patterns and possible fragmentation pathways of these components. The molecular ion peak given by quasi-positive ions is m/z 182.0816 [M + H]+, the possible elemental composition of the compound is C9H11NO3, and the ion fragments are 136.0757 [M + H-CH2O2]+ and 119.0492 [M + H-CH2O2-NH3]+ respectively. Therefore, combined with mass spectrometry information and related literature,21,45 it was speculated that it may be L-Tyrosine. The specific secondary spectrum and possible cleavage pathway is shown in Figure 6.

The secondary mass Spectrum and possible cleavage pathways of L-tyrosine.
Terpenoids
A total of 5 terpenoids were identified. The lactone ring cracking is the most basic cleavage pathway in the mass spectrometry behavior of lactone compounds. In ESI multistage mass spectrometry, the lactone ring is opened first, CO2, H2 is removed, and then C2H4, C3H6 are lost step by step. 4 Here took germacrone as example to describe the fragment patterns and possible fragmentation pathways of these components. Using positive ion mode, compound 46 exhibited a [M + H]+ signal at m/z 219.1742, the possible elemental composition of the compound is C15H22O. The fragment ion m/z 201.1638 is formed by the loss of a neutral molecule of H2O from m/z 219.1742. During subsequent collision-induced cracking, fragmentation ions m/z 163.1117 and 159.1169 are produced. Therefore, it is speculated that it may be germacrone. 46 The specific secondary spectrum and possible cleavage pathway is shown in Figure 7.

The secondary mass Spectrum and possible cleavage pathways of parthenolide.
GC-MS Analysis of Volatile Components from Cumin
Samples were injected according to the GC-MS conditions and a total ion chromatogram of the volatiles in cumin was obtained. The obtained data were searched against and matched with the mass spectrometry database by the National Institute of Standards and Technology (NIST 2017). The relative percentage was calculated by the peak area normalization method.
47
A total of 40 chemical components were identified in cumin volatile oil, accounting for 98.70% of the total peak area. These compounds included 16 monoterpenes, 5 sesquiterpenes, 8 alkanes, 3 alcohols, 4 aldehydes, 2 phenols, 1 ketone, and 1 alkylbenzene compound (Figure 8). Additionally, the volatile oil extract of cumin was found to have the highest percentage of cuminaldehyde (42.88%), γ-terpinene (12.29%), γ- terpinen-7-al (11.50%), o-cymene (7.9%), and

Comparison of the contents of various compounds in the volatile components of cumin.

Total ion current chromatograms of volatile components cumin.
Identification of Chemical Constituents of Volatile oil Components in Cumin by GC-MS.
*Kovats’ RI taken from NIST.
Discussion
UPLC-Q-TOF-MS and GC-MS are highly efficient analytical techniques, known for their high sensitivity, precision, resolution, and rapid data acquisition capabilities, which have been effectively applied to analyze the complex chemical profiles of herbal plants. These methods enhance detection accuracy in the analysis of complex samples and are also used for the simultaneous determination of various components in TCM. 34
Cumin is not only widely used in TCM, 48 but also holds significant value in Mongolian and Uyghur medicinal practices, where it has been employed for the treatment of various diseases. 49 Current research on cumin's chemical composition has primarily focused on its volatile oils, with a particular emphasis on cuminaldehyde and other monoterpenes. 10 However, there is a lack of detailed reports on the identification of highly polar substances such as aqueous and alcoholic extracts of cumin. Therefore, this study establishes UPLC-Q-Orbitrap MS and GC-MS analysis methods to comprehensively characterize the chemical constituents of cumin, integrating various databases and related literature.
In this study, a total of 102 compounds were identified in cumin, of which 62 were systematically identified from the aqueous extract using UPLC-Q-Orbitrap-MS. These include 10 flavonoids, 7 phenylpropanoids, 5 terpenoids, 4 alkaloids, 7 organic acids, 8 fatty acids, 8 amino acids, 2 glycosides and 10 other components. Among these, apigenin, a flavonoid component, is commonly found in the Umbelliferae family of plants and is known for its diverse biological and therapeutic properties, such as antioxidant, anti-inflammatory, anticancer, and antimicrobial effects.
50
According to relevant literature, apigenin also alleviates nephropathy, pancreatic
The GC-MS analysis of cumin's volatile oil revealed components accounting for 98.70% of the total peak area. These include monoterpenes, sesquiterpenes, alkanes, alcohols, aldehydes, phenols, ketones, and alkylbenzenes. As the primary active substance in cumin, the chemical composition and content of its volatile oil provide a scientific basis for its comprehensive utilization. Cuminaldehyde, the main component of cumin, exhibits anti-injury and anti-neuropathic effects on chronic compression nerve injury by stimulating opioid receptors, modulating the L-arginine /NO/ guanyyl cyclase /cGMP pathway, and inhibiting inflammatory cytokines. 52 Additionally, cuminaldehyde demonstrates antibacterial and anticancer activities.13,45 In summary, cuminaldehyde possesses broad pharmacological activities, high research value, and extensive application prospects.
The various types and sources of active ingredients in TCM can achieve comprehensive multi-component, multi-pathway, and multi-target therapy through synergistic or complementary effects. To conduct multi-mechanism and multi-target studies on TCM, it is essential to systematically investigate the chemical composition of these medicines. Only through a comprehensive understanding of the material basis of these medicines can further research be pursued. However, this study has some limitations; it primarily focused on the identification and analysis of chemical components and did not include basic research on blood components or pharmacodynamic substances.
Conclusion
The composition of TCM is complex and diverse, making it insufficient to evaluate quality control based solely on a single component. The quality control of TCM has evolved from “micro-analysis” of individual components to “macro-analysis” of the entire system. In this study, UPLC-Q-TOF MS and GC-MS analytical methods were established to systematically analyze the chemical composition of cumin, providing a foundation for exploring its biological activities, potential applications, compatibility, and pharmacological mechanisms in various fields, including pharmaceuticals, cosmetics, and functional foods.
