Abstract
Introduction
Butyrate is a nutritional element produced by bacterial fermentation of dietary fibers in the digestive track of all mammals. Ruminant species metabolize the short-chain fatty acids (acetate, propionate, and butyrate) to fulfill up to 70% of their nutrient energy requirements. It is now well established that microbial fermentation in the gastrointestinal tract contributes to the energy balance of all mammalian species.1,2 This can be of great advantage to animals and human beings, since no digestive enzymes exist for breaking down cellulose or other complex carbohydrates. The large number of bacterial species, the complex nature of their interactions, and the end products of their fermentation processes are all likely to have significance in human and animal health.
Butyrate appears to be involved in metabolism in ways that differ from its role as a nutrient. It has also proven to be a strong regulator of genomic activities. Butyrate participates in metabolism both as a nutrient and as an inhibitor of histone deacetylases (HDACs). The major biochemical change that occurs in cells treated with butyrate is the global hyperacetylation of histones. In our previously reported studies, the potential biological roles of butyrate were also investigated, using the established Madin-Darby bovine kidney (MDBK) epithelial cell line. This study focused on determining whether normal bovine cells in a standard cell culture condition were sensitive to the growth inhibitory effects of butyrate. Butyrate not only induced apoptosis but also induced cell cycle arrest at the G1/S boundary and G2/M in MDBK cells. The cell responses were concentration dependent. 3
Histone modifications, especially histone acetylation and methylation, play a dominant role in epigenomic regulation. Butyrate is an important player in inducing histone acetylation. For many years, it has been recognized that consumption of dietary fibers has many positive metabolic health effects, such as increasing satiety and lowering blood glucose and cholesterol levels, as well as inducing cell differentiation and apoptosis. These effects may be associated with short-chain fatty acids, particularly propionic and butyric acids. In the past decade, the multiple beneficial effects of butyrate at the intestinal and extraintestinal levels have been demonstrated. Butyrate has different mechanisms of action, and many of these involve epigenetic regulation of gene expression through the inhibition of HDACs. There is a growing interest in butyrate because its impact on epigenetic mechanisms could lead to more specific and efficacious therapeutic strategies for the prevention and treatment of different diseases, ranging from genetic/metabolic conditions to neurological degenerative disorders, as well as cancer. 4 Butyrate as a cancer prevention and therapeutic reagent has been extensively studied and reviewed.5–8
At physiologic concentrations, butyrate regulates the expression of individual genes by inhibition of histone deacetylation and attendant chromatin remodeling.4,9,10 Utilizing global gene expression profiling, recent studies indicate that butyrate induces significant changes in the expression of genes associated with the regulatory pathways critical to cell growth, immune response, and signal transduction.3,11,12 A majority of these genes is repressed by butyrate and is associated with cell cycle control. Next-generation sequencing (NGS) has provided unique tools for transcriptornic profiling and genome analysis.13–15 As a fundamental step toward an ample understanding of the molecular mechanisms of butyrate-induced acetylation and its biological effects, NGS technology was utilized to give a more complete categorization of the RNA transcripts of MDBK cells. 16 In this study, we performed further bioinformatics analysis of the entire transcriptome of cell populations with or without butyrate treatment and identified a unique group of genes that only become activated after butyrate treatment. In order to further understand the process, we analyzed the transcriptome alterations induced by butyrate and discovered a set of genes uniquely expressed only after butyrate treatment. We also conducted a complementary bioinformatics analysis of the functional category, pathway, and integrated network using Ingenuity Pathways Analysis (IPA, http://www.ingenuity.com) of those uniquely expressed genes to identify the possible mechanisms underlying gene expression regulation induced by butyrate.
Methods and Materials
Cell culture and butyrate treatment
Bovine epithelial cells (MDBK epithelial cells, American Type Culture Collection, Catalog No. CCL-22) were used in this study. The culture condition and butyrate treatment were described in our previous report. 3 Briefly, at ~50% confluence, the cells were treated with 10 mM sodium butyrate (Calbiochem) for 24 hours. A total of eight samples (four replicate flasks of control cells and four replicates of butyrate-treated [BT] cells) were used for the RNA sequencing technology (RNA-seq) experiments.
RNA extraction and sequencing using RNA-seq
Total RNA was extracted using Trizol (Invitrogen). Any potential DNA contamination was cleaned by DNase digestion and Qiagen RNeasy column (Qiagen), as previously described. 11 The RNA integrity was verified using an Agilent Bioanalyzer 2100 (Agilent). An Illumina TruSeq RNA sample preparation kit was used to prepare a high-quality RNA sequencing library (RNA integrity number [RIN] >9.0) by following the manufacturer's instruction (Illumina). RNA sequencing was performed at 50 bp/sequence using an Illumina HiSeq 2000 sequencer, as described previously. 17 Approximately 67.5 million reads per sample (mean ± standard deviation [SD] = 67,527,111 ± 8,330,388.6) were generated.
Bioinformatics and data analysis
Raw sequence reads were first through the quality control pipeline. TopHat
18
was used to align trimmed reads to the bovine reference genome (Btau 4.0). Mapped reads were normalized based on the upper-quartile normalization method and Cuffdiff was used to model the variance in fragment counts across replicates using the negative binomial distribution as described previously. The mean number of sequence reads generated per sample was 67,527,111 ± 8,330,388.58 (αSD). A total of eight replicates were used in the two groups, BT and phosphate-buffered saline treated (control [CT];
IPA (Ingenuity® Systems, and www.ingenuity.com) was used to further evaluate molecular processes, molecular functions, and genetic networks of differentially expressed genes.19–23 IPA is a web-based software application that allows analysis, integration, and understanding of data from gene expression. Data analysis and search capabilities help in understanding the significance of data, specific targets, or candidate biomarkers in the context of larger biological or chemical systems. The software is supported by the Ingenuity Knowledge Base of highly structured, detail-rich biological, and chemical findings.
Results and Discussion
Butyrate-induced differential expression of genes and unique genes detected with deep transcriptomic sequencing
High-throughput RNA sequencing has opened a new horizon for revealing global profiles in gene expression allowing the detection of differential transcription, transcript isoforms, alternative splicing, and genomic structural variation. To decipher the gene regulatory mechanisms induced by butyrate treatment, we used untransformed normal bovine epithelial cells (MDBK) exposed to 10 mM butyrate for 24 hours, which is an established model for butyrate stimulation.3,11,12 Using high-throughput RNA sequencing, we generated 57,303,693 to 78,933,744 sequencing reads per sample (NIH NCBI Sequence Read Archive [SRA] 051007; database accession numbers SRX130570, SRX130571, SRX130572, SRX130573, SRX130576, SRX1305680, SRX130582, and SRX130583). There were 24,526 genes that were detected in at least one sample. There were 16,212 genes that were detected in all the samples and these were accounted for as the core transcriptome of the bovine epithelial cell. In the BT group, 19,477 ± 155 (mean ± SD) genes were identified, while 17,626 ± 124 (mean ± SD) genes were identified in the control group. Figure 1 summarizes the results of RNA sequencing analysis. RNA-seq data from BT cells and control cells allow for the detection of differentially expressed genes of biological significance. 16 Figure 1A shows the volcano plot of differential RNA expression between the control group and BT cells. Among those differentially expressed genes, we observed expression changes in over 11,000 genes at a strict false discovery rate of 0.05 (Fig. 1B). The principal component analysis shows the gene expression patterns of control cells and butyrate-treated cells (four replicates each) as two distinctive components (Fig. 1C). To generate a unique set of bovine genes induced by butyrate, cuffcompare was implemented in this study. We detected 588 genes expressed in BT cells of four replicates but not in the normal control cells of the four replicates (Supplementary Table 1). Very interestingly, among these 588 genes, we detected 241 novel genes that have not been annotated. Some of these novel genes are abundantly expressed with butyrate treatment (Supplementary Table 1). The Manhattan plots in Figure 2 show the total number of differentially expressed genes (lower panel) and unique genes in BT cells with genomic coordinates on 29 autosomal chromosomes as well as the X chromosome.

Summary of RNA sequencing data. (

Manhattan plots of differentially expressed genes chromosomal distribution. Lower panel shows all differentially expressed genes and upper panel shows the genes only expressed with butyrate treatment.
Upstream regulators and their targets in the data set.
Comprehensive function and pathway analyses of unique genes activated by butyrate treatment
In order to further systematically discover more about the biological function of these unique genes, the function and pathway analysis of uniquely activated genes in cells treated with butyrate were investigated using the IPA. The biological functions that were most significantly enhanced in the data set were identified. When a functional analysis of the genes was performed, genes from the data sets that were associated with biological functions were cataloged, and

Global functional analysis. The top molecular and cellular functions altered by butyrate-activated genes were summarized. Data sets were analyzed by the IPA software (Ingenuity® Systems, www.ingenuity.com). The significant value associated with a function in global analysis is a measure for how likely it is that genes from the data set file under investigation participate in that function. The significance is expressed as a
In addition to the functional annotation, IPA analysis also revealed that genes uniquely expressed due to butyrate treatment are involved in some very important canonical pathways (Fig. 4). The canonical pathways include 3′,5′-cyclic adenosine monophosphate (cAMP)-mediated signaling and G protein-coupled receptor (GPCR) signaling. GPCRs are a large family of integral membrane proteins that respond to a variety of extracellular stimuli. 24 GPCRs transduce the information provided by these stimuli into intracellular secondary messengers that are interpreted as meaningful signals by the cell. This process involves the coupling of agonist-activated GPCRs to a wide variety of effector systems via their interaction with heterotrimeric guanine nucleotidebinding proteins (G proteins). It is now apparent that GPCR activity and function is regulated by an incredible variety of mechanisms. These mechanisms act at the level of GPCR ligand specificity, G protein activation, and effector regulation. The cAMP-dependent pathway, also known as the adenylyl cyclase pathway, is a GPCR-triggered signaling cascade used in cell communication.25,26 Butyrate treatment induced GPCR and cAMP-dependent pathway activation indicates that these pathways might be involved in the regulation of the biological effects of butyrate. Further investigations are necessary to understand how does butyrate perturb intracellular signal transduction of these signaling pathways.

Global Canonical Pathway analysis: data sets were analyzed by the IPA software (Ingenuity® Systems, www.ingenuity.com). The significance is expressed as a
Upstream regulators and transcription factors
IPA analysis also revealed potential upstream regulators that may be involved in the activation of this group of unique genes. Transcription factors control the transcription of genetic information from DNA to mRNA by binding to specific DNA sequences. 27 Transcription factors can function either as activators or as repressors for the expression of specific genes. Table 1 lists the major transcription regulators (factors) and other upstream regulators identified by IPA analysis. Only the upstream regulators with a regulation Z-score, either positive or negative, are listed. A Z-score is a statistical measurement of a score's relationship to the mean in a set of scores. The positive or negative Z-scores suggest whether the score is above or below the mean and by how many SDs. In addition to these transcription factors, some cell signaling factors, such as cytokines, as well as growth factors, may be involved in the activation of this group of genes. Their regulatory target genes in the data set revealed by IPA analysis are also listed in Table 1. It is worth noting that among all those upstream regulators, LEP, the gene that encodes protein leptin, is itself upregulated. Leptin was initially thought to have a role in energy homeostasis and obesity. More recently, it has also been implicated in the regulation of reproduction, glucose homeostasis, bone formation, wound healing, and the immune system.
The biologically relevant networks in the uniquely expressed gene data set
The IPA analysis was also used to determine the biologically relevant networks among the uniquely expressed genes, as well as the potential association with the uniquely expressed genes induced by butyrate. The top two networks are presented in Figure 5. The first network has associated network functions of cell signaling, nucleic acid metabolism, and small molecular biochemistry. This network also shows the involvement of HDACs and histone 3 and histone 4 in the perturbation of the functional gene network and provides a direct correlation of gene functions and butyrate as an HDAC inhibitor (HDACi). The second network has associated network functions of inflammatory response, cellular movement, and hematological system development and function. The associated network functions underlie the biological effects of the butyrate treatment and shine light on the mechanisms by which histone modification is induced by butyrate treatment.

The biologically relevant network: the data set of uniquely expressed genes was analyzed by the IPA software (Ingenuity® Systems, www.ingenuity.com). The color indicates the expression level of the genes (red indicating upregulated genes and green indicating downregulated genes). The upper panel shows the first network with 24 focus genes, which are involved in biological functions, such as cell signaling, nucleic acid metabolism, and small molecule biochemistry. The lower panel shows the second network with 24 focus genes, which are involved in biological functions, such as tissue morphology, carbohydrate metabolism, and molecular transport.
Using HDAC inhibitors to study histone acetylation and gene regulation contributed significantly to our knowledge of histone modification and epigenetics regulation. Our previous studies3,11,16 revealed that butyrate participates in metabolism as regulators of histone modification and provides an example of nutrient-induced specific modulation of epigenetic regulation. Development of the NGS technique has presented unique tools for transcriptomic profiling and for genome analysis.15,28 We utilized NGS technology to completely characterize the RNA transcripts of MDBK cells and the effects of butyrate treatment. 16 This is a crucial step toward abroad comprehension of the epigenetic system of butyrate-induced hyperacetylation, as well as its biological consequences. The current report is a complementary study to compare the transcriptomes of the normal cells and the cells treated with 10 mM butyrate for 24 hours. Our results uncovered the unique set of genes activated by butyrate treatment. With the technical advantages of NGS deep-sequencing and with an average of >67 million reads per sample, the differential expression of genes induced by butyrate treatment was revealed in much more detail than that of our earlier study that used a microarray. Our previous study, using microarray technique, detected only 450 genes as significantly regulated by butyrate treatment. 11 However, using NGS deep sequencing, ~11,408 genes were detected as significantly impacted by butyrate. 16 NGS also facilitated the revealing of a unique group of genes activated by butyrate treatment from all differentially expressed genes, as described in this report.
Butyrate exerts a very broad range of effects on many biological pathways via its inhibitory ability on HDAC. The genes that were identified (Supplementary Table 1), which fall within a broad range of functional categories, appeared to provide the molecular basis for its biological effects. The pathway analysis on the butyrate-activated genes, using the Ingenuity Pathway Analysis tool, successfully identified remarkable changes in genetic networks, transcription regulators, and functions related to various cell functional activities. These uniquely expressed genes, induced by butyrate treatment, revealed a basis for understanding the molecular mechanisms of the biological effects of butyrate provided by global gene expression profiling using deep RNA sequencing. It is also an expedient step toward understanding the histone modification as an important epigenomic regulatory mechanism.
Author Contributions
CJL, RWL, WL and SW analyzed the data. RLB and LAB contributed to the construction of the manuscript. CJL wrote the manuscript and made critical revisions. All authors reviewed and approved of the final manuscript.
Disclaimer
Mention of a product, reagent, or source does not constitute an endorsement by the USDA to the exclusion of other products or services that perform a comparable function. The US Department of Agriculture is an equal opportunity provider and employer.
