NEAT1 can be a diagnostic biomarker in the breast cancer and gastric cancer patients by targeting XIST, hsa-miR-612, and MTRNR2L8: integrated RNA targetome interaction and experimental expression analysis

Background The most frequent malignancy in women is breast cancer (BC). Gastric cancer (GC) is also the leading cause of cancer-related mortality. Long non-coding RNAs (lncRNAs) are thought to be important neurotic regulators in malignant tumors. In this study, we aimed to evaluate the expression level of NEAT1 and the interaction of this non-coding RNA with correlated microRNAs, lncRNAs, and mRNAs or protein coding genes, experimentally and bioinformatically. Methods For the bioinformatics analyses, we performed RNA-RNA and protein–protein interaction analyses, using ENCORI and STRING. The expression analyses were performed by five tools: Microarray data analysis, TCGA data analysis (RNA-seq, R Studio), GEPIA2, ENCORI, and real-time PCR experiment. qRT-PCR experiment was performed on 50 GC samples and 50 BC samples, compared to adjacent control tissue. Results Based on bioinformatics and experimental analyses, lncRNA NEAT1 have a significant down-regulation in the breast cancer samples with tumor size lower than 2 cm. Also, it has a significant high expression in the gastric cancer patients. Furthermore, NEAT1 have a significant interaction with XIST, hsa-miR-612 and MTRNR2L8. High expression of NEAT1 have a correlation with the lower survival rate of breast cancer samples and higher survival rate of gastric cancer patients. Conclusion This integrated computational and experimental investigation revealed some new aspects of the lncRNA NEAT1 as a potential prognostic biomarker for the breast cancer and gastric cancer samples. Further investigations about NEA1 and correlated mRNAs, lncRNAs, and microRNAs – specially the mentioned RNAs in this study – can lead the researchers to more clear information about the role of NEAT1 in the breast cancer and gastric cancer.


Introduction
Breast cancer (BC) is the most common cancer type in women. Long non-coding RNAs (lncRNAs) are considered crucial gene expression controllers in malignant growths [1]. Gastric cancer (GC) can also be considered the most significant cause of cancer-related death [2].
Among the various known types of lncRNAs, Nuclear Enriched Abundant Transcript 1 (NEAT1), with five recognized splice structures, is situated on chromosome 11. Based on the Gene cards, this gene delivers a long noncoding RNA (lncRNA) deciphered from the numerous endocrine neoplasia locus. This lncRNA is held in the core, where it frames the center primary part of the paraspeckle sub-organelles. It might go about as a transcriptional controller for a very long time, incorporating a few qualities associated with disease movement. NEAT1 can enable miRNA binding and RISC complex binding. This lncRNA has an important role in the positive inflammatory response regulation, negative regulation of gene silencing by miRNA, and positive regulation of synoviocyte proliferation [3].
Mutation in the promoter region of NEAT1 can lead the normal breast and renal cells to the carcinoma [4,5]. In hepatocellular carcinoma cells, treatment with 5-AZA enhanced NEAT1 expression, demonstrating that DNA methylation is a significant determinant of NEAT1 expression [6]. Based on previous studies, NEAT1 controls the initiation and progression of cancer by three molecular mechanisms: (i) influencing the expression level of downstream factors of EZH2, by acting as a scaffold RNA molecule for EZH2, (ii) playing crucial role as a miRNA sponge to disrupt the connections of several tumor suppressor miRNAs with their target mRNAs, and (iii) suppressing the expression of miR-129 (promotion of DNA methylation in the promoter region of miR-129) [7].
Recent studies revealed that NEAT1 has a significant role in developing different cancer types. For example, Xuesong Wang et al. at 2020 found that lncRNA NEAT1 had a significantly high expression in colorectal cancer (CRC) tissues and cells. Also, they demonstrated that the knockdown of NEAT1 can promote the invasion and apoptosis of CRC cells. They also find a novel microRNA (miR-150-5P) that NEAT1 can sponge [8]. In May 2020, Gao M et al. demonstrated that NEAT1 could promote the progression of GC by sponging miR-356a-3p and regulating ABCC4 [9]. NEAT1 expression may be controlled by signal transducer and activator of transcription 3, and changed NEAT1 expression epigenetically influences downstream gene transcription during herpes simplex virus-1 infection and Alzheimer's disease, implying that NEAT1 functions as a stress sensor and effector. The chemicals and regulatory patterns that control NEAT1 gene expression, as well as the molecular mechanism by which NEAT1 regulates the expression of its target genes, are summarized and discussed in this study, bringing new insights into NEAT1's essential function in gene regulation [10]. This molecule also can regulate the regulate the liver fibrosis in the alcoholic steatohepatitis [11]. By modulating the TLR2/NF-B signaling pathway, the LncRNA NEAT1 reduces sepsis-induced myocardial damage [12]. The repressor complex FOXN3-NEAT1-SIN3A promotes the development of hormonally sensitive breast cancer [13]. Both depletion of mitochondrial proteins and treatment of mitochondrial stressors result in abnormal NEAT1 expression as well as changes in the shape and quantity of paraspeckles via ATF2. The retention of mRNAs of nuclear-encoded mitochondrial proteins (mito-mRNAs) in paraspeckles is improved as a result of these alterations. NEAT1 depletion, on the other hand, has a significant impact on mitochondrial dynamics and function through affecting mito-mRNA sequestration in paraspeckles [14].
NEAT1 is a nuclear architectural long non-coding RNA with a high abundance. NEAT1-1 and NEAT1-2 are two overlapping NEAT1 isoforms, with the latter serving as a scaffold for the construction of paraspeckles, a type of nuclear ribonucleoprotein body [15]. It was recently discovered that NEAT1-2 expression, but not NEAT1-1, predicts progression-free survival in ovarian cancer patients receiving platinum-based treatment [16]. Erik Knutsen et al. showed that NEAT1-2 expression level is significantly associated with the breast cancer tumor grade and HER2 positive breast cancer samples. Also, during lactation, NEAT1-2 expression is increased in human breast tissue [17]. Previous studies revealed that NEAT1-1 is expressed in a variety of cell types in the adult mouse tissue. however, the epithelial layers of digestive tissues are where NEAT1-2 is mostly expressed [7].
Mentioned information about the role of NEAT1 reveals that this non-coding RNA is a crucial molecule in regulating different biological processes and pathological statuses. Unwanted changes in the expression of this lncRNA can lead the cells into different diseases, including breast cancer and gastric cancer. In this study, we aimed to evaluate the differences in the expression level of NEAT1 in the high-throughput breast cancer and gastric cancer datasets and the human GC and BC samples of the Isfahan population. Also, we find an RNA regulatory interaction network that can regulate NEAT1 expression level and evaluate the expression of the most significant hub RNAs in this network, directly or indirectly.

Microarray analysis
Microarray data analysis was performed on the two gastric and breast cancer datasets. GSE54129 was analyzed for finding the differentially expressed genes (DEGs) in the gastric cancer microarray samples. Twenty-one control samples and 111 GC samples in this dataset were analyzed. Also, this study analyzed GSE10810 [18] with 27 control samples and 31 case samples to find the DEGs in the BC. These datasets are provided by GPL570 ([HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array). The raw data were downloaded from GEO online database (https:// www. ncbi. nlm. nih. gov/ geo/) and moved to the R Studio environment and normalized by the affy package (Read.Affy command in R) [19]. Statistical analysis of the microarray dataset was performed by limma package [20]. Affy and limma packages were downloaded from Bioconductor online database (https:// www. bioco nduct or. org/). The significance level of microarray data analysis was considered as 0.05 (adjusted p value). The plots of microarray data analysis were created by the ggplot2 and pheatmap packages, downloaded from Bioconductor (Bioconductor.org). In this microarray analysis, the expression of 54,675 mRNA and lncRNA transcripts were analyzed. After normalization (quantiles normalization method) of raw data, transforming the expression data into logarithmic scale, and deleting the transcripts with no expression in the dataset, the difference in the expression level of all RNAs was calculated. The RNAs with logFC > 3 were selected as the up-regulated RNAs and the logFC < -3 was selected as the threshold of low expression.

Clinical characteristics of tissue samples
The Ethics Committee of Al-Zahra Hospital, Isfahan University of Medical Science, approved all procedures for the research in this study that involved human samples, and all patients signed written consent forms. Breast cancer and surrounding normal breast tissue samples from 50 individuals with breast cancer were analyzed in a case-control study. Also, the same expression analysis was performed on 50 gastric cancer samples compared to 50 adjacent normal samples. None of the patients had previously had radiation or chemotherapy. Tissue biopsies were rinsed in distilled water before being submerged in RNA later solution (Invitrogen, USA) and quickly preserved in liquid nitrogen for pathologist evaluation. The clinicopathological characteristics of breast cancer and gastric cancer patients are provided in Table 1 and Table 2.

Real-time PCR
The total RNA content of breast cancer tissue samples and normal breast tissue equivalents from the same individuals was acquired and extracted according to the manufacturer's procedure using an RNA extraction kit (GeneAll, Seoul, Korea). According to the manufacturer's procedure, the first-strand cDNA synthesis kit (Thermo Fisher Scientific, Waltham, MA, USA) was used to make cDNA. The cDNA products were stored at -20 •C for the expression analysis of NEAT1 and GAPDH as the reference gene. Using oligo 7 software, the specific primers were designed for the NEAT1 and GAPDH ( Table 3). The qRT-PCR experiment was performed using Magnetic Induction Cycler (MIC) (Bio molecular Systems, Australia).

Statistical analysis
GraphPad Prism software performed statistical analysis of real-time PCR data and the related graphs (version 8). qRT-PCR data were analyzed using the 2− ΔΔCT method to compare expression levels between the tumor and control samples. The Shapiro-Wilk test was performed on the expression data to evaluate the normality of data. Paired t-test was performed on the -ddCt data to compare expression levels in tumor and control samples. DEG analysis of microarray and the TCGA dataset was performed by R Studio (4.1.0). The GraphPad prism performed the ROC analysis for the real-time PCR datasets based on sensitivity and specificity. P-value less than 0.05 was considered as the significance level of this study.
In the ROC analysis, AUC between 0.7 -0.8 is a fair AUC value, AUC between 0.8-0.9 is a good AUC value (indicating a good biomarker), and AUC between 0.9-1 revealed an excellent biomarker.

NEAT1 had a significantly high expression in breast cancer and low expression in the gastric cancer tissue
Based on the ENCORI (Fig. 1) and data analysis, NEAT1 has a significant down-regulation in breast cancer (FC: 0.73, FDR: 0.015) and gastric cancer (FC: 1.81, FDR: 0.0016). GEPIA2 (Fig. 2) online data analysis revealed that NEAT1  had a significant downregulation in the breast cancer and gastric cancer samples. Survival analysis by ENCORI and GEPIA2 revealed that high expression of the NEAT1 has a non-significant correlation with the low survival rate of BC patients. Also, the survival analysis revealed that the lowexpression of NEAT1 has a not-significant relation with the low survival rate of GC patients (Fig. 3).

Real-time PCR data analysis of NEAT1 in the BC and GC samples, compared to control (Isfahan population)
Real-time PCR data analysis of lncRNA expression level revealed that the expression of this lncRNA had no significant difference in the breast cancer cohort compared to the control. However, there was a significant dysregulation in expression level in the different clinicopathological situations. There was a significant up-regulation in expression level in the samples with the size of tumor bigger than 2 cm, compared to the samples smaller than 2 cm (Fig. 4).
Real-time PCR data analysis revealed that NEAT1 has a significant down-regulation in the Isfahan human gastric cancer samples, compared to control (logFC: -3.775, p-value < 0.0001, Fig. 5a). ROC analysis revealed that NEAT1 can be an excellent prognostic biomarker for the Isfahan gastric cancer patient and can be a novel factor for distinguishing the tumor samples from control samples (AUC: 0.924, p-value < 0.0001, Fig. 5b).
For validation of mentioned results about the expression of NEAT1, the TCGA data analysis was performed

Integrated RNA interaction analysis
For finding the RNA and protein interaction network surrounding the NEAT1, ENCORI online database was used. Based on the interaction number of RNAs with NEAT1, the hub mRNAs, miRNAs, and lncRNAs that have significant interactions with NEAT1 were detected (Figs. 6 and 7). Based on this interaction analysis, NEAT1 Fig. 12 The protein-protein interaction analysis of MTRNR2L8, based on the STRNIG online database Fig. 13 The low-expression of lncRNA XIST in the BC samples, based on the ENCORI relative expression analysis Azadeh et al. Genes and Environment (2022) Table 4.

Expression analysis of MTRNR2L8, hsa-miR-612, and XIST in the GC and BC samples
After RNA interaction analysis, it is demonstrated that MTRNR2L8, hsa-miR-612, and XIST are three coding and non-coding RNAs that could regulate the expression and activity of NEAT1. For understanding the possible role of mentioned coding and non-coding RNAs in the BC and GC patients, the relative expression analysis of the mentioned RNAs was performed by ENCORI, GEPIA2, and microarray data analyses. Based on the ENCORI data analysis, MTRNR2L8 had a significant low expression in the breast cancer samples (FC: 0.66, FDR < 0.0001) and a not significant lowexpression in the GC samples (FC: 0.83, FDR: 0.26, Fig. 8). GEPIA2 online expression analysis revealed no significant dysregulation in the BC and GC samples (Fig. 9). Also, GEPIA2 expression analysis revealed that there is no significant correlation between the stages of breast and gastric cancer and the expression level of MTRNR2L8 (Fig. 10). Survival analysis of ENCORI and GEPIA2 databases revealed a not significant correlation between the low-expression of MTRNR2L8 and the low survival rate of BC and GC patients (Fig. 11). Figure 11 presented a protein-protein interaction for MTRNR2L8. The ENCORI relative expression analysis of lncRNA XIST (Fig. 12) revealed that XIST has a significantly low expression in the BC samples (FC: 0.62. FDR < 0.0001) and has a not significant up-regulation in the GC samples (FC: 1.44, FDR: 0.74). GEPIA2 online expression analysis revealed that XIST has a significantly low expression in the GC samples (Fig. 13). Survival analysis revealed no significant correlation between the expression pattern of lncRNA XIST and the survival rate of the BC and GC patients (Fig. 14).

Microarray data analysis
Microarray data analysis was performed on BC and GC microarray datasets. The principal component analysis (PCA) and Pearson correlation tests were performed to evaluate the quality of microarray samples (Figs. 15 and 16). DEG analysis revealed that lncRNA XIST has a significant low expression in the BC samples and a high expression in the GC samples (Figs. 17 and 18). Differentially expressed genes in the breast cancer and gastric cancer are provided in the Table 5 and Table 6.

Discussion
To better understand the more accurate function of NEAT1 in the breast cancer and gastric cancer statuses, we designed integrated bioinformatics and experimental expression analyses to find the expression pattern of NEAT1 and related lncRNAs and mRNAs that have RNA interactions with NEAT1. Also, we performed the survival analyses by different methods and software to demonstrate the effects of the changes in the expression in NEAT1 and corresponding mRNA and lncRNA on the survival rate of the patients. Our results indicated that NEAT1 had a significant up-regulation in the samples bigger than 2 cm, compared to the smaller samples. NEAT1 had a significant down-regulation in the gastric cancer samples compared to the control. Also, NEAT1 can be an excellent prognostic biomarker for the Isfahan GC patients based on ROC analysis. The TCGA, ENCORI, and GEPIA2 data analyses and the experimental methods can validate each other in this study.
Furthermore, we demonstrated that NEAT1 had a significant RNA interaction with MTRNR2L8 proteincoding RNA, which had a significant down-regulation in the BC samples. Also, lncRNA XIST had a significant lncRNA-lncRNA interaction with NEAT1. This study demonstrated that XIST had a significant down-regulation in the BC and GC samples. hsa-miR-612 is a critical regulatory microRNA for NEAT1 with more interaction numbers.
Previous studies showed that miR-612 could have a remarkable effect on regulating invadopodia of hepatocellular carcinoma [25]. This microRNA also has a   16 These analyses revealed that these dataset's control and tumor samples could be separated and are ready for the DEG analysis. Blur color represents the control samples and red color represents the tumor samples Fig. 17 The heatmap of the correlation between microarray samples of BC and GC proven role in regulating tumorigenesis in the neurofibromatosis type 1 by involving in the NFKB1-miR-612-FAIM2 signaling pathway [26]. In 2020, Li T et al. showed that the miR-612/HOXA13 signaling pathway could promote cardiomyocyte apoptosis in chronic heart failure [27]. Another significant role of miR-612 that has been proved by Jin Y et al. in 2020 is inhibiting cervical cancer progression. This microRNA can perform this function by targeting NOB1 [28]. Regulation of SEMA4D in the cholangiocarcinoma can be affected by sponging miR-612 via lncRNA LINC01061 [29]. About the role of microRNA-612 in gastric cancer, we find only one experiment, published in 2018 by Liyan Wang et al., that indicated the tumor suppressor effect of micro-RNA-612 that can be induced by FOXM1 [30]. Also, about the role of miR-612 in the BC, only Hye Kyung Kim et al. revealed that two common single nucleotide polymorphisms (SNPs) within the miR-612 do not affect the breast cancer cell lines [31]. Nevertheless, for the first time, we find a biological interaction between hsa-miR-612 and NEAT1 that can have significant effects on breast cancer and gastric cancer development. Yang X et al. in 2018 revealed that the PTBP3 splicing factor could destroy the splicing balance of NEAT1 and pre-miR-612 [32]. Based on our bioinformatics approach, hsa-miR-612 can regulate the function of NEAT1 by the higher interaction number compared to the other micro-RNAs, and this mechanism can affect the GC and BC progression.
About the MTRNR2L8, we find no previous research about the role of difference in the expression of MTRNR2L8 in gastric and breast cancer development. So, this is the first investigation about the possible role of MTRNR2L8 in the development of GC and BC. This protein-coding gene can affect BC development by unwanted changes in the expression and can be regulated by NEAT1. Based on the previous investigations, this human peptide is a biological product of the MT-RNR2 gene from mitochondria [33].
About the lncRNA XIST, Yang X et al. showed that the downregulation of lncRNA XIST can lead the CRC cell into proliferation and metastasis [34]. Also, it is revealed that the down-regulation of XIST can inhibit the development of non-small cell lung cancer [35]. This non-coding RNA also can promote the invasion and migration of papillary thyroid cancer cells [36] and pancreatic cancer [37]. About the proven role of XIST in the BC, Xing F et al. at 2018 showed that the loss of XIST can promote the brain metastasis of BC by activating MSN-c-Met and reprogramming the microglia exosomal miRNA [38]. Also, Zheng W et al. showed that the targeting miR-337 by lncRNA XIST can lead the GC cells into migration and proliferation [39]. Also, this lncRNA can promote the progression of GC through TGF-beta 1 signaling pathway by targeting miR-185 [40].
In this experiment, we had some limitations during the study. Some of our plots (including boxplots of GEPIA2 and ENCORI database) are provided by the online database and some of important details (including p-values) may have not good quality. Also, we had limitations in the accessing human clinical samples and the technics for validation of RNA interaction analyses.
Based on our investigation and precious studies, the dysregulation of RNA expression and interactions can lead the normal cells into unwanted pathological statuses, including the different cancer types. We aimed to Fig. 18 Volcano plot indicates the differentially expressed genes in the BC a and GC b samples, compared to the control samples. In this plot, red indicates the up-regulated genes, and green indicates the low-expressed genes in these datasets. The lncRNA XIST indicates in these plots by a blue point as a low-expressed gene in the BC and high-expressed genes in the GC samples evaluate the expression and interactions of NEAT1 in the BC and GC patients experimentally and bioinformatically. The RIP or luciferase assay method is highly recommended to perform the mentioned interaction analyses. Also, we suggest that the expression level of XIST and MT-RNR2 in the human breast cancer and GC patients to evaluate the accurate expression pattern of these gene and lncRNA in the patients. The flow chart of the workflow in this study is presented in the Fig. 19.