Systematic Literature Review on Multivariate Analysis: Research Trends. Methods and Frameworks Fatimah ZUHRA Sultan Syarif Kasim State Islamic University. Riau Indonesia Corresponding Email: fatimah. zuhra@uin-suska. ARTICLE HISTORY Received . February 2. Revised . March 2. Accepted . April 2. KEYWORDS Statistics. Multivariate Analysis. Systematic Literature Review. Bibliometric Analysis. This is an open access article under the CCAe BY-SA license ABSTRACT The focus of this research is to look at the development of analytical multivariate statistical research from 2000-2025. This study tries to identify fundamentally and thoroughly which journal-related multivariate analysis developments are the most significant, the most influential researchers, what research topics are developing, the types of datasets and methods used in multivariate analysis. Bibliometric analysis is used to look systematically based on relevant information from scientific publications, multivariate analysis articles accessed through Ie Explore and Scopus. In the bibliometric analysis stage using Vosviewer, as many as 483 journals were analyzed for the number of citations, analysis of interconnected keywords and others. Therefore, it was found that the most significant journals were Ie Transactions on Biomedical Engineering. Ie Transactions on Geoscience and Remote Sensing. The most influential authors are B. Aiazzi. Baronti. Selva. total of 643 citations writing about the regression multivariate adopted to improve spectral quality. Emerging Research Topics Multivariate Time Series Forecasting. Methods that often appear are Regression. Correlations. Clustering. Principle Component Analysis. INTRODUCTION There is a lot of research that has been done on multivariate analysis applied in various fields of science (Adegoke et al. , 2019. Georga et al. , 2013. Lv et al. , 2020. Mujahid et al. , 2017. Raj et al. , 2020. Riaz et al. , 2019. Sahu et al. , 2016. Tandeo et al. There is still very little research done on the literature review on statistics as a whole (Lim. Yong B. et al. , 2016. Marchy & Juandi, 2. Although there have been some studies on the multivariate analysis review literature (James & McCulloch, 1990. Krzanowski, 2. But very few have conducted a literature review on multivariate analysis as a whole. Many research phenomena or problems involve many variables that should be analyzed simultaneously. Research involving more than one or two variables is called multivariate analysis (Deepa et al. , 2020. Ding et al. , 2020. Karadayi et al. , 2020. Khan et al. , 2. In simple terms, variables are said to be multivariate if the variables are observed simultaneously or simultaneously (Santoso, 2. If the observation of the research variables is not carried out simultaneously, then the right analysis is univariate The second characteristic of multivariate analysis is the data obtained from observations in which the data analysis is carried out simultaneously, where the variables are interconnected, both theoretically and empirically (Raj et al. , 2020. Rezaeieh et al. , 2020. Riaz et al. , 2. In multivariate analysis, the interpretation of the results of the analysis is carried out comprehensively or comprehensively (Georga et al. , 2013. Shen et al. , 2020. Succetti et al. , 2020. Sun et al. , 2. Multivariate analysis can be said to be the use of statistical methods related to several variables whose measurements are carried out JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 109 simultaneously from each research object, with a simultaneous analysis process and comprehensive interpretation implementation. So that the important key to multivariate analysis is research in which there are relationships, so the analysis process must be carried out simultaneously (Solimun et al. , 2. The development of multivariate statistics is very rapid, this is because it is supported by the advancement of statistical software in its calculation and the increasing complexity of data problems that must be calculated multivariate , such as (Toubeau et al. , 2019. Wang et al. , 2018. Zeng et al. , 2016. Zhao et al. , 2. which tells a lot about deep learning. Neural network and others. In this study, we will look at the development of multivariate statistical analysis research for the last 25 years from 2000 to 2025 in Indonesia and other countries using the bibliometric analysis method. More specifically, this study has the following objectives: RQ1 Multivariate journal Which analysis is the most significant? RQ2 Who is the most influential researcher in the field of Multivariate Analysis? RQ3 What kind of research topics did researchers choose in the field of Multivariate Analysis? RQ4 What types of datasets are most used for Multivariate Analysis? RQ5 What methods are often used for Multivariate Analysis? This means that this research tries to identify fundamentally and comprehensively the development of Multivariate Analysis research applied in various fields of science such as in the fields of computer science, health, medicine, and so on. So that the development of methods, datasets, and multivariate analysis topics can be seen clearly. This research has four parts. The first contains an introduction that tells in an outline and in depth the reasons why this topic was raised. Next is the literature review, this is about reviewing the relevant literature related to Multivariate Analysis. Then the research methodology was also presented and how the analysis techniques of the bibliometrics were discussed. Furthermore, the findings of the research continued with the discussion and the last part is the Conclusion. LITERATURE REVIEW Multivariate Statistics Statistics is one of the fields of science required in various scientific fields that involve data processing or data analysis. Statistics itself is a science and method used to collect, organize, present, analyze, and interpret data into information to support effective decision-making (Suharyadi, 2. Statistics is practical knowledge and as an applied science that plays an important role in the application of methods and concepts to data analysis, experimentation activities, as well as observation and inference. However, many research phenomena or problems involve many variables that should be analyzed simultaneously. Research involving more than one or two variables is called multivariate analysis (Santoso, 2. In simple terms, there are four characteristics of multivariate digging variables Variables can be said to be multivariate if the variables are observed simultaneously or simultaneously. If the observation of the research variables is not carried out simultaneously, then the right analysis is univariate analysis. The second characteristic of multivariate analysis is that data obtained from observations is thus analyzed simultaneously, where the variables are interconnected, both theoretically and empirically. The third characteristic of multivariate analysis is the interpretation of the results of the analysis carried out comprehensively or comprehensively. JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 110 Multivariate analysis can be said to be the use of statistical methods related to several variables whose measurements are carried out simultaneously from each research object, with a simultaneous analysis process and comprehensive So that the important key to multivariate analysis is research in which there are relationships, so the analysis process must be carried out simultaneously (Solimun et , 2. The development of multivariate analysis is currently very rapid, both in theory, method and application. This is because his research generally involves many variables that are measured or observed simultaneously, and very rarely that depend on only one variable. In addition, in line with the development of computing technology with the development of computers and Artificial Intelligence are easily accessible throughout the world and at relatively affordable prices. Thus, the complexity constraint in the multivariate analysis calculation process can be overcome. Bibliometric Analysis Bibliometric analysis is not only used to evaluate performance, but also for Mapping is a process that allows one to recognize the elements of knowledge and their configuration, dynamics, mutual dependencies, and interactions (Madjido. Knowledge mapping in bibliometrics is used for technology management purposes, including the definition of research programs, decisions regarding activities related to technology, the design of knowledge base structures, and the creation of education and training programs. In relation to bibliometrics, science mapping is a method of visualizing a field of science. This visualization is done by creating a landscape map. In the map, science topics will appear by entering bibliographic data, keywords, citations and others (Skute, 2. The bibliometric method uses bibliographic data from publication databases to build a picture of the structure of the scientific field (Zupic & Uater, 2. Bibliometric analysis is a study based on the assumption that researchers must communicate the results of their research to other researchers. Because this will provide scientific development if researchers carry out joint activities to study certain research topics (Tupan et al. , 2. Systematic review analysis is a method of understanding large amounts of information, and is a means of contributing to answers to questions about what works and what doesn't, and many other types of questions (Petticrew & Roberts, 2. addition, this method can also be used to map areas of uncertainty and identify where little or no relevant research has been conducted, but where new studies are needed (Petticrew & Roberts, 2. Bibliometric analysis is based on relevant information about scientific publications that can be retrieved from specific data sources. The information is usually organized in the appropriate search field. Relevant information from bibliometric databases on scientific publications, such as Source Identification . uch as Journal Title. Volume. Pag. Author Name. Company Address. Reference. Document Type. Title. Keyword. Abstract. Subject Title. Controlled term. Acknowledgement. (Glanzel, 2. While in (Arksey & O'Malley, 2. The relevant information used is as follows: Author, year of publication, location of study. Type of intervention and comparison . f an. duration of intervention. Study population. Research objectives. Methodology. Outcome of the results. JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . RESEARCH METHODOLOGY This type of research is descriptive research using bibliometric analysis. This study uses data from scientific publications in the field of multivariate statistics indexed on the Ie Explore and Scopus websites from 2000-2022. Steps to be taken in mapping science with bibliometric or systematic review methods according to (Zupic & Uater, 2. Broadly speaking. Research Design. Compilation Of Bibliometric Data. Analysis. Visualization. Interpretation. However, in this study, the research stage was inspired by several previous studies (Karatu et al. , 2023. Watts, 2. which is seen in the image Home Identify the need for systematic review Determination Of Common Keywords. Search Queries: "Multivariate" OR "Multivariate Analysis" OR "Multivariate Data" Planning Internship Search On Scientific Data Set:Ie Explore: 1705 Journal. ConferenceScopus: 201 Journal. N=1906 Data Processing n = 483 Bibliometric Analysis Citation Analysis Disseminate Results NetworkAnalysi Bibliometric Analysis Reporting Stage Source: Processed Research Data, 2024 Figure 1. Systematic Stages of Literature Review The research began by reviewing that a systematic approach of literature review would be used to look at the development of multivariate statistical analysis. This is because the systematic approach of literature review is a series of processes of identifying, assessing and interpreting all research evidence with the aim of answering certain research questions (Wahono, 2. JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 112 As in Figure 1, the research stage consists of 3 stages, namely the Planning Stage. Bibliometric Analysis, and Reporting Stage. At the Planning Stage or research planning, it begins by identifying the need for this research, namely bibliometric analysis in the development of multivariate statistical analysis. The next stage is the determination of common keywords to be used on search engines. The keywords used with the following syntax are "Multivariate" OR "Multivariate Analysis" OR "Multivariate Data". Next is to search on the Ie Explore webside and with the keywords that have been determined, there are 1705 Journals and Conferences that appear related to multivariate analysis. The next search was on Scopus and found 201 multivariate journals with a total of 1906 journals and conferences. The second stage of Bibliometric Analysis begins with the selection of the data that has been obtained so that from 1906 data, only 483 data can be obtained. The first criterion for data selection is that articles are used only articles that are published in journals and not in conferences, then the time of publication of articles from 2000 to 2022, articles that are published in English. Data selection is also assisted by using software, namely OpenRifine software. Furthermore, biblimetric analysis was carried out using VosViewer software using Autor Keyword so that citation analysis and Network analysis could be seen. The final stage in this research is to make a report on the results of the research which will later be published in international journals. Research Question Research Question . bbreviated as RQ) is a research question that is determined to keep the review focused (Wahono, 2. The Research Question in this study has been modified from the research (Wahono, 2. are as follows: RQ1 RQ2 RQ3 RQ4 RQ5 Research Question Which journal is the most significant Multivariate Analysis Who are the most influential researchers in the field of Multivariate Analysis? What kind of research topics are chosen by researchers in the field of Multivariate Analysis What types of datasets are most used for Multivariate analysis? What methods are often used for Multivariate analysis Motivation Identification of the most significant journals in the field of Multivariate Analysis Identify the most active and influential researchers who contributed more to the reset of Multivariate Analysis Identify research topics and trends in Multivariate Analysis Identification of common datasets used in Multivariate analysis methods Identify opportunities and trends for methods from Multivariate analysis Source: Processed Research Data, 2024 RESULT AND DISCUSSION RQ1 Journal of the Most Significant Multivariate Analysis The distribution of data from 2000 to 2022 shows that the development of applied research in multivariate analysis has increased significantly and is still in demand by From Figure 2, it can be seen that more research has been published since 2010 until now. This is due to the increasingly complex development of computerization and the internet that requires thorough and complex data analysis (Chen & Wang, 2. So that in the future multivariate analysis research in various fields will continue to develop and is very interesting to do. JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 113 Figure 2. Multivariate Research Trends Analysis from 2000-2025 The first Reseach Question is which Journal is the most significant application of Multivariate Analysis. Multivariate analysis is applied in various fields of science such as computer science, engineering, health, medicine and others. Of the 483 journals with more than 1000 articles, the author took the highest number of articles and the highest number of citations, 29 journals were found that applied multivariate analysis. The most citations are in the Ie Transaction on Biomedical Engineering journal with 12 articles and 1037 citations. Table 1. Number of Journal Publications and Citations NO. SOURCE TITLE QUANTITYARTICLES Ie Access Ie Communications Letters Ie Geoscience and Remote Sensing Letters Ie Journal of Selected Topics in Applied Earth Observations and Remote Sensing Ie Latin America Transactions Ie Sensors Journal Ie Transactions on Automatic Control Ie Transactions on Biomedical and Health Informatics Ie Transactions on Circuits and Systems for Video Technology Ie Transactions on Communications Ie Transactions on Control Systems Technology Ie Transactions on Cybernetics Ie Transactions on Geoscience and Remote Sensing Ie Transactions on Image Processing Ie Transactions on Industrial Electronics Ie Transactions on Industrial Informatics Ie Transactions on Information Theory Ie Transactions on Instrumentation and Measurement NUMBER CITATIONS JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 114 Ie Transactions on Knowledge and Data Engineering Ie Transactions on Medical Imaging Ie Transactions on Neural Networks Ie Transactions on Neural Networks and Learning Systems Ie Transactions on Neural Systems and Rehabilitation Engineering Ie Transactions on Pattern Analysis and Machine Intelligence Journal of Statistical Software Journal of Computational and Graphical Statistics Multilevel Analysis: Techniques and Applications: Second Edition Multivariate Behavioral Research Neural Networks Source: Processed Research Data, 2025 Table 1 shows that multivariate analysis is applied in various journals with the most articles found in the Ie Acces journal, which is an interdisciplinary journal covering various fields of science. Furthermore, multivariate analysis is also widely applied in the field of computer science Ie Transactions on Neural Networks. Ie Transactions on Neural Networks and Learning Systems. Ie Transactions on Neural Systems and Rehabilitation Engineering. Journal of Computational and Graphical Statistics. In the field of Health. Ie Transactions on Biomedical and Health Informatics. Ie Transactions on Medical Imaging and many other journals. RQ2 Most Influential Researcher in the Field of Multivariate Analysis Table 2. Authors with the Highest Number of Citations NO. WRITER Aiazzi. Baronti. Mr. Selva your Rehman. Mandic Mandic. Rehman. Wu. Huang Ma. Derksen. Hong. Wright Chambon. Mr. Galtier. Arnal. Wainrib. Gramfort -H. Park. Simeone. Sahin. Shamai Diansheng Guo. Jin Chen. MacEachren. The Liao Bhattacharyya. Pachori Ghosh. Basu. Mr. O'Mahony Sagias. Karagiannidis NUMBER CITATIONS Source: Processed Research Data, 2024 JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 115 With the systematic method of literature review, the authors who contribute the most and have an influence on Multivariate Analysis research can be detected well. The best authors are seen from the most cited articles. Table 2, shows the authors who are cited the most in the Multivariate study be B. Aiazzi. Baronti. Mr. Selva. with a total of 643 citations writing about the multivariate regression adopted to improve spectral quality (Aiazzi et al. , 2. Furthermore, research conducted by N. your Rehman. Mandic with a total of 313 citations with the research topic Multivariate empirical mode decomposition algorithm (Ur Rehman & Mandic, 2. RQ3 Research Topics Chosen by Researchers in the Field of Multivariate Analysis Broadly speaking, there are 3 divisions of research topics that can be taken in figure 3 below. The first topic that comes up is about Methods, the second is about Dataset Analysis, and the Application of Multivariate data. The word Multivariate data (Dataset Analysi. is connected with several other keywords Visualization. Visual Analytics. Information Visualization. Time Series. Clustering. Diversity. Change Detection. Principal Component Analysis. TimeFrequency Analysis. Empirical Model Decomposition. Correlation. Source: Processed Research Data, 2024 Figure 3. Lexical Network Analysis The word Multivariate Time Series (Data applicatio. is connected by several words, including variable Selection. Data Mining. Anomaly Detection. Machine Learning. Clustering. Regression. Prediction. Claasification. Copula. Forecaseting. Convolutional Neural Network. Deep leaning. Feature Selection. Mahalanobis Distance. Change Detection. Missing data. Time Series Prediction. Electroenchephalography . The topic that comes up the most is Multivariate Time Series forecasting, which researches JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 116 the forecasting of time series data in a multivariate manner in various fields such as Health, multimedia, finance, biomedicine, computing, and others. What kind of research topics are chosen by researchers in the field of Multivariate Analysis Figure 4. Multivarite Research Topics Figure 5. Multivariate Time Series Research Topics Figure 6. Multivariate Data Research Topics Figure 7. Multivariate Analysis Research Topics Source: Processed Research Data, 2024 Meanwhile, the word Multivariate Analysis (Metho. is connected to the words Statistical Process Control. Multivariate Visualization. Data Mining. Anomaly Detection. Alzheimer's Disease. Classification. Correlated Failing. Diversity. PCA. Fault Detection. Correlation. EEG. Furthermore, for multivariate topics related to the words Neural Network. Regression. Selection variables, anomaly Detection. Multiple Linear Regression. Prediction. Machine Learning. Time Series. Classification. Deep Learning. Missing Data. Fault Detection. Time series Prediction. JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 117 RQ4 Types datasets are most used for Multivariate analysis Types of datasets that are widely used in multivariate research of time series data analysis, data mining. Deep learning. Forecasting. Machine Learning, missing data, variable selection. Time series Prediction. Source: Processed Research Data, 2024 Figure 8. Temporal Network Analysis Figure 8, shows the development of the research topic from year to year. Dark blue indicates the longest research topic while light yellow indicates the most recent research topic on multivariate analysis. The latest research topics shown are multivariate time series. Machine Learning. Deep Learning. Neural Network. Missing Data. Anomaly detection, online Learning. Time Series Prediction. Missing Data. RQ5 What methods are often used for Multivariate analysis From the results of the systematic analysis of the literature review, data were obtained about the methods that are most often used in the research. Based on the results of the review from (A-Review-of-Multivariate-Analysis-2u9l6a05k2, n. James & McCulloch, 1990. Krzanowski, 2. Multivariate analysis consists of several methods, namely Decision Theory & Bayes Inference. Discriminant Analysis. Explanatory (Clustering. Multidimensional Scaling. Graphical Method. Regression. Canonical Correlation. Principla Componen Analysis. Factor Analysis (Explanatory Factor Analysis. Confirmatory Factor Analysis. Interpretatio. Path Analysis & Lisrel. Testing Hypothesis. Discreate Multivariate Analysis. JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . 118 Table 2. Writer Yes Method Analysis Regression Method Multivariate Correlation Method Clustering Method Principal Component Analysis Method Article Several papers discuss the regression method of analysis for multivariate data. (Arenas-Garcia et al. , 2013. Crino & Brown. Dell'Acqua et al. , 2015. Durichen et al. , 2015. Georga et , 2013. Marchant et al. , 2016. Miao et al. , 2013. Ming-Da Ma et al. , 2010. Ren et al. , 2020. Shanableh, 2012. Wang et al. , 2016. Ya Su et al. , 2. In addition, there is also a paper that discusses the correlation method (Ahmed & Mandic, 2012. Anderson et al. , 2012. Durichen et al. , 2015. Naveed & Rehman, 2020. Park et al. Vallet et al. , 2012. Wang et al. , 2016. Ye et al. , 2. There is also a Clustering method approach (Ahmad & Brown. An & Liu, 2019. Cappers & Van Wijk, 2018. Diansheng Guo, 2009. He & Tan, 2020. Javed et al. , 2015. Karami et al. Li et al. , 2019. Lu & Huang, 2020. Markley & Miller. Teng-Yok Lee & Han-Wei Shen, 2009. Yu et al. , 2. Furthermore, the method commonly used in multivariate analysis is the Principal Component Analysis method (Ahmad & Brown, 2014. Alduais et al. , 2017. Ermolova & Tirkkonen. Fujiwara et al. , 2021. He & Tan, 2020. Karami et al. Peruchi et al. , 2020. Sun et al. , 2020. Yu et al. , 2. Explanatory Factor Analysis. Confirmatory Factor Analysis. Interpretatio. Explanatory (Clustering. Multidimensional Scaling. Graphical Method. Path Analysis Lisrel Source: Processed Research Data, 2024 JURNAL SAINTIFIK (Multi Science Journa. Vol. 23 No. 2 May 2025 page: 109 Ae . CONCLUSION Based on the analysis carried out, the following conclusions can be drawn the focus of this study is to see the development of multivariate statistical analysis research from 2000 to 2025. This study tries to identify fundamentally and thoroughly about the development of multivariate analysis related to which journals are the most significant, the most influential researchers, what research topics are developing, the types of datasets, and the methods used in multivariate analysis. Bibliometric analysis is used to look systematically based on relevant information from scientific publications, multivariate analysis articles accessed through Ie Explore and Scopus. In the bibliometric analysis stage using Vosviewer, as many as 483 journals were analyzed for the number of citations, analysis of interconnected keywords and others. The development of the trend of multivariate analysis is applied in various fields of science with various applications, such as in the fields of Health, medicine, computer networks, taransport, biology, engineering and many others. The most developed methods of multivariate analysis are regression methods, correlation methods, factor analysis and analysis component principles. The goal is to simplify complex data or models into something easier to analyze. The most widely used types of datasets are multivariate time series data types and time series forecasting. And the direction of multivariate analysis research in the future that will continue to develop is multivariate time series and multivariate forecasting. It was found that the most significant journals were Ie Transactions on Biomedical Engineering. Ie Transactions on Geoscience and Remote Sensing. The most influential authors are B. Aiazzi. Baronti. Selva. with a total of 643 citations writing about the multivariate regression adopted to improve spectral quality. Emerging Research Topics Multivariate Time Series Forecasting. Methods that often appear are Regression. Correlations. Clustering. Principal Component Analysis. There are still many literature studies that have not been carried out in this study that can be followed in the next research such as the framework study of each multivariate analysis model. REFERENCES