J. Indones. Math. Soc. Vol. No. , pp. 338Ae351. ANALYSIS OF UNEMPLOYED YOUTH IN INDONESIA BY PANEL DATA REGRESSION WITH MODERATING VARIABLE Sri Subanti1 , and Nabila Tri Amanda2 Universitas Sebelas Maret. Surakarta. Central Java. Indonesia, sri subanti@yahoo. Universitas Sebelas Maret. Surakarta. Central Java. Indonesia, nabilatriamanda32@gmail. Abstract. Indonesia is entering the era of demographic bonus with the productive age dominating the population. Productive age is the main focus of the government in maximizing the demographic dividend, but Indonesia has the highest percentage of Not in employment, education or training (NEET) in Asia. NEET are people on 15-24 years old who do other activities outside of school, work or training. This study aims to analyze NEET in Indonesia using panel data with moderate regression The analysis of multiple linear regression is focused on the relationship between the independent and dependent variables without taking into other possible By inserting a moderating variable, this study explores the relationship between the independent and dependent variables differently and aims to strengthen or weaken it. Under certain conditions, the relationship between the independent and dependent variables can be explained by the moderating variable. The research data used were obtained from the employment book and the website of BPS Indonesia, in the form of 34 cross section and 5 years time series data that tends to be The dependent variable is NEET with 5 independent variables including Human development index, the open unemployment rate, labor force participation rate, proportion of individuals who own phone, and proportion of informal employment. The moderating variable is the proportion of youth aged with ICT skills. The best model in regression analysis panel data is FEM with 4 significant independent variables and 92. 75% of R-square. Moderating variable can moderates the relationship of NEET with its independent variables and increased the R-square to 94. Key words and Phrases: Not In Employment Education Or Training. Moderating. Panel Data Regression 2020 Mathematics Subject Classification: 91B82 Received: 05-01-2024, accepted: 09-09-2024. Analysis of Unemployed Youth in Indonesia INTRODUCTION Indonesia is a country that ranks fourth in the world as a country with a large Based on Worldometers Indonesia . IndonesiaAos population in 2020 29% of the global population, or as many as 269 million people. The findings of the National Labor Force Survey (Sakerna. in August 2021. Indonesia even entered the demographic bonus era, namely the young productive age . -64 year. dominated the number of residents in the country. According to EUROFOUND . , this demographic dividend can occur because the number of dependency groups is low, but this cannot be realized if the policies implemented by the government and private sector to increase labor supply and youth labor force participation are not realized. In 2017, the International Labour Organization (ILO) defined NEETs as young individuals aged 15 to 24 who are not employed, educated, or trained, education or training. Indonesia has one of the highest NEET rates in Asia . Referring to Figure 1, the percentage of NEET in Indonesia exhibited an increase Figure 1. IndonesiaAos Percentage of NEET Despite a decrease to 21. 77% in 2019, the highest peak occurred in 2020, this is allegedly reinforced by the Covid-19 pandemic. The NEET issue is crucial and demands attention from the government. A substantial and prolonged NEET phenomenon will hamper the economy, industrial development, and endanger the stability and development of a country. The percentage of young NEET in Indonesia is assumed to be influenced by several variables. This study aims to analyze the independent variables that affect the young NEET with the moderating variables. Multiple linear regression analysis allows the independent variable to impact the dependent variable without considering any other influences. The aim of this study is to gain a more complete SUBANTI. AND AMANDA. NT. understanding of the phenomenon being investigated by incorporating variables that are presumed to enhance or weaken the relationship between the independent and dependent variables. The moderating variable is able to explain the possibilities and conditions under which the relationship between the independent and dependent variables can change. The independent variables include the human development index (HDI), open unemployment rate, the proportion of individuals who own mobile phones, labor force participation rate, and the proportion of informal employment. The moderating variable in this study was the proportion of adolescents and adults aged 15-24 years with information and computer technology (ICT) skills. Based on the previous explanation, the percentage of young NEET will be analyzed using panel data regression with moderation variables. This study is expected to provide an overview of young NEET in Indonesia, in order to provide insights for the government and the public in minimizing the number of NEET in Indonesia. THEORETICAL BASIS Not in Employment. Education or Training (NEET). The NEET percentage is calculated using Equation . The Central Statistics Agency defines Not in Employment. Education or Training (NEET) as young individuals who participate in other activities outside of school, work, or training. The population in the young age category is the male and female population aged 15-24 The NEET percentage is calculated using Equation . PrNEET = (PTB15Oe24 PTS15Oe24 PTT15Oe24 ) /P15Oe24 y 100%, . PrNEET : percentage of young people who are not in school, work, or training (%). PTS15Oe24 : the number of young people who do not go to school . PTB15Oe24 : the number of young people who do not work . PTT15Oe24 : the number of young people who do not attend training . P15Oe24 : total population aged 15-24 years . Regression Analysis of Panel Data. Based on Gujarati . , panel regression is a regression with a panel data structure that combines cross-sectional data and time-series data. Cross-sectional data are observations collected from one or more units at a single point in time, while time-series data are observations collected over time for one or more variables. The panel data regression equation is shown in Equation . Yit = it A Xit AAit , . Analysis of Unemployed Youth in Indonesia Yit : individual unit response variable at period t. it : intercept coefficients of the i-th individual units and t-th time periods slope. A : slope coefficient vector. 1 : coefficient of independent variable. 2 : coefficient of moderating variable. 3 : coefficient of interaction. Xit : predictor variable of i-th individual unit at period t. Mi : moderating variable. AAit : residual component with mean 0 and variance E 2 . Belonging to Baltagi . , panel data regression has three approaches, namely Common Effect Model (CEM). Fixed Effect Model (FEM), and Random Effect Model (REM). The Common Effect Model (CEM). Combines data without regard to individuals and time. Refers to Baltagi . , the CEM regression equation is shown in Equation . Yit = A Xit AAit . Fixed Effect Model (FEM). FEM is an estimation of panel data that can be differentiated by individual and time. The slope coefficient of the FEM model is constant, but the intercept coefficient changes with each individual and time. Appropriate to Gujarati . , the FEM equation is shown in Equation . Yit = AAi t A Xit AAit . Random Effect Model (REM). The correlation between error terms due to changes in time and individuals is involved in REM. Greene . The REM Model is often called the generalized least square with Equation 5. Yit = it A Xit Oit . Panel Data Regression Model Selection. In choosing the panel data regression model, two tests are performed: the Chow Test and Hausman test. CEM and FEM are compared to select the best model in the Chow Test. This test can be done by looking at the significance of FEM model using the F-statistical test. Based on Widarjono . , the Hausman test is used to test between two models, fixed effect model, and random effect model. If both tests are performed, and the p-value is smaller than the significance level, then the null hypothesis is rejected, and the chosen model is the fixed effect model (FEM). SUBANTI. AND AMANDA. NT. Classical Assumption Test. Normality tests are used to determine if variables that disrupt or remain have a normal distribution or not in regression models. A good regression model should have a normal or near-normal distribution. To test whether the data is normally distributed or not, the Jarque-Bera test (J-B) can be employed. Baltagi . Multicollinearity test is necessary to determine whether or not there are independent variables that share similarities between them in a regression model. correlation exists, it is indicated that the regression model is experiencing a multicollinearity issue. Gujarati . The multicollinearity test is done by examining at the value of tolerance and the value of the variance inflation factor (VIF). Multicollinearity problems are not present when the VIF value is below 10. To test heteroscedasticity, the glacier test can be applied. The Glejser test is conducted by making a regression equation of the absolute value of the residue with the independent variable . The autocorrelation test is aimed at establishing a correlation in a linear regression model between the disturbance error . in the period t with the error in the period t Oe 1 . If there is a correlation, then there is a problem of autocorrelation. A regression model is considered good when detached from autocorrelation. The Durbin-Watson test can be used to determine whether autocorrelation is present or absent. Moderated Regression Analysis (MRA). Based on Liana . Moderated Regression Analysis (MRA) is a special application of multiple linear regression in which the regression equation contains interaction . ultiplication of two or more predictor variable. The moderated regression model is shown in Equation . Yi = 1 Xi 2 Mi 3 Xi A Mi A. The classification of moderation variables is determined by the significance of the parameters in the regression model of equation 6, which is explained in Table Table 1 shows the classification of moderation types based on the significance of Analysis of Unemployed Youth in Indonesia Table 1. Classification of Moderation Moderation Type Coefficient Absolute Moderation 1 and 2 insignificant 3 significant Pure Moderation 2 insignificant 3 significant Quasi Moderation 2 significant 3 significant Homologiser Moderation 2 insignificant 3 insignificant Prediktor Moderation 2 significant 3 insignificant Model Goodness Test. Model goodness test consists of 3 tests, namely simultaneous test (F-tes. Partial Test . -tes. , and coefficient of determination. Hypothesis testing research which includes F-test is done to determine the presence or absence of the influence of predictor variables with response variables. Misbahuddin and Hasan . If the value of Fcount > Ftable or p-value is smaller than the significant level, the null hypothesis is rejected, indicating that the independent variables jointly affect the dependent variable. Partial test is used to test the coefficient . regression individually. This test is done for each coefficient separately . If the value of tcount > ttable or p-value is less than the significant level, then the null hypothesis is rejected. This means that the independent variables individually affect the dependent variable. METHODOLOGY Study Area and Data Sources. The study was conducted on the percentage of Not in Employment Education Training (NEET) young people . -24 year. based on provinces in Indonesia in The Labor book in the third edition of 2021 is where we got the data for this study, the Satu Data Ketenagakerjaan website, and the official website of the Central Statistics Agency (BPS). The observation units used are the 34 provinces in Indonesia. Study Variables. Variables used in this study consisted of a dependent variable, five independent variables, and one moderating variable. The explanation of variables is shown in Table 2. Research Methodology. The software used is Microsoft Excel and EViews 10. The steps taken in this study are as follows: SUBANTI. AND AMANDA. NT. Table 2. Variables used in this research Category M . oderating variabl. Name of Variable Percentage of Not In Employment Education Or Training (NEET) Of Youth . -24 Years ol. Human Development Index (HDI) The open unemployment rate The labor force participation rate Individuals who own a mobile phone The proportion of informal employment The proportion of youth aged 15-24 years old with information technology and computer skills . Determining the dependent, independent, and moderating variables . Building the regression models: Common Effect Model (CEM). Fixed Effect Model (FEM), and Random Effect Model (REM) . Performing model selection tests, namely Chow test and Hausman Test . Conducting assumption tests consisting of heteroskedasticity and multicollinearity tests . Performing Moderated Regression Analysis on predictor variables to response variables with the moderating variables . Checking the goodness of fit of the MRA model panel data using Partial Test . -tes. , simultaneous test (F-tes. , and coefficient of determination (R2 ) . Model interpretation and drawing conclusions RESULT AND DISCUSSION Regression model estimation. Analysis regression model for the NEET variable, found that the X4 variable did not significantly affect the NEET variable, so the X4 variable was excluded from the model. The estimation results of the CEM regression parameters are shown in Table 3. Table 3. Estimation of Common Effect Model Var Coeff Prob The estimation results of the FEM regression parameters are shown in Table Fixed effect model has fixed Intercept values that are used between individuals. Analysis of Unemployed Youth in Indonesia Table 4. Estimation of Fixed Effect Model Var Coeff Prob The estimation results of the REM regression parameters are shown in Table Random effect model has intercept value that is not considered a constant, but is considered a random variable with an average value. Table 5. Estimation of Random Effect Model Var Coeff Prob Based on Tables 4, 5, and Table 3 it can be seen that the variables X1 . X2 . X3 , and X5 are significant in all three models, but the moderation variable has a p-value > 0. 05 in the estimated fixed effect model (FEM) which means that the moderation variable is not significant simultaneously to the relationship between the variable Y and the predictor variable. Judging from the value of R-square, common effect model is worth 71%, fixed effect model is worth 92%, and random effect model is worth 50%, with this FEM model has the ability to explain the dependent variable highest among 2 other models. Selection of Model Estimates. The results of the Chow and Hausman tests are shown in Table 6. Table 6. The Chow and Hausman Test Effects Test Cross-section F Cross-section Chi-square Cross-section random Statistic Prob. Based on Table 6, it can be seen that in the Chow Test, a probability value 00 < . is obtained. Therefore, the fixed effect model has been proven to be better than the common effect model. The testing continued to the Hausman SUBANTI. AND AMANDA. NT. test with a probability value of 0. 00 < . Thus, the test is stopped, and the selected model is the fixed effect model. Panel Data Regression Model Equation. The selected model through testing is the fixed effect model shown in Table Table 7. Fixed Effect Model (FEM) Variable Coefficient Prob. The panel data regression equation with moderation variables for the fixed effect model (FEM) as the estimation of the selected model is written as follows: NEET = Oe48. 11X1 0. 44X2 Oe 0. 29X3 0. 26X5 Oe 0. 04M A. From the regression model formed, the interpretation obtained is that every increase in one unit of HDI with other variables constant, then NEET in Indonesia will increase by 1. If there is an increase of one unit of the open unemployment rate with other variables constant, then NEET in Indonesia will increase by 0. Next, for each increase in the labor force participation rate with other variables constant, then NEET in Indonesia will decrease by 0. 2%, and each increase of one unit of the proportion of informal employment with other variables constant, then NEET in Indonesia will increase by 0. Classical Assumption Test. Heteroscedasticity test results are shown in Table 8. Table 8. Heteroscedasticity Test Results Variable Coefficient t-Stat Prob. The heteroscedasticity test is done by the Glejser method, which is to regress the independent variable to the absolute value of the residual. Based on Table Analysis of Unemployed Youth in Indonesia 8, it is known that the value of p-value = 0. 00 < . the residuals have a homogeneous variety. Multicollinearity test results are shown in Table 9. Table 9. Multicollinearity Test Results Variable VIF Result 091 No multicollinearity 864 No multicollinearity 725 No multicollinearity 298 No multicollinearity 822 No multicollinearity Based on Table 9, it can be seen that the Variance Inflation Factor (VIF) values are all less than 10, which means that there is no multicollinearity. The results show that the independent variables used and the dependent variable or related variables are not in a linear relationship. Moderated Regression Analysis. Moderated Regression Analysis results are shown in Table 10. Table 10. Moderated Regression Analysis Variable X1*M X2*M X3*M X5*M Coefficient t-Statistic Prob Based on Table 10, the panel data regression model with moderation variables can be written as follows: NEET = Oe149. 917X1 0. 0003X5 1. 662M Oe 0. 0137X1 M Oe 0. 009X3 M. From the equation, the interpretation is as follows: every increase of one unit of HDI variable with other variables constant, then NEET will increase by 1. Every increase of one unit of the informal employment variable proportion with other variables constant, then NEET will increase by 0. 0003 percent. Each increase of one unit variable proportion of young age with ICT skills with other SUBANTI. AND AMANDA. NT. variables constant, then NEET will increase by 1,662 percent. Every increase of one unit of interaction variable between HDI with ICT skills proportion with other variables is constant, then NEET will decrease by 0. 0137 percent. Each increase of one unit of interaction variable between the labor force participation rate with ICT skills proportion with other variables is constant, then NEET will decrease by 009 percent. Test Goodness Test of Model MRA Panel Data. Goodness test of the panel data MRA model is conducted by examining the simultaneous and partial significance of moderation variables and their interactions. Additionally, the comparison between the coefficient of determination of the MRA model and panel data regression without interaction is observed. Simultaneous Influence Significance Test (F Tes. The F test is conducted to determine the presence or absence of the influence of independent variables together against the dependent variable. Table 11. F-test F-statistic Prob Based on Table 11, moderated regression analysis shows that the value of Prob (F-sta. 000 < 0. This indicates that the variables X1 . X2 . X3 . X4 , and moderating variables together affect the percentage of NEET variables in Indonesia. Partial Significance Test . -tes. Partial testing is performed to determine the nature of the moderating variables in the relationship between dependent and independent variables. Table 12. Partial test of moderation against X1 Variable Coeff Interaksi -0. Prob Based on Table 12, the coefficient values of 1 , 2 , and 3 are not significant because they have a probability > 0. Thus, the moderation variable is categorized as homologiser moderation for the relation of NEET and HDI. Analysis of Unemployed Youth in Indonesia Table 13. Partial test of moderation against X2 Variable Coeff Prob Interaksi 0. Based on Table 13, it is known that the value of the coefficients 3 is significant with a probability of < 0. 05, which is equal to 0. 045, but the value of coefficients 1 and 2 is not significant because it has a probability > 0. 05, so the moderation variable is categorized as pure moderation of the relations between the open unemployment rate with the percentage of NEET. Table 14. Partial test of moderation against X3 Variable Interaksi Coeff Prob Based on Table 14, it is known that the coefficient values of 1 , 2 , and 3 are not significant because they have a probability > 0. 05, so the moderation variable is homologiser moderation for the relation of NEET and labor force participation Table 15. Partial test of moderation against X5 Variable Interaksi Coeff Prob Based on Table 15, it is known that the coefficient values of 1 , 2 , and 3 are not significant because they have a probability > 0. 05, so the moderation variable is homologiser moderation for the relation of NEET and the proportion of informal Coefficient of Determination. Tabel 16 shows the coefficient of determination for models with and without moderator variables. SUBANTI. AND AMANDA. NT. Table 16. Coefficient of Determination R-squared with MRA R-squared without MRA From the test results, obtained coefficient of determination (R2 ) of 0. This means that the ability of the model can explain the diversity of the dependent variable by 94%. According to this analysis, the modelAos ability to explain the dependent variable can be improved by 1. 3% by using panel data regression with moderation variables. CONCLUDING REMARKS Conclusion. Based on the results of the analysis, it was concluded that NEET analysis using panel data regression has the best model of Fixed Effect Model with the following equations: N EET = Oe10. 55X1 0. 48X2 Oe 0. 28X3 0. 23X5 A. The Human Development Index variable, open unemployment rate, labor force participation rate, and proportion of informal employment influence the percentage of the NEET variable in Indonesia with an R-square of 92%. The equation Model, including moderation variables, is considered better because it can increase the value of R-square to 94% with the following equation: N EET = Oe149. 918X1 0. 426X5 1. 66M Oe 0. 013X1 A M Oe 0. 009X3 A M A. Moderating variable, the proportion of young age with ICT skills, moderates the relationship of the NEET percentage variable with its independent variable. The interaction between the variable X1 and the moderating variable is a homologizer moderation for the relationship between Y with X1 . The interaction between the variable X3 and the moderating variable is also a homologizer moderation for the relationship between Y with X3 . Suggestion. Based on the results and discussion of this research, there are several suggestions that can be carried out for further research, namely the need to add more independent variables that influence NEET. Researchers can also compare interaction tests with regression analysis which is moderated by several independent variables with other independent variables. In addition, there is a need for various methods in analyzing the NEET variable in the case of moderating variables, for example using the logistic regression method with moderating variables. This research is also open to constructive criticism and suggestions, in order to perfect the analysis more clearly. Analysis of Unemployed Youth in Indonesia REFERENCES