p-ISSN : 2745-7141 e-ISSN : 2746-1920 Jurnal Pendidikan Indonesia Vol. 6 No. January 2025 ASSESSMENT OF SCIENTIFIC REASONING-COMMUNICATION SKILLS (SR-CS) TEST ON WORK AND ENERGY CONCEPT: DEVELOPMENT. CONTENT VALIDITY AND RASCH MODEL ANALYSIS Fajar Fanika1*. Selly Feranie2. Parlindungan Sinaga3 Departemen Pendidikan Fisika. Universitas Pendidikan Indonesia. Indonesia fajarfanika@gmail. INFO ARTIKEL Keywords: Scientific reasoning skills. communication skill. rasch model. ABSTRACT Studies indicate that scientific reasoning assessments are still rarely integrated as higher-level assessments and new science education standards in school instruction. This study aims to develop a scientific reasoning skills test instrument that is integrated with a scientific communication skills test instrument (SR-CS tes. on work and energy concept. This study used the ADDIE procedure. The research procedure consists of five stages including analyzing, designing, developing, implementing, and evaluating. The initial draft of SR-CS test consisted of 14 multiple choice scientific reasoning skills (SRS) questions and 8 open-ended scientific communication skills (SCS) questions. The results of the expert judgement were analyzed using the content validity index (CVI) and obtained a value of 0,94 . ery suitabl. for the SRS instrument and 0. ery suitabl. for the SCS After being revised based on the expert suggestions, the test instrument was tested on 25 students . girls,10 boy. aged 16-17 years. The trial data were analyzed using the Rasch Model to obtained item fit . , reliability, distinction level, and difficulty level. The results show that 14 questions of SRS instrument have item validity and 8 questions of SCS instrument have item validity. Besides that, the item reliability of the SRS and SCS test instrument is 0. 79 and 0. 91, respectively. Meanwhile, the person reliability is 0. 82 (SRS) and 0. 91 (SCS). Therefore, the SR-CS test is valid and reliable so that it can be used to measure scientific reasoning skills and scientific communication skills of students in further research. Introduction Scientific reasoning is one of the 21st-century skills that students must learn. One of the science process skills that must be mastered is reasoning, which is necessary for the planning and interpretation of experimental outcomes (Coleman et al. , 2. Critical thinking skills even frequently include reasoning (Birgili, 2015. Tiruneh et al. , 2. Critical thinking skills and other higher-level cognitive skills cannot fully develop without powerful reasoning skills. Previous research has provided information on Indonesian high school students' capacity for scientific thinking, particularly in physics classes. Research on high school students' capacity for scientific reasoning reveals that students' scientific reasoning skill is low Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Fajar Fanika1*. Selly Feranie2. Parlindungan Sinaga3 (Ayuningtyas & Pramudya, 2019. Khoirina & Cari, 2018. laela Ermaya & Mashuri, 2. According to early research, 49% of participants are thought to be at the Concrete Operational level, 49% to be at the Early Transitional level, and 2% to be at the Late Transitional level. The assessment of students' scientific reasoning on the subject, regardless of the learning method employed, reveals that students' scientific reasoning level falls into the low category. Scientific reasoning skills can be trained with good communication efforts from students in conveying their arguments. A person can communicate well if they have good arguments. This can be realized if it is familiarized with the atmosphere of discussion, exchanging opinions in conveying ideas or scientific arguments. This is also what is emphasized during the learning process in the classroom so that there is always social interaction between students and students, students and teachers, and students and the environment in conveying their thinking Knowledge that has been formed by students actively, not only passively received from the teacher but also must communicate their thinking process both orally and in writing (Fadly, 2. Training science communication skills to students makes students able to express the science ideas they have. However, a report prepared by (McInnis et al. , 2. for the Australian Council of Deans of Science said that communication skills . ral, interpersonal, and writte. consistently did not meet the predetermined criteria. Based on this, researchers will develop instruments that can measure students' scientific reasoning skills and scientific communication skills. Assessment is the activity of interpreting measurement data based on certain criteria or rules (Widoyoko, 2. A good assessment instrument must meet reliability criteria. Reliability criteria include validity, reliability, difficulty level, and differentiation/discrimination. To test the reliability/quality of the learning outcomes assessment instrument, an instrument trial was conducted. Instrument testing can be done internally and externally. Internal trials are conducted to experts to see content and construct validity, as well as grammar (Widoyoko, 2. Expert judgment is needed to consider the structure of the instrument is correct or not with the structure or scientific arrangement used in compiling the instrument. To stabilize the validity of the assessment instrument, it is necessary to conduct an external trial, namely a field trial (Arikunto, 2. Field trials can be carried out on subjects who are similar / equivalent to the subjects to be Then after the trial, it is necessary to analyze the results of the trial, including analyzing the validity, reliability, difficulty level, and differentiation analysis of the questions. Analysis of learning outcomes assessment items from field trials / empirical trials can be done in a classic way known as classical test theory (CTT) or modern with item response theory (IRT). Classical test theory is based on the observed score which is the sum of the true score and the measurement error score. The quality of the items is determined by the level of difficulty and distinguishing power (Hardianti et al. , 2. , but the characteristics of the items are inconsistent depending on the ability of the students (Erfan et al. , 2. The modern method with the Rasch Model, which was first proposed by Dr. Georg Rasch, a mathematician from Denmark, is here to overcome the weaknesses of the classical method. The Rasch Model uses raw scores in different ways to produce a measurement scale with the same interval, so that it can provide accurate information about test takers and question quality (Sumintono & Widhiarso, 2. This article tries to describe the results of analyzing the quality of test instruments in aspects of validity, reliability, difficulty level, and differentiability through Rasch Model The test instrument analyzed in this study is a scientific reasoning skill measurement test instrument integrated with scientific communication skills (SR-SC). Research Methods Research and Development (R&D) is the type of research being done to create the physics assessment, which is the process for developing and validating product (Ardiyanto & Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Assessment of Scientific Reasoning-Communication Skills (SR-CS) Test on Work and Energy Concept: Development. Content Validity and Rasch Model Analysis Fajaruddin, 2. The five stages of the ADDIE procedure, analyzing, planning, developing, implementing, and evaluating were applied in this investigation. The SR-CS test is a predetermined assessment tool created by the researcher to gauge scientific reasoning and communication abilities. Five experts were given a content validity exam by the researcher. addition, the instrument was refined in response to professional advice before being given to the pupils. Following deployment. Ministep 9. 0 software was used to assess each student's reaction using Rasch analysis. The participants of this research are the 11th-grade public high school students who are locating in Balaraja. Tangerang Regency. Indonesia. This A-accriditate public high school has been established since 1995. Most of the students in Tangerang Regency are a mix of Sundanese and Javanese ethnic. The participants were 25 students . females, 10 male. aged 16-17 years. The scientific reasoning skills (SRS) test, which had 14 multiple-choice questions, and the scientific communication skills (SCS) test, which had 8 open-ended questions, were the test instruments that were designed. The SRS test instrument includes seven aspects, there are control of variables, probability reasoning, correlational reasoning, hypothetical-dedeuktive reasoning, deduktive reasoning, induktive reasoning, dan causal reasoning. The SCS test instrument consists of information representation and scientific reading. In expansion, there's an expert assessment rubric for content validity Five experts assess the completed test instrument. The content validity index (CVI) developed by Lynn . , was used to analyze the expert assessment results. After that, the test instrument was enhanced according to of the experts' recommendations and comments. students were assigned the test after it had been revised. The Rasch Model was then used to analyze the test results. The test instruments' item fit, reliability, difficulty level, and distinction level are all The outfit z-standard (-2. 0 JMLE Measure Ou 1SD), moderate . SD > JMLE Measure Ou mean logi. , easy . ean logit > JMLE Measure Ou -1SD), and very easy (JMLE Measure < -1SD) (Soeharto & Csapy, 2. Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Fajar Fanika1*. Selly Feranie2. Parlindungan Sinaga3 Results and Discussion Analyzing The first step of the research is a review of the literature on scientific communication skills (SCS) and scientific reasoning skills (SRS). The researcher discovered how the test instrument was created after reviewing the literature. When creating the SRS test instrument, the researcher refers to the Lawson Classroom Test Scientific Reasoning (LCTSR) rubric. the meantime, scientific reading and information representation are being considered in the creation of the SCS test instrument. The physics content for which the SRS and SCS test instruments will be created has been determined to be work and energy. Designing At this point, the researchers created SRS and SCS test instrument indicators. The aspects noticed serve as the basis for developing the indicators. Control of variables, probabilistic reasoning, correlational reasoning, hypothetical-deduktive reasoning, deduktive reasoning, induktive reasoning, and causal reasoning are some of the components of science reasoning skills that are observed. In the meantime, scientific reading and information representation were considered components of scientific communication skills. In addition, indicators are used to create the SRS and SCS items. There are 14 multiple-choice questions on the SRS test, and 8 open-ended questions on the SCS test. Figure 1 shows an example of an SRS and SCS Developing During the development phase, the researcher made enhancements to the SRS and SCS test instruments, which were created using feedback and recommendations from experts. The Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Assessment of Scientific Reasoning-Communication Skills (SR-CS) Test on Work and Energy Concept: Development. Content Validity and Rasch Model Analysis content validity index (CVI) is used to determine expert judgment in order to assess the validity of the instrument. The I-CVI and S-CVI are the two components that form the overall CVI score assessment. A question item's validity score is displayed using I-CVI. Conversely. S-CVI displays an instrument's overall validity. For 3-5 experts, the I-CVI value should ideally However, the S-CVI value does not go below 0. According to Lynn . , the I-CVI value should be at least 0. The overall CVI scores for the SRS and SCS test instruments were determined by an expert evaluation, and the results were 0. 94 and 0. 97, respectively. Implementing Test subjects were 25 public high school students in their 11th grade, using the recently revised SRS and SCS test instruments. SR-CS test instrument testing was carried out at the beginning of the year, in March. Students can use a paper-based testing system to access the SRS and SCS test instruments. The time allocation for doing the SRS and SCS tests (SR-CS tes. is 90 minutes. Evaluating To ascertain item validity, reliability, distinction level, and difficulty level, each student's results were analyzed using ministep software. Table 3 shows the results for the MNSQ outfit. ZSTD outfit, and Pt-Measure Corr. of scientific reasoning skills items. Table 4. The interpretation of Scientific Reasoning Skills Item Fit and Distinction Level Outfit Pt. Mea Corr. Item Fit Interpre Accepted Accepted Accepted Accepted Accepted Accepted Accepted Accepted Accepted R10 Accepted Deduc tive R11 Accepted Hypothe tical-de R12 Accepted Causal R13 Accepted Probabi R14 Accepted SRS Aspect Item Num Inductive Correla Probabi Hypothe tical-de Control of Varia bles Correla Causal Probabi Hypothe tical-de Inductive Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Distinc Interpre Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good Fajar Fanika1*. Selly Feranie2. Parlindungan Sinaga3 According to the table, 12 items satisfy all three criteria, indicating that they are fit. Even though two of the goods only satisfy the ZSTD and Pt Measure Corr requirements, they can still be considered approved (Sumintono & Widhiarso, 2. Items R1 and R2 indicate that the MNSQ output value surpasses the 1. 5 score limit, while item R5 indicates that the Pt. Measure Corr value is less than 0. The Pt. Measure Corr value for the remaining items is in the vicinity of one. The degree of discriminating power increases with proximity to one. Table 4 shows the results for the MNSQ outfit. ZSTD outfit, and PT-Measure Corr. of Scientific Communication Skills items. Table 4. The interpretation of Scientific Communication Skills Item Fit and Distinction Level Outfit SCS Aspect Informa tion repre Scientific Pt. Mea Corr. Item Fit Interpre Accepted Accepted Accepted Accepted Accepted Accepted Accepted Accepted Item Num Distinc Interpre Very Good Very Good Very Good Very Good Very Good Very Good Very Good Very Good According to Table 4, all items match the criteria for MNSQ. ZSTD, and Pt-Measure Corr. values, indicating that the items are suitable and can be used in research aimed at detecting students' scientific communication skills. While the MNSQ outfit value of six items is close to 00, indicating that the items have a fair level of consistency, the range of PT-Measure Corr. values for eight items is close to between 0. 50 and 0. this suggests that the items have impoverished differentiating strength. In addition. Ministep Software will generate Cronbach's Alpha (), item reliability, and person reliability. Table 5 displays the SRS and SCS test instrument summary statistics. Table 5. Summary Statistic of Measured Item and Person for each SRS and SCS Test Instrument Mean Mean Measure Mean Outfit MNSQ Mean Outfit ZSTD Realibility Item SRS Person Item SCS Person Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Assessment of Scientific Reasoning-Communication Skills (SR-CS) Test on Work and Energy Concept: Development. Content Validity and Rasch Model Analysis CronbachAos For SRS items, the Rasch model yields good person and item reliability scores of 0. 89 and 81, respectively. In contrast, the SCS test instrument's person and item dependability values 59 and 0. 61, respectively, indicating low item reliability (Sumintono & Widhiarso. The results presented suggest that the science reasoning and scientific communication skills of students can be accurately assessed using the SRS and SCS test instruments. Students' earnestness about taking the SR-CS test is indicated by person reliability ratings that are not significantly different. Furthermore, the quality of the interaction between person and item as illustrated by the Cronbach Alpha value is 0. for the SRS test instrument and 0. for the SCS test instrument. Furthermore, the results of the analysis of the level of difficulty on the SPS test instrument can be seen in Table 6. Tabel 6. Difficulty Level of SRS Test Instruments Entry Number JMLE Measure Mean Difficulty Level Interpreation Difficult Difficult Difficult Difficult Moderate Moderate Moderate Moderate Easy Easy Easy Easy Very Easy Very Easy The JMLE Measure scores are shown in Table 6 from highest to lowest. The more challenging the item, and vice versa, the higher the JMLE Measure score. Three difficult items, four moderate items, four easy items, and two very easy items make up the SRS test instrument. the meantime. Table 7 shows the findings of the difficulty level analysis on SCS test devices. Tabel 6. Difficulty Level of SCS Test Instruments Entry Number JMLE Measure Mean Difficulty Level Interpreation Difficult Moderate Moderate Moderate Moderate Easy Very Easy Very Easy Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Fajar Fanika1*. Selly Feranie2. Parlindungan Sinaga3 The JMLE Measure score is displayed in Table 7 from highest to lowest. There are one easy, two very easy, four moderate, and one difficult things. The distribution of students' abilities and the degree of difficulty of items on the same scale can be described by the Rasch model analysis (Sumintono & Widhiarso, 2. The distribution is shown on a map known as the Wright map. The outcome of the Wright map is displayed in Figure 2. The top position showed the studentsAo who has the highest ability (SRS) The middle showed the studentsAo who has (SRS) The bottom position showed the studentsAo who has the lowest ability (SRS) The top showed the most difficult The of item The bottom showed the of item . Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Assessment of Scientific Reasoning-Communication Skills (SR-CS) Test on Work and Energy Concept: Development. Content Validity and Rasch Model Analysis The top position showed the studentsAo who has the highest ability (SCS) The middle showed the studentsAo who has middle ability (SCS) The top showed the most difficult The middle position showed the middle difficulty of . Figure 2 above illustrates some of the Wright maps of objects and people. The outline person appeared on the left, and the outline item displayed on the right. The participant's proficiency in scientific thinking and scientific communication is nearly medium, according to the plan. The SRS and SCS test instruments produced can be deemed valid and reliable based on the study of item fit, reliability, distinction level, and difficulty level using the Rasch Model. Thus, for future research, the SRS and SCS test instrument (SR-CS tes. can be utilized to assess scientific communication and reasoning abilities Conclusion The scientific reasoning skills test instrument designed had 14 multiple-choice questions. The SRS test instrument's distinction level fell into the very good range, and each item exhibited item fit validity. The person and item reliability of the SRS test instrument as a whole obtained a score of 0,81 and 0. 89 in the good category. For the scientific communication skills test instrument that was developed, there were 8 open-ended questions, and all of questions had item fit. The distinction level of 8 items is in the very good category. The person and item reliability of the SCS test instrument as a whole obtained a score of 0. 61 and 0. 59 in the low The quality of interaction between the person and items illustrated by Cronbach Alpha value, the SRS test instrument scored 0. ery goo. and the SCS instrument scored 64 . Therefore, the SR-CS test which consists of 14 multiple choice questions and 8 Jurnal Pendidikan Indonesia. Vol. 6 No. 1 January 2025 Fajar Fanika1*. Selly Feranie2. Parlindungan Sinaga3 open-ended questions is valid and reliable so that it can be used to measure studentsAo scientific reasoning skills and scientific communication skills in further research. References