AL IBTIDA: JURNAL PENDIDIKAN GURU MI . VOL 12 . : 232- 244 DOI: http://dx. org/10. 24235/al. Al Ibtida: Jurnal Pendidikan Guru MI ISSN: 2442-5133, e-ISSN: 2527-7227 Journal homepage: http://syekhnurjati. id/jurnal/index. php/ibtida Journal email: alibtida@syekhnurjati. Epistemic Agency in Multimodal Reading: How Primary Learners Construct Knowledge Across Modes Hasna Fitri Labibah* *Doctoral Program of Language Education. Faculty of Languages and Arts. Universitas Negeri Semarang. Indonesia E-mail: canachan77@students. Januarius Mujiyanto** **Doctoral Program of Language Education. Faculty of Languages and Arts. Universitas Negeri Semarang. Indonesia E-mail: yanmujiyanto@mail. Dwi Rukmini*** ***Doctoral Program of Language Education. Faculty of Languages and Arts. Universitas Negeri Semarang. Indonesia E-mail: wiwidwirukmini71@gmail. Widhiyanto**** ****Doctoral Program of Language Education. Faculty of Languages and Arts. Universitas Negeri Semarang. Indonesia E-mail: pakwidhi_english@mail. Received: August 19th, 2025. Accepted: October 10th, 2025. Published: October 31st, 2025. Abstract Reading literacy remains a persistent challenge in many English as a Foreign Language (EFL) contexts. However, primary EFL learners frequently encounter difficulties in integrating linguistic, visual, spatial, and gestural modalities. This study investigates how primary learners exercise epistemic agency in multimodal reading and construct knowledge across different modes. Utilizing multimodality theory and epistemic agency, the study implemented an explanatory sequential mixed-methods design involving thirty fifth-grade students from Madrasah Ibtidaiyah Negeri (MIN) 1 Tulang Bawang Barat. Lampung. In Phase 1, a one-group pretestAeposttest design measured changes in multimodal comprehension using a validated twenty-item instrument. Results showed significant improvement, with mean scores rising from 61. 40 (SD = 8. 30 (SD = 7. , t. = 8. 52, p < 001, and CohenAos d = 1. 56, indicating a large effect. In Phase 2, six students representing varied improvement levels were interviewed and engaged in task-based Thematic analysis identified five dimensions of epistemic agency: taking initiative, decision-making across modes, justification of understanding, reflection and self-regulation, and collaboration and dialogue. The integration of both phases demonstrated that learners' agentive engagement with multimodal resources underpinned comprehension gains, transforming affordances into strategies for meaning-making. These findings affirm that multimodal pedagogy fosters not only measurable comprehension outcomes but also autonomy, critical thinking, and collaborative engagement. The study recommends embedding multimodal literacy Copyright A 2025 Al Ibtida: Jurnal Pendidikan Guru MI. Department of Madrasah Ibtidaiyah Teacher Education. UIN Siber Syekh Nurjati Cirebon. Indonesia. Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA into EFL curricula and providing scaffolds that balance cognitive support with opportunities for agency. Keywords: epistemic agency, knowledge construction, multimodal reading, primary school. Abstrak Literasi membaca tetap menjadi tantangan yang terus-menerus dalam banyak konteks Bahasa Inggris sebagai Bahasa Asing (EFL). Namun, pembelajar EFL tingkat dasar sering menghadapi kesulitan dalam mengintegrasikan modalitas linguistik, visual, spasial, dan gestural. Studi ini menyelidiki bagaimana pembelajar primer menjalankan agensi epistemik dalam membaca multimodal dan membangun pengetahuan di berbagai moda. Dengan memanfaatkan teori multimodalitas dan agensi epistemik, penelitian ini menerapkan desain metode campuran sekuensial eksplanatif yang melibatkan tiga puluh siswa kelas lima dari Madrasah Ibtidaiyah Negeri (MIN) 1 Tulang Bawang Barat. Lampung. Pada Fase 1, desain pretes-postes satu kelompok mengukur perubahan dalam pemahaman multimodal menggunakan instrumen dua puluh item yang tervalidasi. Hasil menunjukkan peningkatan yang signifikan, dengan skor rata-rata meningkat dari 61,40 (SD = 8,. menjadi 74,30 (SD = 7,. , t. = 8,52, p < 0,001, dan Cohen's d = 1,56, yang menunjukkan efek yang besar. Pada Fase 2, enam siswa yang mewakili berbagai tingkat peningkatan diwawancarai dan dilibatkan dalam refleksi berbasis tugas. Analisis tematik mengidentifikasi lima dimensi agensi epistemik: mengambil inisiatif, pengambilan keputusan lintas moda, pembenaran pemahaman, refleksi dan pengaturan diri, serta kolaborasi dan dialog. Integrasi kedua fase menunjukkan bahwa keterlibatan agen pembelajar dengan sumber daya multimoda mendukung perolehan pemahaman, mengubah affordance menjadi strategi untuk pembuatan makna. Temuan ini menegaskan bahwa pedagogi multimoda tidak hanya mendorong hasil pemahaman yang terukur tetapi juga otonomi, pemikiran kritis, dan keterlibatan kolaboratif. Studi ini merekomendasikan penanaman literasi multimodal ke dalam kurikulum EFL dan menyediakan perancah yang menyeimbangkan dukungan kognitif dengan peluang untuk agensi. Kata kunci: agensi epistemik, konstruksi pengetahuan, pedagogi literasi, membaca multimodal, sekolah dasar. INTRODUCTION Reading literacy has increasingly been recognized as a multidimensional practice that extends beyond the decoding of printed text to the orchestration of diverse semiotic resources. Research on multiliteracies underscores that learner must navigate multimodal environments where meaning is conveyed through linguistic, visual, spatial, gestural, and digital resources (Cope & Kalantzis, 2020. Fylth et al. , 2023. Jewitt, 2. In EFL contexts, this challenge is particularly urgent, as students are expected not only to acquire linguistic competence but also to mobilize multimodal cues to support comprehension (Rohi & Nurhayati, 2024. Yi. Dong, et al. , 2024. Yi. Zhao, et al. , 2. International assessments such as PISA 2022, have shown persistent gaps in reading performance, especially in developing countries, highlighting the need for more inclusive and innovative literacy pedagogies (AlAli & Wardat, 2024. Mahapoonyanont & Songsang, 2024. Norman, 2. Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA In response, scholars have turned their attention to multimodal literacy as a framework for rethinking reading pedagogy. Multimodality views meaning-making as distributed across linguistic, visual, audio, gestural, and spatial resources rather than confined to print alone (Jewitt, 2014. Mills, 2. Such perspectives challenge traditional text-based instruction by emphasizing the orchestration of multiple semiotic modes to enhance comprehension and Studies report that multimodal designs engage learners more deeply, foster critical and inferential comprehension, and bridge gaps for those who struggle with linguistic texts (Januarty & Nima, 2018. Weninger, 2. Evidence suggests that multimodal approaches also promote equity by allowing diverse learners to access meaning through multiple entry points (Daulay & Dewi, 2025. Santoso & Mangkuluhur, 2024. White, 2. However, multimodality is not only about exposure to multiple resources but also about how learners actively use them. Scholars argue that literacy pedagogy must foreground agency, positioning learners as decision-makers who strategically select, evaluate, and justify their use of multimodal resources (Biesta et al. , 2015. Van de Pol et al. , 2. This emphasis on epistemic agency represents a critical shift in literacy research. Rather than viewing learners as passive consumers of multimodal texts, recent work conceptualizes them as epistemic agents who take responsibility for advancing their own understanding and contributing to collective meaning-making (Rappa & Tang, 2018. Yang, 2. Empirical studies highlight how agency emerges in tasks that require learners to reflect, collaborate, and regulate their strategies (Biesta et al. , 2015. Van de Pol et al. , 2. In multimodal environments, agency becomes especially salient, as learners must make choices across modes, negotiate tensions, and justify their interpretations (Vaughn et al. , 2020. Zhang et al. Despite this growing body of literature, several gaps remain. Much existing research on multimodal literacy has been conducted in secondary or tertiary contexts, often focusing on digital or academic literacy practices (Mills, 2016. White, 2. Studies with younger learners, particularly in primary EFL settings, remain limited, even though early interventions are crucial for developing lifelong literacy habits (Daulay & Dewi, 2025. Santoso & Mangkuluhur, 2. Furthermore, while multimodal approaches have been shown to improve comprehension, fewer studies explicitly investigate the role of epistemic agency in mediating these gains (Vaughn et al. , 2020. Zhang et al. , 2. There is still insufficient evidence on how learners in primary schools navigate multimodal resources and how agency manifests in their comprehension processes. Another gap lies in methodological approaches. Previous studies often rely on either quantitative assessment of reading gains or qualitative explorations of classroom practices, but rarely integrate both systematically (Kim et al. , 2025. Vaughn et al. , 2020. Yang, 2. As a result, the field lacks comprehensive evidence that not only demonstrates measurable improvements in comprehension but also explains the agentive processes underpinning them. Mixed-methods designs, particularly explanatory sequential approaches, are well-suited to address this gap by linking quantifiable outcomes with in-depth qualitative accounts (Creswell & Plano Clark, 2023. Zhang et al. , 2. Addressing these gaps, the present study investigates how primary EFL learners exercise epistemic agency in multimodal reading and how they construct knowledge across Specifically, it examines both the measurable impact of multimodal reading on Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA comprehension and the ways in which learners demonstrate initiative, decision-making, justification, reflection, and collaboration. By situating the study in an Indonesian primary school context, it responds to calls for more research in underrepresented settings where literacy achievement remains a national concern (Daulay & Dewi, 2025. Januarty & Nima. Santoso & Mangkuluhur, 2. The study contributes to the literature in three ways. First, it extends research on multimodal literacy by focusing on younger EFL learners, offering insights into how agency develops in primary classrooms. Second, it empirically demonstrates how comprehension gains are linked to agentive engagement with multimodal resources, providing an explanatory account rather than merely reporting outcomes. Third, it employs a mixed-methods design to bridge the gap between quantitative and qualitative approaches, producing findings that are both measurable and richly contextualized. In doing so, it not only advances scholarly understanding of multimodal literacy and epistemic agency but also offers practical implications for pedagogy and policy. METHODS This study employed an explanatory sequential mixed-methods design (Creswell & Plano Clark, 2. to examine the impact of multimodal reading on EFL learnersAo comprehension and epistemic agency. This design was chosen because it enabled the researcher to first identify measurable learning gains through quantitative analysis, and then explain these gains by exploring learnersAo lived experiences through qualitative inquiry. The study was theoretically grounded in multimodality (Jewitt, 2. , which views meaningmaking as distributed across linguistic, visual, audio, gestural, and spatial modes, and epistemic agency (Biesta, 2009. Mercer, 2. , which emphasizes learnersAo initiative, decision-making, and responsibility in constructing knowledge. Phase 1: Quantitative A one-group pretestAeposttest design was implemented to measure changes in reading comprehension before and after the multimodal reading intervention. The participants were 30 fifth-grade students from State Islamic Elementary School (MIN) 1 Tulang Bawang Barat. Lampung Province recruited using total population sampling. The reading comprehension test, adapted from the national curriculum standards, consisted of 20 multiple-choice items covering literal, inferential, and evaluative comprehension. Content validity was established through expert judgment by two EFL specialists, and reliability analysis yielded a CronbachAos alpha of 0. 87, confirming internal consistency. The intervention was conducted over four weeks . wo sessions per week, 90 minutes per sessio. Students engaged in multimodal reading activities that integrated linguistic texts, visual representations . mages, diagram. , audio narration, gestural enactments, and spatial layouts . age design and digital navigatio. These activities encouraged learners not only to interpret texts across modes but also to exercise epistemic agency by making predictions, justifying interpretations, and negotiating meaning collaboratively. Phase 2: Qualitative In the second phase, qualitative data were collected to explain the quantitative results. Semi-structured interviews were conducted with six students, selected using maximum Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA variation sampling to represent different levels of comprehension improvement and The interviews explored learnersAo experiences of multimodal reading, their strategies for navigating different modes, and their sense of agency in constructing meaning. For the qualitative phase, trustworthiness was enhanced through triangulation, member checking, and peer debriefing, ensuring credibility and interpretive depth. Data were transcribed and analyzed thematically following Braun & Clarke . , combining inductive and deductive coding. Deductive codes were informed by the theoretical constructs of multimodality (Jewitt, 2. and epistemic agency (Biesta et al. , 2. , while inductive coding allowed new insights to emerge from studentsAo accounts. Phase 3: Integrative Phase In the final stage, quantitative and qualitative findings were integrated using a joint display (Creswell & Plano Clark, 2. Quantitative results . retestAeposttest gain. informed the selection of interview participants, while the joint display juxtaposed statistical outcomes with thematic insights . , initiative, reflection, collaboratio. This approach enabled direct alignment between what improved and how learnersAo epistemic agency explained those gains. Data Analysis Quantitative data were analyzed using descriptive statistics and paired-sample t-tests to compare pretest and posttest scores (Wallen & Fraenkel, 2. Qualitative data were analyzed thematically to identify recurring patterns related to learnersAo multimodal meaningmaking and epistemic agency. Integration of the two strands was achieved by comparing numerical trends with thematic insights, thereby providing a comprehensive explanation of how and why multimodal reading influenced learnersAo comprehension. RESULTS AND DISCUSSION Quantitative Findings The first phase of the study examined whether multimodal reading significantly improved learnersAo comprehension. Descriptive statistics were calculated to capture the overall distribution of scores, followed by a paired-sample t-test to determine the significance of pretestAeposttest differences. The analysis provided evidence not only of measurable improvement but also of the magnitude of the interventionAos impact on studentsAo reading Table 1. Descriptive Statistics of Reading Comprehension Scores (N = . Test Pretest Posttest Mean (M) Standard Deviation (SD) Minimum Maximum Table 1 presents the descriptive statistics of studentsAo reading comprehension scores before and after the multimodal reading intervention. The pretest mean score was 61. 40 (SD = , indicating that students initially demonstrated a moderate level of comprehension with considerable variation among individuals. After the intervention, the mean posttest score rose 30 (SD = 7. The increase of nearly 13 points suggests that students benefited from engaging with multimodal texts, which provided them with additional scaffolds for The minimum score also improved from 45 on the pretest to 60 on the posttest. Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA showing that even lower-achieving students made progress. Meanwhile, the maximum score increased from 75 to 88, reflecting the potential of multimodality to push high achievers Overall, the descriptive results indicate consistent improvement across the class. Table 2. Paired-Sample t-test Results for Pretest and Posttest Scores Pair PreAePost Test Mean Difference t . f = . p-value < . CohenAos d Table 2 summarizes the results of the paired-sample t-test conducted to determine whether the observed improvements were statistically significant. The analysis revealed a mean difference of 12. 90 points between pretest and posttest scores. This difference was highly significant, t . = 8. 52, p < . 001, providing strong evidence that the intervention effectively enhanced learnersAo comprehension. The calculated effect size. CohenAos d = 1. indicates a large effect according to CohenAos . benchmark, suggesting that the observed gains were not only statistically significant but also educationally meaningful. This implies that multimodal reading activities had a substantial impact on learnersAo ability to construct meaning across modes. The quantitative results demonstrate that multimodal reading yielded both statistically significant and practically meaningful improvements in learnersAo comprehension. The nearly 13-point gain, coupled with a large effect size . = 1. , confirms that the intervention had more than a modest impact and was effective across ability levels, as reflected in the raised minimum and maximum scores. These findings align with previous research showing that multimodal scaffolds expand access to meaning and reduce achievement. Notably, the reduction in score variability (SD decreased from 8. 25 to 7. suggests that the intervention may have narrowed performance disparities within the class. However, as a one-group pretestAeposttest design was employed, improvements cannot be attributed exclusively to multimodal instruction without considering possible external influences. Despite this limitation, the magnitude and consistency of gains indicate that the intervention substantially contributed to enhancing comprehension, thereby justifying further exploration through qualitative analysis. Qualitative Findings The second phase explored how learners exercised epistemic agency while engaging with multimodal texts. Semi-structured interviews with six purposively selected students provided insights into their strategies, reflections, and collaborative practices. Thematic analysis revealed recurring patterns that explained the quantitative gains, highlighting how students actively initiated, justified, and regulated their meaning-making across modes. Table 3. Themes of LearnersAo Epistemic Agency in Multimodal Reading Theme Subtheme / Indicator Taking Initiative Asking exploring beyond text Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Representative Quotes A AuWhen I saw the picture. I wanted to ask my teacher what it meantAy (S. A AuI tried to guess the story before reading because the picture gave Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA Decision-Making Across Modes Selecting visual and text Justification Understanding Explaining defending interpretation. Reflection Self-Regulation Recognizing difficulties, monitoring progress Collaboration and Dialogue Sharing building on peersAo ideas me ideasAy (S. A AuI focused on the diagram first because it helped me understand the storyAy (S. A AuThe words were long, so I looked at the picture first to know the main ideaAy (S. A AuI think my answer is correct because both the picture and the words say the sameAy (S. A AuThe gesture in the video shows sadness, that is why I chose that answerAy (S. A AuAt first I only saw the picture, but then I read carefully to check my guessAy (S. A AuI changed my answer after I saw the diagram matched the textAy (S. A AuWe discussed the gestures in the video and agreed the boy was angryAy (S. A AuMy friend told me a different idea, and I added it to my answerAy (S. The qualitative findings provided rich insights into how learners exercised epistemic agency while engaging with multimodal texts. A recurring theme was taking initiative, where students demonstrated curiosity and self-directed exploration. For example. S1 described. AuWhen I saw the picture. I wanted to ask my teacher what it meantAy (S1. Intervie. , while S3 explained. AuI tried to guess the story before reading because the picture gave me ideasAy (S3. Intervie. These actions show that students were not passive recipients but actively initiated meaning-making processes. Another prominent theme was decision-making across modes, reflecting learnersAo strategic selection and prioritization of multimodal resources. S2 noted. AuI focused on the diagram first because it helped me understand the storyAy (S2. Intervie. , whereas S4 explained. AuThe words were long, so I looked at the picture first to know the main ideaAy (S4. Intervie. This indicates that learners exercised agency by deliberately navigating different modes to support comprehension. The theme of justification of understanding highlighted studentsAo ability to defend their interpretations with multimodal evidence. For example. S5 argued. AuI think my answer is correct because both the picture and the words say the sameAy (S5. Intervie. , while S1 emphasized. AuThe gesture in the video shows sadness, that is why I chose that answerAy (S1. Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA Intervie. Such reasoning demonstrates epistemic agency in the form of critical evaluation and evidence-based justification. Learners also engaged in reflection and self-regulation, adjusting strategies when faced with difficulties. As S3 reflected. AuAt first I only saw the picture, but then I read carefully to check my guessAy (S3. Intervie. , while S6 admitted. AuI changed my answer after I saw the diagram matched the textAy (S6. Intervie. These reflections reveal metacognitive awareness and a willingness to refine strategies in response to challenges. Finally, collaboration and dialogue emerged as an essential dimension of epistemic S2 explained. AuWe discussed the gestures in the video and agreed the boy was angryAy (S2. Intervie. , and S4 described. AuMy friend told me a different idea, and I added it to my answerAy (S4. Intervie. These collaborative interactions indicate that agency was not only individual but also socially distributed, reinforcing the importance of dialogic engagement in multimodal literacy practices. Finally, these themes show that multimodal reading environments provided opportunities for learners to act agentively by initiating inquiry, making decisions across semiotic modes, justifying their reasoning, reflecting on their strategies, and co-constructing meaning with peers. Such findings align with the theoretical lens that positions epistemic agency as central to knowledge construction, and multimodality as the affordance that enables learners to navigate and integrate diverse meaning-making resources. Integrative Findings The final phase merged the quantitative and qualitative strands to provide a comprehensive explanation of how multimodal reading shaped learnersAo comprehension. Using a joint display, statistical improvements were aligned with thematic insights, allowing the study to connect the what of score gains with the how and why of learnersAo epistemic This integration demonstrated that measurable improvements in comprehension were underpinned by studentsAo initiative, decision-making, justification, reflection, and collaboration across modes. Table 4. Integration of Quantitative and Qualitative Findings Research Questions RQ1: what extent learnersAo Quantitative Findings LearnersAo significantly from 40 (SD = 8. in the pretest to 30 (SD = 7. in the posttest. The gain of 12. 90 points significant, t . = 52, p < . 001, with Qualitative Findings Students resources . isuals, gesture. supported comprehension by offering alternative They described strategies such as predicting from images, cross- Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Integrated Interpretation Quantitative improvement is learnersAo strategic use of multimodal Engagement with multiple modes enabled students effectively, thus Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA a large effect size . = 1. RQ2: How do Agency was not directly captured by test scores, but modes, and verifying collaboratively . AuI looked at the diagram first to know the storyAy Ae S2. Intervie. Five emerged: . Taking . Decision-making across modes, . Justification . Reflection and selfregulation, and . Collaboration Representative quotes show learners their own meaningmaking process. The gains (RQ. are underpinned by epistemic agency (RQ. Students exercised agency which amplified the benefits of The integration of quantitative and qualitative results demonstrates a coherent picture of how multimodal reading shaped learnersAo comprehension and epistemic agency. The quantitative phase (RQ. showed a statistically significant improvement in comprehension, with learnersAo mean scores rising from 61. 40 to 74. The large effect size . = 1. indicates that the intervention was not only effective but also had a substantial educational The qualitative phase (RQ. adds explanatory depth to these gains by showing that learners did not passively receive information. instead, they exercised epistemic agency through multimodal resources. They initiated strategies such as predicting meaning from images, prioritizing diagrams or layouts before reading the text, and cross-checking answers across different modes. In doing so, students demonstrated agency by making decisions, justifying their interpretations, and regulating their comprehension processes. This alignment between numerical improvement and thematic insights underscores that the gains in comprehension were driven by learnersAo active agency. Without the exercise of initiative, reflection, and collaboration, the multimodal materials alone may not have produced such substantial results. Thus, the findings confirm that multimodal reading not only provides multiple channels of representation (Jewitt, 2. but also creates spaces for learners Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA to assert control over knowledge construction, consistent with epistemic agency frameworks (Biesta, 2009. Mercer, 2. In summary, the explanatory sequential design reveals that the AuwhatAy of improved comprehension (Phase . is explained by the AuhowAy of learner agency in multimodal interaction (Phase . Together, these findings highlight the pedagogical value of multimodal resources when they are coupled with opportunities for learners to exercise epistemic agency. Discussion The significant quantitative gains in reading comprehension suggest that multimodal resources provided powerful scaffolds for meaning-making. This supports the growing body of evidence that multimodality enhances comprehension by integrating visual, spatial, and gestural cues with linguistic input (Daulay & Dewi, 2025. Hardison & Pennington, 2021. Zhang et al. , 2. Unlike earlier literacy approaches that relied predominantly on linear text, multimodal pedagogies acknowledge the semiotic richness of texts and allow learners to mobilize diverse cues for interpretation (Jewitt, 2014. Mills, 2. The results of this study extend these insights by showing that such integration was not incidental but actively driven by learnersAo agentive engagement. The qualitative findings further illuminated the mechanisms behind these gains: learners took initiative, justified their answers, and reflected on their strategies across modes. These behaviors exemplify epistemic agency, positioning students as decision-makers in the learning process (Biesta et al. , 2015. Van de Pol et al. , 2019. Vaughn et al. , 2. Previous research shows that when learners are invited to exercise epistemic authority, they become more engaged and construct knowledge collaboratively (Eriksson & Lindberg, 2016. Yang, 2019. Zhang et al. , 2. The present findings converge with this literature by demonstrating how agency manifested concretely in multimodal tasks such as interpreting diagrams, gestures, or spatial layouts. Importantly, the integration of agency with multimodality highlights that comprehension improvement is not attributable to semiotic resources alone but to how learners activated them. This resonates with the view that multimodal texts afford possibilities, but learnersAo choices and reflections determine whether these affordances become pedagogically meaningful (Martin, 2020. Zhang et al. , 2. In our study, students did not simply notice images or gestures. they strategically used them as evidence, a process consistent with reflective agency and knowledge-building pedagogy (Jewitt, 2014. Rappa & Tang, 2. These findings also align with research on differentiated instruction and universal design for learning, which argue that multiple pathways to meaning support a wider range of learners (Mayer, 2014. Wan, 2. The fact that lower-achieving students particularly benefited from multimodal activities echoes Daulay & Dewi's . conclusion that multimodal pedagogy narrows achievement gaps. Similar studies have shown that visual and spatial resources are especially valuable for learners who struggle with linguistic decoding, allowing them to access higher-order comprehension (Januarty & Nima, 2018. Santoso & Mangkuluhur, 2024. White, 2. At the same time, the study underscores the relational nature of epistemic agency. Collaborative dialogue emerged as a key practice through which students co-constructed Al Ibtida: Jurnal Pendidikan Guru MI. Vol. 12 No. October 2025 Hasna Fitri Labibah. Januarius Mujiyanto. Dwi Rukmini. Widhiyanto. Epistemic Agency in Multimodal ReadingA meaning, consistent with sociocultural theories of learning (Eriksson & Lindberg, 2016. Martin, 2. Prior research has shown that dialogic engagement fosters accountability and deepens comprehension in literacy classrooms (Vaughn et al. , 2020. Zhang et al. , 2. our data, students often refined their interpretations by negotiating gestures, images, and textual evidence with peers, illustrating that agency is socially distributed rather than purely Nevertheless, multimodal pedagogy also carries challenges. Cognitive load theory reminds us that too many modes can overwhelm learners if not scaffolded carefully (Mayer. Weninger, 2023. White, 2. Some students initially focused only on visuals and had to adjust by cross-checking with text, a finding consistent with studies highlighting the need for guided structuration in multimodal tasks (Mills, 2016. Zhang et al. , 2. This suggests that teachers must balance freedom for agency with explicit scaffolding to ensure that multimodal affordances are transformed into effective strategies rather than sources of In summary, the findings affirm that multimodal literacy cannot be separated from epistemic agency. Learners improved comprehension not simply because they were exposed to multiple modes, but because they actively navigated, selected, and justified meaning across those modes. This echoes global calls for 21st-century literacies that integrate criticality, collaboration, and self-direction into reading pedagogy (Cope & Kalantzis, 2020. Jewitt. By situating learners as epistemic agents, multimodal reading equips them with both measurable literacy skills and broader competencies for knowledge construction in complex communicative environments. CONCLUSION This study highlights that fostering multimodal literacy is most effective when it is understood as an agentive process of knowledge construction rather than mere exposure to multiple modes. When learners are positioned as epistemic agents able to choose, justify, and reflect on their meaning-making strategies multimodal reading becomes a space for developing critical thinking, self-regulation, and collaborative learning. Such a perspective shifts literacy pedagogy toward empowering students to navigate complex semiotic environments and construct understanding actively. These insights call for instructional designs that balance scaffolding with learner autonomy, curriculum policies that recognize multimodal competence as part of core literacy development, and further research exploring how agency can be nurtured across diverse contexts and digital platforms. REFERENCES