This study aims to verify the effects of scaffolding on learning outcomes in the acquisition of visual-to-auditory substitution skills and to analyze learners' perceptions of four types of scaffolding. To achieve this, an instructional program was designed based on Cognitive Load Theory and Scaffolding Theory to manage high cognitive load and facilitate learning. An experimental study was conducted with 45 adult learners, employing an explanatory sequential mixed-methods design. To analyze the differences in learning outcomes between the control group and experimental group, the provision of scaffolding was set as the independent variable and learning outcomes as the dependent variable. An Analysis of Covariance(ANCOVA) was performed, controlling for prior musical ability. The results revealed that while no significant difference existed between the groups at pre-test, the experimental group demonstrated statistically significant gains in post-test learning outcomes. Furthermore, a repeated measures ANCOVA confirmed a significant interaction effect over time. A qualitative analysis of the four types of scaffolding indicated that all types were perceived positively, with metacognitive scaffolding reported as the most effective. These findings suggest that scaffolding is an effective instructional strategy for managing the high cognitive load associated with novel and non-intuitive tasks, thereby promoting learners' self-regulated learning.