摘要:Studi ini bertujuan untuk: (1) menghasilkan model pengembangan evaluasi program pembelajar’an tahfi ẓ al-Quran diberi nama Coni P2, (2) menghasilkan teknik pelaksanaan evaluasi program pembelajaran tahfi ẓ al-Qur’an, dan (3) menghasilkan struktur komponen dan indikator model evaluasi. Studi ini merupakan penelitian dan pengembangan (R&D) dengan menggunakan sembilan langkah dari 10 langkah model Borg dan Gall. Jumlah subjek uji coba pertama 33 orang, uji coba kedua 49 orang, dan uji coba ketiga 224 orang. Komponen model evaluasi yang digunakan adalah model evaluasi Stufflebeam (CIPP). Langkah-langkah evaluasi yang digunakan adalah langkah Malcolm Provus. Teknik pengumpul data yang digunakan adalah Delphi, FGD, kuesioner, observasi, wawancara, dan studi dokumentasi. Validitas konstruk dianalisis menggunakan CFA dan Reliabilitas menggunakan Cronbach Alpha. Hasil penelitian: (1) model evaluasi program Coni P2 dikembangkan dengan cara kajian teori, temuan di lapangan, Delphi, FGD, uji coba sebanyak tiga kali; (2) evaluasi di tiga pondok pesantren: Al-Ittifaqiah, Raudhatul Ulum, dan Raudhatul Qur’an ditemukan kesenjangan sarana belajar, kinerja guru, dan motivasi belajar santri; (3) komponen konstruk model evaluasi Coni P2 terdiri atas konteks, input, proses, dan produk, yang terbagi menjadi 13 indikator. Hasil analisis CFA: (1) Chi Square (χ²) = kecil; (2) ρ -value > 0,05; (3) Root Mean Square Error of Approximation (RMSEA) < 0,08; dan (4) Goodness of Fit Index (GFI) < 0,90 . Kata kunci: pengembangan, evaluasi, tahfiẓ al-Qur’an ______________________________________________________________ DEVELOPING CIPP EVALUATION INSTRUMENT FOR TAHFIZ AL-QUR’AN IN PONDOK PESANTREN Abstract The study aimed to: (1) generate an evaluation development model of tahfiz Al-Qur’an learning program entitled Coni P2; (2) generate a technique of tahfiz Al-Qur’an learning program evaluation implementation; and (3) generate component structures and an indicators of evaluation model. The study was a research and development (R&D) type by implementing 9 of 10 steps in Borg and Gall’s model. The subject for the first trial the 33 people, for the second trial were 49 people, and for the third trial were 224 people. The implemented component of evaluation model was Stufflebeam Evaluation Model (CIPP). The evaluation steps that the researchers implemented were the ones taken from Malcolm Provus. The data gathering techniques that the researchers implemented were Delphi, FGD, questionnaires, observation, interview and study of documentation. The construct validity was analyzed by implementing CFA and the construct reliability was analyzed by implementing Cronbach Alpha. The results of the research were as follows: (1) the model of Coni P2 evaluation program was developed by implementing theoretical review, field findings, Delphi, FGD and three-time experiments; (2) from the evaluations performed in three pondok pesantren, namely Al-Ittifaqiah, Raudhatul Ulum and Raudhatul Qur’an, the researchers found discrepancy in learning facilities, teacher performance and santri’s learning motivation; and (3) the construct components of Coni P2 evaluation model consisted of context, input, process and product that were divided into 13 indicators. The results of CFA analysis were as follows: (1) (1) Chi Square (χ²) = kecil; (2) ρ -value > 0.05; (3) Root Mean Square Error of Approximation (RMSEA) < 0.08; dan (4) Goodness of Fit Index (GFI) < 0.90. Keywords: development, evaluation, tahfiz Al-Qur’an