[MAICE Dev Log 4] QAC checklist development: making educational quality measurable

1. Why this post pivots to QAC

Persona simulation was useful for exploration, but it did not directly guarantee production-quality educational behavior.

What actually supported iterative improvement was a consistent evaluation framework.

That is why this post focuses on QAC (Question-Answer-Context).

Educational AI quality cannot be reduced to factual correctness only.

We needed a framework that evaluates, together:

QAC has three domains:

Session-level scores are computed from checklist items and aggregated by domain.

In the thesis workflow:

Interpretation rule:

Compared to persona simulation, QAC produced clearer implementation artifacts:

This changed iteration from subjective intuition to item-level traceable improvement.

Persona testing is still useful, but now as a supporting tool:

Core quality judgment remains QAC-centered.

The key outcome of this stage is not “better persona realism.” It is making educational quality measurable and actionable.

Master’s thesis: Development and Effectiveness Analysis of AI Agent Supporting Question Clarification in High School Mathematics Learning (Kim Kyubong, Pusan National University, 2026)