Multi-Model Essay Grading
EducationEssay distributed to 3 LLMs for parallel grading, score aggregation agent combines results, bias detection agent checks for systematic errors, final grade generated.
agentsystem
Why OSOP matters here
AI grading must be fair. OSOP records each model's score, the aggregation method, and bias check results — enabling educators to verify grading consistency across demographics.
Workflow Steps (7)
1
Essay Intake
system2
Grader A (GPT-4o)
agent3
Grader B (Claude)
agent4
Grader C (Gemini)
agent5
Score Aggregation
system6
Bias Detection Agent
agent7
Final Grade & Feedback
agentConnections (9)
Essay Intake→Grader A (GPT-4o)parallel
Essay Intake→Grader B (Claude)parallel
Essay Intake→Grader C (Gemini)parallel
Grader A (GPT-4o)→Score Aggregationparallel
Grader B (Claude)→Score Aggregationparallel
Grader C (Gemini)→Score Aggregationparallel
Score Aggregation→Bias Detection Agentsequential
Bias Detection Agent→Final Grade & Feedbackconditionalbias.detected == false
Bias Detection Agent→Grader A (GPT-4o)fallbackBias detected, re-grade with adjusted prompts
7
Steps
9
Connections
2
Node Types