Multi-Model Essay Grading

Education

Essay distributed to 3 LLMs for parallel grading, score aggregation agent combines results, bias detection agent checks for systematic errors, final grade generated.

agentsystem
Why OSOP matters here

AI grading must be fair. OSOP records each model's score, the aggregation method, and bias check results — enabling educators to verify grading consistency across demographics.

Workflow Steps (7)

1
Essay Intake
system
2
Grader A (GPT-4o)
agent
3
Grader B (Claude)
agent
4
Grader C (Gemini)
agent
5
Score Aggregation
system
6
Bias Detection Agent
agent
7
Final Grade & Feedback
agent

Connections (9)

Essay IntakeGrader A (GPT-4o)parallel
Essay IntakeGrader B (Claude)parallel
Essay IntakeGrader C (Gemini)parallel
Grader A (GPT-4o)Score Aggregationparallel
Grader B (Claude)Score Aggregationparallel
Grader C (Gemini)Score Aggregationparallel
Score AggregationBias Detection Agentsequential
Bias Detection AgentFinal Grade & Feedbackconditionalbias.detected == false
Bias Detection AgentGrader A (GPT-4o)fallbackBias detected, re-grade with adjusted prompts
7
Steps
9
Connections
2
Node Types