Moderation & safety
Human-in-the-loop moderation
A moderation model where AI handles the volume and auto-decides the clear-cut cases, while a human moderator arbitrates the ambiguous ones the machine cannot judge with confidence.
Human-in-the-loop moderation is a hybrid model where AI processes the full volume of contributions and decides the clear-cut cases on its own, while a human moderator arbitrates only the ambiguous ones. The machine does the reading; the person does the judging where judgement is actually needed.
Why neither side works alone
The reason this design exists is that the two pure approaches both fail, but in opposite ways.
- AI alone gets context wrong. A model scores text, not intent. Irony, quoted slurs, in-group reclaiming, regional slang and replies that only make sense against the parent comment all trip it up. Left unsupervised, it over-blocks legitimate speech and lets context-dependent abuse through.
- Humans alone cannot scale. A busy comment section produces far more contributions than any team can read in real time. Pure manual moderation means either long delays or rules applied unevenly as moderators tire.
Human-in-the-loop keeps the strengths of both : the throughput of automation, the discernment of a person, with the human attention spent only where it changes the outcome.
How it works in Logora
Logora runs this model by default :
- AI handles roughly 85% on-site. Clean contributions are auto-approved and clearly abusive ones are auto-blocked, so the editorial team never sees the bulk of the traffic.
- The team reviews only the ~15% to judge. The uncertain cases land in a dedicated human queue, each one presented with its toxicity score and the surrounding context the moderator needs to decide.
- QA at launch. During the first three months, decisions are reviewed to calibrate thresholds to the publication’s own rules and tone.
Related concepts
- AI moderation, the automated layer
- Content moderation, the broader practice
- Moderation queue, where the human work happens
- Toxicity detection