Open
1
Constitutional AI Critique Generator
alignmentevalsrobustness
Difficulty
Intermediate
Verification
Human Review
Compute
Inference Only
Description
Build an open-source implementation of Constitutional AI's critique and revision pipeline. **Goal:** Given a potentially harmful model output, generate: 1. A critique identifying specific problems 2. A revised output that addresses the issues 3. Explanation of changes made **Use Cases:** - Training data generation for safety fine-tuning - Real-time output filtering - Red-teaming feedback loops
Created: 1/18/2026
Last updated: 1/19/2026