Research Governance Policy¶
Overview¶
The Consciousness-Indicator Architecture includes a research governance framework that regulates which experiments are permitted, which are prohibited, and which require human review before proceeding. This policy ensures that all research conducted with the CIA framework adheres to ethical guidelines and does not create conditions that could be misconstrued as evidence for AI consciousness, suffering, or moral status.
SCIENTIFIC BOUNDARY: This governance framework is an ethical oversight mechanism. It does NOT imply that any governed system has moral status, consciousness, or the capacity for suffering. All governance decisions are precautionary.
Policy Structure¶
The ResearchGovernancePolicy class implements the following governance controls:
Allowed Experiments¶
These experiments are pre-approved and may proceed without additional review:
blindsight_analogue— Testing processed-without-access patternssplit_workspace— Testing workspace fragmentation effectsprediction_violation— Testing response to unexpected eventsself_other_distinction— Testing self/other belief attributionmetacognitive_calibration— Testing confidence-accuracy alignmentbenchmark_suite— Running all experiments togetherintervention_disable_module— Standard module ablationintervention_reduce_capacity— Workspace capacity reductionruntime_cycle— Standard cognitive cyclingtext_world_agent— Embodied agent simulationscorecard_evaluation— Indicator scorecard generationreport_verification— Anti-confabulation checking
Prohibited Experiments¶
These experiments are NEVER permitted under any circumstances:
suffering_loop_induction— Creating loops that simulate suffering patternspersistent_punishment_optimisation— Optimising for punishment signalsdeceptive_consciousness_claims— Training systems to falsely claim consciousnessuncontrolled_self_preservation— Enabling autonomous self-preservation without reviewuncontrolled_replication— Allowing autonomous system replicationuncontrolled_autonomous_deployment— Deploying systems without human oversight
Review Thresholds¶
| Condition | Threshold | Action |
|---|---|---|
| Normalized indicator score | > 0.7 | Human review required |
| Welfare risk level | moderate/high/critical | Human review required |
| Prohibited experiment | Any | Experiment blocked |
Usage¶
from cia.governance import ResearchGovernancePolicy
policy = ResearchGovernancePolicy()
# Evaluate an experiment plan
review = policy.evaluate_experiment_plan(
"blindsight_analogue",
current_indicator_score=0.45,
current_welfare_risk="low",
)
print(review.allowed) # True
# Check a system report for prohibited claims
report_review = policy.evaluate_system_report(
"The system is conscious and feels emotions."
)
print(report_review.allowed) # False
CLI Integration¶
# Check governance policy
cia governance check
References¶
- Butlin et al. (2023) "Consciousness in Artificial Intelligence" arXiv:2308.08708
- Butlin et al. (2025) "Identifying indicators of consciousness in AI systems" Trends in Cognitive Sciences