12.2 - Alignment
Artificial Intelligence Policy
In Progress
π§ Think:
π Read:
π Browse:
- claude constitution
- openai spec
- Hidden AI instructions reveal how Anthropic controls Claude 4. 2025. Ars Technica
- Shao et al, 2026. Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce. arXiv.
- Farrell et al, 2025. βLarge AI models are cultural and social technologiesβ. Science.
- Mobayed, 2025. βYour Brain on ChatGPTβ Psychology Today.
- Bastani et al, 2025. βGenerative AI without guardrails can harm learning: Evidence from high school mathematicsβ PNAS.
- Berg and Rosenblatt, 2025. βThe Monster inside ChatGPTβ Wall Street Journal.
- βIs AI Rewiring our Minds? Scientists probe cognitive cost of chatbotsβ Washington Post.
- Li et al, 2025. β(Core Knowledge Deficits in MMLMs)(https://arxiv.org/abs/2410.10855v4)β arXiv.
- See also: Grow AI like a Child
- Gao et al, 2025. Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation
π Submit:
- Discussion question to course chat
TipTip
- βπ Readβ, βπ§ Listenβ, and/or βπΊ Watchβ items are required content for the day, and should be read/heard/watched before class on that day.
- βπ Browseβ items should be briefly looked at but do not need to be read deeply unless you want to
- βπ Additional Resourcesβ do not need to be looked at; they are there to serve, if useful, as further references for your debates, final projects, and general edification later.