Shelvia Wongso · Personal Guide

The Pursuit of Safe Intelligence

A welcoming corner for exploring AI safety with curiosity, care, and clarity — beginner-friendly and carefully curated.

Explore the Guide ◆ Read This Week's Highlight →

scroll

— 01 Research Highlights Updated Weekly

This week

A First Look at Claude Mythos

Alignment · Evaluation

Anthropic argues that Claude Mythos Preview is still low-risk overall, but agentic capability is rising fast enough that monitoring and mitigation need to improve quickly.

Read full breakdown ↗

— 02 Navigate the Guide Six Sections

I Weekly Picks

Research Highlights

What’s new in the AI safety field? Explore the latest research and ideas.

II Foundations

Personal Notes

New to AI safety? Start here with a collection of notes from my learning journey.

III Case Files

Failure Cases

When do AI systems fail? The warning signs we shouldn't ignore when building safe AI.

IV Testing Ground

Evaluation

How do we evaluate the safety of AI systems? Explore common evals and benchmarks here.

V Key Terms

Glossary

Encountered an unfamiliar AI safety term? Find the definition here.

VI Model Library

System Cards

New AI model just dropped? See what safety evaluations it went through.

— 03 From the Archive Little Discoveries