arxiv:2604.12710
Junxiao Yang
yangjunxiao2021
AI & ML interests
Alignment/AI safety
Recent Activity
authored a paper 6 days ago
LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety submitted a paper 7 days ago
LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety