从人工智能安全的角度来看,拥有一个清晰的设计原则和一个清晰的表明了它解决了什么问题的特性描述就意味着我们不必去猜测哪些智能体是安全的。在本文和这篇论文中,我们描述了一种称为当下奖励函数优化的设计原理如何避免奖励函数篡改问题。
从人工智能安全的角度来看,拥有一个清晰的设计原则和一个清晰的表明了它解决了什么问题的特性描述就意味着我们不必去猜测哪些智能体是安全的。在本文和这篇论文中,我们描述了一种称为当下奖励函数优化的设计原理如何避免奖励函数篡改问题。
A Reasonable Theology for Our Time
What if we understood more things?
A Research Blog
Computing with Meaning and Values
LASP - Learning And Signal Processing
Just another WordPress.com site
Ph.D. Candidate at Stanford
Massively Collaborative Theoretical Computer Science Projects
Philosophy, Mathematics, and Logic
by Jessica Taylor
Updates on my research and expository papers, discussion of open problems, and other maths-related topics. By Terence Tao
Random things about software development, machine learning and image processing research.
Just another WordPress.com weblog
Looking askance at reality