我们需要一种良好的方式提供反馈并让智能体可靠地理解我们所想要的东西,从而帮助我们达成这些目标。换言之,我们希望在有人类反馈的情形下以一种让系统行为和我们的意图对齐的方式来训练人工智能系统。
我们需要一种良好的方式提供反馈并让智能体可靠地理解我们所想要的东西,从而帮助我们达成这些目标。换言之,我们希望在有人类反馈的情形下以一种让系统行为和我们的意图对齐的方式来训练人工智能系统。
A Reasonable Theology for Our Time
What if we understood more things?
A Research Blog
Computing with Meaning and Values
LASP - Learning And Signal Processing
Just another WordPress.com site
Ph.D. Candidate at Stanford
Massively Collaborative Theoretical Computer Science Projects
Philosophy, Mathematics, and Logic
by Jessica Taylor
Updates on my research and expository papers, discussion of open problems, and other maths-related topics. By Terence Tao
Random things about software development, machine learning and image processing research.
Just another WordPress.com weblog
Looking askance at reality