Skip to content

AGI Watchful Guardians

  • Home
  • About
  • Alignment Newsletter in Chinese
  • Nick Bostrom’s latest work in Chinese
  • Research

Tag: Alignment

January 2, 2019 Xiaohu Zhu AGI, Alignment

基于奖励建模的可扩展智能体对齐

我们需要一种良好的方式提供反馈并让智能体可靠地理解我们所想要的东西,从而帮助我们达成这些目标。换言之,我们希望在有人类反馈的情形下以一种让系统行为和我们的意图对齐的方式来训练人工智能系统。

Posts navigation

Newer posts

Strong, but safe.

Search

Archives

Categories

AGI AI Safety Alignment Beneficial CID CSAGI DeepMind Intelligence KL divergence Machine Learning Theory OpenAI Side effects 未分类

Follow me on Twitter

My Tweets

Blogs I Follow

  • Foundation Operation X for languages, cultures and perspectives
  • Brian Lui's blog
  • The Divine Life Communion
  • Aceso Under Glass
  • Windows On Theory
  • Victoria Krakovna
  • Self-Aware Systems
  • Steve Omohundro
  • laspucl2016.wordpress.com/
  • Neural Networks Blog
  • Mina Lee
  • The PolyTCS Project
  • Neil Barton
  • Unstable Ontology
  • What's new
  • William J Shipman
  • Kris Carlson
  • Marco Bonzanini
  • The sideways view
  • Gregory Lewis

Tags

AAAI AGI AI AI Safety AIsafety Alignment AN bayes-optimal Beneficial AI Books CID COLT CSAGI DeepMind DeepRL Divergence Exploration HAI ICML Incentives KL Learning Machine Learning Nick Bostrom ontological conflicts PapeRman Papers Planning REALab Reinforcement Learning Research reward modeling risks RL SeftEffects Shakir social ontology Stanford Tutorial UL

Authors

  • Xiaohu Zhu
    • 人工智能书籍推荐:将这些添加到您的阅读列表
    • 齐智通讯 第 173 期 来自DeepMind的语言模型
    • Compositional game theory reading list
    • 本体论冲突与欧洲人民的故事
    • 读论文:本体危机
Create a website or blog at WordPress.com
Foundation Operation X for languages, cultures and perspectives

Brian Lui's blog

The Divine Life Communion

A Reasonable Theology for Our Time

Aceso Under Glass

What if we understood more things?

Windows On Theory

A Research Blog

Victoria Krakovna

Self-Aware Systems

Computing with Meaning and Values

Steve Omohundro

laspucl2016.wordpress.com/

LASP - Learning And Signal Processing

Neural Networks Blog

Just another WordPress.com site

Mina Lee

Ph.D. Candidate at Stanford

The PolyTCS Project

Massively Collaborative Theoretical Computer Science Projects

Neil Barton

Philosophy, Mathematics, and Logic

Unstable Ontology

by Jessica Taylor

What's new

Updates on my research and expository papers, discussion of open problems, and other maths-related topics. By Terence Tao

William J Shipman

Random things about software development, machine learning and image processing research.

Kris Carlson

Just another WordPress.com weblog

Marco Bonzanini

The sideways view

Looking askance at reality

Gregory Lewis

  • Follow Following
    • AGI Watchful Guardians
    • Already have a WordPress.com account? Log in now.
    • AGI Watchful Guardians
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar