tf-agents:tensorflow的强化学习库

tf-agents-nightl的Python项目详细描述


TF-Agents:TensorFlow中的强化学习库

注意:当前TF代理的预发布正在积极开发中,并且 接口可以随时更改请随时提供反馈和意见。

为了开始,我们建议查看我们的colab教程之一。如果你 需要一个介绍rl(或快速回顾)。 start here。否则,请查看我们的 DQN tutorial让一个特工起来 在车柱环境中跑步。

目录

Agents
Tutorials
Examples
Installation
Contributing
Principles
Citation
Disclaimer

Agents

In TF-Agents, the core elements of RL algorithms are implemented as ^{}. An agent encompasses two main responsibilities: defining a Policy to interact with the Environment, and how to learn/train that Policy from collected experience.

Currently the following algorithms are available under TF-Agents:

Tutorials

See ^{}专业教程 提供组件

Examples

End-to-end examples training agents can be found under each agent directory. e.g.:

Installation

To install the latest version, use nightly builds of TF-Agents under the pip package ^{}, which requires you install on one of ^{} and ^{} and also ^{}. Nightly builds include newer features, but may be less stable than the versioned releases.

To install the nightly build version, run the following:

^{pr 1}$

If you clone the repository you will still need a ^{} installation. You can then run ^{} from the agents directory to get dependencies to run tests.

Contributing

We're eager to collaborate with you! See ^{} 关于如何贡献的指南这个项目遵循TensorFlow的 code of conduct。通过参与,你应该 坚持这个准则

Principles

This project adheres to Google's AI principles。 通过参与、使用或参与本项目 坚持这些原则。

引文

如果您使用此代码,请将其引用为:

@misc{TFAgents,
  title = {{TF-Agents}: A library for Reinforcement Learning in TensorFlow},
  author = "{Sergio Guadarrama, Anoop Korattikara, Oscar Ramirez,
    Pablo Castro, Ethan Holly, Sam Fishman, Ke Wang, Ekaterina Gonina, Neal Wu,
    Chris Harris, Vincent Vanhoucke, Eugene Brevdo}",
  howpublished = {\url{https://github.com/tensorflow/agents}},
  url = "https://github.com/tensorflow/agents",
  year = 2018,
  note = "[Online; accessed 25-June-2019]"
}

免责声明

这不是谷歌的官方产品。

欢迎加入QQ群-->: 979659372 Python中文网_新手群

推荐PyPI第三方库


热门话题
java如何在字符串文本中查找字符串的出现处?   java类对象何时被垃圾回收?   java在没有setter/getter的其他类中切换变量   java禁用依赖项的lint   Android TextView HTML中的java换行符   java Mutiny:处理多个UNI并使用来自“外部”的值   java在JavaFX中使用ComboBox获得选项的空白列表   java仅显示和保存true语句   mavencompilerplugin中使用的release标记/属性和compilerVersion标记/属性之间的差异。升级到java 11时出现的问题   java:单例异常可以工作吗?   java twitter4j setpage()不工作   java如何使用REST在HP ALM 12中创建新缺陷   Eclipse会在每次重新启动时将Java编译器遵从性级别重置为1.7   java组织。冬眠例外虽然捕获到了DataException   查找多维数组*java的特定元素的长度*