Available on

Google Play

App Store

JapaneseClass.jp

Forgot password

MAIN
Social Feeds
LEARN
Practice
Lessons
Readings
Notes
COMMUNITY
Rankings
Forum
Discord
MISCELLANEOUS
Topics
Matome
Dictionary
Donate

Category:強化学習

人間のフィードバックによる強化学習
Reinforcement learning from human feedback
近接方策最適化
Proximal policy optimization
SARSA法
State–action–reward–state–action
ベイズ強化学習
DQN (コンピュータ)
モデルフリー (強化学習)
Model-free (reinforcement learning)
分布ソフト・アクター・クリティック法
Distributional Soft Actor Critic
エンドツーエンドの強化学習
End-to-end reinforcement learning
階層型強化学習
Q学習
Q-learning
時間差分学習
Temporal difference learning
強化学習
Reinforcement learning

Categories

日本のアニメ映画
日本の文化
日本の祭り
日本の世界遺産
日本のアイドル

Share