강화 학습 참고할 사이트 :: 세상은 내가 정하는 대로

ABOUT ME

-

Today: -

Yesterday: -

Total: -

강화 학습 참고할 사이트

AI 2021. 9. 23. 13:16

728x90

https://dnddnjs.gitbooks.io/rl/content/policy_iteration.html

Policy Iteration · Fundamental of Reinforcement Learning

dnddnjs.gitbooks.io

https://github.com/rlcode/reinforcement-learning-kr

GitHub - rlcode/reinforcement-learning-kr: [파이썬과 케라스로 배우는 강화학습] 예제

[파이썬과 케라스로 배우는 강화학습] 예제. Contribute to rlcode/reinforcement-learning-kr development by creating an account on GitHub.

github.com

https://sumniya.tistory.com/10?category=781573

[Ch.4] Dynamic Programming

이번 포스팅에서 다룰내용은 Dynamic Programming(이하 DP)입니다. 강화학습은 시간에 따라 step별로 action을 취하는 문제를 MDP로 정의하여 푸는 방법 중에 하나인데, DP도 마찬가지 입니다. 차이점

sumniya.tistory.com

https://tonylim.tistory.com/88

Reinforcement Learning

각 state에서 어떤 액션을 취하는게 좋은것인지를 각각의 Q 값을 통해 알수있다. 학습이 잘되어있는 경우에는 하지만 Q table은 처음부터 정답을 알려주지 않는다. 학습을 계속 반복적으로 수행해

tonylim.tistory.com

https://www.codenong.com/cs105527154/

Chapter 1 – 9: Solve OpenAI Gym’s Taxi-v2 Task | 码农家园

import numpy as np from collections import defaultdict class Agent: def __init__(self, nA=6): """ Initialize agent. Params ====== - nA: number of actions available to the agent - Q: state-ac

www.codenong.com

728x90

'AI' 카테고리의 다른 글

[2023 Spring NLP Seminar] Data-Centric AI : Chapter 1 (0)	2023.04.05
Discriminative Model / Generative Model (1)	2023.02.24
Argparse 자습서 (0)	2021.09.08
Global Average Pooling (0)	2021.09.03

관련글 관련글 더보기

인기포스트

ABOUT ME

LINK

ADMIN

티스토리툴바