Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AI-related issues to address #75

Open
Bartleby2718 opened this issue Oct 25, 2018 · 0 comments
Open

AI-related issues to address #75

Bartleby2718 opened this issue Oct 25, 2018 · 0 comments
Labels
question Further information is requested

Comments

@Bartleby2718
Copy link
Owner

  • How should I design my reward function? Having read an OpenAI post, I'm thinking of...

    • a big constant function for winning the game
    • an exponentially decaying function for 'good' or 'bad' behavior
      • ex) good: shouting "die" when the chance of winning is low
      • ex) bad: shouting "die" when the chance of winning is high
  • What should the value of my epsilon be?

    • a constant function: e_k = .1 (.05 afterwards) (source)
    • an inverse function: e_k = 1/k (source)
    • an exponentially decaying function: e_k = 0.9 * a^k for 0 < a < 1 (source)
  • How should I implement multiple rules? Rules should change depending on whether an input is timed or what happens when the sums of both players are equal. I'm thinking of subclassing Game and overriding necessary classes/methods, but not sure yet.

  • Both players have the same knowledge about their own or the opponent's deck, like the game of Go, Chess, and Gomoku. That means the same algorithm can be applied to both sides in a game, doubling the data received per game. However, I also read that "[Q learning] isn't likely to lead to very good results if you assume that the opponent can also learn. "

@Bartleby2718 Bartleby2718 added the question Further information is requested label Oct 25, 2018
@Bartleby2718 Bartleby2718 self-assigned this Oct 25, 2018
@Bartleby2718 Bartleby2718 removed their assignment Mar 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant