Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

PeeteKeesel / sokoban-ai Public

Notifications You must be signed in to change notification settings
Fork 0
Star 3

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Breadcrumbs

sokoban-ai

/

Todos.md

Copy path

Latest commit

History

69 lines (58 loc) · 1.89 KB

Breadcrumbs

sokoban-ai

/

Todos.md

File metadata and controls

69 lines (58 loc) · 1.89 KB

ToDo's

General

state_after_action(self, a)
- test
successors()
- test
set_children()
- test
get_children()
- test
class structure
- tests
for a given state, build a game tree until either max_steps is reached or the game is finished
- test
clean code
track metrics to separate file for later plotting

Comparison Algorithms

implement different search algorithms
- Backtracking
  - tests
- Depth First Search (DFS)
  - tests
- Breadth First Search (BFS)
  - tests
- Uniform Cost Search (UCS)
  - tests
- A*
  - tests
  - manhattan_distance + test
  - manhattan_heuristic + test
  - other Heuristics

RL Algorithm Ideas

implement MCTS and make it runnable
- different policies
  - random
  - eps-greedy
Single Agent from Feng et al., 2020 (page 6)
- implement Resnets/ConvNets for Learning
- implement MCTS for Planning
  - tests
AlphaGo
- MCTS
- Integrate DCNN to predict value and probability of states

Additional Todos

implement deadlock detection
train CNN to predict best possible action for a given state
How to play with one world to test agents behaviour
Research on what algo's to implement

Optional Tasks

Deadlock detection (helps making the game tree sparser)
change dfs, bfs to recursive implementation

Questions

When comparing the previous and new network's performance on some level, how to choose for a set of Sokoban environments on where to test the performance on?
How to structure the NN architecture?

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.