Skip to content

Commit 03008ec

Browse files
committed
adding testing drl topic
Signed-off-by: ncardozo <[email protected]>
1 parent 66f754e commit 03008ec

File tree

1 file changed

+29
-0
lines changed

1 file changed

+29
-0
lines changed

_thesistopics/testing-drl.md

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,29 @@
1+
---
2+
title: Testing Deep Reinforcement Learning Algorithms.
3+
period: 2024
4+
level: master
5+
area: ai
6+
abstract:
7+
people: Nicolás Cardozo
8+
file: testing-drl
9+
layout: default
10+
---
11+
12+
### Context
13+
14+
Reinforcement Learning (RL) and in particular Deep Reinforcement Learning (DeepRL) are popular learning techniques. given the complexity and high dimentionality of such systems makes it difficult to provide any assurences about the system behavior. Moreover, given that most of the responsibility of the behavior is delegated to the neural network, which is a black box for programmers, evaluating and testing the algorithms is not transparent.
15+
16+
### Project proposal
17+
18+
In this project we propose to build a testing framework for RL algorithms. The idea of the project is to propose a unifying theory to test different properties of RL algorithms (e.g., fairness, correctness, convergence, bias) for different DeepRL algorithms (e.g., DQN, DDQN, A3C, PPO, T3PO).
19+
20+
### Implementation plan
21+
22+
To generate the testing fraemwork for DeepRL we will take inspiration from existing testing approaches for RL or general Deep Neuran Network ML algorithms. based on these approaches, we will build a unifying theory to allow us to test different properties over a range of DeepRL algorithms.
23+
24+
Initially, we will evaluate the appropriateness of existing techniques on DeepRL algorithms. Then based on the imposibilities of the tecniques, we will propose extensions over the theory to cover as many algorithms possible, and as many system properties as possible.
25+
26+
27+
### Contact
28+
29+
n.cardozo

0 commit comments

Comments
 (0)