-
Notifications
You must be signed in to change notification settings - Fork 0
Evaluation Method
ai-lab-projects edited this page Apr 29, 2025
·
1 revision
This page describes how the trained agents (buyer and seller) are evaluated.
- The sum of all rewards (profits and losses) achieved during the evaluation period.
- Higher is better.
- The percentage of profitable trades among all completed trades.
- Formula: (Number of winning trades) / (Total number of trades)
- The mean profit or loss per trade.
- Important for understanding trade quality beyond just win rate.
- Average number of days the asset was held before selling.
- Helps assess the strategy's turnover speed.
- Proportion of time spent holding positions vs. waiting.
- Statistical evaluation of strategy performance.
- Compares total reward to the distribution of random trading simulations.
- A lower p-value indicates a lower probability that the observed results are due to random chance.
- A measure comparing the total reward achieved to the mean reward of random strategies.
-
Evaluation on Training Data
- To check how well the model fits the training set.
-
Evaluation on Validation Data
- To assess the model’s ability to generalize to unseen data.
-
Random Simulations for p-Value
- Conduct random buy/sell simulations to establish a null distribution.
- Calculate the p-value by comparing the agent's result to random performance.
- Evaluation is conducted without exploration (i.e., agents use deterministic policies without random actions).
- Early stopping is performed based on validation results to avoid overfitting.
- Models are saved if they achieve the best validation p-value during training.