Prashant working task dependency simulator nov 22 #4000

prashantcraju · 2025-11-23T20:11:29Z

Summary

align the task dependency simulator’s stats aggregation with the refactored metta/rl/stats.py, summing per_label_samples/evictions/tracked_task_completions instead of averaging
confirm the simulator still logs identical WandB telemetry (mean performance, tasks above threshold, sampling gini/entropy, dependency waterfall)

Testing

uv run ./tools/run.py recipes.experiment.curriculum_test.task_dependency_simulator.train
num_epochs=2 samples_per_epoch=5 run=test_refactor_stats

uv run ./tools/run.py recipes.experiment.curriculum_test.task_dependency_simulator.train
num_epochs=500 samples_per_epoch=10 num_envs=32 run=integration_test- wandb proof: https://wandb.ai/metta-research/curriculum_test/runs/d6ozn564

Evidence

Gini_report.pdf
Combined plot showing mean performance, tasks above threshold, sampling gini/entropy, eviction counts, and dependency waterfall

…with refactor

mstormbull and others added 25 commits November 12, 2025 11:41

squash history

9a3b1d6

reduce diff

01b72d2

generate variants map

6e73ae3

update task gen

c9bb990

testing evictions

6b3a87e

fix gini calc

8114134

ginig

e0cadc1

hunting memory leak

c538d02

merge main

5cee5eb

cleanup

5084f27

LP count fix

f911f5c

merge: Merge main into msb_curriculum_refactor_clean

f3835df

default session id

2677c1f

cleanup

ca29f0a

cleanup experiments folder

d37d0da

major simplificiation

67ca550

improved polymorphism

a1e31d8

add shared memory logging

5dfedc1

add debug logging

a1161ae

remove dfebug

cad00c0

fix locks and float conversion

f3bca86

try lockless

5df9852

major refactor

820176b

working task dependency simulator

059f1d6

changes made changes to fix: align task dependency stats aggregation …

8359b29

…with refactor

prashantcraju requested a review from mstormbull November 23, 2025 20:11

prashantcraju assigned mstormbull Nov 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prashant working task dependency simulator nov 22 #4000

Prashant working task dependency simulator nov 22 #4000

Uh oh!

prashantcraju commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Prashant working task dependency simulator nov 22 #4000

Are you sure you want to change the base?

Prashant working task dependency simulator nov 22 #4000

Uh oh!

Conversation

prashantcraju commented Nov 23, 2025

Summary

Testing

Evidence

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants