final = w1 × base + w2 × prior + bias
And learns: w1, w2 per class.
PCA embedding ← acoustic features
raw score ← Perch output
prior score ← site/hour prior
base score ← fused base
prev/next score ← temporal context
mean/max score ← smoothing context
Plan is as follows:
P(class | site, hour)PCA reduces the feature vector to its most relevant components:
2048 → maybe 64 or 128 dimensionsProbes grab the feature vectors + other characteristics and predicts the scores itself, rather than letting Perch compute the scores from the feature/embeddings only
probe sees: