The effect of ScaleKernel in PairwiseGP #2779

Thomasq99 · 2025-03-20T10:43:15Z

Thomasq99
Mar 20, 2025

Hi everyone,

I have a question regarding the way PairwiseGP handles the likelihood by implicitly setting noise to 1 ($\sigma=1$ ). Instead of having $\sigma$ as a hyperparameter, pairwiseGP simply sets it to 1 and uses ScaleKernel(Kernel).

Does this mean that if the actual noise in stated preference relations is 0.1 that the scale kernel will scale by 10? How does this scaleKernel handle setting the noise to 1?

Additionally, in the cited paper in the code (Chu, W., & Ghahramani, Z. (2005, August). Preference learning with Gaussian processes. In Proceedings of the 22nd international conference on Machine learning (pp. 137-144).), the predictive preference (equation 19) includes the sigma value. If I want to compute the predictive preference, would I still need to use $\sigma$ or is it handled by the scale kernel?

Balandat · 2025-03-21T03:26:25Z

Balandat
Mar 21, 2025
Collaborator

At a high level, using both a scale kernel and noise variance would mean the model would be overparameterized. @ItsMrLin should be able to provide more details on the reasoning here!

1 reply

Thomasq99 Mar 21, 2025
Author

Thank you! I managed to find out the reasoning. I could not find a way to delete the discussion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The effect of ScaleKernel in PairwiseGP #2779

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

The effect of ScaleKernel in PairwiseGP #2779

Thomasq99 Mar 20, 2025

Replies: 1 comment · 1 reply

Balandat Mar 21, 2025 Collaborator

Thomasq99 Mar 21, 2025 Author

Thomasq99
Mar 20, 2025

Replies: 1 comment 1 reply

Balandat
Mar 21, 2025
Collaborator

Thomasq99 Mar 21, 2025
Author