Add metrax_example colab notebook #104

jshin1394 · 2025-07-29T07:05:05Z

No description provided.

jeffcarp

This is great, thanks Jiwon! Left a bunch of comments but they're mainly optional suggestions.

jeffcarp · 2025-08-14T17:30:11Z

metrax_example.ipynb

+      "source": [
+        "Please connect to `Metrax (go/metrax)` colab runtime.\n",
+        "\n",
+        "If you dont see `Metrax (go/metrax)` from the dropdown menu, please run `/google/bin/releases/colaboratory/public/tools/authorize_colab` on your gLinux workstation or cloudtop and try again."


Remove from external version

jeffcarp · 2025-08-14T17:47:33Z

metrax_example.ipynb

+        "The core `metrax` API is functional and stateless, making it a natural fit for JAX. It works by creating immutable `Metric` state objects that can be merged.\n",
+        "\n",
+        "Each `metrax` metric inherits the CLU [`metric`](http://shortn/_e70RtO7j36) class and provides the following APIs:\n",
+        "\n",


One idea (feel free to ignore): it might be useful to describe the lifecycle of a CLU metric so it's easier for users to understand the list of methods below. Something like:

The usual pattern of using a CLU metric is to call Metric.empty() once to create a metric object, then call metric.merge(Metric.from_model_output(y_true, y_pred)) for each subsequent batch of outputs, then finally call metric.compute() to get the final result.

jeffcarp · 2025-08-14T17:49:53Z

metrax_example.ipynb

+        "print(\"--- Method 1: Full-Batch Calculation (on all 32 samples) ---\")\n",
+        "full_batch_results = {}\n",
+        "for name, MetricClass in metrics_to_compute.items():\n",
+        "  # Conditionally add sample_weights for supported metrics.\n",


Do you think it would make sense to split this cell into an initial simpler example without sample weights, then an example with sample weights? I just worry that there's a lot of logic in this cell that might obscure the basic usage of the API.

jeffcarp · 2025-08-14T17:51:29Z

metrax_example.ipynb

+        "for name in metrics_to_compute.keys():\n",
+        "  assert np.allclose(full_batch_results[name], iterative_results[name])\n",
+        "\n",
+        "print(\"✅ Success! Both methods produce identical results.\")"


I wonder if verifying that batch vs. iterative results is relevant to end users? It seems like more of a detail that's important to the implementers of the library but as long as it's tested I'm not sure if end users will be worrying about doing this check?

jeffcarp · 2025-08-14T17:58:01Z

metrax_example.ipynb

+        "    update_kwargs['sample_weights'] = sample_weights\n",
+        "  if name in metrics_with_threshold:\n",
+        "    update_kwargs['threshold'] = 0.5\n",
+        "  metric_obj.update(**update_kwargs)\n",


There's a lot of logic and kwarg updating here, I wonder if it'd be easier for users to understand if it didn't automate as much? I.e. just calling Precision.update() directly instead of computing the kwargs? It may result in around the same number of LOC and be more readable for newcomers.

jeffcarp · 2025-08-14T17:58:41Z

metrax_example.ipynb

+        "  print(f\"{name}: {full_batch_results_nnx[name]}\")\n",
+        "\n",
+        "\n",
+        "# --- Method 2: Iterative Updating by Batch (nnx) ---\n",


(Optional) it might be worth splitting these two methods into separate cells for clarity (feel free to ignore)

jeffcarp · 2025-08-14T17:59:05Z

metrax_example.ipynb

+        "  print(f\"{name}: {iterative_results_nnx[name]}\")\n",
+        "\n",
+        "\n",
+        "# --- Verification ---\n",


IMO this might be able to be removed depending on how you feel about how relevant this is to end users

jeffcarp · 2025-08-14T18:00:41Z

metrax_example.ipynb

+        "\n",
+        "### Method 2: The `jit` and `Mesh` Approach (Advanced Parallelism)\n",
+        "\n",
+        "For more advanced control over distributed computation, JAX provides an explicit sharding mechanism using the `jax.sharding` API. This **SPMD (Single Program, Multiple Data)** approach is more powerful and flexible than `pmap` and is the standard for large-scale models.\n",


I thought Method 1 was also SPMD?

jeffcarp · 2025-08-14T18:01:56Z

metrax_example.ipynb

+        "id": "C3YWS1_x19DJ"
+      },
+      "source": [
+        "## 🧠 Advanced Use: Multi-Host Environments\n",


I like the shoutout here, for anything further I think it makes sense for this to be in its own Colab

jeffcarp · 2025-08-14T18:05:31Z

metrax_example.ipynb

+        "\n",
+        "# --- 1. Metric Calculation Functions ---\n",
+        "\n",
+        "# Method 1: pmap (Simple Data Parallelism)\n",


This is great! Just confirming: is pmap still recommended? It seems like maybe shard_map is the new API for manual parallelism[1], though for this intro guide I think it could make sense to only discuss jit().

[1] jax-ml/jax#20312 (comment)

Add metrax_example colab notebook

9dae4a0

jshin1394 requested a review from rni418 July 29, 2025 07:05

jeffcarp self-requested a review August 14, 2025 17:25

jeffcarp approved these changes Aug 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add metrax_example colab notebook #104

Add metrax_example colab notebook #104

Uh oh!

jshin1394 commented Jul 29, 2025

Uh oh!

jeffcarp left a comment

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jshin1394 Aug 18, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jshin1394 Aug 19, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

jeffcarp Aug 14, 2025

Uh oh!

Uh oh!

Add metrax_example colab notebook #104

Are you sure you want to change the base?

Add metrax_example colab notebook #104

Uh oh!

Conversation

jshin1394 commented Jul 29, 2025

Uh oh!

jeffcarp left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!