NVIDIA
diff --git a/‎docs/README.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎fern/README.md‎
Lines changed: 2 additions & 0 deletions b/‎fern/README.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎fern/build_docs.sh‎
Lines changed: 22 additions & 0 deletions b/‎fern/build_docs.sh‎
Lines changed: 22 additions & 0 deletions
diff --git a/‎fern/docs.yml‎
Lines changed: 7 additions & 3 deletions b/‎fern/docs.yml‎
Lines changed: 7 additions & 3 deletions
diff --git a/‎fern/pages/build.md‎
Lines changed: 5 additions & 5 deletions b/‎fern/pages/build.md‎
Lines changed: 5 additions & 5 deletions
diff --git a/‎fern/pages/c_api/c-api-cluster-kmeans.md‎
Lines changed: 159 additions & 11 deletions b/‎fern/pages/c_api/c-api-cluster-kmeans.md‎
Lines changed: 159 additions & 11 deletions
@@ -2,6 +2,8 @@
 
 The cuVS documentation is a Fern project in [../fern](../fern).
 
+Fern requires Node.js 18 or newer. If the docs fail with an error such as `SyntaxError: Unexpected token '.'`, check `node --version` and activate a newer Node.js runtime.
+
 ## Preview locally
 
 ```bash
 
@@ -4,6 +4,8 @@ The cuVS documentation lives in this Fern project. Pages are in `fern/pages`, an
 
 The C, C++, Python, Java, Rust, and Go API reference pages are generated from the source tree by `fern/scripts/generate_api_reference.py`. `fern/build_docs.sh` refreshes those pages before validation, preview, and publish runs.
 
+Fern requires Node.js 18 or newer. If the docs fail with an error such as `SyntaxError: Unexpected token '.'`, check `node --version` and activate a newer Node.js runtime.
+
 ## Preview locally
 
 Start the local preview server from the repository root:
 
@@ -30,6 +30,28 @@ Examples:
 EOF
 }
 
+require_node_18() {
+  if ! command -v node >/dev/null 2>&1; then
+    echo "Fern docs require Node.js 18 or newer, but node was not found on PATH." >&2
+    echo "Install or activate Node.js 18+ before running fern/build_docs.sh." >&2
+    exit 1
+  fi
+
+  local node_version
+  local node_major
+  node_version=$(node -p 'process.versions.node' 2>/dev/null || true)
+  node_major="${node_version%%.*}"
+
+  if [[ ! "${node_major}" =~ ^[0-9]+$ || "${node_major}" -lt 18 ]]; then
+    echo "Fern docs require Node.js 18 or newer, but found Node.js ${node_version:-unknown}." >&2
+    echo "Older Node.js versions can fail with errors such as \"SyntaxError: Unexpected token '.'.\"" >&2
+    echo "Install or activate Node.js 18+ before running fern/build_docs.sh." >&2
+    exit 1
+  fi
+}
+
+require_node_18
+
 if [[ -n "${FERN_CLI:-}" ]]; then
   FERN_CMD=("${FERN_CLI}")
 elif command -v fern >/dev/null 2>&1; then
 
@@ -2,7 +2,7 @@
 
 title: "cuVS"
 instances:
-   - url: "nvidia-cuvs.docs.buildwithfern.com/cuvs"
+  - url: "nvidia-cuvs.docs.buildwithfern.com/cuvs"
     custom-domain: docs/nvidia.com/cuvs
 footer: "./theme/nvidia/components/CustomFooter.tsx"
 logo:
@@ -159,8 +159,12 @@ navigation:
         path: "./pages/user_guide/integration_patterns.md"
   - section: "Developer Guide"
     contents:
-      - page: "Guidelines"
-        path: "./pages/developer_guide.md"
+      - section: "Guidelines"
+        contents:
+          - page: "C++ Guidelines"
+            path: "./pages/cpp_guidelines.md"
+          - page: "Python Guidelines"
+            path: "./pages/python_guidelines.md"
       - section: "Advanced Topics"
         path: "./pages/advanced_topics.md"
         contents:
 
@@ -199,18 +199,18 @@ cuVS has the following configurable cmake flags available:
 
 ### Preview documentation
 
-The cuVS documentation is a Fern project in the repository's `fern` directory. Install the Fern CLI, then run the local preview from the repository root:
+The cuVS documentation is a Fern project in the repository's `fern` directory. Fern requires Node.js 18 or newer. If the docs fail with an error such as `SyntaxError: Unexpected token '.'`, check `node --version` and activate a newer Node.js runtime.
+
+Run the local preview from the repository root:
 
 ```bash
-npm install -g fern-api
-fern docs dev
+fern/build_docs.sh dev
 ```
 
 Fern serves the preview at [http://localhost:3000](http://localhost:3000) by default.
 
 Run the Fern checks before publishing documentation changes:
 
 ```bash
-fern check --warnings --strict-broken-links
-fern docs md check
+fern/build_docs.sh check
 ```
@@ -30,6 +30,8 @@ typedef enum { ... } cuvsKMeansInitMethod;
 
 Hyper-parameters for the kmeans algorithm
 
+NB: The inertia_check field is kept for ABI compatibility. Removed in cuvsKMeansParams_v2. TODO: CalVer for the replacement: 26.08
+
 ```c
 struct cuvsKMeansParams { ... };
 ```
@@ -46,10 +48,40 @@ struct cuvsKMeansParams { ... };
 | `oversampling_factor` | `double` | Oversampling factor for use in the k-means\|\| algorithm |
 | `batch_samples` | `int` | batch_samples and batch_centroids are used to tile 1NN computation which is useful to optimize/control the memory footprint Default tile is [batch_samples x n_clusters] i.e. when batch_centroids is 0 then don't tile the centroids |
 | `batch_centroids` | `int` | if 0 then batch_centroids = n_clusters |
-| `inertia_check` | `bool` | Check inertia during iterations for early convergence. |
+| `inertia_check` | `bool` | Deprecated, ignored. Kept for ABI compatibility. |
 | `hierarchical` | `bool` | Whether to use hierarchical (balanced) kmeans or not |
 | `hierarchical_n_iters` | `int` | For hierarchical k-means , defines the number of training iterations |
 | `streaming_batch_size` | `int64_t` | Number of samples to process per GPU batch for the batched (host-data) API. When set to 0, defaults to n_samples (process all at once). |
+| `init_size` | `int64_t` | Number of samples to draw for KMeansPlusPlus initialization. When set to 0, uses heuristic min(3 * n_clusters, n_samples) for host data, or n_samples for device data. |
+| `metric` | [`cuvsDistanceType`](/api-reference/c-api-distance-distance#cuvsdistancetype) |  |
+
+<a id="cuvskmeansparams-v2"></a>
+### cuvsKMeansParams_v2
+
+Hyper-parameters for the kmeans algorithm
+
+TODO: Remove this after cuvsKMeansParams is replaced in ABI 2.0
+
+```c
+struct cuvsKMeansParams_v2 { ... };
+```
+
+**Fields**
+
+| Name | Type | Description |
+| --- | --- | --- |
+| `n_clusters` | `int` | The number of clusters to form as well as the number of centroids to generate (default:8). |
+| `init` | [`cuvsKMeansInitMethod`](/api-reference/c-api-cluster-kmeans#cuvskmeansinitmethod) | Method for initialization, defaults to k-means++:<br />- cuvsKMeansInitMethod::KMeansPlusPlus (k-means++): Use scalable k-means++ algorithm to select the initial cluster centers.<br />- cuvsKMeansInitMethod::Random (random): Choose 'n_clusters' observations (rows) at random from the input data for the initial centroids.<br />- cuvsKMeansInitMethod::Array (ndarray): Use 'centroids' as initial cluster centers. |
+| `max_iter` | `int` | Maximum number of iterations of the k-means algorithm for a single run. |
+| `tol` | `double` | Relative tolerance with regards to inertia to declare convergence. |
+| `n_init` | `int` | Number of instance k-means algorithm will be run with different seeds. |
+| `oversampling_factor` | `double` | Oversampling factor for use in the k-means\|\| algorithm |
+| `batch_samples` | `int` | batch_samples and batch_centroids are used to tile 1NN computation which is useful to optimize/control the memory footprint Default tile is [batch_samples x n_clusters] i.e. when batch_centroids is 0 then don't tile the centroids |
+| `batch_centroids` | `int` | if 0 then batch_centroids = n_clusters |
+| `hierarchical` | `bool` | Whether to use hierarchical (balanced) kmeans or not |
+| `hierarchical_n_iters` | `int` | For hierarchical k-means , defines the number of training iterations |
+| `streaming_batch_size` | `int64_t` | Number of samples to process per GPU batch for the batched (host-data) API. When set to 0, defaults to n_samples (process all at once). |
+| `init_size` | `int64_t` | Number of samples to draw for KMeansPlusPlus initialization. When set to 0, uses heuristic min(3 * n_clusters, n_samples) for host data, or n_samples for device data. |
 | `metric` | [`cuvsDistanceType`](/api-reference/c-api-distance-distance#cuvsdistancetype) |  |
 
 <a id="cuvskmeansparamscreate"></a>
@@ -58,9 +90,11 @@ struct cuvsKMeansParams { ... };
 Allocate KMeans params, and populate with default values
 
 ```c
-cuvsError_t cuvsKMeansParamsCreate(cuvsKMeansParams_t* params);
+CUVS_EXPORT cuvsError_t cuvsKMeansParamsCreate(cuvsKMeansParams_t* params);
 ```
 
+replaced by cuvsKMeansParamsCreate_v2.
+
 **Parameters**
 
 | Name | Direction | Type | Description |
@@ -69,17 +103,19 @@ cuvsError_t cuvsKMeansParamsCreate(cuvsKMeansParams_t* params);
 
 **Returns**
 
-[`cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
 
 <a id="cuvskmeansparamsdestroy"></a>
 ### cuvsKMeansParamsDestroy
 
 De-allocate KMeans params
 
 ```c
-cuvsError_t cuvsKMeansParamsDestroy(cuvsKMeansParams_t params);
+CUVS_EXPORT cuvsError_t cuvsKMeansParamsDestroy(cuvsKMeansParams_t params);
 ```
 
+replaced by cuvsKMeansParamsDestroy_v2.
+
 **Parameters**
 
 | Name | Direction | Type | Description |
@@ -88,7 +124,47 @@ cuvsError_t cuvsKMeansParamsDestroy(cuvsKMeansParams_t params);
 
 **Returns**
 
-[`cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+
+<a id="cuvskmeansparamscreate-v2"></a>
+### cuvsKMeansParamsCreate_v2
+
+Allocate KMeans params
+
+```c
+CUVS_EXPORT cuvsError_t cuvsKMeansParamsCreate_v2(cuvsKMeansParams_v2_t* params);
+```
+
+Mirrors cuvsKMeansParamsCreate but operates on cuvsKMeansParams_v2. Will become the unsuffixed cuvsKMeansParamsCreate in cuVS 26.08.
+
+**Parameters**
+
+| Name | Direction | Type | Description |
+| --- | --- | --- | --- |
+| `params` | in | [`cuvsKMeansParams_v2_t*`](/api-reference/c-api-cluster-kmeans#cuvskmeansparams-v2) | cuvsKMeansParams_v2_t to allocate |
+
+**Returns**
+
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+
+<a id="cuvskmeansparamsdestroy-v2"></a>
+### cuvsKMeansParamsDestroy_v2
+
+De-allocate KMeans params allocated by cuvsKMeansParamsCreate_v2.
+
+```c
+CUVS_EXPORT cuvsError_t cuvsKMeansParamsDestroy_v2(cuvsKMeansParams_v2_t params);
+```
+
+**Parameters**
+
+| Name | Direction | Type | Description |
+| --- | --- | --- | --- |
+| `params` | in | [`cuvsKMeansParams_v2_t`](/api-reference/c-api-cluster-kmeans#cuvskmeansparams-v2) |  |
+
+**Returns**
+
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
 
 <a id="cuvskmeanstype"></a>
 ### cuvsKMeansType
@@ -114,7 +190,7 @@ typedef enum { ... } cuvsKMeansType;
 Find clusters with k-means algorithm.
 
 ```c
-cuvsError_t cuvsKMeansFit(cuvsResources_t res,
+CUVS_EXPORT cuvsError_t cuvsKMeansFit(cuvsResources_t res,
 cuvsKMeansParams_t params,
 DLManagedTensor* X,
 DLManagedTensor* sample_weight,
@@ -127,6 +203,8 @@ Initial centroids are chosen with k-means++ algorithm. Empty clusters are reinit
 
 X may reside on either host (CPU) or device (GPU) memory. When X is on the host the data is streamed to the GPU in batches controlled by params-&gt;streaming_batch_size.
 
+replaced by cuvsKMeansFit_v2.
+
 **Parameters**
 
 | Name | Direction | Type | Description |
@@ -141,15 +219,48 @@ X may reside on either host (CPU) or device (GPU) memory. When X is on the host
 
 **Returns**
 
-[`cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+
+<a id="cuvskmeansfit-v2"></a>
+### cuvsKMeansFit_v2
+
+Find clusters with k-means algorithm (v2 params layout).
+
+```c
+CUVS_EXPORT cuvsError_t cuvsKMeansFit_v2(cuvsResources_t res,
+cuvsKMeansParams_v2_t params,
+DLManagedTensor* X,
+DLManagedTensor* sample_weight,
+DLManagedTensor* centroids,
+double* inertia,
+int* n_iter);
+```
+
+Mirrors cuvsKMeansFit but takes cuvsKMeansParams_v2_t. Will become the unsuffixed cuvsKMeansFit in cuVS 26.08.
+
+**Parameters**
+
+| Name | Direction | Type | Description |
+| --- | --- | --- | --- |
+| `res` | in | [`cuvsResources_t`](/api-reference/c-api-core-c-api#cuvsresources-t) | opaque C handle |
+| `params` | in | [`cuvsKMeansParams_v2_t`](/api-reference/c-api-cluster-kmeans#cuvskmeansparams-v2) | Parameters for KMeans model (v2 layout). |
+| `X` | in | `DLManagedTensor*` | Training instances to cluster. The data must be in row-major format. May be on host or device memory. [dim = n_samples x n_features] |
+| `sample_weight` | in | `DLManagedTensor*` | Optional weights for each observation in X. Must be on the same memory space as X. [len = n_samples] |
+| `centroids` | inout | `DLManagedTensor*` | [in] When init is InitMethod::Array, use centroids as the initial cluster centers. [out] The generated centroids from the kmeans algorithm are stored at the address pointed by 'centroids'. Must be on device. [dim = n_clusters x n_features] |
+| `inertia` | out | `double*` | Sum of squared distances of samples to their closest cluster center. |
+| `n_iter` | out | `int*` | Number of iterations run. |
+
+**Returns**
+
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
 
 <a id="cuvskmeanspredict"></a>
 ### cuvsKMeansPredict
 
 Predict the closest cluster each sample in X belongs to.
 
 ```c
-cuvsError_t cuvsKMeansPredict(cuvsResources_t res,
+CUVS_EXPORT cuvsError_t cuvsKMeansPredict(cuvsResources_t res,
 cuvsKMeansParams_t params,
 DLManagedTensor* X,
 DLManagedTensor* sample_weight,
@@ -159,6 +270,8 @@ bool normalize_weight,
 double* inertia);
 ```
 
+replaced by cuvsKMeansPredict_v2.
+
 **Parameters**
 
 | Name | Direction | Type | Description |
@@ -174,15 +287,50 @@ double* inertia);
 
 **Returns**
 
-[`cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+
+<a id="cuvskmeanspredict-v2"></a>
+### cuvsKMeansPredict_v2
+
+Predict the closest cluster each sample in X belongs to (v2 params layout).
+
+```c
+CUVS_EXPORT cuvsError_t cuvsKMeansPredict_v2(cuvsResources_t res,
+cuvsKMeansParams_v2_t params,
+DLManagedTensor* X,
+DLManagedTensor* sample_weight,
+DLManagedTensor* centroids,
+DLManagedTensor* labels,
+bool normalize_weight,
+double* inertia);
+```
+
+Mirrors cuvsKMeansPredict but takes cuvsKMeansParams_v2_t. Will become the unsuffixed cuvsKMeansPredict in cuVS 26.08.
+
+**Parameters**
+
+| Name | Direction | Type | Description |
+| --- | --- | --- | --- |
+| `res` | in | [`cuvsResources_t`](/api-reference/c-api-core-c-api#cuvsresources-t) | opaque C handle |
+| `params` | in | [`cuvsKMeansParams_v2_t`](/api-reference/c-api-cluster-kmeans#cuvskmeansparams-v2) | Parameters for KMeans model (v2 layout). |
+| `X` | in | `DLManagedTensor*` | New data to predict. [dim = n_samples x n_features] |
+| `sample_weight` | in | `DLManagedTensor*` | Optional weights for each observation in X. [len = n_samples] |
+| `centroids` | in | `DLManagedTensor*` | Cluster centroids. The data must be in row-major format. [dim = n_clusters x n_features] |
+| `labels` | out | `DLManagedTensor*` | Index of the cluster each sample in X belongs to. [len = n_samples] |
+| `normalize_weight` | in | `bool` | True if the weights should be normalized |
+| `inertia` | out | `double*` | Sum of squared distances of samples to their closest cluster center. |
+
+**Returns**
+
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
 
 <a id="cuvskmeansclustercost"></a>
 ### cuvsKMeansClusterCost
 
 Compute cluster cost
 
 ```c
-cuvsError_t cuvsKMeansClusterCost(cuvsResources_t res,
+CUVS_EXPORT cuvsError_t cuvsKMeansClusterCost(cuvsResources_t res,
 DLManagedTensor* X,
 DLManagedTensor* centroids,
 double* cost);
@@ -199,4 +347,4 @@ double* cost);
 
 **Returns**
 
-[`cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)
+[`CUVS_EXPORT cuvsError_t`](/api-reference/c-api-core-c-api#cuvserror-t)