[codex] Check UDF skill markdown license headers#15129
Conversation
Signed-off-by: WilliamK112 <164879897+WilliamK112@users.noreply.github.com>
| *.xml, | ||
| *.properties, | ||
| *.scala, | ||
| skills/udf-*/*.md, |
There was a problem hiding this comment.
Subdirectory markdown files are not matched by this pattern
skills/udf-*/*.md only matches files exactly one level deep inside each udf-* directory — a bare * does not cross path separators. Four existing markdown files (skills/udf-convert-to-cuda/references/JNI_CUDA_GUIDE.md, NATIVE_BUILD_ENV.md, skills/udf-convert-to-cudf/references/RAPIDS_UDF.md, skills/udf-optimize-cudf/references/OPTIMIZATION_PATTERNS.md) all happen to have headers already, so there is no immediate CI failure, but any future markdown file added under a references/ or other nested directory would silently escape the check. Using skills/udf-*/**/*.md would cover the full tree if that is the intent.
| *.xml, | ||
| *.properties, | ||
| *.scala, | ||
| skills/udf-*/*.md, |
There was a problem hiding this comment.
SKILL.md front-matter copyright may not satisfy the license-header action
The SKILL.md files (e.g. skills/udf-benchmark/SKILL.md) embed copyright only in YAML front matter under metadata.spdx-file-copyright-text, not in a comment block. Every non-SKILL markdown file under skills/udf-* (e.g. CUDF_MICROBENCHMARKS.md) uses an <!-- SPDX-FileCopyrightText: … --> HTML comment. If NVIDIA/spark-rapids-common/license-header-check looks for a comment-style header (as it does for .yml, .scala, .py, etc.), all seven SKILL.md files will fail the check the moment any of them is touched in a PR, potentially blocking unrelated work. Has the action been confirmed to accept the YAML front-matter format as a valid copyright declaration? Has the license-header-check action been confirmed (locally or via its source) to accept the metadata.spdx-file-copyright-text YAML front-matter field in SKILL.md files as a valid copyright header, or does it only recognise comment-block formats?
Fixes #15112.
Description
This adds
skills/udf-*/*.mdto the license-header workflow include patterns so changes to UDF skill markdown files are checked for a current-year NVIDIA copyright/license header. The pattern is scoped to UDF skill content and avoids broad*.mdcoverage, which would include repository docs that do not use skill front matter.Validation:
git diff --check.github/workflows/license-header-check.ymlwith Ruby YAMLskills/udf-*markdown filesskills/README.mdandskills/docs/dev/VERSIONS.mdare not matched by the new patternChecklists
Documentation
Testing
(Please provide the names of the existing tests in the PR description.)
Performance