website/scripts
Qiming Teng 95257a2edd Improve the linkchecker script
The linkchecker script is not working the same way as the `scripts/lsync.sh`.

- The path must start with '/docs'. This is not implied in any way.
- The language can be deduced if user provides a full path to a markdown
  file, e.g. `content/en/docs/concepts/security/controlling-access.md`.
- The path parameter could use a positional argument for ease of use.

This PR improves the user experience for the tool.
2022-05-03 18:56:34 +08:00
..
releng releases: Update script reference and front matter for release.md 2021-05-19 17:30:51 -04:00
.spelling_failures support ignoring files when check the spelling 2021-08-11 15:42:14 +08:00
README.md Add script for detecting bad characters. 2021-04-08 09:36:11 +00:00
check-ctrlcode.py Add script for detecting bad characters. 2021-04-08 09:36:11 +00:00
check-headers-file.sh Add _headers file checking logic (#9879) 2018-08-21 12:51:15 -07:00
diff_l10n_branches.py Enhance diff_l10n_branches script output 2020-12-30 10:14:48 +09:00
find_pr.py Added error handling to find_pr script (#14110) 2019-05-08 14:12:35 -07:00
get-changed-urls.sh Add script to print URLs updated by target PR 2021-07-19 10:10:22 +09:00
hash-files.sh automatically derive image version 2020-07-22 11:04:09 -07:00
linkchecker.py Improve the linkchecker script 2022-05-03 18:56:34 +08:00
lsync.sh Fix logic in lsync script 2020-11-05 11:36:11 +08:00
replace-capture.sh Autodetect content directory for capture cleanup tool 2020-06-29 18:26:56 +01:00
requirements.txt Add readme for scripts (#14913) 2019-07-08 15:24:00 -07:00
test_examples.sh Check all commits in branch for examples (to trigger testing) (#14243) 2019-06-10 19:54:16 -07:00
upstream_changes.py Add a script to check differences between translated and English version (#14731) 2019-06-12 06:49:33 -07:00
verify-spelling.sh Merge branch 'main' of github.com:swiftslee/k8s-website 2022-02-21 20:17:45 +08:00

README.md

Development scripts for Kubernetes documentation

Script Description
find_pr.py Find what GitHub pull requests touch a given file.
upstream_changes.py Find what changes occurred between two versions.
test_examples.sh This script tests whether a change affects example files bundled in the website.
check-headers-file.sh This script checks the headers if you are in a production environment.
diff_l10n_branches.py This script generates a report of outdated contents in content/<l10n-lang> directory by comparing two l10n team milestone branches.
hash-files.sh This script emits as hash for the files listed in $@
linkchecker.py This a link checker for Kubernetes documentation website.
lsync.sh This script checks if the English version of a page has changed since a localized page has been committed.
replace-capture.sh This script sets K8S_WEBSITE in your env to your docs website root or rely on this script to determine it automatically
check-ctrlcode.py This script finds control-code(0x00-0x1f) in text files.

Requirements

Some of those scripts have external requirements. You can install them with the following commands:

python3 -m pip install -r requirements.txt

find_pr.py

$ ./find_pr.py --help
Usage: find_pr.py [OPTIONS] PATH

  Find what GitHub pull requests touch a given file.

  ex: ./find_pr.py --tags "language/fr" "content/fr/_index.html"

Options:
  --tags TEXT          Tags of PullRequest (Can be passed multiple times)
  --token TEXT         GitHub API token. (Default env variable GITHUB_TOKEN)
  --last-n-pr INTEGER  Last n-th PullRequests
  --help               Show this message and exit.

upstream_changes.py

$ ./upstream_changes.py --help
Usage: upstream_changes.py [OPTIONS] PATH

  Find what changes occurred between two versions

  ex: ./upstream_changes.py content/fr/_index.html

Options:
  --reference TEXT  Specify the reference version of the file. Default to the
                    English one.
  --git-path TEXT   Specify git path
  --help            Show this message and exit.

test_examples.sh

This script tests whether a change affects example files bundled in the website.

To install the dependencies:

$ ./scripts/test_examples.sh install

To run the examples:

$ ./scripts/test_examples.sh run

check-headers-file.sh

This script checks the headers if you are in a production environment.

./scripts/check-headers-file.sh

diff_l10n_branches.py

$ scripts/diff_l10n_branches.py --help
Usage: diff_l10n_branches.py [OPTIONS] L10N_LANG L_COMMIT R_COMMIT

  This script generates a report of outdated contents in `content/<l10n-
  lang>` directory by comparing two l10n team milestone branches.

  L10n team owners can open a GitHub issue with the report generated by this
  script when they start a new team milestone.

  ex: `scripts/diff_l10n_branches.py ko dev-1.15-ko.3 dev-1.15-ko.4`

Options:
  --src-lang TEXT  Source language
  --help           Show this message and exit.

hash-files.sh

This script emits as hash for the files listed in $@.

$ ./scripts/hash-files.sh

linkchecker.py

This a link checker for Kubernetes documentation website.

  • We cover the following cases for the language you provide via -l, which defaults to 'en'.
  • If the language specified is not English (en), we check if you are actually using the localized links. For example, if you specify zh as the language, and for link target /docs/foo/bar, we check if the English version exists AND if the Chinese version exists as well. A checking record is produced if the link can use the localized version.

Usage: linkchecker.py -h

Cases handled:

- [foo](#bar)                         : ignored currently
+ [foo](http://bar)                   : insecure links to external site
+ [foo](https://k8s.io/website/...)   : hardcoded site domain name

+ [foo](/<lang>/docs/bar/...)  : where <lang> is not 'en'
  + /<lang>/docs/bar           : contains shortcode, so ignore, or
  + /<lang>/docs/bar           : is a image link (ignore currently), or
  + /<lang>/docs/bar           : points to shared (non-localized) page, or
  + /<lang>/docs/bar.md        : exists for current lang, or
  + /<lang>/docs/bar/_index.md : exists for current lang, or
  + /<lang>/docs/bar/          : is a redirect entry, or
  + /<lang>/docs/bar           : is something we don't understand, then ERR

+ [foo](/docs/bar/...)
  + /docs/bar                : contains shortcode, so ignore, or
  + /docs/bar                : is a image link (ignore currently), or
  + /docs/bar                : points to a shared (non-localized) page, or
  + /docs/bar.md             : exists for current lang, or
  + /docs/bar/_index.md      : exists for current lang, or
  + /docs/bar                : is a redirect entry, or
  + /docs/bar                : is something we don't understand

lsync.sh

This script checks if the English version of some localized contents have changed since a localized version has been committed.

The following example checks a single file:

./scripts/lsync.sh content/zh/docs/concepts/_index.md

The following command checks a subdirectory:

./scripts/lsync.sh content/zh/docs/concepts/

check-ctrlcode.py

This script finds control-code(0x00-0x1f) in text files. It will display illegal character in browser.

Usage: ./check-ctrlcode.py <dir> <ext>

  <dir>    Specify the directory to check.
  <ext>    Specify the extension to check.

For example, we can execute as following.

  ./check-ctrlcode.py ../content/en/ .md

The output is following format.

  "{0} <L{1}:{2}:{3}>: {4}"

  {0} : The path of file that a control-code exists.
  {1} : The line number that a control-code exists.
  {2} : The column number that a control-code exists.
  {3} : The found control-code.
  {4} : The one-line strings in the file.