ska-sw-integration-testing-badger

This repository contains the deployments and tests for software integration testing of the SKA Mid and SKA Low telescopes.

Repository Content Overview

This repository includes:

Kubernetes charts for deploying SKA Mid and SKA Low telescopes for software-only integration testing.
- Location: charts/ directory
A standard-ish Makefile to manage deployments and tests, supporting the differences between SKA Mid and SKA Low.
- Location: Makefile for the common structure, and Makefile-mid.mk and Makefile-low.mk for telescope-specific details.
A standard-ish GitLab CI/CD pipeline to manage deployments and tests.
- Location: .gitlab-ci.yml as the entry point, .gitlab/ for specific, slightly customised jobs for the two telescopes.
A set of system-level tests, written using pytest and Gherkin syntax (via pytest-bdd), to validate the deployments.
- Location: tests/ directory.
Automatic publishing of test results to Jira Xray using the ska-ser-xray library.
Automatic publishing of test results to a MariaDB database for comprehensive test data collection, analysis, and reporting across multiple pipeline runs and repositories.
- Location: .gitlab/publish-db.yml defines common MariaDB setup and base jobs that are extended by telescope-specific publishing jobs in .gitlab/publish-db-low.yml and .gitlab/publish-db-mid.yml.
- Database credentials are securely fetched from HashiCorp Vault using GitLab CI tokens.

NOTE: When we say “standard-ish” we mean standard as per the current SKA DevOps practices at the time of setting up this repository in PI28, inspired by repositories such as:

ska-cicd-training-pipeline-machinery

ska-tango-examples

Below is a more detailed overview of each of these components.

Charts Overview

Currently, there are two main charts: one for SKA Mid and one for SKA Low:

charts/ska-mid-sw-tests/
charts/ska-low-sw-tests/

Both charts essentially consist of the following set of sub-charts:

Common Tango base charts and related utilities, such as:
- ska-tango-base
- ska-tango-util
- ska-tango-tangogql
- ska-ser-skuid
- Taranta-related charts
- Archiver-related charts
Common or similar SKA Mid and Low sub-system charts that include:
- TMC
- SDP
- CSP.LMC
- CBF
Mid and Low specific sub-system charts that include:
- For Mid: Dish LMCs
- For Low: MCCS

Generally speaking, for Mid we are currently working with just one subarray, while for Low we are working with two subarrays. For Mid, at present, four dishes are used. This may change in future as we expand the tests and the deployed SUT.

These charts and value files are partially inherited from the repositories:

To make them work in this deployment and to have initial green pipelines as a baseline, we had to make a few modifications and some temporary patches, such as disabling certain non-essential faulty components. Also, we cannot yet claim to have full ownership or understanding of all the configuration options and settings used in these charts, so further work is required:

Determine exactly what we consider the SUT (System Under Test) for our software integration testing purposes.
Identify which extra components and services are needed for testing purposes (even if they are not strictly part of the SUT).
Understand, clean up, and rationalise the chart and value file configurations to reflect the above two points.
Clearly document the choices made and the reasoning behind them.

Also, a protocol to control and manage changes to these charts and values needs to be established. We have not yet decided whether to version the charts in this repository or keep them in separate repositories, as both approaches have advantages and disadvantages. Keeping them in separate repositories may help to isolate changes to the charts and validate different versions of the SUT with different versions of the tests (ideally), but at the same time it also introduces the practical complexity of managing multiple repositories, ambiguity about where to place particular configurations, and the difficulty of debugging processes that require dynamically changing both the tests and the charts.

Makefile Structure for Mid and Low

The standard Makefile is the entry point for commands to deploy and test. The basic Makefile includes all the agnostic logic and structure, while the telescope-specific details are delegated to the Makefile-mid.mk and Makefile-low.mk files.

The standard-ish part is essentially what we import from:

.make/k8s.mk
.make/helm.mk
.make/python.mk
.make/raw.mk
.make/base.mk
.make/release.mk

OCI image building is necessary only for repositories that have source code with new Tango devices or subsystems to deploy and build images from. This repository does not have any such source code (it only has already built charts and tests), so we do not need support for OCI image building. Therefore, we do not include .make/oci.mk (nor do we have a Dockerfile or any image build logic). Instead, we build a Helm chart, which is then deployed to Kubernetes but does not require building new container images.

The most relevant customisations are the following:

Through a TELESCOPE variable, we select the target, which can be either SKA-low or SKA-mid (defaulting to SKA-low). According to this variable’s value, we include the relevant telescope-specific Makefile-*.mk file (which can point to either Makefile-low.mk or Makefile-mid.mk).

You can set this variable in one of the following ways:
- From the command line when calling make, e.g.,
```
make ... TELESCOPE=SKA-mid
```
- By exporting it in your shell environment before calling make, e.g.,
```
export TELESCOPE=SKA-mid
make ...
```
- By setting it in the GitLab CI/CD pipeline jobs (see below for details)
This will activate all the necessary customisations for the selected telescope (test markers, chart to deploy, namespace to use, etc.). In the pipeline jobs, we also override the namespace to use (to have a unique namespace per pipeline run or to have more control in persistent deployment namespaces; see below for details).

NOTE: This variable only affects deployment and test commands. Other commands that are not telescope-specific (e.g., code formatting, linting, docs building, helm linting and building, etc.) are not affected by this variable, so you can run them as usual without setting it.
The K8S_CHARTS are both ska-low-sw-test and ska-mid-sw-test, but the used K8S_CHART and HELM_CHART are set according to the selected telescope in the relevant Makefile-*.mk file.
In general, KUBE_NAMESPACE and HELM_CHART are set by the specific Makefile-*.mk file.
Since SDP needs a separate namespace to execute the scripts, we dynamically build a KUBE_NAMESPACE_SDP namespace and create it through some hooks that are executed when make k8s-install is called (see k8s-pre-install-chart, etc.)
In the K8S_CHART_PARAMS we inject some value customisations, such as:
- Cluster domain override
- Tango host
- Namespace for SDP
- Kafka host for SDP (generated dynamically in Makefile-*.mk)
- Taranta params
- Archiver params (with a path and a DB name defined in Makefile-*.mk)
- Extra params for Mid and Low respectively
- (Some others, see the files for details)
Regarding (integration) test runs, there are a few relevant customisations worth mentioning:
- We set a standard K8S_TEST_IMAGE_TO_TEST (since we do not build images here)
- K8S_TEST_RUNNER depends on the CI_JOB_ID
  - Not sure if this is strictly needed
- Before executing the tests, we export a requirements.txt file from the pyproject.toml (see test-requirements target)
  - If we do not do this, we do not see all the dependencies installed
- We have to set a few variables that indicate that we are not running any pairwise tests (see *_SIMULATION_ENABLED variables)
  - This is inherited from the old test harness; we have to keep this until we refactor the tests to not require the previous test harness infrastructure
- Through PYTHON_VARS_BEFORE_PYTEST we need to expose a few environment variables to the test execution environment, such as:
  - PYTHONPATH
  - KUBE_NAMESPACE and KUBE_NAMESPACE_SDP
  - TANGO_HOST
  - The *_SIMULATION_ENABLED variables mentioned above
- Through PYTHON_VARS_AFTER_PYTEST we:
  - Execute only the tests tagged with MARK (where MARK is set to mid or to low in the telescope-specific Makefile-*.mk files, to distinguish between Mid and Low tests)
  - We exit at the first failure (-x flag)
  - (Temporarily) we skip tests with two subarrays, since they still need to be updated to work with the newer SUT component versions
- MARK is used also to set the JSON reports file path and the XRAY config files for Jira Xray publishing (see below for details).

IMPORTANT NOTE: At present, both deployments are not supposed to run locally with Minikube, since the resource requirements are too high (see below). Therefore, you will likely not be able to run make k8s-install locally nor will you be able to run the tests locally (make k8s-test). Instead, locally you may still be able to:

Format the code (make python-format)

Lint the code (make python-lint)

Lint the charts (make helm-lint)

Build the Sphinx documentation (make docs-build html, as soon as we set up the ReadTheDocs standard documentation)

(We will add some other meaningful local targets in the future)

Instead, most of the useful processes will now run in the GitLab CI/CD pipelines.

Tests

The tests in this repository are system-level integration tests that interact with the telescope through TMC and verify the emitted events, mainly from TMC, but also from other high-level components, such as:

CSP.LMC (controller and subarrays)
SDP (controller and subarrays)
MCCS (for Low only, controller and subarrays)
Dish LMCs (for Mid only)

The verifications mainly involve state changes, on the Telescope State, on the Subarray Observation State, and on other relevant Tango attributes. Generally speaking, we do not go too deep into the subsystems’ details.

All the tests are located in the tests/ directory, and are written using pytest and Gherkin syntax via the pytest-bdd plugin.

Old tests

This set of tests is inherited from the repository this one forked from (ska-sw-integration-testing). We are gradually refactoring and improving them. See the next section.

At present, Mid and Low tests are distinct and:

They reside respectively in the tests/mid/ and tests/low/ directories,
They are selected through the MARK variable in the respective Makefile-*.mk files (mid for Mid and low for Low), which permit you to run only the tests tagged with the relevant mark.

NOTE: As a consequence, it is important that all new tests that are added are labelled with @pytest.mark.mid or @pytest.mark.low accordingly, otherwise they will not be executed!

Each of the two sets of folders at present contains:

A features/ sub-directory with the Gherkin feature files,
A data/ sub-directory with any test data needed (mainly JSON files that are used as command parameters),
A tests/ sub-directory with the step implementations (likely to be reorganised in the future),
A resources/ sub-directory with test harness pieces (mainly related to the old test harness, which will likely be removed in the future)
A conftest.py file with common fixtures and hooks (NOTE: Be careful, there may be other conftest.py files in sub-directories that may override or add to the main one)
A common pytest.ini file with common configuration for pytest and pytest-bdd.

NOTE: The tests/ and features/ directories still contain a leftover system_level_tests/ sub-directory structure, which comes from when we also had pairwise tests. This will be cleaned up in the future.

Even if the implementation of the tests is different for Mid and Low, the orchestration and high-level logic are quite similar:

all the tests assume the telescope to be in a certain fixed initial state (telescope state OFF, subarrays EMPTY)
in the given step, they execute a sequence of commands through TMC to prepare the system in the desired state for the next step (generally the telescope is turned on, and then the subarray operational flow is executed)
the when step executes a command (without waiting for completion)
the then steps both wait for the command to complete and verify the state changes using event-based assertions (generally implemented through ska-tango-testing TangoEventTracer or similar utilities)
at the end of each test execution, a tear down procedure is executed to bring the telescope back to the initial state (telescope state OFF, subarrays EMPTY)
the interaction between the tests and the SUT passes through a test harness, that essentially serves three main purposes:
- represent the SUT and its structure and provide access to its components in a structured way
- encapsulate some orchestration logic (e.g., waiting for commands to complete, moving the telescope to a certain state, tear down procedure, etc.)

A few further notes about these tests:

Mid tests at present depend on ska-integration-test-harness, in particular on the so-called “Monolithic Harness” that is hardcoded around TMC. Having it in a separate repository is not ideal, and we plan to move towards having our own test harness directly in this repository in the future, as well as making a few updates to simplify some aspects, make it more flexible and support new test scenarios (e.g., multiple subarrays).
For Mid tests, there may also still be some leftover code and fixtures for the old test harness that supported pairwise tests. This will be cleaned up in the future.
Low tests at present have their own test harness code directly in this repository, in the tests/low/resources/ directory. In the future, we plan to replace it with the same one we are using for Mid, once we have adapted it to stay in this repository, be more flexible and support more test scenarios (e.g., multiple subarrays).

New tests

The old tests are slowly being rewritten with the purpose of improving at the same time:

the robustness of the tests themselves (we want more robust tests that require less implicit preconditions on the SUT state and that are able to “set themselves up” instead of relying on some fixed initial state)
the execution time and the performance of the tests (we want faster tests, in terms of optimised orchestration flow and better re-use of eventually previous test runs’ state when possible and useful, instead of always starting from the same initial state)
the readability and maintainability of the tests (we want more readable, less duplicated, more rationalised tests, that leverage steps reusability and modular generic components)

This re-engineering process is based on the approach of the Test Harness as a Platform, where the tests themselves rely on two layers of modular components;

generic core components: those exposed by the Python library ska-integration-test-harness, which are generic and reusable across different SKA testing projects, and expected to remain relatively stable over time
custom components built on top of the generic ones to serve specific testing needs of this project (hosted in src/ska_sw_integration_testing_badger/ith) that evolve dynamically as the tests and needs evolve

The tests themselves are now hosted in tests/low_new/ and tests/mid_new/ directories, and they are executed immediately after the old ones in the same test job. Gradually we will skip and then remove the old tests that are replaced by the new ones.

Some important principles about the new tests and the related test harness customisation layer are the following.