Algorithm Insufficiency Identification - SOTIF (ISO 21448)

Algorithm Insufficiency: Definition and Structure

Concept	Description	Example
Algorithm insufficiency	Limitation of the intended algorithm that causes unsafe behaviour when system works correctly	AEB object classifier cannot distinguish bicycle from road sign
False negative (miss)	Algorithm fails to detect a real hazard object	Pedestrian in low-light not classified above confidence threshold
False positive (false alarm)	Algorithm classifies non-hazard as hazard	Bridge shadow classified as stopped vehicle; unnecessary braking
Performance boundary	Specific input condition where algorithm transitions from safe to unsafe output	Object classification accuracy drops below 70% below 10 lux illumination

Algorithm Insufficiency Types

Type	Description	Automotive Example
Coverage gap	Algorithm not trained/designed for certain scenario class	AEB trained on European roads; deployed in Japan with different road signs
Distribution shift	Real-world input distribution differs from training distribution	Nighttime data underrepresented in training set
Boundary condition	Algorithm degrades near its operating envelope boundary	Object detector accuracy at maximum range (40m) is below acceptable threshold
Interaction effect	Two individually-acceptable limitations combine to cause failure	Moderate rain + partial occlusion: individually tolerable, combined safety-critical
Overconfidence	Algorithm reports high confidence even when wrong	ML classifier outputs 0.95 confidence for misclassified object

ML-Specific Insufficiency Analysis

Pythonml_insufficiency.py

"""Analyse ML model insufficiencies for SOTIF."""
import numpy as np

def analyse_classifier_performance(predictions: list,
                                    ground_truth: list,
                                    conditions: list) -> dict:
    """
    Analyse classifier performance by condition.
    Identifies conditions where performance is insufficient.
    """
    results = {}
    unique_conditions = set(conditions)

    for cond in unique_conditions:
        mask = [i for i, c in enumerate(conditions) if c == cond]
        preds = [predictions[i] for i in mask]
        truth = [ground_truth[i] for i in mask]

        tp = sum(1 for p, t in zip(preds, truth) if p == 1 and t == 1)
        fn = sum(1 for p, t in zip(preds, truth) if p == 0 and t == 1)
        fp = sum(1 for p, t in zip(preds, truth) if p == 1 and t == 0)

        recall    = tp / (tp + fn) if (tp + fn) > 0 else 0.0
        precision = tp / (tp + fp) if (tp + fp) > 0 else 0.0

        results[cond] = {
            "recall":    round(recall, 3),
            "precision": round(precision, 3),
            "n_samples": len(mask),
            "sufficient": recall >= 0.95  # SOTIF target
        }
    return results

# Example: analyse AEB detector across weather conditions
conditions = (["daylight"]*500 + ["light_rain"]*200 +
              ["heavy_rain"]*50  + ["fog"]*30)
# Simulated: heavy rain recall = 0.71 (insufficient)
gt    = [1]*780
preds = ([1]*495 + [0]*5 +     # daylight: recall 0.99
         [1]*186 + [0]*14 +    # light rain: recall 0.93
         [1]*35  + [0]*15 +    # heavy rain: recall 0.70 FAIL
         [1]*22  + [0]*8)      # fog: recall 0.73 FAIL

results = analyse_classifier_performance(preds, gt, conditions)
for cond, r in results.items():
    status = "OK" if r["sufficient"] else "INSUFFICIENT"
    print(f"{cond:15s}: recall={r['recall']:.2f}  [{status}]")

Summary

Algorithm insufficiency identification is the most technically demanding SOTIF activity for ML-based perception systems because the insufficiency boundaries are not defined by a specification -- they emerge from the data distribution and training process. The condition-stratified performance analysis is the key technique: instead of reporting a single overall accuracy metric (which can hide poor performance on safety-critical subgroups), it breaks down performance by environmental condition, target type, and range. A classifier with 95% overall recall but 70% recall in fog has an insufficiency that is invisible in the aggregate number but safety-critical in the SOTIF context.

🔬 Deep Dive — Core Concepts Expanded

This section builds on the foundational concepts covered above with additional technical depth, edge cases, and configuration nuances that separate competent engineers from experts. When working on production ECU projects, the details covered here are the ones most commonly responsible for integration delays and late-phase defects.

Key principles to reinforce:

Configuration over coding: In AUTOSAR and automotive middleware environments, correctness is largely determined by ARXML configuration, not application code. A correctly implemented algorithm can produce wrong results due to a single misconfigured parameter.
Traceability as a first-class concern: Every configuration decision should be traceable to a requirement, safety goal, or architecture decision. Undocumented configuration choices are a common source of regression defects when ECUs are updated.
Cross-module dependencies: In tightly integrated automotive software stacks, changing one module's configuration often requires corresponding updates in dependent modules. Always perform a dependency impact analysis before submitting configuration changes.

🏭 How This Topic Appears in Production Projects

Project integration phase: The concepts covered in this lesson are most commonly encountered during ECU integration testing — when multiple software components from different teams are combined for the first time. Issues that were invisible in unit tests frequently surface at this stage.
Supplier/OEM interface: This is a topic that frequently appears in technical discussions between Tier-1 ECU suppliers and OEM system integrators. Engineers who can speak fluently about these details earn credibility and are often brought into critical design review meetings.
Automotive tool ecosystem: Vector CANoe/CANalyzer, dSPACE tools, and ETAS INCA are the standard tools used to validate and measure the correct behaviour of the systems described in this lesson. Familiarity with these tools alongside the conceptual knowledge dramatically accelerates debugging in real projects.

⚠️ Common Mistakes and How to Avoid Them

Assuming default configuration is correct: Automotive software tools ship with default configurations that are designed to compile and link, not to meet project-specific requirements. Every configuration parameter needs to be consciously set. 'It compiled' is not the same as 'it is correctly configured'.
Skipping documentation of configuration rationale: In a 3-year ECU project with team turnover, undocumented configuration choices become tribal knowledge that disappears when engineers leave. Document why a parameter is set to a specific value, not just what it is set to.
Testing only the happy path: Automotive ECUs must behave correctly under fault conditions, voltage variations, and communication errors. Always test the error handling paths as rigorously as the nominal operation. Many production escapes originate in untested error branches.
Version mismatches between teams: In a multi-team project, the BSW team, SWC team, and system integration team may use different versions of the same ARXML file. Version management of all ARXML files in a shared repository is mandatory, not optional.

📊 Industry Note

Engineers who master both the theoretical concepts and the practical toolchain skills covered in this course are among the most sought-after professionals in the automotive software industry. The combination of AUTOSAR standards knowledge, safety engineering understanding, and hands-on configuration experience commands premium salaries at OEMs and Tier-1 suppliers globally.