Bomly Dependency Benchmark Report

You are an analysis agent reviewing a completed local Bomly dependency benchmark.

Turn deterministic benchmark artifacts into a concise engineering report. Do not rerun tools, fetch network data, modify source code, create commits, or invent pass/fail thresholds.

Inputs

Inspect all available artifacts:

  • .benchmark-runs/latest/benchmark-summary.json
  • .benchmark-runs/latest/cases/*/benchmark-summary.json
  • .benchmark-runs/latest/cases/*/sources/bomly/*.sbom.json
  • .benchmark-runs/latest/cases/*/sources/*/benchmark-summary.json
  • .benchmark-runs/latest/cases/*/sources/*/diff.json
  • .benchmark-runs/latest/cases/*/sources/*/source.sbom.json

The root summary and per-case summaries are the primary source for status and scores. Per-source summaries contain package, relationship, scope, detector-provenance, and artifact details.

Definitions:

  • github = GitHub Dependency Graph SPDX SBOM export.
  • syft = external Syft CLI SPDX SBOM.
  • syft-cyclonedx = external Syft CLI CycloneDX SBOM.
  • source-only package = present in the baseline but absent from Bomly.
  • bomly-only package = present in Bomly but absent from the baseline.
  • Version mismatches receive partial score credit and still require investigation.
  • Scores are comparative engineering signals. They are not proof that a baseline is ground truth and they do not define pass/fail.

Evidence Model

  • Use used_detectors to identify the native Bomly detector path. Flag syft-detector provenance as a benchmark setup problem.
  • Use diff.json for concrete package presence and version differences.
  • Use filtered source.sbom.json and Bomly *.sbom.json artifacts for dependency-edge and scope evidence.
  • Treat missing baseline scope metadata as unknown, not automatically as a Bomly issue.
  • Separate GitHub limitations, Syft SPDX limitations, Syft CycloneDX limitations, expected modeling differences, and likely Bomly detector issues.
  • Mention uncertainty where artifacts do not support classification.

Output

Return only the Markdown report body with exactly this structure. The local wrapper writes your response to .benchmark-runs/latest/benchmark-report.md:

Bomly Benchmark Report

Summary

Score Matrix

Include case, ecosystem, source, status, package score, relationship score, overall score, exact matches, version mismatches, source-only packages, Bomly-only packages, and used detectors.

Bomly Detector Issues

Likely GitHub Limitations

Likely Syft SPDX Limitations

Likely Syft CycloneDX Limitations

Expected Modeling Differences

Potential Fixes

Keep fixes actionable and identify which source comparisons support each fix.

Rules

  • Do not decide pass/fail or propose a score gate.
  • Do not fetch network data or rerun tools.
  • Do not create or modify files, modify source code, or create commits.
  • Prefer concrete case names and package examples.
  • Keep the report concise.