ASH v3 Release by scrthq · Pull Request #117 · awslabs/automated-security-helper

scrthq · 2025-04-16T14:42:33Z

ASH v3 Release

This PR includes the work comprising the next major version release of the Automated Security Helper.

Drivers
Breaking Changes
- aggregated_results.{txt,json} Structure
- Migration from git-secrets to detect-secrets
New Features / Enhancements

Feature Parity - Various Item Tracker

Drivers

The core drivers for the changes in this release are:

Standardization of ASH results data structure:
- ASH should produce machine-readable outputs by default so the outputs can be better leveraged by users and organizations integrating ASH into their SDLC processes.
Support for industry standard output formats:
- ASH should be able to produce reports from its standardized data structure that align with industry standards for security scanning and test reporting, e.g. SARIF, CycloneDX, JUnitXML.
ASH reports should be easily actionable:
- Reviewing an ASH report and identifying the issues that need to be actioned on should be simple.
- ASH should support producing formats optimized for human-readability, e.g. HTML reports or text reports that display the findings in a way that focuses on what is important from the scan.
Extensibility and an overall better developer experience:
- ASH has historically been written mostly as shell scripts, with small amounts of various other languages being introduced over time depending on what was required at that time. This has made extensibility, development and testing overall difficult compared to focusing entirely on a language better suited for development such as Python.
- Extending/customizing ASH has also been something not easily accomplishable without having a deep understanding of ASH, often requiring internalization and additional administrative overhead.
Configurability:
- A feature request we've received often has been to surface a mechanism to configure ASH, e.g. providing custom path exclusions or providing configuration to underlying scanners.

Breaking Changes

The following changes in this release could impact how you currently use ASH.

`aggregated_results.{txt,json}` Structure

One of the primary goals with this release has been to improve how ASH collects, processes, formats the outputs it produces across the suite of scanners ASH employs. The output format up until this release has been raw stdout/stderr redirection from the scanners themselves. This makes scan result processing manual, often including a large amount of "noise" due to capturing all of the scanner output.

This release changes the output format for the aggregated results to a standardized data model named the "ASHARP" model (ASH Aggregated Results Parser). This model is emitted as a JSON file to the output directory named aggregated_results.json.

*If you are not currently parsing the aggregated_results.{txt,json} output of ASH, you are likely not going to be impacted by this change)

The output model JSON schema is available at src/automated_security_helper/schemas/ASHARPModel.json
The Pydantic model that generates the JSON schema is available at src/automated_security_helper/models/asharp_model.py

Migration from `git-secrets` to `detect-secrets`

detect-secrets currently provides a full Python interface and can have the version pinned within our pyproject.toml.
detect-secrets provides the ability to baseline a directory or file so acknowledged findings do not continue to raise false positives.
Within our testing, git-secrets found far less findings than detect-secrets has, with a sample directory showing 2 secrets detected by git-secrets (AWS key pair) vs 157 by detect-secrets (including the AWS key pair that git-secrets found)
- git-secrets only matching AWS credentials without custom rule/pattern authoring
- detect-secrets supports a large variety of predefined rules that greatly increase overall secret-type detection support:

$ detect-secrets scan --list-all-plugins
ArtifactoryDetector
AWSKeyDetector
AzureStorageKeyDetector
BasicAuthDetector
CloudantDetector
DiscordBotTokenDetector
GitHubTokenDetector
GitLabTokenDetector
Base64HighEntropyString
HexHighEntropyString
IbmCloudIamDetector
IbmCosHmacDetector
IPPublicDetector
JwtTokenDetector
KeywordDetector
MailchimpDetector
NpmDetector
OpenAIDetector
PrivateKeyDetector
PypiTokenDetector
SendGridDetector
SlackDetector
SoftlayerDetector
SquareOAuthDetector
StripeDetector
TelegramBotTokenDetector
TwilioKeyDetector

New Features / Enhancements

SARIF as primary data structure for SAST reports

The Static Analysis Results Interchange Format (SARIF) defines a standard format for the output of static analysis tools. ASH uses the SARIF 2.1.0 schema specification as an intermediary data format for SAST scanner results to emit reports from.

Along with being open source itself, SARIF has been chosen for ASH's SAST data format due to its broad ecosystem and existing integration support with common enterprise tooling.

Links:

CycloneDX as primary data structure for SBOM reports

Similar to SARIF, OWASP CycloneDX is a full-stack Bill of Materials (BOM) standard that provides advanced supply chain capabilities for cyber risk reduction.

Links:

JSON output from ASHARP model for aggregated results

The ASHARP model is a lightweight metadata wrapper that allows collection of all relevant data from a scan necessary to produce scan reports.

Configuration Support

ASH now has a local configuration format with a backing ASHConfig model JSON schema. The configuration can be authored in either JSON or YAML. ASH looks in the source directory of the scan for the following configuration file paths, if an explicit path was not provided by default:

The ASH_CONFIG environment variable, if set to a valid path to an ASH configuration file.
An ash.yaml or ash.yml in the root of the source directory of the scan.
An ash.json in the root of the source directory of the scan.

Plugin Support / Extensibility

ASH v3 introduces support for custom plugins in the form of Python modules extending the following module namespaces:

automated_security_helper.converters
- Converters are responsible for converting unscannable file formats into scannable ones.
- ASH currently includes the following ConverterPlugin implementations as of this release (checked means implemented, tested and ready to release):
  - ArchiveConverter: Identifies zip, tar, and tar.gz files in the source directory, searches for scannable files within the archive, and extracts the scannable files into the temporary working directory of the scan.
  - JupyterNotebookConverter: Identifies Jupyter Notebook (.ipynb) files and converts them to Python using nbconvert, outputting the convertable Python files to the temporary working directory of the scan.
automated_security_helper.scanners
- Scanners are the core of ASH and are the integration point for SAST and SBOM scanners.
- ASH currently includes the following ScannerPlugin implementations as of this release (checked means implemented, tested and ready to release):
  - BanditScanner: Runs bandit to perform SAST scanning against Python files.
  - CdkNagScanner: Evaluates rendered CloudFormation YAML/JSON templates against CDK Nag's provided NagPacks. Defaults to including the AWS Solutions NagPack, but allows enabling any other CDK NagPack: HIPAA Security, NIST 800-53 rev 4, NIST 800-53 rev 5, and PCI DSS 3.2.1 NagPacks.
  - CfnNagScanner: Runs cfn-nag against rendered CloudFormation templates for IaC analysis.
  - CheckovScanner: Runs checkov to perform IaC/SAST scanning against applicable content in the source directory.
  - DetectSecretsScanner: Runs detect-secrets tool against scannable files in the source directory to identify secrets in code. Replaces git-secrets in ASH's scanner stack.
  - NpmAuditScanner: Runs npm/yarn/pnpm audit based on which package lock(s) are discovered in the source directory.
  - SemgrepScanner: Runs semgrep to perform SAST scans.
  - GrypeScanner: Runs grype to perform SAST scans.
  - SyftScanner: Runs syft to perform SBOM scans.
  - CustomScanner: Configuration-driven implementation that allows easy integration of custom scanner tools that emit SARIF and/or CycloneDX outputs.
automated_security_helper.reporters
- Reporters are responsible for ingesting the ASHARPModel and outputting the data into different formats or data stores, e.g. to file or to a centralized security finding aggregation service like Amazon Security Hub.
- ASH currently includes the following ReporterPlugin implementations as of this release (checked means implemented, tested and ready to release):
  - ASFFReporter: Converts report to ASFF (Amazon Security Findings Format), saves as ash.asff in the output directory.
  - CSVReporter: Converts report to simple CSV format, saves as ash.csv in the output directory.
  - CycloneDXReporter: Converts SBOM report to CycloneDX JSON format, saves as ash.cdx.json in the output directory.
  - HTMLReporter: Converts report to simple HTML format, saves as ash.html in the output directory.
  - JSONReporter: Converts report to simple JSON format, saves as ash.json in the output directory.
  - JUnitXMLReporter: Converts report to JUnitXML format, saves as ash.junit.xml in the output directory.
  - MarkdownReporter: Converts report to Markdown format, saves as ash.md in the output directory. Provides useful top-level information around the scan results, including listing the file locations with based on finding count to identify hotspots to focus on.
  - OCSFReporter: Converts report to OCSF (Open Cybersecurity Schema Framework) format, saves as ash.ocsf in the output directory.
  - SARIFReporter: Converts Sreport to SARIF format, saves as ash.sarif in the output directory.
  - SPDXReporter: Converts SBOM report to SPDF JSON format, saves as ash.spdf.json in the output directory.
  - TextReporter: Converts report to a simple text-based report, saves as ash.txt in the output directory.
  - YAMLReporter: Converts report to simple YAML format, saves as ash.yaml in the output directory.

github-actions · 2025-04-16T14:42:51Z

ASH Security Scan Report

Report generated: 2025-08-01T15:52:01+00:00
Time since scan: 2 minutes

Scan Metadata

Project: ASH
Scan executed: 2025-08-01T15:49:58+00:00
ASH version: 3.0.0

Summary

Scanner Results

The table below shows findings by scanner, with status based on severity thresholds and dependencies:

Severity levels:
- Suppressed (S): Findings that have been explicitly suppressed and don't affect scanner status
- Critical (C): Highest severity findings that require immediate attention
- High (H): Serious findings that should be addressed soon
- Medium (M): Moderate risk findings
- Low (L): Lower risk findings
- Info (I): Informational findings with minimal risk
Duration (Time): Time taken by the scanner to complete its execution
Actionable: Number of findings at or above the threshold severity level that require attention
Result:
- PASSED = No findings at or above threshold
- FAILED = Findings at or above threshold
- MISSING = Required dependencies not available
- SKIPPED = Scanner explicitly disabled
- ERROR = Scanner execution error
Threshold: The minimum severity level that will cause a scanner to fail
- Thresholds: ALL, LOW, MEDIUM, HIGH, CRITICAL
- Source: Values in parentheses indicate where the threshold is set:
  - global (global_settings section in the ASH_CONFIG used)
  - config (scanner config section in the ASH_CONFIG used)
  - scanner (default configuration in the plugin, if explicitly set)
Statistics calculation:
- All statistics are calculated from the final aggregated SARIF report
- Suppressed findings are counted separately and do not contribute to actionable findings
- Scanner status is determined by comparing actionable findings to the threshold

Scanner	Suppressed	Low	Result	Threshold
bandit	44	53	PASSED	MEDIUM (global)
cdk-nag	0	0	PASSED	MEDIUM (global)
cfn-nag	0	0	PASSED	MEDIUM (global)
checkov	1	0	PASSED	LOW (config)
detect-secrets	6	0	PASSED	MEDIUM (global)
grype	1	0	PASSED	MEDIUM (global)
npm-audit	0	0	PASSED	MEDIUM (global)
opengrep	13	0	PASSED	MEDIUM (global)
semgrep	0	0	PASSED	MEDIUM (global)
syft	0	0	PASSED	MEDIUM (global)

Report generated by Automated Security Helper (ASH) at 2025-08-01T15:52:01+00:00

github-advanced-security

AWS Labs - Automated Security Helper found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

src/automated_security_helper/cli/image.py

src/automated_security_helper/reporters/ash_default/report_content_emitter.py

src/automated_security_helper/cli/image.py

automated_security_helper/cli/image.py

automated_security_helper/plugins/discovery.py

automated_security_helper/plugins/loader.py

automated_security_helper/plugins/plugin_manager.py

automated_security_helper/utils/subprocess_utils.py

automated_security_helper/interactions/run_ash_container.py

…ependency checking via the validate method on plugins

…ymore

…asses still with coupling

v3/feat/cleanup

…or better clarity on intent

…for better clarity on intent

feat(aggregated_result): Renamed ASHARPModel to AshAggregatedResult for better clarity on intent

… column of the metrics_table

v3/fix/table results

fix: scan error result

awsmadi

LGTM! Can't wait for this new release!

rafaelpereyra

LGTM let's go!

awsntheule · 2025-07-31T20:25:32Z

It looks good to me, excited to see this get merged into main!

awsmadi · 2025-07-31T20:35:54Z

I have cut Issue #170 to investigate Grype soft fails. Thank you @rafaelpereyra for identifying this! This will not block merging at this time.

…ulnerabilityFinding objects

V3/fix/ocsf doc structure

…flow Updated GitHub Actions workflow to use proper bash variable expansion syntax by changing $VARIABLE to ${VARIABLE} format for: - ASH_UVX_SOURCE variable references in uvx commands - ASH_OUTPUT_DIR variable references in file paths and commands

- Add environment variables for ASH_MODE, ASH_ARGS, FAIL_ON_FINDINGS, and VERBOSE - Replace direct input substitution with environment variable references - Use ASH_OUTPUT_DIR environment variable for output directory - Improve shell safety by avoiding direct parameter expansion in command line

Convert the description field in the ASH Security Scan workflow from YAML description block to comment format to resolve workflow syntax issues.

Added explicit permissions configuration to the ASH Security Scan GitHub workflow, specifying contents: read, pull-requests: write, and security-events: write permissions.

- Convert single quotes to double quotes for consistency across all string values - Update expiration dates, rule IDs, paths, and reasons to use double quotes - Remove expired GitHub Actions security suppression rule - Maintain consistent formatting for better readability and YAML standards compliance

Added checks: write permission to the GitHub workflow permissions in .github/workflows/run-ash-security-scan.yml.

- Refactored fail-on-findings parameter handling by introducing FAIL_ON_FINDINGS_PARAM environment variable - Moved conditional logic for --fail-on-findings/--no-fail-on-findings flags from inline command to environment variable - Simplified the uvx command line by using the new environment variable instead of inline conditional expression

github-advanced-security bot found potential problems Apr 21, 2025

View reviewed changes

github-advanced-security bot found potential problems May 1, 2025

View reviewed changes

github-advanced-security bot found potential problems May 2, 2025

View reviewed changes

src/automated_security_helper/cli/image.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems May 4, 2025

View reviewed changes

github-advanced-security bot found potential problems May 5, 2025

View reviewed changes

automated_security_helper/interactions/run_ash_container.py Fixed Show fixed Hide fixed

scrthq added 24 commits May 6, 2025 22:32

fix(plugins): fix plugins

5669956

feat(multi): cleanup log output, added more visibllity into missing d…

d34082c

…ependency checking via the validate method on plugins

feat(multi): cleanup log output, added more visibllity into missing d…

69e28e1

…ependency checking via the validate method on plugins

fix(custom-scanner): removed CustomScanner, no current plan to use an…

71cb2ee

…ymore

fix(tests): cleaned up custom scanner and found orphaned duplicate cl…

85973c3

…asses still with coupling

Merge pull request #138 from awslabs/v3/feat/cleanup

8182228

v3/feat/cleanup

feat(aggregated_result): Renamed ASHARPModel to AshAggregatedResult f…

d85efef

…or better clarity on intent

feat(aggregated_result): Renamed ASHARPModel to AshAggregatedResults …

a84dec1

…for better clarity on intent

fix(ocsf): Validated OCSF output against Security Lake OCSF validator.py

3e0f90b

Merge branch 'beta' into v3/feat/cleanup

131dab1

Merge pull request #139 from awslabs/v3/feat/cleanup

ce8ab9a

feat(aggregated_result): Renamed ASHARPModel to AshAggregatedResult for better clarity on intent

fix(cdk-nag): No files found error

ba39657

fix(table): Scanners should show as MISSING or SKIPPED in the Results…

b0ad346

… column of the metrics_table

fix(table): Scanners should show as MISSING or SKIPPED in the Results…

638d973

… column of the metrics_table

feat(reports): Fixed results and unified report tables

cfb3777

feat(reports): Fixed results and unified report tables

6400652

feat(reports): Fixed results and unified report tables

701cf82

feat(reports): Fixed results and unified report tables

09ef9bc

feat(reports): Fixed results and unified report tables

43b3117

feat(reports): Fixed results and unified report tables

3087bc6

feat(reports): Fixed results and unified report tables

0ba78e4

Merge pull request #140 from awslabs/v3/fix/table-results

047e889

v3/fix/table results

feat(semgrep): re-enabled by default

341a4dd

feat(opengrep): disabled while troubleshooting

4f7ffda

rafaelpereyra and others added 6 commits July 29, 2025 17:31

feat: Added handling of failed scans to reports

8043e7c

fix: Fixed regression on execution report

83f09d3

test: fixed failing tests. Added test for failed scan

0338e12

fix: regression on scanner results

7509cf6

Merge branch 'beta' into v3/fix-scan-error

53e1ca5

Merge pull request #168 from awslabs/v3/fix-scan-error

4ca413b

fix: scan error result

awsmadi marked this pull request as ready for review July 31, 2025 20:03

awsmadi previously approved these changes Jul 31, 2025

View reviewed changes

rafaelpereyra previously approved these changes Jul 31, 2025

View reviewed changes

awsntheule previously approved these changes Jul 31, 2025

View reviewed changes

scrthq added 2 commits July 31, 2025 20:45

fix(ocsf): Shifted OCSFReporter hierarchy so it outputs an array of V…

5cb2b6b

…ulnerabilityFinding objects

fix(ocsf): formatting

2acafa6

scrthq mentioned this pull request Aug 1, 2025

Added Vulert into README.md #172

Closed

scrthq added 2 commits July 31, 2025 21:07

fix(ocsf): test fixes

1ceba23

Merge pull request #173 from awslabs/v3/fix/ocsf-doc-structure

3e3eb6c

V3/fix/ocsf doc structure

scrthq dismissed stale reviews from awsntheule, rafaelpereyra, and awsmadi via 3e3eb6c August 1, 2025 07:13

rafaelpereyra added 9 commits August 1, 2025 09:18

fix: convert workflow description to comment format

39ba285

Convert the description field in the ASH Security Scan workflow from YAML description block to comment format to resolve workflow syntax issues.

ci: add explicit permissions to ASH security scan workflow

fc394ba

Added explicit permissions configuration to the ASH Security Scan GitHub workflow, specifying contents: read, pull-requests: write, and security-events: write permissions.

ci: add checks write permission to ASH security scan workflow

4a86af2

Added checks: write permission to the GitHub workflow permissions in .github/workflows/run-ash-security-scan.yml.

ci: added suppression for opengrep python compatibility rule

f669e07

ci: fixed permissions on workflow

8daf632

awsmadi merged commit ba32f87 into main Aug 1, 2025
32 checks passed

Conversation

scrthq commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ASH v3 Release

Feature Parity - Various Item Tracker

Drivers

Breaking Changes

aggregated_results.{txt,json} Structure

Migration from git-secrets to detect-secrets

New Features / Enhancements

SARIF as primary data structure for SAST reports

CycloneDX as primary data structure for SBOM reports

JSON output from ASHARP model for aggregated results

Configuration Support

Plugin Support / Extensibility

Uh oh!

github-actions bot commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ASH Security Scan Report

Scan Metadata

Summary

Scanner Results

Uh oh!

github-advanced-security bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

awsmadi left a comment

Choose a reason for hiding this comment

Uh oh!

rafaelpereyra left a comment

Choose a reason for hiding this comment

Uh oh!

awsntheule commented Jul 31, 2025

Uh oh!

awsmadi commented Jul 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

scrthq commented Apr 16, 2025 •

edited

Loading

`aggregated_results.{txt,json}` Structure

Migration from `git-secrets` to `detect-secrets`

github-actions bot commented Apr 16, 2025 •

edited

Loading