Skip to main content

APIs and scripts for validating STIX 2.0 documents.

Project description

NOTE: This is an OASIS Open Repository. See the Governance section for more information.

The STIX validator checks that STIX JSON content conforms to the requirements specified in the STIX 2.0 specification. In addition to checking conformance with the JSON schemas, the validator checks conformance with requirements that cannot be specified in JSON schema, as well as with established “best practices.” This validator is non-normative; in cases of conflict with the STIX 2.0 specification, the specification takes precedence.

The STIX 2.0 specification contains two types of requirements: mandatory “MUST” requirements, and recommended “SHOULD” best practice requirements. The validator checks documents against the “MUST” requirements using JSON schemas. Some of these mandatory requirements cannot be implemented in JSON Schema, however, so the validator uses Python functions to check them. The “SHOULD” requirements are all checked by Python functions, and options may be used to ignore some or all of these recommended “best practices.”

The only exception to this is the mandatory requirement that an object’s ‘type’ be one of those defined by a STIX Object in the specification. This rules out custom objects, so this check was made optional.

The validator also color-codes its output to make it easier to tell at a glance whether validation passed.

Installation

The easiest way to install the STIX validator is with pip:

$ pip install git+https://github.com/oasis-open/cti-stix-validator.git

Note that if you instead install it by cloning or downloading the repository, you will need to set up the submodules before you install it:

$ git clone https://github.com/oasis-open/cti-stix-validator.git
$ cd cti-stix-validator/
$ git submodule update --init --recursive
$ python setup.py install

Usage

As A Script

The validator comes with a bundled script which you can use to validate a JSON file containing STIX content:

$ stix2_validator <stix_file.json>

As A Library

You can also use this library to integrate STIX validation into your own tools. You can validate a JSON file:

from stix2validator import validate_file, print_results

results = validate_file("stix_file.json")
print_results(results)

You can also validate a JSON string, and check if the input passed validation:

from stix2validator import validate_string, print_results

stix_json_string = "..."
results = validate_string(stix_json_string)
if results.is_valid:
    print_results(results)

You can pass in a ValidationOptions object if you want behavior other than the default:

from stix2validator import ValidationOptions

options = ValidationOptions(strict=True)
results = validate_string(stix_json_string, options)

Checking Best Practices

The validator will always validate input against all of the mandatory “MUST” requirements from the spec. By default it will issue warnings if the input fails any of the “SHOULD” recommendations, but validation will still pass. To turn these “best practice” warnings into errors and cause validation to fail, use the --strict option with the command-line script, or create a ValidationOptions object with strict=True when using the library.

You cannot select which of the “MUST” requirement checks will be performed; all of them will always be performed. However, you may select which of the “SHOULD” checks to perform. Use the codes from the table below to enable or disable these checks. For example, to disable the checks for the report label and tool label vocabularies, use --disable 218,222 or disabled="218,222". All the other checks will still be performed. Conversely, to only check that custom property names adhere to the recommended format but not run any of the other “best practice” checks, use --enable 103 or enabled="103".

Enabling supersedes disabling. Simultaneously enabling and disabling the same check will result in the validator performing that check.

Some checks access Internet resources to determine valid values for certain properties. For instance, the ‘mime-type’ check accesses the IANA’s list of registered MIME types. These web requests are cached to conserve bandwidth, will expire after one week, and are stored in a file called ‘cache.sqlite’ in the same directory the script is run from. The cache can be refreshed manually with the --refresh-cache or refresh_cache=True, or cleared with --clear-cache or clear_cache=True. This caching can be disabled entirely with --no-cache or no_cache=True.

Recommended Best Practice Check Codes

Code

Name

Ensures…

1

format-checks

all 1xx checks are run

101

custom-object-prefix

custom object type names follow the correct format

102

custom-object-prefix-lax

same as 101 but more lenient; no source identifier needed in prefix

103

custom-property-prefix

custom object property names follow the correct format

104

custom-property-prefix-lax

same as 103 but more lenient; no source identifier needed in prefix

111

open-vocab-format

values of open vocabularies follow the correct format

121

kill-chain-names

kill-chain-phase name and phase follow the correct format

141

observable-object-keys

observable object keys follow the correct format

142

observable-dictionary-keys

dictionaries in cyber observable objects follow the correct format

143

custom-observable-object-prefix

custom observable object names follow the correct format

144

custom-observable-object-prefix-lax

same as 144 but more lenient; no source identifier needed in prefix

145

custom-object-extension-prefix

custom observable object extension names follow the correct format

146

custom-object-extension-prefix-lax

same as 145 but more lenient; no source identifier needed in prefix

147

custom-observable-properties-prefix

observable object custom property names follow the correct format

148

custom-observable-properties-prefix-lax

same as 148 but more lenient; no source identifier needed in prefix

149

windows-process-priority-format

windows-process-ext’s ‘priority’ follows the correct format

2

approved-values

all 2xx checks are run

201

marking-definition-type

marking definitions use a valid definition_type

202

relationship-types

relationships are among those defined in the specification

210

all-vocabs

all of the following open vocabulary checks are run

211

attack-motivation

certain property values are from the attack_motivation vocabulary

212

attack-resource-level

certain property values are from the attack_resource_level vocabulary

213

identity-class

certain property values are from the identity_class vocabulary

214

indicator-label

certain property values are from the indicator_label vocabulary

215

industry-sector

certain property values are from the industry_sector vocabulary

216

malware-label

certain property values are from the malware_label vocabulary

218

report-label

certain property values are from the report_label vocabulary

219

threat-actor-label

certain property values are from the threat_actor_label vocabulary

220

threat-actor-role

certain property values are from the threat_actor_role vocabulary

221

threat-actor-sophistication

certain property values are from the threat_actor_sophistication vocabulary

222

tool-label

certain property values are from the tool_label vocabulary

241

hash-algo

certain property values are from the hash-algo vocabulary

242

encryption-algo

certain property values are from the encryption-algo vocabulary

243

windows-pebinary-type

certain property values are from the windows-pebinary-type vocabulary

244

account-type

certain property values are from the account-type vocabulary

270

all-external-sources

all of the following external source checks are run

271

mime-type

file.mime_type is a valid IANA MIME type

272

protocols

certain property values are valid IANA Service and Protocol names

273

ipfix

certain property values are valid IANA IP Flow Information Export (IPFIX) Entities

274

http-request-headers

certain property values are valid HTTP request header names

275

socket-options

certain property values are valid socket options

276

pdf-doc-info

certain property values are valid PDF Document Information Dictionary keys

301

network-traffic-ports

network-traffic objects contain both src_port and dst_port

Governance

This GitHub public repository ( https://github.com/oasis-open/cti-stix-validator ) was proposed and approved [bis] by the OASIS Cyber Threat Intelligence (CTI) TC as an OASIS Open Repository to support development of open source resources related to Technical Committee work.

While this Open Repository remains associated with the sponsor TC, its development priorities, leadership, intellectual property terms, participation rules, and other matters of governance are separate and distinct from the OASIS TC Process and related policies.

All contributions made to this Open Repository are subject to open source license terms expressed in the BSD-3-Clause License. That license was selected as the declared “Applicable License” when the Open Repository was created.

As documented in “Public Participation Invited”, contributions to this OASIS Open Repository are invited from all parties, whether affiliated with OASIS or not. Participants must have a GitHub account, but no fees or OASIS membership obligations are required. Participation is expected to be consistent with the OASIS Open Repository Guidelines and Procedures, the open source LICENSE designated for this particular repository, and the requirement for an Individual Contributor License Agreement that governs intellectual property.

Maintainers

Open Repository Maintainers are responsible for oversight of this project’s community development activities, including evaluation of GitHub pull requests and preserving open source principles of openness and fairness. Maintainers are recognized and trusted experts who serve to implement community goals and consensus design preferences.

Initially, the associated TC members have designated one or more persons to serve as Maintainer(s); subsequently, participating community members may select additional or substitute Maintainers, per consensus agreements.

Current Maintainers of this Open Repository

About OASIS Open Repositories

Feedback

Questions or comments about this Open Repository’s activities should be composed as GitHub issues or comments. If use of an issue/comment is not possible or appropriate, questions may be directed by email to the Maintainer(s) listed above. Please send general questions about Open Repository participation to OASIS Staff at repository-admin@oasis-open.org and any specific CLA-related questions to repository-cla@oasis-open.org.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

stix2_validator-0.3.0-py2.py3-none-any.whl (109.9 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page