nf_core.utils
Common utility functions for the nf-core python package.
classnf_core.utils.GitHubAPISession
Bases: CachedSession
Class to provide a single session for interacting with the GitHub API for a run. Inherits the requests_cache.CachedSession and adds additional functionality, such as automatically setting up GitHub authentication if we can.
get(url, **kwargs)
Initialise the session if we haven’t already, then call the superclass get method.
lazy_init() → None
Initialise the object.
Only do this when it’s actually being used (due to global import)
log_content_headers(request, post_data=None)
Try to dump everything to the console, useful when things go wrong.
request_retry(url, post_data=None)
Try to fetch a URL, keep retrying if we get a certain return code.
Used in nf-core pipelines sync code because we get 403 errors: too many simultaneous requests See https://github.com/nf-core/tools/issues/911
safe_get(url)
Run a GET request, raise a nice exception with lots of logging if it fails.
setup_github_auth(auth=None)
Try to automatically set up GitHub authentication
pydantic modelnf_core.utils.NFCoreTemplateConfig
Bases: BaseModel
Template configuration schema
Show JSON schema
```json { "title": "NFCoreTemplateConfig", "description": "Template configuration schema", "type": "object", "properties": { "org": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Org" }, "name": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Name" }, "description": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Description" }, "author": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Author" }, "version": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Version" }, "force": { "anyOf": [ { "type": "boolean" }, { "type": "null" } ], "default": true, "title": "Force" }, "outdir": { "anyOf": [ { "type": "string" }, { "format": "path", "type": "string" }, { "type": "null" } ], "default": null, "title": "Outdir" }, "skip_features": { "anyOf": [ { "items": {}, "type": "array" }, { "type": "null" } ], "default": null, "title": "Skip Features" }, "is_nfcore": { "anyOf": [ { "type": "boolean" }, { "type": "null" } ], "default": null, "title": "Is Nfcore" } } } ```fieldauthor: str | None
Pipeline author
fielddescription: str | None
Pipeline description
fieldforce: bool | None
Force overwrite of existing files
fieldis_nfcore: bool | None
Whether the pipeline is an nf-core pipeline.
fieldname: str | None
Pipeline name
fieldorg: str | None
Organisation name
fieldoutdir: str | Path | None
Output directory
- Validated by:
fieldskip_features: list | None
Skip features. See https://nf-co.re/docs/nf-core-tools/pipelines/create for a list of features.
fieldversion: str | None
Pipeline version
validatoroutdir_to_str»
get(item: str, default: Any = None) → Any
_abc_impl= <_abc._abc_data object>
pydantic modelnf_core.utils.NFCoreYamlConfig
Bases: BaseModel
.nf-core.yml configuration file schema
Show JSON schema
```json { "title": "NFCoreYamlConfig", "description": ".nf-core.yml configuration file schema", "type": "object", "properties": { "repository_type": { "anyOf": [ { "enum": [ "pipeline", "modules" ], "type": "string" }, { "type": "null" } ], "default": null, "title": "Repository Type" }, "nf_core_version": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Nf Core Version" }, "org_path": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Org Path" }, "lint": { "anyOf": [ { "$ref": "#/$defs/NFCoreYamlLintConfig" }, { "type": "null" } ], "default": null }, "template": { "anyOf": [ { "$ref": "#/$defs/NFCoreTemplateConfig" }, { "type": "null" } ], "default": null }, "bump_version": { "anyOf": [ { "additionalProperties": { "type": "boolean" }, "type": "object" }, { "type": "null" } ], "default": null, "title": "Bump Version" }, "update": { "anyOf": [ { "additionalProperties": { "anyOf": [ { "type": "string" }, { "type": "boolean" }, { "additionalProperties": { "anyOf": [ { "type": "string" }, { "additionalProperties": { "anyOf": [ { "type": "string" }, { "type": "boolean" } ] }, "type": "object" } ] }, "type": "object" } ] }, "type": "object" }, { "type": "null" } ], "default": null, "title": "Update" } }, "$defs": { "NFCoreTemplateConfig": { "description": "Template configuration schema", "properties": { "org": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Org" }, "name": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Name" }, "description": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Description" }, "author": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Author" }, "version": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Version" }, "force": { "anyOf": [ { "type": "boolean" }, { "type": "null" } ], "default": true, "title": "Force" }, "outdir": { "anyOf": [ { "type": "string" }, { "format": "path", "type": "string" }, { "type": "null" } ], "default": null, "title": "Outdir" }, "skip_features": { "anyOf": [ { "items": {}, "type": "array" }, { "type": "null" } ], "default": null, "title": "Skip Features" }, "is_nfcore": { "anyOf": [ { "type": "boolean" }, { "type": "null" } ], "default": null, "title": "Is Nfcore" } }, "title": "NFCoreTemplateConfig", "type": "object" }, "NFCoreYamlLintConfig": { "description": "schema for linting config in `.nf-core.yml` should cover:\n\n.. code-block:: yaml\n\n files_unchanged:\n - .github/workflows/branch.yml\n modules_config: False\n modules_config:\n - fastqc\n # merge_markers: False\n merge_markers:\n - docs/my_pdf.pdf\n nextflow_config: False\n nextflow_config:\n - manifest.name\n - config_defaults:\n - params.annotation_db\n - params.multiqc_comment_headers\n - params.custom_table_headers\n # multiqc_config: False\n multiqc_config:\n - report_section_order\n - report_comment\n files_exist:\n - .github/CONTRIBUTING.md\n - CITATIONS.md\n template_strings: False\n template_strings:\n - docs/my_pdf.pdf\n nfcore_components: False\n # nf_test_content: False\n nf_test_content:\n - tests/fieldbump_version: dict[str, bool] | None
Disable bumping of the version for a module/subworkflow (when repository_type is modules). See https://nf-co.re/docs/nf-core-tools/modules/bump-versions for more information.
fieldlint: NFCoreYamlLintConfig | None
Pipeline linting configuration, see https://nf-co.re/docs/nf-core-tools/pipelines/lint#linting-config for examples and documentation
fieldnf_core_version: str | None
Version of nf-core/tools used to create/update the pipeline
fieldorg_path: str | None
Path to the organisation’s modules repository (used for modules repo_type only)
fieldrepository_type: Literal[‘pipeline’, ‘modules’] | None
Type of repository
fieldtemplate: NFCoreTemplateConfig | None
Pipeline template configuration
fieldupdate: dict[str, str | bool | dict[str, str | dict[str, str | bool]]] | None
Disable updating specific modules/subworkflows (when repository_type is pipeline). See https://nf-co.re/docs/nf-core-tools/modules/update for more information.
get(item: str, default: Any = None) → Any
model_dump(**kwargs) → dict[str, Any]
!!! abstract “Usage Documentation” : model_dump
Generate a dictionary representation of the model, optionally specifying which fields to include or exclude.
- Parameters:
- mode – The mode in which to_python should run. If mode is ‘json’, the output will only contain JSON serializable types. If mode is ‘python’, the output may contain non-JSON-serializable Python objects.
- include – A set of fields to include in the output.
- exclude – A set of fields to exclude from the output.
- context – Additional context to pass to the serializer.
- by_alias – Whether to use the field’s alias in the dictionary key if defined.
- exclude_unset – Whether to exclude fields that have not been explicitly set.
- exclude_defaults – Whether to exclude fields that are set to their default value.
- exclude_none – Whether to exclude fields that have a value of None.
- exclude_computed_fields – Whether to exclude computed fields. While this can be useful for round-tripping, it is usually recommended tu use the dedicated round_trip parameter instead.
- round_trip – If True, dumped values should be valid as input for non-idempotent types such as Json[T].
- warnings – How to handle serialization errors. False/”none” ignores them, True/”warn” logs errors, “error” raises a [PydanticSerializationError][pydantic_core.PydanticSerializationError].
- fallback – A function to call when an unknown value is encountered. If not provided, a [PydanticSerializationError][pydantic_core.PydanticSerializationError] error is raised.
- serialize_as_any – Whether to serialize fields with duck-typing serialization behavior.
 
- Returns: A dictionary representation of the model.
_abc_impl= <_abc._abc_data object>
pydantic modelnf_core.utils.NFCoreYamlLintConfig
Bases: BaseModel
schema for linting config in .nf-core.yml should cover:
files_unchanged:
    - .github/workflows/branch.yml
modules_config: False
modules_config:
        - fastqc
# merge_markers: False
merge_markers:
        - docs/my_pdf.pdf
nextflow_config: False
nextflow_config:
    - manifest.name
    - config_defaults:
        - params.annotation_db
        - params.multiqc_comment_headers
        - params.custom_table_headers
# multiqc_config: False
multiqc_config:
    - report_section_order
    - report_comment
files_exist:
    - .github/CONTRIBUTING.md
    - CITATIONS.md
template_strings: False
template_strings:
        - docs/my_pdf.pdf
nfcore_components: False
# nf_test_content: False
nf_test_content:
    - tests/<test_name>.nf.test
    - tests/nextflow.config
    - nf-test.configShow JSON schema
```json { "title": "NFCoreYamlLintConfig", "description": "schema for linting config in `.nf-core.yml` should cover:\n\n.. code-block:: yaml\n\n files_unchanged:\n - .github/workflows/branch.yml\n modules_config: False\n modules_config:\n - fastqc\n # merge_markers: False\n merge_markers:\n - docs/my_pdf.pdf\n nextflow_config: False\n nextflow_config:\n - manifest.name\n - config_defaults:\n - params.annotation_db\n - params.multiqc_comment_headers\n - params.custom_table_headers\n # multiqc_config: False\n multiqc_config:\n - report_section_order\n - report_comment\n files_exist:\n - .github/CONTRIBUTING.md\n - CITATIONS.md\n template_strings: False\n template_strings:\n - docs/my_pdf.pdf\n nfcore_components: False\n # nf_test_content: False\n nf_test_content:\n - tests/fieldactions_awsfulltest: bool | None
Lint all required files to run full tests on AWS
fieldactions_awstest: bool | None
Lint all required files to run tests on AWS
fieldactions_nf_test: bool | None
Lint all required files to use GitHub Actions CI
fieldactions_schema_validation: bool | None
Lint GitHub Action workflow files with schema
fieldbase_config: bool | None
Lint base.config file
fieldfiles_exist: bool | list[str] | None
List of files that can not exist
fieldfiles_unchanged: bool | list[str] | None
List of files that should not be changed
fieldincluded_configs: bool | None
Lint for included configs
fieldlocal_component_structure: bool | None
Lint local components use correct structure mirroring remote
fieldmerge_markers: bool | list[str] | None
List of files that should not contain merge markers
fieldmodules_config: bool | list[str] | None
List of modules that should not be changed
fieldmodules_json: bool | None
Lint modules.json file
fieldmodules_structure: bool | None
Lint modules structure
fieldmultiqc_config: bool | list[str] | None
List of MultiQC config options that be changed
fieldnextflow_config: bool | list[str | dict[str, list[str]]] | None
List of Nextflow config files that should not be changed
fieldnf_test_content: bool | list[str] | None
List of nf-test content that should not be changed
fieldnfcore_components: bool | None
Lint all required files to use nf-core modules and subworkflows
fieldnfcore_yml: bool | None
Lint nf-core.yml
fieldpipeline_if_empty_null: bool | None
Lint for ifEmpty(null) statements
fieldpipeline_name_conventions: bool | None
Lint for pipeline name conventions
fieldpipeline_todos: bool | None
Lint for TODOs statements
fieldplugin_includes: bool | None
Lint for nextflow plugin
fieldreadme: bool | list[str] | None
Lint the README.md file
fieldrocrate_readme_sync: bool | None
Lint for README.md and rocrate.json sync
fieldschema_description: bool | None
Check that every parameter in the schema has a description.
fieldschema_lint: bool | None
Lint nextflow_schema.json file
fieldschema_params: bool | None
Lint schema for all params
fieldsystem_exit: bool | None
Lint for System.exit calls in groovy/nextflow code
fieldtemplate_strings: bool | list[str] | None
List of files that can contain template strings
fieldversion_consistency: bool | None
Lint for version consistency
get(item: str, default: Any = None) → Any
_abc_impl= <_abc._abc_data object>
classnf_core.utils.Pipeline(wf_path: Path)
Bases: object
Object to hold information about a local pipeline.
- Parameters: path (str) – The path to the nf-core pipeline directory.
conda_config
The parsed conda configuration file content (environment.yml).
- Type: dict
conda_package_info
The conda package(s) information, based on the API requests to Anaconda cloud.
- Type: dict
nf_config
The Nextflow pipeline configuration file content.
- Type: dict
files
A list of files found during the linting process.
- Type: list
git_sha
The git sha for the repo commit / current GitHub pull-request ($GITHUB_PR_COMMIT)
- Type: str
minNextflowVersion
The minimum required Nextflow version to run the pipeline.
- Type: str
wf_path
Path to the pipeline directory.
- Type: str
pipeline_name
The pipeline name, without the nf-core tag, for example hlatyping.
- Type: str
schema_obj
A PipelineSchema object
- Type: obj
_fp(fn: str | Path) → Path
Convenience function to get full path to a file in the pipeline
_load() → bool
Run core load functions
_load_conda_environment() → bool
Try to load the pipeline environment.yml file, if it exists
list_files() → list[Path]
Get a list of all files in the pipeline
load_pipeline_config() → bool
Get the nextflow config for this pipeline
Once loaded, set a few convenience reference class attributes
classnf_core.utils.SingularityCacheFilePathValidator
Bases: Validator
Validator for file path specified as –singularity-cache-index argument in nf-core pipelines download
_abc_impl= <_abc._abc_data object>
validate(value)
Validate the input.
If invalid, this should raise a ValidationError.
- Parameters:
document – Documentinstance.
nf_core.utils.anaconda_package(dep, dep_channels=None)
Query conda package information.
Sends a HTTP GET request to the Anaconda remote API.
- Parameters:
- dep (str) – A conda package name.
- dep_channels (list) – list of conda channels to use
 
- Raises:
- A LookupError**,** if the connection fails or times out or gives an unexpected status code –
- A ValueError**,** if the package name can not be found (**404**) –
 
nf_core.utils.check_if_outdated(current_version=None, remote_version=None, source_url='https://nf-co.re/tools_version')
Check if the current version of nf-core is outdated
nf_core.utils.check_nextflow_version(minimal_nf_version: tuple[int, int, int, bool], silent=False) → bool
Check the version of Nextflow installed on the system.
- Parameters:
- minimal_nf_version (tuple *[*int , int , int , bool ]) – The minimal version of Nextflow required.
- silent (bool) – Whether to log the version or not.
 
- Returns: True if the installed version is greater than or equal to minimal_nf_version
- Return type: bool
nf_core.utils.custom_yaml_dumper()
Overwrite default PyYAML output to make Prettier YAML linting happy
nf_core.utils.determine_base_dir(directory: Path | str = '.') → Path
nf_core.utils.fetch_remote_version(source_url)
nf_core.utils.fetch_wf_config(wf_path: Path, cache_config: bool = True) → dict
Uses Nextflow to retrieve the the configuration variables from a Nextflow workflow.
- Parameters:
- wf_path (str) – Nextflow workflow file system path.
- cache_config (bool) – cache configuration or not (def. True)
 
- Returns: Workflow configuration settings.
- Return type: dict
nf_core.utils.file_md5(fname)
Calculates the md5sum for a file on the disk.
- Parameters: fname (str) – Path to a local file.
nf_core.utils.get_biocontainer_tag(package, version)
Given a bioconda package and version, looks for Docker and Singularity containers using the biocontaineres API, e.g.: https://api.biocontainers.pro/ga4gh/trs/v2/tools/{tool}/versions/{tool}-{version} Returns the most recent container versions by default. :param package: A bioconda package name. :type package: str :param version: Version of the bioconda package :type version: str
- Raises:
- A LookupError**,** if the connection fails or times out or gives an unexpected status code –
- A ValueError**,** if the package name can not be found (**404**) –
 
nf_core.utils.get_first_available_path(directory: Path | str, paths: list[str]) → Path | None
nf_core.utils.get_nf_version() → tuple[int, int, int, bool] | None
Get the version of Nextflow installed on the system.
nf_core.utils.get_repo_commit(pipeline, commit_id)
Check if the repo contains the requested commit_id, and expand it to long form if necessary.
- Parameters:
- pipeline (str) – GitHub repo username/repo
- commit_id – The requested commit ID (SHA). It can be in standard long/short form, or any length.
 
- Returns: String or None
- Return type: commit_id
nf_core.utils.get_repo_releases_branches(pipeline, wfs)
Fetches details of a nf-core workflow to download.
- Parameters:
- pipeline (str) – GitHub repo username/repo
- wfs – A nf_core.pipelines.list.Workflows() object, where get_remote_workflows() has been called.
 
- Returns: Array of releases, Array of branches
- Return type: wf_releases, wf_branches (tuple)
- Raises: LockupError**,** if the pipeline can not be found. –
nf_core.utils.get_wf_files(wf_path: Path)
Return a list of all files in a directory (ignores .gitigore files)
nf_core.utils.is_file_binary(path)
Check file path to see if it is a binary file
nf_core.utils.is_pipeline_directory(wf_path)
Checks if the specified directory have the minimum required files (‘main.nf’, ‘nextflow.config’) for a pipeline directory
- Parameters: wf_path (str) – The directory to be inspected
- Raises: UserWarning – If one of the files are missing
nf_core.utils.is_relative_to(path1, path2)
Checks if a path is relative to another.
Should mimic Path.is_relative_to which not available in Python < 3.9
path1 (Path | str): The path that could be a subpath path2 (Path | str): The path the could be the superpath
nf_core.utils.load_tools_config(directory: str | Path = '.') → tuple[Path | None,NFCoreYamlConfig| None]
Parse the nf-core.yml configuration file
Look for a file called either .nf-core.yml or .nf-core.yaml
Also looks for the deprecated file .nf-core-lint.yml/yaml and issues a warning that this file will be deprecated in the future
Returns the loaded config dict or False, if the file couldn’t be loaded
nf_core.utils.nested_delitem(d, keys)
Deletes a key from a nested dictionary
- Parameters:
- d (dict) – the nested dictionary to traverse
- keys (list *[*Any ]) – A list of keys to iteratively traverse, deleting the final one
 
nf_core.utils.nested_setitem(d, keys, value)
Sets the value in a nested dict using a list of keys to traverse
- Parameters:
- d (dict) – the nested dictionary to traverse
- keys (list *[*Any ]) – A list of keys to iteratively traverse
- value (Any) – The value to be set for the last key in the chain
 
nf_core.utils.parse_anaconda_licence(anaconda_response, version=None)
Given a response from the anaconda API using anaconda_package, parse the software licences.
Returns: Set of licence types
nf_core.utils.pip_package(dep)
Query PyPI package information.
Sends a HTTP GET request to the PyPI remote API.
- Parameters: dep (str) – A PyPI package name.
- Raises:
- A LookupError**,** if the connection fails or times out –
- A ValueError**,** if the package name can not be found –
 
nf_core.utils.plural_es(list_or_int)
Return a ‘es’ if the input is not one or has not the length of one.
nf_core.utils.plural_s(list_or_int)
Return an s if the input is not one or has not the length of one.
nf_core.utils.plural_y(list_or_int)
Return ‘ies’ if the input is not one or has not the length of one, else ‘y’.
nf_core.utils.poll_nfcore_web_api(api_url: str, post_data: dict | None = None) → dict
Poll the nf-core website API
Takes argument api_url for URL
Expects API response to be valid JSON and contain a top-level ‘status’ key.
nf_core.utils.pretty_nf_version(version: tuple[int, int, int, bool]) → str
nf_core.utils.prompt_pipeline_release_branch(wf_releases: list[dict[str, Any]], wf_branches: dict[str, Any], multiple: bool = False) → tuple[Any, list[str]]
Prompt for pipeline release / branch
- Parameters:
- wf_releases (array) – Array of repo releases as returned by the GitHub API
- wf_branches (array) – Array of repo branches, as returned by the GitHub API
- multiple (bool) – Allow selection of multiple releases & branches (for Seqera Platform)
 
- Returns: Selected release / branch or False if no releases / branches available
- Return type: choice (questionary.Choice or bool)
nf_core.utils.prompt_remote_pipeline_name(wfs)
Prompt for the pipeline name with questionary
- Parameters: wfs – A nf_core.pipelines.list.Workflows() object, where get_remote_workflows() has been called.
- Returns: GitHub repo - username/repo
- Return type: pipeline (str)
- Raises: AssertionError**,** if pipeline cannot be found –
nf_core.utils.rich_force_colors()
Check if any environment variables are set to force Rich to use coloured output
nf_core.utils.run_cmd(executable: str, cmd: str) → tuple[bytes, bytes] | None
Run a specified command and capture the output. Handle errors nicely.
nf_core.utils.set_wd(path: Path) → Generator[None, None, None]
Sets the working directory for this context.
- Parameters: path (Path) – Path to the working directory to be used inside this context.
nf_core.utils.setup_nfcore_cachedir(cache_fn: str | Path) → Path
Sets up local caching for caching files between sessions.
nf_core.utils.setup_nfcore_dir() → bool
Creates a directory for files that need to be kept between sessions
Currently only used for keeping local copies of modules repos
nf_core.utils.setup_requests_cachedir() → dict[str, Path | timedelta | str]
Sets up local caching for faster remote HTTP requests.
Caching directory will be set up in the user’s home directory under a .config/nf-core/cache_* subdir.
Uses requests_cache monkey patching. Also returns the config dict so that we can use the same setup with a Session.
nf_core.utils.sort_dictionary(d: dict) → dict
Sorts a nested dictionary recursively
nf_core.utils.strip_ansi_codes(string, replace_with='')
Strip ANSI colouring codes from a string to return plain text.
From Stack Overflow: https://stackoverflow.com/a/14693789/713980
nf_core.utils.validate_file_md5(file_name, expected_md5hex)
Validates the md5 checksum of a file on disk.
- Parameters:
- file_name (str) – Path to a local file.
- expected (str) – The expected md5sum.
 
- Raises: IOError**,** if the md5sum does not match the remote sum. –
nf_core.utils.wait_cli_function(poll_func: Callable[[], bool], refresh_per_second: int = 20) → None
Display a command-line spinner while calling a function repeatedly.
Keep waiting until that function returns True
- Parameters:
- poll_func (function) – Function to call
- refresh_per_second (int) – Refresh this many times per second. Default: 20.
 
- Returns: None. Just sits in an infinite loop until the function returns True.