Generate de novo assemblies from PacBio read data. Supports single-sample and trio-based assemblies.
Workflow for running de novo assembly using human PacBio whole genome sequencing (WGS) data. Written using Workflow Description Language (WDL). The assembly workflow performs de novo assembly on samples and trios.
Workflow Inputs
Each sample can independently have single-sample de novo assembly run. Additionally, if a trio is provided, trio-based assembly may be run.
Input | Description | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
cohort | A cohort can include one or more samples. Samples need not be related.
| ||||||||||||
samples | Sample information for each sample in the workflow run.
| ||||||||||||
reference | Files associated with the reference genome.
| ||||||||||||
backend | Backend where the workflow will be executed | ||||||||||||
zones | Zones where compute will take place; required if backend is set to ‘AWS’ or ‘GCP’. | ||||||||||||
aws_spot_queue_arn | Queue ARN for the spot batch queue; required if backend is set to ‘AWS’ and | ||||||||||||
aws_on_demand_queue_arn | Queue ARN for the on demand batch queue; required if backend is set to ‘AWS’ and | ||||||||||||
container_registry | Container registry where workflow images are hosted. If left blank, PacBio’s public Quay.io registry will be used. | ||||||||||||
preemptible | If set to |
Workflow Outputs
The output set will depend on whether single-sample or trio-based de novo assembly is run.
Output | Description | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sample de novo assembly | These files will be output if
| ||||||||||||||
Trio de novo assembly | These files will be output if
|
References
Reference datasets are hosted publicly for use in the pipeline.
Containers
Docker images definitions used by this workflow can be found in the wdl-dockerfiles repository. Images are hosted in PacBio’s quay.io. Docker images used in the workflow are pegged to specific versions by referring to their digests rather than tags.