
Open data
|
Deployed on AWS
Several reference genomes to enable translation of whole human genome sequencing to clinical practice. On 11/12/2020 these data were updated to reflect the most [up to date GIAB release](https://www.nist.gov/programs-projects/genome-bottle).
Overview
Several reference genomes to enable translation of whole human genome sequencing to clinical practice. On 11/12/2020 these data were updated to reflect the most up to date GIAB release .
Features and programs
Open Data Sponsorship Program
This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
We'd like to hear your feedback and ideas on how to improve this page.
Legal
Content disclaimer
Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- GIAB release data, last updated 11/12/2020.
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::giab
- AWS region
- us-east-1
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://giab/
Resources
Vendor resources
Support
Contact
How to cite
Genome in a Bottle on AWS was accessed on DATE from https://registry.opendata.aws/giab .
License
There are no restrictions on the use of this data. More information on citation is available here .
Similar products
The DRAGEN Complete Suite enables ultra-rapid analysis of Next Generation Sequencing (NGS) data for large data sets, such as whole genomes, exomes, and genes/panels.
This offering enables you to shift on-prem genomics data processing to the AWS Cloud without changing existing HPC application code. This provides a scalable compute and storage solution that allows samples to be processed at higher speed and at higher frequency
NOTE: This deployment requires a specific SageMaker Inference AMI selection (al2-ami-sagemaker-inference-gpu-3-1). Please use the example notebook provided at https://github.com/NVIDIA/nim-deploy/blob/main/cloud-service-providers/aws/sagemaker/aws_marketplace_notebooks/nim-evo2-40b-v2-1-0_aws_marketplace.ipynb for deploying the endpoint.
Evo 2 is a biological foundation model that can interpret and generate DNA sequences across various biological scales: from individual molecules to entire genomes while retaining sensitivity to single-nucleotide changes, enabling zero-shot predictions and complex biological system designs.
10x Genomics is an industry leader in microfluidics devices that tag and label single cells. Their Cell Ranger, Space Ranger, and other powerful visualization software are widely used in NGS and single cell workflows - in tandem with various QC alignment, analysis, and exploratory tools. PTP’s Connected Lab Services offers networking and cloud expertise to ensure that your NGS and single cell workflows are supported, efficient, and scalable in hybrid architecture environments.
CancerVision is a cutting-edge 2-in-1 whole-genome cancer assay offering both somatic (40x) and paired germline (20x) coverage, with ultra-deep targeting (500x) of 600+ clinically relevant cancer genes. It delivers >99% sensitivity and PPV, with robust detection of complex variants including SVs, CNVs, and mutations in non-coding regions. CancerVision also provides key biomarker insights such as TMB, MSI, HRD, mutational signatures, and germline variants. This CAP/CLIA-validated assay offers a fast 2-week turnaround and supports advanced custom analyses, including ecDNA, tumor ploidy, and transposable element detection, at whole-genome resolution. Designed for precision oncology, CancerVision is your comprehensive genomic solution for cancer diagnostics and research.