Overview

Product video
Unstract is a document intelligence platform that converts unstructured documents into structured data your systems can use.
It works in three stages: parsing, extraction, and deployment.
Document Parsing Unstract reads PDFs, scanned images, forms, and documents with mixed layouts, then converts them into clean, machine-readable text.
- Reads PDFs, scanned images, forms, and documents in unseen layouts and supports 300+ languages.
- Preserves tables, columns, and key-value pairs so layout reaches the extraction stage intact.
- Handles low-quality scans and dense files without manual cleanup.
- Reaches up to 99.9% extraction accuracy across supported document types.
Document Data Extraction Teams describe the fields they need in plain language. Unstract uses large language models to interpret the parsed content and return structured output ready for databases, applications, and downstream workflows.
- Define extraction logic in plain language, with no template setup or labeled training data.
- Interprets varied and unseen layouts, so a new vendor format needs no new rules.
- Supports a wide range of LLMs
- Drives 7x token cost savings and a 20x improvement in operational efficiency.
Deployment Once an extraction is defined, you can put it into production as an API your applications call directly, or run it across batches of documents as a pipeline.
- Ship extraction logic as an API your applications call directly.
- Run documents in batches as an ETL pipeline for high-volume processing.
- Add Human Quality Review at checkpoints such as high-value contracts or regulated records.
- Cuts manual work by 90% and reaches 90%+ straight-through processing.
Common use-cases:
- Insurance claims intake
- Loan and mortgage document review
- Bank statement and financial document analysis
- KYC and customer onboarding
- Contract and agreement data extraction
- Patient and healthcare record processing
- Invoice and accounts payable processing
Highlights
- Prompt Studio: no-code environment to write extraction prompts that hold up across document variants, define your output schema, and compare results and cost from multiple LLMs side by side.
- Agentic Prompt Studio: AI agents read your documents, infer the schema, generate prompts, run extractions, and validate accuracy against verified outputs.
- Human in the Loop: route low-confidence or high-stakes fields to reviewers and approvers for verification, keeping control over what flows downstream.
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Vendor refund policy
This is a contract with usage-based pricing. Once the usage credits included in the contract are exhausted, you will be billed based on your actual usage.
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Resources
Vendor resources
Support
Vendor support
If you need assistance or encounter any issues during use, please contact us at support@unstract.com .
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.