AWS Physical AI Blog

Category: Graviton

Training World Models on Scene Semantics, Not Pixels

A different recipe for training robot world models: compose pre-trained AI modules with classical computer vision to extract scene semantics from ordinary monocular video — no domain data, no synthetic frames. Introduction Today’s recipe for training robot AI looks the same almost everywhere: feed a giant neural network billions of pixels paired with text instructions […]

Accelerating OpenCV on Graviton – the COOL framework

Introduction  Computer vision workloads are compute-intensive and expensive, forcing developers to choose between performance and cost when processing millions of images for applications ranging from autonomous vehicles to medical diagnostics. OpenCV (Open Computer Vision) is an open-source library designed to make computer vision and image processing fast, easy to use, and portable. It’s one of […]