AWS Re:Invent Keynote Announcements (2024)

Tags: AI Series, AWS, GenAi

After watching AWS’s 2.5-hour re:Invent keynote, it powerfully reinforces what I’ve been telling my customers and peers: AI success requires so much more than just LLMs and prompt engineering.

A key theme was the concept of building blocks—the idea that complex applications can be broken down into simpler components. This approach has allowed AWS to develop hundreds of services that customers can combine in innovative ways, and this really fits well in the era of generative AI which is revolutionizing industries and where startups are at the forefront of this disruption.

The announcements demonstrate that organizations need a complete ecosystem – from specialized hardware to robust data management to development tools – to truly leverage AI at scale. AWS has been building the full stack of technology required to make AI work in real business environments, and the latest improvements do not disappoint at all!

Sure they introduced Amazon’s new Nova family of foundational models (LLMS), but I think a jewel was AWS’s Automated Reasoning capability in Bedrock, which addresses the critical challenge of hallucinations in AI models. It prevents factual errors and hallucinations in model outputs by mathematically verifying statements against source documentation. Crazy and much needed!

As someone who works with different organizations implementing AI, I’ve seen firsthand how companies often fixate on LLMs while overlooking critical infrastructure needs. This keynote showcases exactly what I’ve been advocating – you need the entire foundation to make AI work effectively.

Announcement list

Amazon Bedrock Updates:

Model Distillation: New capability to create smaller, faster, cheaper models that run 500% faster and cost 75% less
Automated Reasoning Checks: New feature to prevent hallucinations and ensure factual accuracy
Multi-Agent Collaboration: Enhanced support for complex workflows using multiple autonomous agents

Amazon Q Updates:

Q Developer: New autonomous agents for generating unit tests, documentation, and code reviews
Q Business: Integration with QuickSight and support for third-party ISV access to Q index
Q Business Automate: New capability to automate complex workflows across multiple applications

Amazon Nova Models:

Text Models: Nova Micro (text-only)
Multimodal Models: Nova Lite, Pro, and Premier (coming Q1 2024)
Nova Canvas: New image generation model
Nova Reel: New video generation model for 6-second clips (expanding to 2 minutes)
Coming in 2024: Speech-to-speech model (Q1) and multimodal-to-multimodal model (mid-year)

Amazon SageMaker Updates:

Next-generation SageMaker with integrated analytics capabilities
SageMaker Unified Studio: New single development environment for data and AI
SageMaker Lakehouse: New Apache Iceberg compatible data lake
Zero-ETL for Applications: New capability for analyzing data across third-party SaaS applications

Training Infrastructure:

Trainium 2: General availability of AWS’s custom AI chip
Trainium 3: Announced for late 2024, offering 2x more compute and 40% better efficiency
Project Rainier: Massive Trainium 2 cluster being built with Anthropic

Content based on category with timestamps:

Category	Product/Service	Timestamp
Compute & Infrastructure
	EC2	13:20
	Graviton4	17:50
	P6 instances	20:38
	Trainium2 GA	23:23
	Trainium2 Ultra	25:50
	Trainium3	34:38
	Elastic VMware Service	2:08:06
Storage
	S3 Table Buckets	42:33
	S3 Metadata	45:51
Databases
	Aurora D SQL	59:56
	DynamoDB Global Tables Updates	1:01:11
AI/ML Services
	Nova Models	1:48:20
	Nova Canvas	1:53:45
	Nova Reel	1:55:12
	Q Developer autonomous agents	2:01:46
	Q Business with QuickSight	2:25:18
	Q Business APIs	2:27:34
	Q Business Automate	2:31:47
	SageMaker Next Generation	2:39:04
	SageMaker Unified Studio	2:39:39
	SageMaker Lakehouse	2:42:02
Data Management
	Zero-ETL for Applications	2:41:16

Announcement list

Content based on category with timestamps:

Resources: