AWS Updates AI Offerings with Amazon Nova Premier, Llama 4, Anonymous User Q Business Chatbots

Amazon Web Services (AWS) has made a number of AI moves to maintain its position alongside fellow cloud giants Microsoft and Google. New developments include: the general availability of Amazon Nova Premier, the company's self-described most capable multimodal foundation model for complex tasks; the first models in the new Llama 4 herd of models — Llama 4 Scout 17B and Llama 4 Maverick 17B — are now available fully managed in Amazon Bedrock; and anonymous user access for Q Business.

"Customers can now create anonymous Q Business applications to power use cases such as public web site Q&A, documentation portals, and customer self-service experiences, where user authentication is not required and content is publicly available," the company said of the latter in an April 30 post.

Some Limitations
[Click on image for larger view.] Some Limitations (source: AWS).

Amazon Q Business is a generative AI-powered assistant offered as part of AWS's enterprise cloud services. It's designed to help users get fast, secure answers to work-related questions by interacting with company data.

Key features include:

  • Enterprise Search: Connects to internal data sources like Confluence, Salesforce, S3, SharePoint, and more to retrieve relevant answers.
  • Natural Language Interface: Users can ask questions in plain language and receive accurate, contextual responses.
  • Customization: Organizations can tailor the assistant with custom plugins, APIs, and business logic.
  • Security and Privacy: Built on AWS's identity and access control systems, ensuring responses respect data permissions.

The anonymous chat APIs and web experience are available in the US East (N. Virginia), US West (Oregon), Europe (Ireland), and Asia Pacific (Sydney) AWS Regions, with company offering up Creating an Amazon Q Business application environment for anonymous access documentation, and the Build public-facing generative AI applications using Amazon Q Business for anonymous users post for more guidance.

Amazon Nova Premier

As noted, AWS claims this is its most capable model for complex tasks such as processing long documents, videos, large codebases, and executing multistep agentic workflows. The company said it's also its most capable teacher model and can be used with Amazon Bedrock Model Distillation to create custom distilled models for specific needs. This refers to knowldege distillation, where a large, powerful model (the teacher) is used to train a smaller, more efficient model (the student).

The company said Nova Premier extends the capabilities available from its Amazon Nova understanding models with several key improvements that include:

  • Superior intelligence: The model scores 87.4% in the Massive Multitask Language Understanding (MMLU) benchmark for undergraduate-level knowledge, 82.0% on Math500 for mathematic problems, and 84.6% on the CharXiv benchmark for chart understanding.
  • Improved agentic capabilities: Nova Premier can perform end-to-end actions on behalf of the user, enabling more complex workflows such as Retrieval-Augmented Generation (RAG), function calling, and agentic coding. The model scores 86.3% on SimpleQA with RAG, 63.7% on the Berkeley Function Calling Leaderboard (BFCL), and 42.4% on SWE-bench Verified for software engineering tasks.
  • Longer context: The model offers a context window of one million tokens. This enables analysis of bigger data sets like large codebases, multiple documents and images, documents longer than 400 pages, or 90-minute-long videos.

Nova Premier is available in Amazon Bedrock in US East (N. Virginia), US East (Ohio), and US West (Oregon) through cross-Region inference. Related resources include:

Meta's Llama 4 in Amazon Bedrock

Meta's Llama 4 models — Llama 4 Scout 17B and Llama 4 Maverick 17B — are now fully managed and available serverlessly in Amazon Bedrock, the company said. These advanced multimodal models are designed to handle both text and image inputs, offering enhanced performance and scalability for enterprise applications.

Key features include:

  • Multimodal Capabilities: Both models support native multimodal processing, allowing for seamless integration of text and image data.
  • Mixture-of-Experts (MoE) Architecture: Utilizes MoE to optimize performance and efficiency, activating only relevant subsets of the model for specific tasks.
  • Extended Context Windows:
    • Llama 4 Scout 17B: Supports up to 10 million tokens, facilitating complex tasks like multi-document summarization and extensive codebase analysis.
    • Llama 4 Maverick 17B: Offers a 1 million token context window, suitable for detailed image and text understanding.
  • Language Support: Handles text in 12 languages, including English, French, German, Hindi, Italian, Portuguese, Spanish, Thai, Arabic, Indonesian, Tagalog, and Vietnamese.

Meta's Llama 4 models are available in Amazon Bedrock in the US East (N. Virginia) and US West (Oregon) AWS Regions. Users can also access Llama 4 in US East (Ohio) via cross-region inference. For more, the company offers:

Featured

  • abstract geometric pattern of glowing interconnected triangles, hexagons, and circles in blue, gold, and white, spread across a dark navy-to-black gradient background

    OpenAI Unveils 'Operator' AI for Performing Web Tasks

    OpenAI has launched "Operator," an AI agent designed to perform web-based tasks autonomously using its own browser. Currently available as a research preview for Pro users in the United States, the tool aims to automate everyday activities such as filling out forms, ordering groceries, and even creating memes.

  • glowing brain, connected circuits, and abstract representations of a book and graduation cap on a light gray gradient background

    Snowflake Launches Program to Upskill 100,000 People in Data and AI

    Cloud data platform Snowflake is embarking on an effort to train and certify more than 100,000 users on its AI Data Cloud by 2027. The One Million Minds + One Platform program will provide Snowflake-delivered courses, training materials, and free access to Snowflake software, at no cost to learners.

  • NVIDIA DGX line

    NVIDIA Intros Personal AI Supercomputers

    NVIDIA has introduced a new lineup of AI-powered computing solutions designed to accelerate enterprise workloads.

  • computer screen displaying a landline phone being unplugged from a single cord, with a modern office desk, keyboard, and subtle lighting in the background

    Microsoft to Discontinue Skype Services

    Microsoft has announced that it is shutting down service for its Skype telecommunications and video calling services on May 5, 2025.