How to use AWS Bedrock to access foundational AI models

3 min readNov 16, 2024

AWS Bedrock is a fully managed service from AWS that provides a unified access point for popular foundational models from leading AI companies. These models include Llama, Titan, Claude, Stable Diffusion, Mistral, Command, and Jamba 1.5. Another big plus is that they offer on demand pricing which helps make access to the models affordable without managing infrastructure.

To get started go to the AWS console and select AWS Bedrock then select the model you would wish to enable.

Select a foundational model e.g Llama then click on Request model access

Click on the modify model access button to select the models to enable.

Click on the checkboxes for the models you would wish to have access the and click on next.

Review the models you've enabled and click on submit to accept the EULA.

To access the models programmatically using Python install the AWS Python SDK using the following command

pip install boto3

Ensure you have obtained the credentials to access the AWS SDK programmatically which includes the Access Secret Key and Access Secret ID. In addition, obtain the model ID of the model you wish to prompt that is used in the initialization of AWS Bedrock here. You're all set to go now you can run the code example below.

import json
import boto3
from prompt_toolkit import prompt

# Set up AWS credentials explicitly
aws_access_key_id = 'AWS_ACCESS_KEY_ID' # set access key id
aws_secret_access_key = 'AWS_SECRET_ACCESS_KEY' # set acces key secret 
aws_region = 'us-east-1'  # Adjust to your AWS region


# Create Bedrock client
bedrock = boto3.client(
    'bedrock-runtime',
    region_name=aws_region,
    aws_access_key_id=aws_access_key_id,
    aws_secret_access_key=aws_secret_access_key
)


try:
   
    prompt = ('What is the capital city of America?')
    # Model invocation
    response = bedrock.invoke_model(
        modelId='MODEL_ID',  # Replace with the correct model ID
        body=json.dumps({
            "prompt": prompt,

                "max_gen_len":2048, # Specify the maximum number of tokens to use in the generated response
                "temperature": 0.7, #  Use a lower value to decrease randomness in the response. MAX VALUE IS 1.0
                "top_p": 0.5 #  Use a lower value to ignore less probable options. Set to 0 or 1.0 to disable.

        }),
        accept='application/json',
        contentType='application/json'
    )

    # Read and print the response
    response_body = response['body'].read().decode('utf-8')

    print(response_body)
    parsed = json.loads(response_body)

    print("Response from the model:", parsed["generation"])

except Exception as e:
    print("An error occurred:", e)

For more in-depth information refer to the official documentation here. Happy Coding!

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Isaac Sichangi

9 Followers

29 Following

Product Design | Software Development

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Isaac Sichangi

Isaac Sichangi

HOW TO USE THE OPEN AI API IN VANILLA JAVASCRIPT

Open AI has gained popularity in the last year with the launch of Chat GPT which has made the advanced capabilities of AI accessible to…

May 23, 2023

Isaac Sichangi

DUKAPOS REDEFINING FINTECH IN KENYA

Walking along the streets of Nairobi one is fascinated by the number of micro, small and medium enterprises that line up the road…

Jul 12, 2020

Isaac Sichangi

THE IMPORTANCE OF THE FUNDAMENTALS IN SOFTWARE DEVELOPMENT

2024 is finally here with us and as the year kicks off, I’ve noted a lot of content being released on new tech trends to expect for the…

Jan 14, 2024

Isaac Sichangi

THEORETICAL INTRODUCTION TO DEVOPS

What is DevOps? is a question asked frequently by individuals who hear the term for the first time. There is no real concrete definition of…

Apr 12, 2022

See all from Isaac Sichangi

Recommended from Medium

Building a RAG-Based Conversational AI System on AWS

The Slalom Blog

Amarpreet Singh

Building a RAG-Based Conversational AI System on AWS

Learn how to create a RAG-based AI system with AWS Bedrock agents, blending generative AI, real-time data retrieval, and domain knowledge

Jan 20

Building an Intelligent Customer Service Agent with Amazon Bedrock and a Knowledge Base

Amit Rai

Building an Intelligent Customer Service Agent with Amazon Bedrock and a Knowledge Base

Simplify customer support with AI and a robust knowledge base

Nov 22, 2024

Lists

Business

41 stories183 saves

Natural Language Processing

1977 stories1620 saves

Medium's Huge List of Publications Accepting Submissions

414 stories4678 saves

Staff picks

827 stories1648 saves

Why AWS Bedrock RAG’s Serverless Model Isn’t Truly Pay-Per-Us

MaruAI

Why AWS Bedrock RAG’s Serverless Model Isn’t Truly Pay-Per-Us

This blog is just a rant, born out of frustration after discovering an unexpectedly high bill for AWS Bedrock Knowledge Base (KB) and…

Jan 7

Fine-Tuning Models with Amazon Bedrock: A Step-by-Step Guide

Irina (Xinli) Yu, Ph.D.

Fine-Tuning Models with Amazon Bedrock: A Step-by-Step Guide

Introduction

Oct 30, 2024

Customizing Foundation Models on Amazon Bedrock: A Complete Guide

Sampathkumarbasa

Customizing Foundation Models on Amazon Bedrock: A Complete Guide

Foundation models, with their broad knowledge base and ability to handle various tasks, are key drivers of many AI applications today…

Oct 22, 2024

Meenakshisundaram Thandavarayan

Amazon Bedrock: Inference Options

Gen AI applications path to production and ROI (return in investment) is largely dependent on LLM inference challenges in cost, capacity…

Dec 17, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams