RAG App on AWS FAQs

Question 1

What are the key features?

Accepted Answer

Key features include Infrastructure as Code (IaC) with Terraform, Google's Gemini Pro and Embedding model integration, a Streamlit UI with token-based authentication, AWS Lambda for serverless compute, and PostgreSQL RDS with pgvector for vector storage.

Question 2

What is the estimated cost to run this?

Accepted Answer

The estimated cost is around $3/month (~₹250) without the AWS Free Tier, mainly for RDS and NAT Gateway (if active). Costs may vary based on usage.

Question 3

Is this application secure?

Accepted Answer

Yes, it uses secure user management with AWS Cognito, and network architecture ensures sensitive operations are processed in a secure environment. Secure communication is done via API Gateway.

Question 4

What is RAG App on AWS?

Accepted Answer

It's a tool that deploys a complete AWS backend infrastructure for Retrieval-Augmented Generation (RAG) applications. It integrates Google's Gemini Pro and Embedding models with a Streamlit UI for document querying.

Question 5

What AWS services are used?

Accepted Answer

The application leverages AWS Lambda, API Gateway, S3, RDS (PostgreSQL with pgvector), Cognito, and DynamoDB. It uses Terraform for infrastructure provisioning.

RAG App on AWS

About

Key Features

Use Cases