This demo demonstrates intial AVS behavior. This version scrapes webpages relating to the topic: OUTER SPACE.
The Lex bot is currently connected to a knowledge base that uses the Titan Embeddings model to pull data from an s3 bucket populated with wikipedia pages (pdfs) about topics relating to OUTER SPACE. New data can easily be added to this bucket in pretty much any file format. The bot then uses the new Bedrock gen AI features and the Anthropic Claude v2 model to search the that knowledge base for answers to the user’s questions.
Ask any question relating to the topic: OUTER SPACE
Wikipedia pages (pdfs) about topics relating to the topic OUTER SPACE.
Stores data sources. These can be files of any type - pdf, csv, etc. Any data to be fed into AVS will live here.
AWS Bedrock is used to build a functional knowledge base that accesses the data in the S3 bucket.
The Titan Embeddings G1 - Textv1.2 model is used to retrieve text from the documents in the S3 bucket segmenting documents into logical segments (paragraphs & sections). This model can also perform semantic similarity and clustering procedures. It can process up to 8k tokens, generating a 1536-dimensional vector.
The embeddings model relies on Amazon OpenSearch as a Serverless Vector Engine to process the vector it generates.
Ask a Virtual Scientist is ultimately powered by an AWS Lex V2 bot that uses a generative AI intent.