Skip to content

Commit

Permalink
docs: adds initial tutorial files and setup Azure Functions environment
Browse files Browse the repository at this point in the history
  • Loading branch information
glaucia86 committed Apr 9, 2024
1 parent c9c39d0 commit 308effe
Show file tree
Hide file tree
Showing 3 changed files with 119 additions and 0 deletions.
116 changes: 116 additions & 0 deletions docs/tutorial/01-introduction.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,116 @@
# Tutorial - Create a Serverless ChatGPT with RAG using LangChain.js and TypeScript

Welcome to the tutorial _Create a Serverless ChatGPT with RAG using LangChain.js and TypeScript_.

This tutorial will guide you through creating a serverless a ChatGPT and RAG (Retrieval-Augmented Generation) application using **[LangChain.js](https://js.langchain.com/docs/get_started/introduction)**, **[Azure Functions](https://learn.microsoft.com/azure/azure-functions/)**, **[Azure Cosmos DB](https://learn.microsoft.com/azure/cosmos-db/)**, **[Azure Blob Storage](https://learn.microsoft.com/azure/storage/blobs/)**, and **[Azure Static Web Apps](https://learn.microsoft.com/azure/static-web-apps/)**.

The chatbot will be able to answer questions based on a set of enterprise documents uploaded from a fictional company called _Contoso Real Estate_.

The final result of this application can be seen in the following picture:

![ChatGPT with RAG](../../docs/images/demo.gif)

Our goal is to provide you with a hands-on experience building a serverless application using Azure Services and LangChain.js. We will guide you explaining each step of the process, from setting up the environment to deploying the application.

The FrontEnd side of the application will already be provided, so you can focus on the Backend side of the application.

## Prerequisites

You can follow this tutorial in two ways: locally or using Codespaces.

### Locally

If you choose to use a local environment, you will need to install:

- [Node.js](https://nodejs.org/en/download/)
- [TypeScript](https://www.typescriptlang.org/download)
- [Visual Studio Code](https://code.visualstudio.com/download)
- [Azure Functions Core Tools](https://docs.microsoft.com/en-us/azure/azure-functions/functions-run-local?tabs=windows%2Ccsharp%2Cbash)
- [Git](https://git-scm.com/downloads)
- [Azure Developer CLI](https://docs.microsoft.com/en-us/cli/azure/install-azure-cli)

### Codespaces

If you decide to continue using **[Codespaces](https://github.com/features/codespaces)**, you can follow the steps described in the `README.md` file at the root of the project.

> **Note:** If you are using Codespaces, you don't need to install any of the prerequisites mentioned above. Codespaces already has all the necessary tools installed. Codespaces can be used for free for up to 60 hours per month, and this is renewed every month.
## Overview do Projeto

Building AI applications can be complex and time-consuming, but using LangChain.js and Azure serverless technologies allows to greatly simplify the process. This application is a chatbot that uses a set of enterprise documents to generate responses to user queries.

We provide sample data to make this sample ready to try, but feel free to replace it with your own. We use a fictitious company called _Contoso Real Estate_, and the experience allows its customers to ask support questions about the usage of its products. The sample data includes a set of documents that describes its terms of service, privacy policy and a support guide.

## Understanding the project architecture

Understanding the development process is extremely important. The project's architecture should be clear and well-defined. The architecture of the project is shown in the following diagram:

![ChatGPT with RAG](../../docs/images/architecture.drawio.png)

To understand the architecture of the project, let's break it down into its individual components:

1. **Web App:**

- The user interface for the chatbot is a web application built with **[Lit](https://lit.dev/)** (a library for building web components) and hosted on Azure Static Web Apps. It presents a chat interface for users to interact with.
- The code for this component is located in the `packages/webapp` folder.

2. **Serverless API:**

- When a user submits a query through the web app, it is sent via HTTP to an API buitl with Azure Functions.
- This API uses LangChain.js to process the query.
- The API handles the logic of ingesting enterprise documents and generating responses to the chat queries.
- The code for this component we will create later in this tutorial. Which will be located in the `packages/api` folder.

3. **Database:**

- Text extracted from the documents and the vectors generated by LangChain.js are stored in Azure Cosmos DB for MongoDB vCore
- The database allows for the storage and retrieval of text chunks using vector search, which enables quick and relevant responses based on the user's queries.

4. **File Storage:**

- The source documents such as terms of service, privacy policy and support guides for the fictional company Contoso Real Estate are stored in Azure Blob Storage. This is where the PDF documents are uploaded and from where they can be retrieved.

5. **Azure OpenAI Service:**
- This service is where the AI Model, capable of understanding and generating natural language is hosted. This is used to embed text chunks or generate answers based on the vector search from the database.

Now let's examine the application flow based on the application architecture image:

- A user interacts with the chat interface in the web app
- The web app sends the user's query to the Serverless API via HTTP calls
- The Serverless API interacts with Azure OpenAI Service to generate a response, using the data from Azure Cosmos DB.
- If there's a need to reference the original documents, Azure Blob Storage is used to retrieve the PDF documents.
- The generated response is then sent back to the web app and displayed to the user.

The architecture is based on the RAG (Retrieval-Augmented Generation) standard. This technique combines the ability to retrieve information from a database with the ability to generate text from a language model. We will explain this pattern further throughout the tutorial.

## Executing the Project

Now that you understand the project's architecture, let's run it!

Once you have `forked` and `cloned` the project, use the `starter` branch to continue with the tutorial. The `main` branch contains the finished project if you wish to view it!

To execute the project, follow these steps:

1. Install the project dependencies:

```bash
npm install
```

2. To run the project, with only FrontEnd, execute the following command:

```bash
start:webapp
```

> At this point, don't worry about the other scripts in the `package.json` file at the root of the project. They will be used throughout the tutorial.
1. Open your browser and go to `http://localhost:8000`. The application will be displayed, as shown in the image below:

![FrontEnd application](./images/application-webapp.png)

## Next Steps

In the next section, we will start to create the API using Azure Functions. See you there!

**[Next Step: Setting Up the Serverless Environment using Azure Functions](./02-setting-up-azure-functions.md)**
3 changes: 3 additions & 0 deletions docs/tutorial/02-setting-up-azure-functions.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# Setting Up the Serverless Environment using Azure Functions

**todo**
Binary file added docs/tutorial/images/application-webapp.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 308effe

Please sign in to comment.