8000

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
llm_rag_demo.ipynb		llm_rag_demo.ipynb
readme.md		readme.md
requirements.txt		requirements.txt

Repository files navigation

DEMO Large Language Models RAG

This repository contains a jupyternotebook file which contains a working RAG demo.

setup

You need to setup a opensearch database locally. I prefer using Docker.
You need an OpenAI apikey which some credits on it.

In this demo

Setup
Connect to Opensearch
Connec to OpenAI
Create an index in the Opensearch database
Extraxt text from the PDF
Chunk the text
Generate embeddings (from the chunks)
Index the chunks
Experiment with kNN results
Write a query
Get the top 5 results (Retrieval step)
Build the context for the LLM (Augmentation step)
Prompt the LLM (Generation step)

Future work

The following list is what comes to mind:

Add more then 1 document
Add and use metadata. From which source comes your information?
Upgrade code to python functions
build a pipeline

About

Walking through the RAG steps. Using opensearch as a vectordatabase. Extract and embed text from PDF. Using kNN for retrieval and OpenAI for generation.

Report repository

Releases

No releases published

Packages

Contributors

Languages

Jupyter Notebook 100.0%

0