๐ Gemma Model Document Q&A - Streamlit App
๐ Overview
This project is a Streamlit-based document Q&A system that uses LangChain, FAISS, and Google Generative AI Embeddings to provide accurate answers based on uploaded documents. It leverages vector embeddings for retrieval-augmented generation (RAG) and supports GROQ Llama 3 model for natural language responses.
๐ Key Features
๐ Document Ingestion & Processing:
Uploads PDFs and processes text using PyPDFDirectoryLoader.
Splits documents into chunks using RecursiveCharacterTextSplitter.
๐ Embedding & Retrieval:
Uses Google Generative AI Embeddings to create high-quality vector embeddings.
Stores and retrieves document vectors using FAISS.
๐ค Question Answering:
Queries documents through LangChainโs retrieval chain.
Uses GROQ-powered Llama 3 model for precise responses.
๐ Future Enhancements
โ Integrate OpenAI and Anthropic models for comparison.
โ Add database storage (PostgreSQL) for persistent document retrieval.
โ Improve UI with better document visualization tools.