mirror of https://github.com/QuivrHQ/quivr.git synced 2024-12-12 14:13:22 +03:00

Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative

ai api chatbot chatgpt database docker frontend html javascript llm openai postgresql privacy rag react rest-api security starred-quivrhq-repo starred-repo typescript vector ycombinator

Go to file

Jacopo Chevallard 285fe5b960 feat: websearch, tool use, user intent, dynamic retrieval, multiple questions (#3424 ) # Description This PR includes far too many new features: - detection of user intent (closes CORE-211) - treating multiple questions in parallel (closes CORE-212) - using the chat history when answering a question (closes CORE-213) - filtering of retrieved chunks by relevance threshold (closes CORE-217) - dynamic retrieval of chunks (closes CORE-218) - enabling web search via Tavily (closes CORE-220) - enabling agent / assistant to activate tools when relevant to complete the user task (closes CORE-224) Also closes CORE-205 ## Checklist before requesting a review Please delete options that are not relevant. - [ ] My code follows the style guidelines of this project - [ ] I have performed a self-review of my code - [ ] I have commented hard-to-understand areas - [ ] I have ideally added tests that prove my fix is effective or that my feature works - [ ] New and existing unit tests pass locally with my changes - [ ] Any dependent changes have been merged ## Screenshots (if appropriate): --------- Co-authored-by: Stan Girard <stan@quivr.app>		2024-10-31 17:57:54 +01:00
.github	feat: websearch, tool use, user intent, dynamic retrieval, multiple questions (#3424 )	2024-10-31 17:57:54 +01:00
.vscode	feat: introducing configurable retrieval workflows (#3227 )	2024-09-23 09:11:06 -07:00
core	feat: websearch, tool use, user intent, dynamic retrieval, multiple questions (#3424 )	2024-10-31 17:57:54 +01:00
docs	chore: docs on quivr-core workflows (#3420 )	2024-10-23 11:12:23 +02:00
examples	feat: websearch, tool use, user intent, dynamic retrieval, multiple questions (#3424 )	2024-10-31 17:57:54 +01:00
.gitignore	docs(fix): fixed warnings from griffe (#3381 )	2024-10-16 02:48:33 -07:00
.pre-commit-config.yaml	feat: introducing configurable retrieval workflows (#3227 )	2024-09-23 09:11:06 -07:00
.python-version	feat(quivr-core): beginning (#3388 )	2024-10-21 00:50:31 -07:00
.readthedocs.yaml	chore(docs): incorrect path in .readthedocs.yaml (#3401 )	2024-10-21 01:15:18 -07:00
.release-please-manifest.json	chore(main): release core 0.0.22 (#3410 )	2024-10-21 08:39:12 -07:00
CHANGELOG.md	chore(main): release 0.0.322 (#3352 )	2024-10-16 01:39:23 -07:00
LICENSE	Update license to include enterprise features (#2653 )	2024-06-10 09:42:14 -07:00
logo.png	[ImgBot] Optimize images (#2568 )	2024-05-09 07:16:31 -07:00
README.md	Update README.md	2024-10-24 00:31:03 -07:00
release-please-config.json	feat(changelog): now default method	2024-10-21 14:32:21 +02:00
requirements-dev.lock	feat(quivr-core): beginning (#3388 )	2024-10-21 00:50:31 -07:00
requirements.lock	feat(quivr-core): beginning (#3388 )	2024-10-21 00:50:31 -07:00
vercel.json	Revert "feat: 🎸 posthog (#1945 )"	2024-01-02 10:23:40 +01:00

README.md

Quivr - Your Second Brain, Empowered by Generative AI

Quivr, helps you build your second brain, utilizes the power of GenerativeAI to be your personal assistant !

Key Features 🎯

Opiniated RAG: We created a RAG that is opinionated, fast and efficient so you can focus on your product
LLMs: Quivr works with any LLM, you can use it with OpenAI, Anthropic, Mistral, Gemma, etc.
Any File: Quivr works with any file, you can use it with PDF, TXT, Markdown, etc and even add your own parsers.
Customize your RAG: Quivr allows you to customize your RAG, add internet search, add tools, etc.
Integrations with Megaparse: Quivr works with Megaparse, so you can ingest your files with Megaparse and use the RAG with Quivr.

We take care of the RAG so you can focus on your product. Simply install quivr-core and add it to your project. You can now ingest your files and ask questions.*

We will be improving the RAG and adding more features, stay tuned!

This is the core of Quivr, the brain of Quivr.com.

Getting Started 🚀

You can find everything on the documentation.

Prerequisites 📋

Ensure you have the following installed:

Python 3.10 or newer

30 seconds Installation 💽

Step 1: Install the package

pip install quivr-core # Check that the installation worked

Step 2: Create a RAG with 5 lines of code

import tempfile

from quivr_core import Brain

if __name__ == "__main__":
    with tempfile.NamedTemporaryFile(mode="w", suffix=".txt") as temp_file:
        temp_file.write("Gold is a liquid of blue-like colour.")
        temp_file.flush()

        brain = Brain.from_files(
            name="test_brain",
            file_paths=[temp_file.name],
        )

        answer = brain.ask(
            "what is gold? asnwer in french"
        )
        print("answer:", answer)

Configuration

Workflows

Basic RAG

Creating a basic RAG workflow like the one above is simple, here are the steps:

Add your API Keys to your environment variables

import os
os.environ["OPENAI_API_KEY"] = "myopenai_apikey"

Quivr supports APIs from Anthropic, OpenAI, and Mistral. It also supports local models using Ollama.

Create the YAML file basic_rag_workflow.yaml and copy the following content in it

workflow_config:
  name: "standard RAG"
  nodes:
    - name: "START"
      edges: ["filter_history"]

    - name: "filter_history"
      edges: ["rewrite"]

    - name: "rewrite"
      edges: ["retrieve"]

    - name: "retrieve"
      edges: ["generate_rag"]

    - name: "generate_rag" # the name of the last node, from which we want to stream the answer to the user
      edges: ["END"]

# Maximum number of previous conversation iterations
# to include in the context of the answer
max_history: 10

# Reranker configuration
reranker_config:
  # The reranker supplier to use
  supplier: "cohere"

  # The model to use for the reranker for the given supplier
  model: "rerank-multilingual-v3.0"

  # Number of chunks returned by the reranker
  top_n: 5

# Configuration for the LLM
llm_config:

  # maximum number of tokens passed to the LLM to generate the answer
  max_input_tokens: 4000

  # temperature for the LLM
  temperature: 0.7

Create a Brain with the default configuration

from quivr_core import Brain

brain = Brain.from_files(name = "my smart brain",
                        file_paths = ["./my_first_doc.pdf", "./my_second_doc.txt"],
                        )

Launch a Chat

brain.print_info()

from rich.console import Console
from rich.panel import Panel
from rich.prompt import Prompt
from quivr_core.config import RetrievalConfig

config_file_name = "./basic_rag_workflow.yaml"

retrieval_config = RetrievalConfig.from_yaml(config_file_name)

console = Console()
console.print(Panel.fit("Ask your brain !", style="bold magenta"))

while True:
    # Get user input
    question = Prompt.ask("[bold cyan]Question[/bold cyan]")

    # Check if user wants to exit
    if question.lower() == "exit":
        console.print(Panel("Goodbye!", style="bold yellow"))
        break

    answer = brain.ask(question, retrieval_config=retrieval_config)
    # Print the answer with typing effect
    console.print(f"[bold green]Quivr Assistant[/bold green]: {answer.answer}")

    console.print("-" * console.width)

brain.print_info()

You are now all set up to talk with your brain and test different retrieval strategies by simply changing the configuration file!

Go further

You can go further with Quivr by adding internet search, adding tools, etc. Check the documentation for more information.

Contributors ✨

Thanks go to these wonderful people:

Contribute 🤝

Did you get a pull request? Open it, and we'll review it as soon as possible. Check out our project board here to see what we're currently focused on, and feel free to bring your fresh ideas to the table!

Partners ❤️

This project would not be possible without the support of our partners. Thank you for your support!

License 📄

This project is licensed under the Apache 2.0 License - see the LICENSE file for details