LightRAG, a lightweight and efficient RAG

October 28, 2024

LightRAG is a new Retrieval-Augmented Generation method that generates faster and more contextually relevant answers than previous RAG models. It combines graph structures to connect related pieces of information during query processing with a dual-level retrieval mechanism that accesses both specific details and broader context.

Developed by researchers from the University of Hong Kong and Beijing University of Posts and Telecommunications, LightRAG is open-source and can be accessed here. See the how to use chapter below for a simple overview.

What is RAG?

Retrieval-Augmented Generation (RAG) is a method that increases the accuracy of LLMs by allowing them to access external data sources beyond their pre-trained knowledge.

RAG combines two main processes: retrieval and generation. When a user submits a query, it searches an external, extensive knowledge database to retrieve relevant documents or information. They are passed to the generation component to create a contextually appropriate response.

Despite their performance, traditional RAG systems often rely on flat data structures, where information is stored in isolated chunks, losing the contextual information and the complex relationships between entities, ideas, or events. This limitation causes the model give fragmented responses.

LightRAG overcomes these limitations by improving both the retrieval and generation phases in RAG, resulting in more detailed and better integrated responses.

Why should you use LightRAG?

LightRAG preserves the relationship between different pieces of information in your data, resulting in better answers. It is also faster and less computationally intensive.

The model introduces several innovative features over previous RAG models:

Graph-enhanced text indexing: It incorporates graph structures into text indexing that helps to create complex relationships between related entities, improving the system’s contextual understanding.
Dual-level retrieval system: LightRAG uses a dual-level retrieval system to handle both low-level (specific, detailed) and high-level (abstract, conceptual) queries. For example, it can address specific questions like “Who wrote Pride and Prejudice?” as well as abstract ones like “How does artificial intelligence influence modern education?”
Incremental update algorithm: The model uses an incremental update algorithm to incorporate the latest information without rebuilding its entire data index. Instead, it selectively indexes only the new or modified content. This approach, which reduces the computational costs, is particularly useful in dynamic environments such as news or real-time analytics, where data changes frequently.

Unlike resource-intensive models, LightRAG is lightweight and efficient, enabling it to process large-scale knowledge bases and generate text quickly, with less computational effort.

The model

The new framework, which is based on RAG, integrates graph structures into text indexing and a dual-level retrieval system, as can be seen in the next picture.

Overall architecture of the LightRAG system (source: paper)

The model is made of two parts: graph-based text indexing and dual-level retrieval. It divides the raw text documents into smaller, manageable parts for efficient retrieval. It further uses an LLM-powered profiling function, P(·), to identify and extract entities and relationships R(·) and builds a knowledge graph that organizes information and highlights connections across documents. For each entity node and relation edge, it generates a text key-value pair (K, V). Finally, the system eliminates duplicate information D(·). To retrieve relevant information, LightRAG generates (K, V) pairs at both detailed and abstract levels.

Detailed level: These are specific search keys focused on smaller, exact parts of documents, allowing precise information retrieval.
Abstract level: These are broader search keys that look at the overall meaning, enabling retrieval that considers the broader connections between different parts.

By using both levels, LightRAG can find relevant information within small document sections and understand the larger, interconnected ideas across documents.

Evaluation

LightRAG was evaluated and compared to similar RAG models, such as NaiveRAG, RQ-RAG, HyDE, and GraphRAG. The evaluation focused on retrieval accuracy, model ablation, response efficiency, and adaptability to new information. The results prove that LightRAG outperforms them, mainly for searching and retrieving information from large-scale databases and complex language contexts.

The following case study compares GraphRAG and LightRAG. GraphRAG is a tool developed by Microsoft that also uses graph-based knowledge to improve document retrieval and text generation. However, it requires more resources than LightRAG and therefore more expensive to run.

The two models were evaluated based on the following qualitative metrics: comprehensiveness, diversity, and empowerment in providing detailed information. LightRAG was the winner across all these dimensions, giving a more detailed answer.

Query	Which are the key metrics for evaluating movie recommendation systems?
GraphRAG	1. Precision […] 2. Recall […] 3.F1 Score […] 4. Mean Average Precision (MAP) […] 5. Root Mean Squared Error (RMSE) […] 6. User Satisfaction Metrics […]
LightRAG	1. Mean Average Precision at K (MAPK) […] 2. Precision and Recall […] 3. Root Mean Squared Error (RMSE) and Mean Squared Error (MSE) […] 4. Area Under the Curve (AUC) […] 5. F-Measure […] 6. User Engagement Metrics […]

Comparison between the baseline method GraphRAG and LightRAG (source: paper)

How to use LightRAG

LightRAG is open-source and you can set it up on your local machine by following these steps:

Install LightRAG directly from the source or via PyPI
Set up your environment. If you’re using OpenAI models, set your API key in the environment.
Prepare your data. Collect the data you want to use, such as text files, PDFs, or other formats. They could be stored locally on your computer or downloaded from external sources. You may need to convert some files into a usable text format.
Initialize LightRAG. Once your environment and data are ready, initialize LightRAG. You can configure LightRAG to work with different models, such as those from Hugging Face or Ollama.
Perform queries. You can now query your documents. If you have multiple texts, LightRAG allows you to insert them in batches or process them in chunks for better efficiency.

For more details, please check the GitHub repository.

Conclusion

LightRAG is an open-source model that builds on the typical RAG architecture but makes the process lighter and more efficient. The key improvements in LightRAG over previous RAG models include a graph-based approach, which enables it to handle complex interdependencies across documents and a dual-level retrieval process that works on both detailed and abstract levels.

These features allow LightRAG to retrieve and process information faster and more precise, contextually-aware than traditional RAG models, which are based on single-level and less structured retrieval methods.