Skip to main content
Open In ColabOpen on GitHub

AwaDB

AwaDB 是一个面向大语言模型应用的 AI 原生数据库,用于嵌入向量的搜索与存储。

您需要使用 langchain-community 安装 pip install -qU langchain-community 才能使用此集成

这个笔记本展示了如何使用与AwaDB相关的功能。

%pip install --upgrade --quiet  awadb
from langchain_community.document_loaders import TextLoader
from langchain_community.vectorstores import AwaDB
from langchain_text_splitters import CharacterTextSplitter
loader = TextLoader("../../how_to/state_of_the_union.txt")
documents = loader.load()
text_splitter = CharacterTextSplitter(chunk_size=100, chunk_overlap=0)
docs = text_splitter.split_documents(documents)
db = AwaDB.from_documents(docs)
query = "What did the president say about Ketanji Brown Jackson"
docs = db.similarity_search(query)
print(docs[0].page_content)
And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.

相似性搜索(带分数)

The returned distance score is between 0-1. 0 is dissimilar, 1 is the most similar

docs = db.similarity_search_with_score(query)
print(docs[0])
(Document(page_content='And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'source': '../../how_to/state_of_the_union.txt'}), 0.561813814013747)

还原创建并添加的数据之前

AwaDB自动持久化添加的文档数据。

如果可以恢复之前创建并添加的表格,您可以这样做:

import awadb

awadb_client = awadb.Client()
ret = awadb_client.Load("langchain_awadb")
if ret:
print("awadb load table success")
else:
print("awadb load table failed")

awadb load table success