AwaDB
AwaDB 是一个面向大语言模型应用的 AI 原生数据库,用于嵌入向量的搜索与存储。
您需要使用 langchain-community 安装 pip install -qU langchain-community 才能使用此集成
这个笔记本展示了如何使用与AwaDB相关的功能。
%pip install --upgrade --quiet awadb
from langchain_community.document_loaders import TextLoader
from langchain_community.vectorstores import AwaDB
from langchain_text_splitters import CharacterTextSplitter
loader = TextLoader("../../how_to/state_of_the_union.txt")
documents = loader.load()
text_splitter = CharacterTextSplitter(chunk_size=100, chunk_overlap=0)
docs = text_splitter.split_documents(documents)
db = AwaDB.from_documents(docs)
query = "What did the president say about Ketanji Brown Jackson"
docs = db.similarity_search(query)
print(docs[0].page_content)
And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.
相似性搜索(带分数)
The returned distance score is between 0-1. 0 is dissimilar, 1 is the most similar
docs = db.similarity_search_with_score(query)
print(docs[0])
(Document(page_content='And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'source': '../../how_to/state_of_the_union.txt'}), 0.561813814013747)
还原创建并添加的数据之前
AwaDB自动持久化添加的文档数据。
如果可以恢复之前创建并添加的表格,您可以这样做:
import awadb
awadb_client = awadb.Client()
ret = awadb_client.Load("langchain_awadb")
if ret:
print("awadb load table success")
else:
print("awadb load table failed")
awadb load table success