Johnsnowlabs

使用开源 johnsnowlabs 库访问 johnsnowlabs 企业 NLP 库生态系统，其中包含超过 200 种语言的 21,000 多个企业 NLP 模型。有关所有 24,000 多个模型，请参阅 John Snow Labs Model Models Hub

安装和设置

pip install johnsnowlabs

要[安装企业功能](https://nlp.johnsnowlabs.com/docs/en/jsl/install_licensed_quick，运行

# for more details see https://nlp.johnsnowlabs.com/docs/en/jsl/install_licensed_quick
nlp.install()

您可以使用 gpu、cpu、apple_silicon、aarch 基于优化的二进制文件嵌入您的查询和文档。默认情况下使用 cpu 二进制文件。会话启动后，您必须重启笔记本电脑才能在 GPU 或 CPU 之间切换，否则更改将不会生效。

使用 CPU 嵌入查询：

document = "foo bar"
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert')
output = embedding.embed_query(document)

使用 GPU 嵌入查询：

document = "foo bar"
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert','gpu')
output = embedding.embed_query(document)

使用 Apple Silicon (M1,M2,etc..) 嵌入查询：

documents = ["foo bar", 'bar foo']
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert','apple_silicon')
output = embedding.embed_query(document)

使用 AARCH 嵌入查询：

documents = ["foo bar", 'bar foo']
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert','aarch')
output = embedding.embed_query(document)

使用 CPU 嵌入文档：

documents = ["foo bar", 'bar foo']
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert','gpu')
output = embedding.embed_documents(documents)

使用 GPU 嵌入文档：

documents = ["foo bar", 'bar foo']
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert','gpu')
output = embedding.embed_documents(documents)

使用 Apple Silicon (M1,M2,etc..) 嵌入文档：

```python
documents = ["foo bar", 'bar foo']
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert','apple_silicon')
output = embedding.embed_documents(documents)

使用 AARCH 嵌入文档：

```python
documents = ["foo bar", 'bar foo']
embedding = JohnSnowLabsEmbeddings('embed_sentence.bert','aarch')
output = embedding.embed_documents(documents)

模型通过 nlp.load 加载，spark 会话通过 nlp.start() 在后台启动。

安装和设置​

使用 CPU 嵌入查询：​

使用 GPU 嵌入查询：​

使用 Apple Silicon (M1,M2,etc..) 嵌入查询：​

使用 AARCH 嵌入查询：​

使用 CPU 嵌入文档：​

使用 GPU 嵌入文档：​

使用 Apple Silicon (M1,M2,etc..) 嵌入文档：​

使用 AARCH 嵌入文档：​

此页是否对您有帮助？