學習python庫：elasticsearch-py

時間 2021-08-14

標籤 node python 安全 less elasticsearch ide 函數 url spa 插件欄目 Python 简体版

原文原文鏈接

1、介紹

elasticsearch-py是一個官方提供的low-level的elasticsearch python客戶端庫。爲何說它是一個low-level的客戶端庫呢？由於它只是對elasticsearch的rest API接口作了一層簡單的封裝，所以提供了最大的靈活性，可是於此同時使用起來就不是太方便。相對於這個low-level的客戶端庫，官方還提供了一個high-level的python客戶端庫：elasticsearch-dsl，這個會在另外一篇文章中介紹。node

更多介紹參見官方文檔：https://elasticsearch-py.readthedocs.io/en/master/python

2、安裝

不一樣的elasticsearch版本要求不一樣的客戶端版本，因此安裝的時候須要根據你的elasticsearch來決定，下面是一個簡單的參考：安全

# Elasticsearch 6.x
elasticsearch>=6.0.0,<7.0.0
# Elasticsearch 5.x
elasticsearch>=5.0.0,<6.0.0
# Elasticsearch 2.x
elasticsearch>=2.0.0,<3.0.0

在兼容的大的版本號下儘可能選擇最新的版本。less

pip install elasticsearch elasticsearch

3、API

3.1 API文檔

全部API都儘量緊密的映射原始的rest API。ide

3.1.1 全局選項

某些被客戶端添加的參數可使用在全部的API上。函數

1.ignoreurl

被用戶忽略某些http錯誤狀態碼。spa

from elasticsearch import Elasticsearch
es = Elasticsearch()

# ignore 400 cause by IndexAlreadyExistsException when creating an index
es.indices.create(index='test-index', ignore=400)

# ignore 404 and 400
es.indices.delete(index='test-index', ignore=[400, 404])

2.timeout插件

被用於設置超時時間。

# only wait for 1 second, regardless of the client's default
es.cluster.health(wait_for_status='yellow', request_timeout=1)

3.filter_path

被用於過濾返回值。

es.search(index='test-index', filter_path=['hits.hits._id', 'hits.hits._type'])

3.1.2 Elasticsearch

Elasticsearch是一個low-level客戶端，提供了一個從python到es rest端點的直接映射。這個實例擁有屬性cat、cluster、indices、ingest、nodes、snapshot和tasks，經過他們能夠訪問CatClient、ClusterClient、IndicesClient、IngestClient、NodesClient、SnapshotClient和TasksClient的實例。

elasticsearch類包含了操做elasticsearch許多經常使用方法，例如：get、mget、search、index、bulk、create、delete等，這些方法的具體用法，能夠參考elasticsearch-py的官方文檔。

在執行以上方法以前，首先須要得到一個elasticsearch的實例，而獲取這個實例有兩個方法，一個是給elasticsearch的初始化函數傳遞一個connection class實例，另外一個是給elasticsearch的初始化函數傳遞要鏈接的node的host和port，其實最終這些host、port仍是被傳遞給了connection class。

# create connection to localhost using the ThriftConnection
es = Elasticsearch(connection_class=ThriftConnection)

# connect to localhost directly and another node using SSL on port 443
# and an url_prefix. Note that ``port`` needs to be an int.
es = Elasticsearch([
    {'host': 'localhost'},
    {'host': 'othernode', 'port': 443, 'url_prefix': 'es', 'use_ssl': True},
])