Python模組的使用-- elasticsearch模組

阿新 • • 發佈：2019-01-09

Python操作Elasticsearch

參考整理了一下，當做學習筆記，記錄一下。
安裝模組

pip install elasticsearch  # 6.x版本

Python操作Elasticsearch

建立索引

from elasticsearch import Elasticsearch
 
es = Elasticsearch()
result = es.indices.create(index='news', ignore=400)
print(result)  # {'acknowledged': True, 'index': 'news', 'shards_acknowledged': True} acknowledged 為True表示建立成功

刪除索引

result = es.indices.delete("news", ignore=[400, 404])
print(result) # {'acknowledged': True}

插入一條document

# 法一
es.create("news", "politics", body=content, id=1)

# 法二
 es.index("news", doc_type="politics", body=data)  # ok 可以不用指定id, 引數id預設為隨機建立

更新資料

# 法一
data = {'date': '2018-01-05 12:30:00' 
,
 'title': 'asd123',
 'url': 'http://view.news.qq.com/zt2011/usa_iraq/index.htm'}
result = es.update(index='news', doc_type='politics', body=data, id=1)  # error 
print(result)

data_doc = {'doc': {'date': '2018-01-05 12:30:00',
  'title': 'asd123',
  'url': 'http://view.news.qq.com/zt2011/usa_iraq/index.htm'}}
result = 
 es.update(index='news', doc_type='politics', body=data, id=1)  # ok 
print(result)

# 法二
data = {'date': '2018-01-05 12:30:00',
 'title': 'asd123',
 'url': 'http://view.news.qq.com/zt2011/usa_iraq/index.htm'}  # ok 使用data_doc也ok
es.index(index='news', doc_type='politics', body=data, id=1)

刪除資料

result = es.delete("news", "politics", id='vNaqc2cBE_LRbsBxQ94C')  # ok

查詢資料

# 指定分詞器 安裝一個分詞外掛，這裡使用的是 elasticsearch-analysis-ik
# mapping 資訊中指定了分詞的欄位，指定了欄位的型別 type 為 text，分詞器 analyzer 和 搜尋分詞器 search_analyzer 為 ik_max_word
mapping = {
    'properties': {
        'title': {
            'type': 'text',
            'analyzer': 'ik_max_word',
            'search_analyzer': 'ik_max_word'
        }
    }
}

query = {
    'query': {
        'match': {
            'title': '中國 領事館'
        }
    }
}
es = Elasticsearch()
result = es.search(index='news', doc_type='politics', body=query)
print(result)

查詢並刪除

# 匹配大於21歲的文件
query = {"query": {"range": {"age": {"gt": 21}}}}
es.delete_by_query(index='indexName', body=query, doc_type='typeName')

查詢相關

datas = [{'age': 34,'date': '2018-01-01','sex': '男', 'title': '鏢旗標題啊111，Python ElasticSearch基礎教程'}, {'age': 10, 'date': '2017-05-13', 'sex': '男', 'title': '標題2'},{'age': 24, 'date': '2019-01-01', 'sex': '女', 'title': 'haha'}]

# 查詢所有
query = {"query": {"match_all":{}}}
# 提供boost引數可以修改_score
query = {"query": {"match_all": {"boost": 1.2}}}
# 查詢所有相反操作, 不匹配任何文件
query = {'query': {'match_none': {}}}

## 全文查詢相關
#（1）匹配查詢
query = {"query": {"match" : {"title" : "Python"}}}
# (2) 多匹配查詢
query = {'query': {'multi_match': {'fields': ['title', 'sex'], 'query': '標題 男'}}}
# (3) term
body = {"query":{"term":{"title":"python"}}}  # 查詢title包含"python"的所有資料
es.search(index="bbs",doc_type="user",body=body)

# (4) terms
body = {"query":{"terms":{"title":["python","標題"]}}}
es.search(index="bbs",doc_type="user",body=body)  # 搜尋出title包含"python"或包含"標題"的所有資料

# (5) ids
body = {"query":{"ids":{"type":"user","values":["1", "z9Yrd2cBE_LRbsBxdt7t"]}}}
# 搜尋出id為1或對應id的資料
es.search(index="bbs",doc_type="user",body=body)

# (6) 複合查詢bool
# bool有3類查詢關係,must(都滿足),should(其中一個滿足),must_not(都不滿足)
body = {'query': {'bool': {'should': [{'term': {'title': 'python'}},{'term': {'age': 24}}]}}}
es.search("bbs", doc_type="user", body=body)

# (7) 切片查詢
body = {'from': 2, 'query': {'match_all': {}}, 'size': 4}  # from 從第二條開始查詢 size查詢4條記錄
# (8) 範圍查詢
body = {'query': {'range': {'age': {'gte': 10, 'lte': 32}}}}  # 大於等於10，小於等於32
es.search("bbs", doc_type="user", body=body)
# (9) 字首查詢
body = {'query': {'prefix': {'title': '標'}}}
# (10) 萬用字元查詢
body =  {'query': {'wildcard': {'title': 'python*'}}}
# (11) 排序
body = {'query': {'match_all': {}}, 'sort': {'age': {'order': 'desc'}}}  # 升序asc 降序desc
# (12) 相應過濾
es.search("bbs", "user", filter_path=["hits.hits._id", "hits.hits._source.title"])  # 獲取id和對應的title
es.search("bbs", "user", filter_path=["hits.hits.*"])  # 獲取所有資料
# (13) 執行查詢並獲取查詢匹配數
es.count(index="bbs", doc_type="user")  # {'_shards': {'failed': 0, 'skipped': 0, 'successful': 5, 'total': 5}, 'count': 3}

參考文件：
https://elasticsearch-py.readthedocs.io/en/master/api.html#global-options
https://blog.csdn.net/u013429010/article/details/81746179

Python模組的使用-- elasticsearch模組

Python操作Elasticsearch 參考整理了一下，當做學習筆記，記錄一下。安裝模組 pip install elasticsearch # 6.x版本 Python操作Elasticsearch 建立索引 from elasticsearch import

python的pyserial模組

pyserial是python提供用於進行串列埠通訊的庫源文件：https://pythonhosted.org/pyserial/ 1、安裝pyserial pip install pyserial 2、檢視電腦現連串列埠裝置 import serial.tools.list_ports #檢

python中multiprocessing模組之Pipe管道

原文地址，本文在原文基礎上添加了部分註釋。 multiprocessing.Pipe([duplex]) 方法返回2個連線物件(conn1, conn2),代表管道的兩端,預設duplex為True，是雙向通訊。如果duplex為False，則conn1只能用來接收訊息，conn2只能用來

Python之argparse模組的使用

我們在寫python指令碼的時候，有時候需要在執行的時候傳入引數，而不是寫死在程式裡，這個時候就要用到argparse模組。argparse 是 Python 內建的一個用於命令項選項與引數解析的模組，通過在程式中定義好我們需要的引數，argparse 將會從sys.argv 中解析出這些引數，

Python -- queue佇列模組

一簡單使用 --內建模組哦 import Queuemyqueue = Queue.Queue(maxsize = 10)　　Queue.Queue類即是一個佇列的同步實現。佇列長度可為無限或者有限。可通過Queue的建構函式的可選引數maxsize來設定佇列長度。如果maxsize小於1就

Python使用PyMysql模組報錯：lock wait timeout exceeded; try restarting transactio

呵呵，我只想說：關於這個問題我整了兩個星期，關於這個問題的原因，從網上看到的很多文章全都是說要conn.commit（），但是我在程式裡面已經commit（）了，最後定位到的問題是Pymysql在多執行緒（或多程序下）面會有bug，對，你沒聽錯， Pymysql模組自身的bug造成的：

Python的MongoDB模組PyMongo

介面文件：http://api.mongodb.com/python/current/migrate-to-pymongo3.html#pymongo-2-9http://api.mongodb.com/python/current/api/pymongo/collation.html 優秀部落格：http

python學習筆記(19) 常用模組--OS模組

os.getcwd()　　#獲取當前目錄 os.chdir()　　#開啟目錄，記得加r os.curdir　　#返回當前目錄os.chdir('.') os.pardir　　#獲取當前目錄的父目錄字串名 ('..') os.makedirs() os.removedirs()　　#刪除多個空目錄

python學習筆記(19) 常用模組--sys模組

sys.argv　　#命令列引數list，第一個元素是程式本身路徑，後面跟傳的引數，只能在命令列執行 sys.platform　　#返回系統平臺名稱 sys.version　　#返回python直譯器的版本資訊 sys.exit(n)　　#推出程式，正常退出時exit(0),錯誤退出exit(1) s

Python全棧學習筆記day 20：序列化模組、模組的匯入

一、序列化模組從資料型別 --> 字串的過程：序列化從字串 --> 資料型別的過程：反序列化 json # 通用的序列化格式 # 只有很少的一部分資料型別（數字、字串、列表、字典、元組）能夠通過json轉化成字串 pickl

python學習之模組匯入

作為C++程式設計師，最近因為工作需要，學習了python。第一次接觸指令碼語言，難免有覺得新奇的地方，python程式沒有main()函式，只有主檔案，檔案裡就一條print（）語句也可執行。標準Python是CPython。在python命令列（不是系統命令列！）下，要匯入.py檔案

python 關於fork模組及getpid方法自我理解。

import os print ('process %s'%os.getpid()) #得到當前流程的ID值，假設是876 pid = os.fork() #fork函式用來複製出2個流程。 # 子個流程值為0，父流程返回子流程的ID值，切記父流程自己也有ID值 if pid == 0 :

python程序管理模組

參考：https://www.cnblogs.com/cindy-cindy/p/8031731.html python程序管理的模組:subprocess,multiprocessing subprocess：執行外部的程式，而不是執行python內部編寫的函式。程序之間通過管道進行交流。

python中os模組的作用

簡介 OS模組簡單的來說它是一個Python的系統程式設計的操作模組，可以處理檔案和目錄這些我們日常手動需要做的操作。如果你希望你的程式能夠與平臺無關的話，這個模組是尤為重要的。常用函式和變數 os.sep可以取代作業系統特定的路徑分隔符。windows下為 “\” os.

random模組 time模組的用法 python

1.random()模組的使用 import random x = random.random() y = random.random() print(x,y*10) #random.random()隨機生成一個[0,1）之間的隨機數 m = random.randint(0,10) print

python之os模組的基本使用

import os匯入模組 os模組： os.sep 　　　　　　可以取代作業系統特定的路徑分割符 os.linesep 　　　　字串給出當前平臺使用的行終止符。例如，Windows使用'\r\n'，Linux使用'\n' 而Mac使用'\r'。 os.name 　　

Python面試題----Python 的re模組中match、search、findall、finditer的區別

請簡要說明Python 的re模組中match、search、findall、finditer的區別 re是Python中用於正則表示式相關處理的類，這四個方法都是用於匹配字串的，具體區別如下： match 匹配string 開頭，成功返回Match object

python 安裝 numpy模組

第一步：安裝numpy模組，下載numpy-1.11.3-cp35-none-win_amd64.whl（https://pypi.python.org/pypi/numpy）,將這個下載檔案放到<Python安裝目錄>\Scripts\ 目錄下 nump

Python常用模組——re模組

　　有些人在面臨問題的時候會想：“我知道，我將使用正則表示式來解決這個問題。”這讓他們面臨的問題變成了兩個。　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　—— Jamie Zawinski 首先我們對比一下兩段程式碼處理使用者輸入手機號的不同 1 pho

python 之 Collections模組

官方文件：https://yiyibooks.cn/xx/python_352/library/collections.html 參考：　　https://blog.csdn.net/songfreeman/article/details/50502194 　　https://www.cnblogs.

Python模組的使用-- elasticsearch模組

Python操作Elasticsearch

Python操作Elasticsearch

相關推薦