淺嘗輒止MongoDB：高階查詢

阿新 • • 發佈：2018-12-14

一、全文檢索

1. 建立索引

MongoDB一個集合上只能建立一個文字索引。

建立文字索引：在集合texttest上的body鍵上建立文字索引。

db.texttest.createIndex( { body : "text" } );

指定索引的預設語言：

db.texttest.createIndex( { body : "text" }, { default_language : "french" } );

在多種語言上建立索引：同一集合中存在多種語言，需要有一個欄位標記每個文件的語言，如下面的四個文件中的lingvo欄位標識其語言。

{ _id : 1, content : "cheese", lingvo : "english" }
{ _id : 2, content : "fromage", lingvo: "french" }
{ _id : 3, content : "queso", lingvo: "spanish" }
{ _id : 4, content : "ost", lingvo: "swedish" }

使用文件中給定的語言建立索引：

db.textExample.createIndex( { content : "text" }, { language_override : "lingvo" } );

建立符合索引：同時索引content和comments欄位，可以在這兩個欄位上進行文字搜尋。

db.textExample.createIndex( { content : "text", comments : "text" });

使用萬用字元：在全部欄位上建立索引，並命名索引。

db.textExample.createIndex( { "$**": "text" }, { name: "alltextindex" } );

指定權重：指定content的權重是10，comments權重是5，其它欄位的權重為1。

db.textExample.createIndex( { content : "text", comments : "text"}, { weights : { content: 10, comments: 5, } } );

同時建立文字和非文字的複合索引：content上建立文字索引，username上建立普通索引。

db.textExample.createIndex( { content : "text", username : 1 });

2. 執行搜尋

文字搜尋：以fish為詞根進行搜尋，返回body中匹配fish字串的文件。

db.texttest.find({ $text : { $search :"fish" } });

過濾結果：在文字匹配的文件中過濾出about鍵值為food的結果。

db.texttest.find({ $text : { $search : "fish" }, about : "food" });

複雜搜尋：返回文件中body鍵匹配cook，但不匹配lunch的body值。先搜尋所有匹配條件的資料，再刪除不匹配的資料。

db.texttest.find({ $text : { $search : "cook -lunch" } }, {_id:0, body:1});

字面搜尋：返回body鍵匹配整個字串mongodb text search，而不是匹配mongodb、text、search這三個單詞的文件。

db.texttest.find({ $text : { search : "\"mongodb text search\"" } });

限制返回的文件數：返回1條。

db.texttest.find({ $text : { $search :"fish" }}).limit(1);

顯示指定元素：只顯示body。

db.texttest.find({ $text : { $search :"fish"}}, { _id : 0, body : 1 });

指定文字搜尋使用的語言：全小寫方式指定。

db.texttest.find({ $text : { $search :"fish", $language : " french" } });

利用文字與非文字的複合索引優化查詢：

db.texttest.createIndex( { about : 1, body : "text" });
db.texttest.find({ $text : { $search : "fish"}, about : "food"}).explain("executionStats").executionStats;

二、聚合

db.collection.aggregate( { $group : { _id : "$color" } } );

類比SQL：

select distinct color from collection;
-- 或
select color from collection group by color;

db.collection.aggregate({ $group : { _id : "$color", count : { $sum : 1 } } });

類比SQL：

select color, count(1) count from collection group by color;

db.collection.aggregate({ $group : { _id : { color: "$color", transport: "$transport"} , count : { $sum : 1 } } });

類比SQL：

select color transport, count(1) 
  from collection 
 group by color, transport;

db.collection.aggregate( 
    [
        { $group : { _id : { color: "$color", transport: "$transport"} , count : { $sum : 1 } } },
        { $limit : 5 }
    ]);

類比SQL：

select color, transport, count(1) 
  from collection 
 group by color, transport 
 limit 5;

db.collection.aggregate( 
    [
        { $match : { num : { $gt : 500 } } },
        { $group : { _id : { color: "$color", transport: "$transport"} , count : { $sum : 1 } } },
        { $limit : 5 }
    ]);

類比SQL：

select color, transport, count(1) 
  from collection 
 where num > 500
 group by color, transport 
 limit 5;

db.collection.aggregate( 
    [
        { $group : { _id : { color: "$color", transport: "$transport"} , count : { $sum : 1 } } },
        { $sort : { _id :1 } },
        { $limit : 5 }
    ]);

類比SQL：

select color, transport, count(1) 
  from collection 
 group by color, transport 
 order by color, transport
 limit 5;

db.collection.aggregate( 
    [
        { $match : { num : { $gt : 500 } } },
        { $group : { _id : { color: "$color", transport: "$transport"} , count : { $sum : 1 } } },
        { $sort : { _id :1 } },
        { $limit : 1 }
    ]);

類比SQL：

select color, transport, count(1) 
  from collection 
 where num > 500
 group by color, transport 
 order by color, transport
 limit 1;

db.collection.aggregate( { $unwind : "$vegetables" });

類比SQL：

select collection.*, substring_index(substring_index(vegetables, ',', id),',' ,-1) vegetables
  from collection, nums -- nums為只有id一列的數字輔助表
 where id <= length(vegetables)-length(replace(vegetables,',',''))+1;

db.collection.aggregate(
    [
        { $unwind : "$vegetables" },
        { $project : { _id: 0, fruits:1, vegetables:1 } }
    ]);

類比SQL：

select fruits, substring_index(substring_index(vegetables, ',', id),',' ,-1) vegetables
  from collection, nums -- nums為只有id一列的數字輔助表
 where id <= length(vegetables)-length(replace(vegetables,',',''))+1;

db.collection.aggregate(
    [
        { $unwind : "$vegetables" },
        { $project : { _id: 0, fruits:1, veggies: "$vegetables" } }
    ]);

類比SQL：

select fruits, substring_index(substring_index(vegetables, ',', id),',' ,-1) veggies
  from collection, nums -- nums為只有id一列的數字輔助表
 where id <= length(vegetables)-length(replace(vegetables,',',''))+1;

db.collection.aggregate(
    [
        { $unwind : "$vegetables" },
        { $project : { _id: 0, fruits:1, vegetables:1 } },
        { $skip : 2995 }
    ]);

類比SQL：

select fruits, substring_index(substring_index(vegetables, ',', id),',' ,-1) vegetables
  from collection, nums -- nums為只有id一列的數字輔助表
 where id <= length(vegetables)-length(replace(vegetables,',',''))+1
 limit 2995, 999999999;

db.collection.aggregate(
    [
        { $unwind : "$vegetables" },
        { $project : { _id: 0, fruits:1, vegetables:1 } },
        { $skip : 2995 },
        { $out : "food" }
    ]);

類比SQL：

create table food as 
select @a:[email protected]+1 id, fruits, substring_index(substring_index(vegetables, ',', id),',' ,-1) vegetables
  from collection, (select @a:=0) t, nums -- nums為只有id一列的數字輔助表
 where id <= length(vegetables)-length(replace(vegetables,',',''))+1
 limit 2995, 999999999;

db.prima.aggregate(
    [
        {$lookup: {
            from : "secunda",
            localField : "number",
            foreignField : "number",
            as : "secundaDoc"
         } },
    ]);

類比SQL：

select prima.*, concat(secunda.c1,secunda.c2,...secunda.cn) secundaDoc
  from prima left join secunda on prima.number = secunda.number;

db.prima.aggregate(
    [
        {$lookup:{
            from : "secunda",
            localField : "number",
            foreignField : "number",
            as : "secundaDoc" }},
        {$unwind: "$secundaDoc"},
        {$project: {_id : "$number", english:1, ascii:"$secundaDoc.ascii" }}
    ]);

類比SQL：

select prima.*, secunda.*
  from prima left join secunda on prima.number = secunda.number;

三、MapReduce

MongoDB通過兩個使用者自定義的JavaScript函式實現查詢：map和reduce。MongoDB將對指定的集合執行一個專門的查詢，所有匹配該查詢的文件都將被輸入到map函式中。map函式被設計用於生成鍵值對。任何含有多個值的鍵都將被輸入到reduce函式中，reduce函式將返回輸入資料的聚合結果。最後，還有一個可選步驟，通過finalize函式對資料的顯示進行完善。

以下是來自文件的圖，可以清楚的說明 Map-Reduce 的執行過程。

1. 最簡MapReduce

定義map函式：

var map = function() {
    emit(this.color, this.num);
};

MongoDB中使用emit函式向MapReduce提供Key/Value對。map函式接收集合中的color和num欄位作為輸入，輸出為以color為鍵，以num陣列為值的文件。

定義空reduce函式：

var reduce = function(color, numbers) { };

reduce函式接收map傳來的鍵值對，但不執行任何操作。

執行MapReduce：

db.mapreduce.mapReduce(map,reduce,{ out: { inline : 1 } });

{ out : { inline : 1 } } 表示將執行結果輸出到控制檯，顯示類似如下的結果。

{
    "results" : [
        {
            "_id" : "black",
            "value" : null
        },
        {
            "_id" : "blue",
            "value" : null
        },
        ...
        {
            "_id" : "yellow",
            "value" : null
        }
    ],
    "timeMillis" : 95,
    "counts" : {
        "input" : 1000,
        "emit" : 1000,
        "reduce" : 55,
        "output" : 11
    },
    "ok" : 1,
}

結果顯示，為每種顏色建立了一個單獨的文件，並且使用顏色作為文件的唯一_id值。因為reduce函式體為空，所以value被設定為null。

2. 求和

定義求和reduce函式：

var reduce = function(color, numbers) {
    return Array.sum(numbers);
};

該reduce函式對每個color對應的多個num求和。

執行MapReduce，並將結果輸出到集合mrresult中：

db.mapreduce.mapReduce(map,reduce,{ out: "mrresult" });

檢視結果集合：

> db.mrresult.findOne();
{ "_id" : "black", "value" : 45318 }

3. 求平均

map函式：

var map = function() {
    var value = {
        num : this.num,
        count : 1
    };
    emit(this.color, value);
};

count為計數器，為了只統計每個文件一次，將count值設定為1。

reduce函式：

var reduce = function(color, val ) {
    reduceValue = { num : 0, count : 0};
    for (var i = 0; i < val.length; i++) {
        reduceValue.num += val[i].num;
        reduceValue.count += val[i].count;
    }
    return reduceValue;
};

用一個簡單的迴圈對num和count求和。注意reduce函式中return函式返回的值，必須與map函式中傳送到emit函式中的value結構相同。

finalize函式：

var finalize = function (key, value) {
    value.avg = value.num/value.count;
    return value;
};

finalize函式從reduce函式接收結果，並計算平均值。

執行：

db.mapreduce.mapReduce(map,reduce,{ out: "mrresult", finalize : finalize });

檢視結果：

> db.mrresult.findOne();
{
    "_id" : "black",
    "value" : {
        "num" : 45318,
        "count" : 91,
        "avg" : 498
    }
}

4. 除錯

（1）除錯map函式過載emit函式，列印map函式的輸出：

var emit = function(key, value) {
    print("emit results - key: " + key + " value: " + tojson(value));
}

使用map.apply和樣例文件進行測試：

> map.apply(db.mapreduce.findOne());
emit results - key: blue value: { "num" : 1, "count" : 1 }

（2）除錯reduce函式首先需要確認map和reduce函式返回結果的格式必須嚴格一致。然後建立一個數組，模擬傳入到reduce函式中的陣列：

a = [{ "num" : 1, "count" : 1 },{ "num" : 2, "count" : 1 },{ "num" : 3, "count" : 1 }]

現在呼叫reduce函式，顯示返回結果：

>reduce("blue",a);
{ "num" : 6, "count" : 3 }

如果出現某些問題，不理解函式中的內容，那麼可以使用printjson()函式將JSON值輸出到mongodb日誌檔案中。在除錯時，這是一個有價值的工具。

淺嘗輒止MongoDB：高階查詢

目錄二、聚合 2. 求和 3. 求平均 4. 除錯一、全文檢索 1. 建立索引 MongoDB一個集合上只能建立一個文字索引。建立文字索引：在集合texttest上的body鍵上建立文字索引。 db.

Beego框架：高階查詢

isnull：判斷某個欄位是否為null func (this *OperatorsController) GetIsNull() { user := models.User{} var users []*models.User orm := orm.NewOrm()//建立o

java操作mongodb（高階查詢）

直接上程式碼（依賴程式碼請檢視之前的部落格）： public void query() { // $or (查詢id等於1或者id等於2的資料) BasicDBObject queryObject = new BasicDBObject().appen

淺嘗輒止MongoDB：基礎

大部分摘自《MongoDB大資料處理權威指南》（第3版）。一、簡介 MongoDB（源自單詞humongous）是一個只用於處理文件的資料庫。不同於關係資料庫管理系統（Relational Database Management Sys

淺嘗輒止MongoDB：管理（1）

目錄 MongoDB和SQL資料庫之間的主要區別是：不需要在伺服器上建立資料庫、集合或欄位，因為MongoDB將在訪問它們時動態建立這些元素。 MongoDB中的所有物件和元素名稱

淺嘗輒止MongoDB：管理（2）

目錄五、監控四、驗證與修復以下是一些資料已損壞的跡象：資料庫伺服器無法啟動，表示資料檔案已損壞。在伺服器日誌檔案中發現asserts或使用db.serverStatus()

PHP操作Mongodb之高階查詢篇

在PHP操作Mongodb之增刪改查篇中我們介紹了PHP中Mongodb的增加、刪除、修改及查詢資料的操作。本文主要是將查詢時用到的高階知識跟大家分享下。 1、查詢時的排序在關係型資料庫的查詢中，往往會用到排序。例如時間倒序，點選率升序啦等等。在Mongodb的查詢中，

二、MongoDB的高階查詢（聚合、遊標、管道、索引）

MongoDB中聚合(aggregate)主要用於處理資料(諸如統計平均值,求和等)，並返回計算後的資料結果。有點類似sql語句中的 count(*)。一、聚合對於Mongodb中的聚合應該使用aggregate（）方法語法

mongoDB的高階查詢和高階修改

db.c1.remove(); 刪除c1集合中所有的資料 db.c1.remove("條件"); 刪除c1集合中符合條件的所有的資料修改資料把user3修改成user30：若已經插入db.c1.insert({name:"user3",age:30}); { "_id" : ObjectId(

淺嘗輒止MongoDB：複製

目錄一、複製基礎副本集是一種建立多個MongoDB例項的方式，這些例項將擁有相同的資料（冗餘）和其它相關設定。主從複製、主主複製、複製對等方法都被副本集的概念所取代。在MongoDB中，副本集由一個主

MongoDB：聯合查詢並更新資料

–工作筆記，記錄今天解決的一個問題。我在system_logging庫裡放了一個woplus_tservice集合，存有使用者手機號碼。同時，我在base_data庫裡有另一個集合mobile_segment，存有號碼歸屬地。我需要使用mobile_se

淺嘗輒止MongoDB：操作（2）

目錄大部分摘自《MongoDB大資料處理權威指南》（第3版）。 4. 更新資料（1）update() 在MongoDB中可以使用update()函式執行資料更新操作。該函式將接受3個主要引數：criteria、objNew和op

MongoDB之——高階查詢

1、條件操作符<, <=, >, >= 這個操作符就不用多解釋了，最常用也是最簡單的db.collection.find({ "field" : { $gt: value } } ); // 大於: field > value db.colle

python資料庫-mongoDB的高階查詢操作(55)

一、MongoDB索引　　為什麼使用索引？　　假設有一本書，你想看第六章第六節講的是什麼，你會怎麼做，一般人肯定去看目錄，找到這一節對應的頁數，然後翻到這一頁。這就是目錄索引，幫助讀者快速找到想要的章節。在資料庫中，我們也有索引，其目的當然和我們翻書一樣，能幫助我們提高查詢的效率。索引就像目錄一樣，減

MySQL快速回顧：高階查詢操作

8.1 排序資料檢索出的資料並不是以純粹的隨機順序顯示的。如果不排序，資料一般將以它在底層表中出現的順序顯示。這可以是資料最初新增到表中的順序。但是，如果資料後來進行過更新或刪除，則此順序將會受到MySQL重用回收儲存空間的影響。因此，如果不明確控制的話，不能依賴該排序順序。關係資料庫設計理論認為，如果不

3-MongoDB：查詢（一）

簡單 gte ted 分享圖片 string font 投影 binary ava 一、簡介 MongoDB提供了db.collection.find() 方法可以實現根據條件查詢和指定使用投影運算符返回的字段省略此參數返回匹配文檔中的所有字段。二．db.co

mongodb 高階查詢詳解

MongoDB：管道操作使用聚合框架可以對集合中的文件進行變換和組合。基本上，可以用多個構件建立一個管道（pipeline），用於對一連串的文件進行處理。這些構件包括篩選（filter）、投射（projecting）、分組（grouping）、排序（sorting）、限制（limiting）

mongodb的基本增刪改查與高階查詢指令及聚合命令

一、MongoDB中關於database的基本指令 1 ###關於database的基本指令 2 #查詢當前使用的資料庫指令 3 db 4 5 #查詢所有的資料庫指令 6 show dbs#或者：show databases 7 8 #切換資料庫指令 9 use db_name

MongoDB基礎教程：find 查詢之條件查詢（$lt\$gt\$lte\$gte\$or\$not）

轉載：原文連結：原文連結出處，點選跳轉查詢不僅能像MongoDB基礎教程：find 查詢中說的那樣精確匹配，還能匹配更加複雜的條件，比如範圍、OR子句和取反。查詢條件首先我們先來認識比較操作符，下面是全部的比較操作符。 "$lt" less than

mongodb進階一之高階查詢

這篇我們來說說mongodb的進階--------------高階查詢一：各種查詢 1：條件操作符 <, <=, >, >= 這個操作符就不用多解釋了，最常用也是最簡單的。 db.collection.find({ "field" : { $gt

淺嘗輒止MongoDB：高階查詢

一、全文檢索

1. 建立索引

2. 執行搜尋

二、聚合

三、MapReduce

1. 最簡MapReduce

2. 求和

3. 求平均

4. 除錯

相關推薦