【MySQL 原理分析】之 Explain & Trace 深入分析全模糊查詢走索引的原理

阿新 • • 發佈：2020-03-09

## 一、背景今天，交流群有一位同學提出了一個問題。看下圖： ![在這裡插入圖片描述](https://img-blog.csdnimg.cn/20200309172555930.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L0hvd2luZnVu,size_16,color_FFFFFF,t_70) 之後，這位同學確實也發了一個全模糊查詢走索引的例子： ![在這裡插入圖片描述](https://img-blog.csdnimg.cn/20200309172606317.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L0hvd2luZnVu,size_16,color_FFFFFF,t_70) 到這我們可以發現，這兩個sql最大的區別是：一個是查詢全欄位（select *），而一個只查詢主鍵（select id）。此時，又有其他同學講了其他方案： ![在這裡插入圖片描述](https://img-blog.csdnimg.cn/20200309172629690.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L0hvd2luZnVu,size_16,color_FFFFFF,t_70) 全文索引這個不用說，那是能讓全模糊查詢走索引的。但是索引覆蓋這個方案，我覺得才是符合背景的： 1、因為提問的背景就是模糊查詢欄位是普通索引，而普通索引只查詢主鍵就能用上覆蓋索引。 2、並且背景中，就是隻查詢主鍵（ID）就顯示用上索引了。 ## 二、資料準備和場景重現 #### 1、準備表和資料：建立 user 表，給 phone 欄位加了個普通索引： ```sql CREATE TABLE `user` ( `id` int(10) unsigned NOT NULL AUTO_INCREMENT, `name` varchar(255) DEFAULT NULL, `age` int(11) DEFAULT NULL, `phone` varchar(11) DEFAULT NULL, PRIMARY KEY (`id`), KEY `index_phone` (`phone`) USING BTREE COMMENT 'phone索引' ) ENGINE=InnoDB AUTO_INCREMENT=200007 DEFAULT CHARSET=utf8; ``` 準備10萬條資料意思意思： ```sql delimiter ; CREATE DEFINER=`root`@`localhost` PROCEDURE `iniData`() begin declare i int; set i=1; while(i<=100000)do insert into user(name,age,phone) values('測試', i, 15627230000+i); set i=i+1; end while; end;; delimiter ; call iniData(); ``` #### 2、執行 SQL ，檢視執行計劃： ```sql explain select * from user where phone like '%156%'; explain select id from user where phone like '%156%'; ``` #### 3、執行結果： | id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra | | ---- | ----------- | ----- | ---------- | ---- | ------------- | ---- | ------- | ---- | ----- | -------- | ----------- | | 1 | SIMPLE | user | | ALL | | | | | 99927 | 11.11 | Using where | | id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra | | ---- | ----------- | ----- | ---------- | ----- | ------------- | ----------- | ------- | ---- | ----- | -------- | ------------------------ | | 1 | SIMPLE | user | | index | | index_phone | 36 | | 99927 | 11.11 | Using where; Using index | 我們可以發現，第二條 SQL 確實是顯示用上了 `index_phone` 索引。但是細心的同學可能會發現：`possible_keys` 竟然為空！有貓膩。。。 > **我這裡先說一下 prossible_keys 和 key 的關係：** > > 1、`possible_keys` 為可能使用的索引，而 `key` 是實際使用的索引； > > 2、正常是： `key` 的索引，必然會包含在 `possible_keys` 中。還有貓膩一點就是：使用索引和不使用索引讀取的行數（rows）竟然是一樣的！ ## 三、驗證和階段性猜想上面講到，`possible_keys` 和 `key` 的關係，那麼我們利用正常的走索引來驗證一下。下面的 SQL，不是全模糊查詢，而是右模糊查詢，保證是一定走索引的，我們分別看看此時 `possible_keys` 和 `key` 的值： ```sql explain select id from user where phone like '156%'; ``` 執行結果： | id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra | | ---- | ----------- | ----- | ---------- | ----- | ------------- | ----------- | ------- | ---- | ----- | -------- | ------------------------ | | 1 | SIMPLE | user | | range | index_phone | index_phone | 36 | | 49963 | 100 | Using where; Using index | 這裡太明顯了： 1、`possible_keys` 裡確實包含了 `key` 裡的索引。 2、並且`rows` 瞬間降到 49963，整整降了一倍，並且 `filtered` 也達到了 100。 #### 階段猜想： 1、首先，`select id from user where phone like '%156%';` 因為**覆蓋索引**而用上了索引 `index_phone`。 2、possible_keys 為 null，證明用不上索引的樹形查詢。很明顯，`select id from user where phone like '%156%';` 即使顯示走了索引，但是讀取行數 **rows** 和 `select * from user where phone like '%156%';` 沒有走索引的 **rows** 是一樣的。 3、那麼，我們可以猜測到，`select id from user where phone like '%156%';` 即使因為覆蓋索引而用上了 `index_phone` 索引，但是卻沒用上樹形查詢，只是正常順序遍歷了索引樹。所以說，其實這兩條 SQL 在表字段不多的情況下，查詢效能應該差不了多少。 ## 四、通過 Trace 分析來驗證 #### 我們分別利用 Trace 分析對於這兩個 SQL 優化器是如何選擇的。 ##### 1、查詢全欄位： ```sql -- 開啟優化器跟蹤 set session optimizer_trace='enabled=on'; select * from user where phone like '%156%'; -- 檢視優化器追蹤 select * from information_schema.optimizer_trace; ``` 下面我們只看 TRACE 就行了： ```json { "steps": [ { "join_preparation": { "select#": 1, "steps": [ { "expanded_query": "/* select#1 */ select `user`.`id` AS `id`,`user`.`name` AS `name`,`user`.`age` AS `age`,`user`.`phone` AS `phone` from `user` where (`user`.`phone` like '%156%')" } ] } }, { "join_optimization": { "select#": 1, "steps": [ { "condition_processing": { "condition": "WHERE", "original_condition": "(`user`.`phone` like '%156%')", "steps": [ { "transformation": "equality_propagation", "resulting_condition": "(`user`.`phone` like '%156%')" }, { "transformation": "constant_propagation", "resulting_condition": "(`user`.`phone` like '%156%')" }, { "transformation": "trivial_condition_removal", "resulting_condition": "(`user`.`phone` like '%156%')" } ] } }, { "substitute_generated_columns": { } }, { "table_dependencies": [ { "table": "`user`", "row_may_be_null": false, "map_bit": 0, "depends_on_map_bits": [ ] } ] }, { "ref_optimizer_key_uses": [ ] }, { "rows_estimation": [ { "table": "`user`", "table_scan": { "rows": 99927, "cost": 289 } } ] }, { "considered_execution_plans": [ { "plan_prefix": [ ], "table": "`user`", "best_access_path": { "considered_access_paths": [ { "rows_to_scan": 99927, "access_type": "scan", // 順序掃描 "resulting_rows": 99927, "cost": 20274, "chosen": true } ] }, "condition_filtering_pct": 100, "rows_for_plan": 99927, "cost_for_plan": 20274, "chosen": true } ] }, { "attaching_conditions_to_tables": { "original_condition": "(`user`.`phone` like '%156%')", "attached_conditions_computation": [ ], "attached_conditions_summary": [ { "table": "`user`", "attached": "(`user`.`phone` like '%156%')" } ] } }, { "refine_plan": [ { "table": "`user`" } ] } ] } }, { "join_execution": { "select#": 1, "steps": [ ] } } ] } ``` ##### 2、只查詢主鍵 ```sql set session optimizer_trace='enabled=on'; select id from user where phone like '%156%'; -- 檢視優化器追蹤 select * from information_schema.optimizer_trace; ``` 下面我們繼續只看 TRACE 就行了： ```json { "steps": [ { "join_preparation": { "select#": 1, "steps": [ { "expanded_query": "/* select#1 */ select `user`.`id` AS `id` from `user` where (`user`.`phone` like '%156%')" } ] } }, { "join_optimization": { "select#": 1, "steps": [ { "condition_processing": { "condition": "WHERE", "original_condition": "(`user`.`phone` like '%156%')", "steps": [ { "transformation": "equality_propagation", "resulting_condition": "(`user`.`phone` like '%156%')" }, { "transformation": "constant_propagation", "resulting_condition": "(`user`.`phone` like '%156%')" }, { "transformation": "trivial_condition_removal", "resulting_condition": "(`user`.`phone` like '%156%')" } ] } }, { "substitute_generated_columns": { } }, { "table_dependencies": [ { "table": "`user`", "row_may_be_null": false, "map_bit": 0, "depends_on_map_bits": [ ] } ] }, { "ref_optimizer_key_uses": [ ] }, { "rows_estimation": [ { "table": "`user`", "table_scan": { "rows": 99927, "cost": 289 } } ] }, { "considered_execution_plans": [ { "plan_prefix": [ ], "table": "`user`", "best_access_path": { "considered_access_paths": [ { "rows_to_scan": 99927, "access_type": "scan", // 順序掃描 "resulting_rows": 99927, "cost": 20274, "chosen": true } ] }, "condition_filtering_pct": 100, "rows_for_plan": 99927, "cost_for_plan": 20274, "chosen": true } ] }, { "attaching_conditions_to_tables": { "original_condition": "(`user`.`phone` like '%156%')", "attached_conditions_computation": [ ], "attached_conditions_summary": [ { "table": "`user`", "attached": "(`user`.`phone` like '%156%')" } ] } }, { "refine_plan": [ { "table": "`user`" } ] } ] } }, { "join_execution": { "select#": 1, "steps": [ ] } } ] } ``` 好了，到這裡我們可以發現，在 Trace 分析裡面，都沒顯示優化器為這兩個 SQL 實際選擇了什麼索引，而只是顯示了都是用了 **順序掃描** 的方式去查詢資料。可能唯一不同點就是：一個使用了主鍵索引的全表掃描，而另外一個是使用了普通索引的全表掃描；**但是兩個都沒用上樹形查詢，也就是沒用上 B+Tree 的特性來提升查詢效能。** ## 六、最後總結 1、當全模糊查詢的 SQL 只查詢主鍵作為結果集時，因為覆蓋索引，會用上查詢欄位對應的索引。 2、即使用上了索引，但是卻沒用上樹形查詢的特性，只是正常的順序遍歷。 3、而正常的全表掃描也是主鍵索引的順序遍歷，所以說，其實這兩者的效能其實是差不

【MySQL 原理分析】之 Explain & Trace 深入分析全模糊查詢走索引的原理

【MySQL 原理分析】之 Explain & Trace 深入分析全模糊查詢走索引的原理

【Java併發程式設計】之十六：深入Java記憶體模型——happen-before規則及其對DCL的分析（含程式碼）

【MySQL 原理分析】之 Trace 分析 order by 的索引原理

【MYSQL學習筆記02】MySQL的高階應用之Explain（完美詳細版，看這一篇就夠了）

【MySQL 線上 BUG 分析】之多表同欄位異常：Column ‘xxx’ in field list is ambiguous

轉：【Java並發編程】之十六：深入Java內存模型——happen-before規則及其對DCL的分析（含代碼）

【外貿推廣科普】主流外貿推廣渠道優缺點分析及實操之展會推廣

【外貿推廣科普】主流外貿推廣渠道優缺點分析及實操之平臺推廣

【MYSQL筆記3】MYSQL過程式資料庫物件之儲存過程的呼叫、刪除和修改

【MYSQL筆記3】MYSQL過程式數據庫對象之存儲過程的調用、刪除和修改

mysql之explain詳解（分析索引的最佳使用）

【IOS遊戲開發】之IPA破解原理

【源碼學習】之requirejs

【安裝Python環境】之“安裝 setuptools ”時出現的問題以及解決辦法

轉：【Java並發編程】之十七：深入Java內存模型—內存操作規則總結

【閑聊產品】之六：拍板的人

【MySQL集群】——Java程序連接MySQL集群

【軟件project】之第五、六章總結

【MySQL報錯】ERROR 1558 (HY000): Column count of mysql.user is wrong. Expected 43, found 39.

【Python算法】哈希存儲、哈希表、散列表原理

【MySQL 原理分析】之 Explain & Trace 深入分析全模糊查詢走索引的原理

相關推薦