字串中判斷存在的幾種模式和效率(string.contains、string.IndexOf、Regex.Match)

阿新 • • 發佈：2018-12-04

　通常情況下，我們判斷一個字串中是否存在某值常常會用string.contains，其實判斷一個字串中存在某值的方法有很多種，最常用的就是前述所說的string.contains，相對來說比較常用的還有string.IndexOf和Regex.Match。直接上程式碼，後面在說些什麼吧，通常情況下功能的實現最重要，作者的話，只對有心者有效。

using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
using System.Text.RegularExpressions;

namespace ExistsInString
{
    class Program
    {
        static void Main(string[] args)
        {
            string str0 = "|456|";
            string str1 = "|444|";
            string str2 = "|111|222|333|444|555|666|777|888|999|000|";

            //------------------------------------------
            //String.Contains方法

            if (str2.Contains(str0))
                Console.WriteLine("String.Contains->true");
            else
                Console.WriteLine("String.Contains->false");

            if (str2.Contains(str1))
                Console.WriteLine("String.Contains->true");
            else
                Console.WriteLine("String.Contains->false");

            //------------------------------------------
            //String.IndexOf方法
            int val1 = str2.IndexOf(str0);//不存在返回-1
            Console.WriteLine("String.IndexOf(no exists)->" + val1);
            int val2 = str2.IndexOf(str1);//存在返回str1首字元所在str2中的位置(>=0)
            Console.WriteLine("String.IndexOf(exists)->" + val2);

            //------------------------------------------
            //正則匹配方法
            if (Regex.Match(str2, "[|]456[|]").Success)
                Console.WriteLine("Regex.Match(no exists)->true");
            else
                Console.WriteLine("Regex.Match(no exists)->false");

            if (Regex.Match(str2, "[|]444[|]").Success)
                Console.WriteLine("Regex.Match(exists)->true");
            else
                Console.WriteLine("Regex.Match(exists)->false");

            Console.ReadKey();

            /*
             *如果上述三種方式都處理大量資料，效率如何呢？
             *以下迴圈六組資料說明 
             */

            int loopCount = (int)10e6;
            DateTime lasttime = DateTime.Now;
            DateTime nowtime = DateTime.Now;

            for (int loop = 1; loop < 7; loop++)
            {
                Console.WriteLine("\r\nloop " + loop + " >>>>>>>");

                //------------------------------------------
                //String.Contains方法

                //no exists
                lasttime = DateTime.Now;
                for (int i = 0; i < loopCount; i++)
                    if (str2.Contains(str0)) { };
                nowtime = DateTime.Now;
                TimeSpan tsStrConNoExists = nowtime - lasttime;

                //exists
                lasttime = DateTime.Now;
                for (int i = 0; i < loopCount; i++)
                    if (str2.Contains(str1)) { };
                nowtime = DateTime.Now;
                TimeSpan tsStrConExists = nowtime - lasttime;


                //------------------------------------------
                //String.IndexOf方法

                //no exists
                lasttime = DateTime.Now;
                for (int i = 0; i < loopCount; i++)
                    if (str2.IndexOf(str0) >= 0) { };//上述已經提到不存在返回-1，存在返回一個非負整數，這裡為什麼不用 == -1 ，而是用了 >= 0 ，這是一個值得深思的問題？
                nowtime = DateTime.Now;
                TimeSpan tsStrIndNoExists = nowtime - lasttime;

                //exists
                lasttime = DateTime.Now;
                for (int i = 0; i < loopCount; i++)
                    if (str2.IndexOf(str1) >= 0) { };
                nowtime = DateTime.Now;
                TimeSpan tsStrIndExists = nowtime - lasttime;

                //------------------------------------------
                //Regex.Match方法

                //no exists
                Regex Reg0 = new Regex("[|]456[|]");
                lasttime = DateTime.Now;
                for (int i = 0; i < loopCount; i++)
                    if (Reg0.Match(str2).Success) { };
                nowtime = DateTime.Now;
                TimeSpan tsStrRegNoExists = nowtime - lasttime;

                //exists
                Regex Reg1 = new Regex("[|]444[|]");
                lasttime = DateTime.Now;
                for (int i = 0; i < loopCount; i++)
                    if (Reg1.Match(str2).Success) { };
                nowtime = DateTime.Now;
                TimeSpan tsStrRegExists = nowtime - lasttime;

                Console.WriteLine("no exists >>>");
                Console.WriteLine("tsStrConNoExists = " + tsStrConNoExists.Milliseconds);
                Console.WriteLine("tsStrIndNoExists = " + tsStrIndNoExists.Milliseconds);
                Console.WriteLine("tsStrRegNoExists = " + tsStrRegNoExists.Milliseconds);
                Console.WriteLine("exists >>>");
                Console.WriteLine("tsStrConExists = " + tsStrConExists.Milliseconds);
                Console.WriteLine("tsStrIndExists = " + tsStrIndExists.Milliseconds);
                Console.WriteLine("tsStrRegExists = " + tsStrRegExists.Milliseconds);
            }

            Console.ReadKey();
        }
    }
}

輸入結果：

String.Contains->false
String.Contains->true
String.IndexOf(no exists)->-1
String.IndexOf(exists)->12
Regex.Match(no exists)->false
Regex.Match(exists)->true

loop 1 >>>>>>>
no exists >>>
tsStrConNoExists = 796
tsStrIndNoExists = 687
tsStrRegNoExists = 171
exists >>>
tsStrConExists = 484
tsStrIndExists = 234
tsStrRegExists = 796

loop 2 >>>>>>>
no exists >>>
tsStrConNoExists = 46
tsStrIndNoExists = 671
tsStrRegNoExists = 234
exists >>>
tsStrConExists = 546
tsStrIndExists = 437
tsStrRegExists = 734

loop 3 >>>>>>>
no exists >>>
tsStrConNoExists = 62
tsStrIndNoExists = 875
tsStrRegNoExists = 171
exists >>>
tsStrConExists = 609
tsStrIndExists = 562
tsStrRegExists = 781

loop 4 >>>>>>>
no exists >>>
tsStrConNoExists = 78
tsStrIndNoExists = 921
tsStrRegNoExists = 218
exists >>>
tsStrConExists = 609
tsStrIndExists = 640
tsStrRegExists = 828

loop 5 >>>>>>>
no exists >>>
tsStrConNoExists = 156
tsStrIndNoExists = 268
tsStrRegNoExists = 265
exists >>>
tsStrConExists = 609
tsStrIndExists = 578
tsStrRegExists = 890

loop 6 >>>>>>>
no exists >>>
tsStrConNoExists = 109
tsStrIndNoExists = 46
tsStrRegNoExists = 546
exists >>>
tsStrConExists = 625
tsStrIndExists = 609
tsStrRegExists = 953

測試結果中不難發現，如果strA中不包括strB，使用strA.Contains(strB)更優；反之，如果strA中包括strB，使用strA.IndexOf(strB)更優。（Regex.Match在此方法中貌似沒有體現出任何優勢，它更適用於模糊匹配）

具體要使用string.Contains，或是string.IndexOf要看形勢。

之前有看過string下很多方法實現的程式碼（微軟的，非他人），string.Contains是基於string.IndexOf上的一個方法，使用string.Contains的時候，會呼叫

string.IndexOf，按原理，使用string.IndexOf的效率是要高於string.Contains的，但是這個測試結果讓我大跌眼鏡，應該是我在上述程式碼中使用的判斷語句造成的這種非理想的測試結果，按照個人的意願，還是希望多使用string.IndexOf。

其實一次微小的改變在當前可能影響不了什麼，但是在日積月累中，它的優勢就顯而易見了。想要快速變得比他人更強，不需要多麼費勁，只需要每天多做一點點（千分之一）

一年之後：（1 + 0.001）365 = 1.44倍

十年之後（1 + 0.001）3650 = 38.4倍

字串中判斷存在的幾種模式和效率(string.contains、string.IndexOf、Regex.Match)

字串中判斷存在的幾種模式和效率(string.contains、string.IndexOf、Regex.Match)

Java中去除字串中空格的幾種方法

map遍歷的幾種方式和效率問題

JAVA for迴圈的幾種寫法和效率

python中常見的幾種正則表示式的使用（re.split、re.sub、re.match與re.search）

JavaScript中創建對象的幾種模式

獲得元素的幾種方法,和dom中常用的事件

Cesium中的幾種座標和相互轉換

HTML中呼叫JavaScript的幾種情況和規範寫法

Web開發中前端路由實現的幾種方式和適用場景

Unity中常用的幾種設計模式

php中常見的幾種設計模式

JAVA中常用的幾種設計模式--單例

Spring容器中定義Bean幾種初始化方法和銷燬方法

推薦系統中常見的幾種相似度計算方法和其適用資料

Spring容器中的Bean幾種初始化方法和銷燬方法的先後順序

ASP.NET程式中Session儲存的幾種模式

設計模式——抽象工廠模式及在jdk中的應用+幾種工廠模式的比較

RN中使用fetch進行網路請求的幾種場景和姿勢

Linux中Vim編輯器三種模式和命令

字串中判斷存在的幾種模式和效率(string.contains、string.IndexOf、Regex.Match)

相關推薦