寫高併發程式時慎用strncpy和sprintf

阿新 • • 發佈：2019-01-05

分享一下最近做程式優化的一點小心得：在寫高併發交易程式碼時要謹慎使用strncpy和sprintf。

下面詳細介紹一下這樣說的原因及建議實踐：

1 慎用strncpy因為它的副作用極大

我們平時使用strncpy防止字串拷貝時溢位，常常這樣寫

char 
 buf[1024] = {0};

char str[16] = "hello";

strncpy(buf, sizefo(buf), str);

這樣寫當然沒問題，但有些人不知道的是：strncpy一行程式碼執行時是往buf寫了sizeof(buf) = 1024個位元組，而不是直觀以為的strlen(str) + 1 = 6個字元。

也就是說我們為了複製6個字元卻寫了1024個位元組，多了不少額外消耗。如果這個函式被頻繁呼叫，會導致系統性能出現不少損失。

因為呼叫strncpy(dest, n, str)時，函式首先將字元從源緩衝區str逐個複製到目標緩衝區dest，直到拷貝了n碰上\0。

緊接著，strncpy函式會往buf填充\0字元直到寫滿n個字元。

所以我才會說上面的程式碼strncpy才會寫了1024個位元組。

可以做一個小實驗：

看上面程式碼及輸出結果，我們可以知道在執行strncpy之前dest是用'1'填充的，但在執行strncpy後，前面幾個字元變成hello，後面的字元全變成\0;

我個人的解決方法是寫一個巨集專用於往字元陣列拷貝的，與大家分享一下，拋磚引玉。

// 靜態斷言  從vc拷貝過來(_STATIC_ASSERT) 稍微修改了一下 
// 原來是typedef char __static_assert_t[ (expr) ]
// 現在是typedef char __static_assert_t[ (expr) - 1 ] 
// 原因是gcc支援0字元陣列
//TODO: 這裡在win上編譯有警告 有待優化 另外在linux巨集好像不起作用 原因待查。暫時只有在win編譯程式碼可以用
#ifndef _STATIC_ASSERT_RCC
#   ifdef __GNUC__
#       define _STATIC_ASSERT_RCC(expr) typedef char __static_assert_t[ (expr) - 1 ]
#   else
#       define _STATIC_ASSERT_RCC(expr) do { typedef char __static_assert_t[ (expr) ]; } while (0)
#   endif
#endif
 
//將src複製到字元陣列arr 保證不會越界並且末尾肯定會加\0
//_STATIC_ASSERT_RCC這裡作用是防止有人傳字串指標進來
#define strncpy2arr(arr, src) do { \
    char *dest_ = arr; \
    size_t n = strnlen(src, sizeof(arr) - 1); \
    _STATIC_ASSERT_RCC(sizeof(arr) != sizeof(char *)); \
    memcpy(dest_, src, n); \
    dest_[n] = '\0'; \
} while (0)
 
 
#ifdef WIN32
int main(int argc, char *argv[])
{
    char dest[16];
    char *src = "hello                                    222";
    int i = 0;
 
    for (i = 0; i < sizeof(dest); ++i)
    {
        dest[i] = '1';
    }
 
    printf("before strncpy\n");
    for (i = 0; i < sizeof(dest); ++i)
    {
        printf("%d ", dest[i]);
    }
    printf("\n");
 
    strncpy2arr(dest, src);
    printf("after strncpy\n");
    for (i = 0; i < sizeof(dest); ++i)
    {
        printf("%d ", dest[i]);
    }
    printf("\n");
 
    strncpy(dest, src, sizeof(dest));
    printf("after strncpy\n");
    for (i = 0; i < sizeof(dest); ++i)
    {
        printf("%d ", dest[i]);
    }
    printf("\n");
    
 
    return 0;
   
    //return CompressPerformanceTestMain(argc, argv);
}
#endif

2 慎用sprintf，因為它的效率比你想象的低

之前我一直沒注意到sprintf效率低的問題，直到有一次使用callgrind對程式進行效能分析時，發現有相當大的資源消耗在sprintf上面，我才有所警覺。

為此，我寫了一點測試程式碼，對常用的函式做了一下基準測試，結果如下：

測試內容	耗時（us）
for迴圈賦值40億次	13023889
呼叫簡單函式40億次	16967986
呼叫memset函式4億次（256個位元組）	6932237
呼叫strcpy函式4億次（12個位元組）	3239218
呼叫memcpy函式4億次（12個位元組）	3239201
呼叫strcmp函式4億次（12個位元組）	2500568
呼叫memcmp函式4億次（12個位元組）	2668378
呼叫strcpy函式4億次（74個位元組）	4951085
呼叫memcpy函式4億次（74個位元組）	4950890
呼叫strcmp函式4億次（74個位元組）	5551391
呼叫memcmp函式4億次（74個位元組）	3840448
呼叫sprintf函式8千萬次（約27個位元組）	21398106
呼叫scanf函式8千萬次（約27個位元組）	36158749
呼叫fwrite函式8千萬次	5913579
呼叫fprintf函式8千萬次	24806837
呼叫fread函式8千萬次	3182704
呼叫fscanf函式8千萬次	18739442
呼叫WriteLog函式20萬次（15個位元組）	4873746
呼叫WriteLog函式20萬次（47個位元組）	4846449
呼叫WriteLog函式20萬次（94個位元組）	4950448

1us = 1000ms

圖示：scanf/printf系列函式耗時是其它常見字串操作函式的10倍以上，甚至比io操作還耗時

測試程式碼見這裡：

#define TEST_LOG_INF NULL, __FILE__, __LINE__

 

#ifdef WIN32

 

#define WriteLog lazy_log_output

#define LOG_ERROR NULL, __FILE__, __LINE__

#define LOG_KEY NULL, __FILE__, __LINE__

#define sleep(n) Sleep(100 * n)

 

 

int gettimeofday(struct timeval *tv, struct timezone *tz)

{

    SYSTEMTIME wtm;

    GetLocalTime(&wtm);

    tv->tv_sec = (long)(wtm.wDayOfWeek * 24 * 3600 + wtm.wHour * 3600 + wtm.wMinute * 60 + wtm.wSecond);

    tv->tv_usec = wtm.wMilliseconds * 1000;

 

    return 0;

}

 

void InitLog(const char *logname)

{

 

}

#endif

 

struct timeval  begTimes = {0}, endTims = {0};

void beginTimer()

{

    gettimeofday(&begTimes, NULL);

}

 

int g_nSleepSec = 10;

void stopTimer(char *userdata, const char *file, int fileno, int nSleepFlag)

{

    size_t totalTranTimes;

    gettimeofday(&endTims, NULL);

    totalTranTimes = (size_t)(endTims.tv_sec - begTimes.tv_sec) * 1000000 + (endTims.tv_usec - begTimes.tv_usec); 

 

#ifdef WIN32

    WriteLog(userdata, file, fileno, "== == end == == == totalTranTimes %lu us", (unsigned long) totalTranTimes);

#else

    WriteLog(2, file, fileno, "== == end == == == totalTranTimes %lu us", (unsigned long) totalTranTimes);

#endif

 

    if (nSleepFlag)

    {

        WriteLog(LOG_ERROR, "sleep");

        sleep(g_nSleepSec);

    }

    else

    {

        beginTimer();

    }

}

 

void PerformanceTestLog(char *userdata, const char *file, int fileno, const char *log)

{

    stopTimer(userdata, file, fileno, 1);

#ifdef WIN32

    WriteLog(userdata, file, fileno, "== == beg == == == %s", log);

#else

    WriteLog(2, file, fileno, "== == beg == == == %s", log);

#endif

    beginTimer();

}

 

int func(int argc, char *argv[], char *tmp)

{

    tmp[argc] = '1';

 

    return 0;

}

 

//基準測試

int BaseTest(unsigned long nTimes)

{

    unsigned long i = 0;

    char tmp[256], t1[64], t2[64], t3[64];

    int nTmp;

    const char *strWriten;

 

    nTimes *= 100000; //40億

    WriteLog(LOG_KEY, "BaseTest %lu", nTimes);

 

    beginTimer();

    PerformanceTestLog(TEST_LOG_INF, "test for");

    for (i = 0; i < nTimes; ++i)

    {

        i = i; 

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test call func");

    for (i = 0; i < nTimes; ++i)

    {

        func(1, NULL, tmp);

    }

 

    stopTimer(TEST_LOG_INF, 0);

    nTimes /= 10; //4億

    WriteLog(LOG_KEY, "BaseTest %lu", nTimes);

 

    PerformanceTestLog(TEST_LOG_INF, "test memset");

    for (i = 0; i < nTimes; ++i)

    {

        memset(tmp, 0, sizeof(tmp));

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test strcpy");

    for (i = 0; i < nTimes; ++i)

    {

        strcpy(tmp, "test strcpy");

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test memcpy");

    for (i = 0; i < nTimes; ++i)

    {

        memcpy(tmp, "test strcpy", sizeof("test strcpy"));

    }

 

 

    PerformanceTestLog(TEST_LOG_INF, "test strcmp");

    for (i = 0; i < nTimes; ++i)

    {

        if (0 == strcmp(tmp, "test strcpy"))

        {

            i = i;

        }

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test memcmp");

    for (i = 0; i < nTimes; ++i)

    {

        if (0 == memcmp(tmp, "test strcpy", sizeof("test strcpy")))

        {

            i = i;

        }

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test strcpy1");

    for (i = 0; i < nTimes; ++i)

    {

        strcpy(tmp, "test strcpy  test strcpy  test strcpy  test strcpy test strcpytest strcpy");

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test memcpy1");

 

    for (i = 0; i < nTimes; ++i)

    {

        memcpy(tmp, "test strcpy  test strcpy  test strcpy  test strcpy test strcpytest strcpy", 

            sizeof("test strcpy  test strcpy  test strcpy  test strcpy test strcpytest strcpy"));

    }

 

 

    PerformanceTestLog(TEST_LOG_INF, "test strcmp1");

    for (i = 0; i < nTimes; ++i)

    {

        if (0 == strcmp(tmp, "test strcpy  test strcpy  test strcpy  test strcpy test strcpytest strcpy"))

        {

            i = i;

        }

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test memcmp1");

    for (i = 0; i < nTimes; ++i)

    {

        if (0 == memcmp(tmp, "test strcpy  test strcpy  test strcpy  test strcpy test strcpytest strcpy", 

            sizeof("test strcpy  test strcpy  test strcpy  test strcpy test strcpytest strcpy")))

        {

            i = i;

        }

    }

 

    stopTimer(TEST_LOG_INF, 0);

    nTimes /= 5; //8千萬

    WriteLog(LOG_KEY, "BaseTest %lu", nTimes);

 

    PerformanceTestLog(TEST_LOG_INF, "test sprintf");

    for (i = 0; i < nTimes; ++i)

    {

        sprintf(tmp, "thiis %s testing %d", "sprintf", i);

    }

 

    PerformanceTestLog(TEST_LOG_INF, "test sscanf");

    for (i = 0; i < nTimes; ++i)

    {

        sscanf(tmp, "%s %s %s %d", t1, t2, t3, &nTmp);

    }

 

    {

        FILE *fp;

        int nStr;

        PerformanceTestLog(TEST_LOG_INF, "fopen");

        fp = fopen("performancetest.txt", "w");

        strWriten = "this is testing write\n";

        nStr = strlen(strWriten);

 

        PerformanceTestLog(TEST_LOG_INF, "test write file");

        for (i = 0; i < nTimes; ++i)

        {

            fwrite(strWriten, 1, nStr, fp);

        }

 

        PerformanceTestLog(TEST_LOG_INF, "fflush");

        fflush(fp);

 

        PerformanceTestLog(TEST_LOG_INF, "test fprintf file");

        for (i = 0; i < nTimes; ++i)

        {

            //太過簡單的fprintf好像會被自動優化成fwrite，即使沒開優化選項

            //例如 fprintf(fp, "%s", "strWriten");

            fprintf(fp, "%s %d\n", "strWriten", i);

        }

 

        PerformanceTestLog(TEST_LOG_INF, "fclose");

        fclose(fp);

    }

 

    {

        FILE *fp;

        int nStr;

        PerformanceTestLog(TEST_LOG_INF, "fopen 1");

        fp = fopen("performancetest.txt", "r");

 

        nStr = strlen(strWriten);

        PerformanceTestLog(TEST_LOG_INF, "test read file");

        for (i = 0; i < nTimes; ++i)

        {

            fread(tmp, 1, nStr, fp);

            tmp[nStr] = '\0';

        }

 

        PerformanceTestLog(TEST_LOG_INF, "test fscanf file");

        tmp[0] = t1[0] = '\0';

        for (i = 0; i < nTimes; ++i)

        {

            fscanf(fp, "%s %s", tmp, t1);

        }

 

        PerformanceTestLog(TEST_LOG_INF, "fclose");

        fclose(fp);

    }

    fclose(fopen("performancetest.txt", "w"));

    

    nTimes /= 400; //20萬

    WriteLog(LOG_KEY, "BaseTest %lu", nTimes);

 

    PerformanceTestLog(TEST_LOG_INF, "WriteLog 1");

    for (i = 0; i < nTimes; ++i)

    {

        WriteLog(LOG_ERROR, "this is loging");

    }

 

    PerformanceTestLog(TEST_LOG_INF, "WriteLog 2");

    for (i = 0; i < nTimes; ++i)

    {

        WriteLog(LOG_ERROR, "this is loging  this is loging  this is loging");

    }

 

 

    PerformanceTestLog(TEST_LOG_INF, "WriteLog 3");

    for (i = 0; i < nTimes; ++i)

    {

        WriteLog(LOG_ERROR, "this is loging  this is loging  this is loging  this is loging  this is loging this is loging");

    }

 

    stopTimer(TEST_LOG_INF, 0);

 

    return 0;

}

從基準測試結果可以知道，sprintf系列函式效率是比較低的，是我們常見的字串操作函式的1/10以下。

我個人的解決方案是sprintf該用還是用，但有些情況不是特別必要用的情況，用自己寫一些小函式代替。例如下面這個巨集是用來代替sprintf(buf, "%02d", i)的

//sprintf比較慢 這裡需要寫一些簡單的字串組裝函式
//這個是代替%02d的（但不會新增\0結尾）顧名思義，傳入的值需要保證0 <= vallue < 100
//再次提醒注意，這裡為了方便呼叫，不會新增\0! 不會新增\0! 不會新增\0!
#define itoaLt100Ge0(value, buff_output) do \
{\
    int value_ = (int)(value);\
    char *buff_output_ = (buff_output);\
    if ((value_) >= 10) { int nDigit_ = value_ / 10; buff_output_[0] = '0' + nDigit_; buff_output_[1] = '0' + (value_ - nDigit_ * 10); }\
    else { buff_output_[0] = '0'; buff_output_[1] = '0' + (value_);  } \
} while (0)

總結一下就是：高併發交易需要慎用strncpy和sprintf，因為不恰當使用它們可能會成為程式效能瓶頸。

如果大家有啥想法，歡迎分享，我是黃詞輝，一個程式設計師 ^_^

寫高併發程式時慎用strncpy和sprintf

分享一下最近做程式優化的一點小心得：在寫高併發交易程式碼時要謹慎使用strncpy和sprintf。 &nbs

高併發系統設計與時間和空間的平衡

高併發系統設計與時間和空間的平衡高可用上文我們已經講過了，可當前網際網路時代，怎麼少的了高併發呢？高併發和高可用一樣，已經變成各個系統的標配了，如果你的系統QPS沒有個大幾千上萬，都不好意思跟人打招呼，雖然可能每天的呼叫量不超過100。

Android進階系列-手寫高併發網路訪問框架

一個專案，訪問網路那是必須的。現在開源的網路框架很多。比如最開始的HeepClient，Volley，xUtils，最近很火的okhttp，還有例如retrofit，okGo這些都是很不錯的框架。但是畢竟是別人寫的。出了什麼問題都不好查詢。這裡自己封裝了一個網路框架，記錄

高併發程式設計：Volatile關鍵字和Atomic類

在接觸併發程式設計之前我對volatile關鍵字是沒有什麼映像的，這個關鍵字解決了什麼問題呢？讓我們先來看一個示例： public class UseVolatitle extends Thread { private boolean isrunning

高併發伺服器---基礎----IO模式和IO多路複用

轉自：https://www.cnblogs.com/zingp/p/6863170.html 閱讀目錄 1 基礎知識回顧 2 I/O模式 3 事件驅動程式設計模型 4 select/poll/epoll的區別及其Python示例　　網路程式設計裡常聽到阻塞IO、

MQ高併發量時的調優引數設定說明

高可用（主從）與負載均衡架構圖訊息傳送中的接收Topic訂閱結果訊息佇列URL地址、訊息接收佇列URL地址、訊息代理的傳送與接收佇列URL地址以及訊息轉發器傳送的Topic結果訊息佇列URL地址，均需設定為Failover 地址。由於訊息佇列元件ActiveMQ是

高併發訪問時如何確保伺服器端session過多而造成記憶體溢位致使伺服器宕機的方法之一

使用者登入後所在登入頁面中設定一個隱藏的iframe標籤。該子頁面會每隔10s中向報告一次線上訊息。程式碼如下： …… <divclass="response"> <iframesrc="response.html"></iframe>

mysql 讀寫高併發大資料表優化

1.更新頻繁儘量使用innode引擎，支援行級鎖，降低鎖粒度，提高併發量。 2.考慮使用mysql 主從做讀寫分離，可以利用主庫更新，從庫進行查詢。分擔資料庫壓力，提高併發。 3.考慮使用reids

高併發rpc時如何connect（非阻塞）

方案一：之前有設計中轉伺服器，用於轉發redis、url等訊息。在這裡面，專門開執行緒負責套接字的連線與重連。使用阻塞等待式的方式直到連線真正連上，效率低下。程式碼如下： bool Conne

Java高併發--等待執行緒結束和謙讓

針對本格專題我們主要討論join()和yield()這兩個方法。一、等待執行緒結束如果我們想要在一個執行緒中獲取到另外一個執行緒的處理結果，那麼這個時候我們該怎麼辦呢？最好的方式當然就是等待另一個執行緒的結束後再來執行當前執行緒，這個時候就該我們的join()方法上場

mysql insert into (高併發插入時出現的問題) 解決

筆者最近工作中，遇到了一個問題就是筆者在給使用者新增虛擬資源的時候出現了資源表中出現了uid 重複如果按照程式碼梳理應該不會發生這種情況，但是抽獎程式在高量的併發下出現了使用者id 重複程式程式碼： $badge_data = DB::connection(

如何實現千萬級的高併發程式

之前看了文章： http://www.csdn.net/article/2013-05-16/2815317-The-Secret-to-10M-Concurrent-Connections 得到如下結論：（1）對於提高併發效能來說，主要降低核心的負擔，儘可能的

【Java併發基礎】利用面向物件的思想寫好併發程式

前言下面簡單總結學習Java併發的筆記，關於如何利用面向物件思想寫好併發程式的建議。面向物件的思想和併發程式設計屬於兩個領域，但是在Java中這兩個領域卻可以融合到一起。在Java語言中，面向物件程式設計的思想能夠讓併發程式設計變得更加簡單。下面將從封裝共享變數、識別共享變數間的約束條件和制定併發訪問策略三

發個無聊時寫的俄羅斯方塊（分為SDL和Qt兩個版本）

app deb fcm cnn 無聊線程 dac tutorial spi 6213-ChineseZodiac(map) 多線程問題【CF472G】【XSY2112】DesignTutorial壓位大家都開始C++0x了,我也來湊熱鬧,今天的主題是《調侃rvalue

大規模分散式應用之海量資料和高併發解決方案總結視訊教程網盤

大規模分散式應用之海量資料和高併發解決方案總結視訊教程網盤 39套Java架構師，高併發，高效能，高可用，分散式，叢集，電商，快取，微服務，微信支付寶支付，公眾號開發，java8新特性，P2P金融專案，程式設計，功能設計，資料庫設計，第三方支付，web安全，效能調優，設計模式，資料結構，併發程式

SimpleDateFormat時間格式化高併發、多執行緒時出現問題

SimpleDateFormat是是 Java 中一個非常常用的類，該類用來對日期字串進行解析和格式化輸出，但是DateFormat 和 SimpleDateFormat 類不都是執行緒安全的，在生產環境的多執行緒或高併發情況使用 format() 和 parse() 方法，會出現很多問題：

高併發程式設計：執行緒安全和ThreadLocal

執行緒安全的概念：當多個執行緒訪問某一個類（物件或方法）時，這個類始終都能表現出正確的行為，那麼這個類（物件或方法）就是執行緒安全的。執行緒安全說的可能比較抽象，下面就以一個簡單的例子來看看什麼是執行緒安全問題。 public class MyThread impleme

Java面試題-生產者和消費者-高併發

面試題：寫一個固定容量同步容器，擁有put和get方法，以及getCount方法，能夠支援，2個生產者執行緒及10個消費者執行緒的阻塞呼叫（經常問！） -------------------------------------------------------------------

高併發程式設計 volatile 和加鎖解決快取不一致

因為程式執行都在cpu中，但是如果沒有快取記憶體，cpu大部分的時間都用來了讀取記憶體的資料。從而Cpu有快取記憶體，在執行指令前，會把相關需要的資料提前拷貝到cpu，運算完成後在刷回記憶體裡。快取記憶體主要提前快取資料到cpu，等cpu運算完成後把結果返回給主存

Web大規模高併發請求和搶購的原理及解決方案

電商的秒殺和搶購，對我們來說，都不是一個陌生的東西。然而，從技術的角度來說，這對於Web系統是一個巨大的考驗。當一個Web系統，在一秒鐘內收到數以萬計甚至更多請求時，系統的優化和穩定至關重要。這次我們會關注秒殺和搶購的技術實現和優化，同時，從技術層面揭開，為什麼我們總是不容易搶到火車票的原因？&nb

寫高併發程式時慎用strncpy和sprintf

相關推薦