python 實現在無序陣列中找到中位數方法

阿新 • • 發佈：2020-03-04

一、問題描述

1、求一個無序陣列的中位數，（若陣列是偶數，則中位數是指中間兩個數字之和除以2，若陣列是奇數，則中位數是指最中間位置。要求：不能使用排序，時間複雜度儘量低

2、例如：

lists = [3,2,1,4],中位數為 = (2+3)/2 = 2.5
lists = [3,2],中位數為 2

3、演算法思想：

利用快速排序思想（但是並不是全部使用）：任意挑選一個元素，以該元素為key，劃分陣列為兩個部分，如果左側陣列長度剛好為(n-1)/2,那麼key就為中位數，若左側陣列長度 < (n-1)/2 ，那麼中位數點在右側，反之，中位數在左側。然後進入相應的一側繼續尋找中位

平均時間複雜度為O(n)

二、程式

class Solution(object):
 
 def findmedian(self,lists):
  if not lists or len(lists) == 0:
   return []
 
  n = len(lists)
  if n % 2 == 0:
   a = self.partition(lists,n/2,n-1)
   b = self.partition(lists,n/2-1,n-1)
   mid = (lists[a]+lists[b])/ (2 * 1.0)
   return mid
  else:
   mid = self.partition(lists,n-1)
   return lists[mid]
 
 def partition(self,lists,k,start,end):
 
  key = lists[start]
  left,right = start,end
 
  while left < right:
   while left < right and lists[right] > key:
    right = right - 1
   lists[left] = lists[right]
 
   while left < right and lists[left] < key:
    left = left + 1
   lists[right] = lists[left]
 
  lists[left] = key
 
  if left == k:
   return left
  elif left > k:
   return self.partition(lists,left-1)
  else:
   return self.partition(lists,left+1,end)
 
 
if __name__ == "__main__":
 
 sol = Solution()
 lists = [2,5,4,9,3,6,8,7,1]
 # lists = [1,2]
 
 
 data = sol.findmedian(lists)
 print("中位數 = %s" % data)

知識補充：python streaming 實現某個欄位排序

一，hadoop streaming預設情況

1，在hadoop streaming的預設情況下，是以\t作為分隔符的，標準輸入時，每行的第一個\t之前的內容作為key，第一個\t之後的內容作為value。注意，如果一個\t字元都沒有，那麼整行作為key。

2，streaming的一些引數如下：

-D stream.map.output.field.separator ：設定map輸出中key和value的分隔符
-D stream.num.map.output.key.fields ：設定map程式分隔符的位置，該位置之前的部分作為key，之後的部分作為value

-D map.output.key.field.separator : 設定map輸出中key內部的分割符
-D num.key.fields.for.partition : 指定分桶時，key按照分隔符切割後，其中用於分桶key所佔的列數（配合-partitioner org.apache.hadoop.mapred.lib.KeyFieldBasedPartitioner 使用）
-D stream.reduce.output.field.separator：設定reduce輸出中key和value的分隔符
-D stream.num.reduce.output.key.fields：設定reduce程式分隔符的位置

二，python streaming 實現某個欄位的排序

1，輸入資料: cat data.txt (中間是tab鍵）

11 2
11 3
11 4 1
11 1

11 12 22

2，streaming程式如下：

vim sorted.sh

#!/bin/bash

export CURRENT=/home/chunhe.liao/hadoop_streaming/sort

/usr/local/hadoop-2.6.3/bin/hadoop jar /usr/local/hadoop-2.6.3/share/hadoop/tools/lib/hadoop-streaming-2.6.3.jar \
-D stream.map.output.field.separator='\t' \
-D stream.num.map.output.key.fields=3 \
-D mapreduce.job.output.key.comparator.class=org.apache.hadoop.mapreduce.lib.partition.KeyFieldBasedComparator \
-D mapreduce.partition.keycomparator.options=-k3,3nr \  # 按照第三列逆序排列,可以根據想要的第幾段來選擇。
-input "/user/test/inputdata/datas3/data.txt" \
-output "/user/test/streaming/sorted_20180711" \
-mapper "python mapper.py" \
-reducer "python reducer.py" \
-file "$CURRENT/mapper.py" \
-file "$CURRENT/reducer.py"

(2) mapper.py

# -*- coding: utf-8 -*-
import sys
 
for line in sys.stdin:
 line = line.strip()
 print('{0}'.format(line))

(3) reducer.py

# -*- coding: utf-8 -*-
import sys
 
for line in sys.stdin:
 line = line.strip()
 print("{0}".format(line))

執行命令：

bash sorted.sh

執行結果：

hdfs dfs -cat /user/test/streaming/sorted_20180711/part-00000

11 12 22
11 3
11 2
11 4 1
11 1

以上這篇python 實現在無序陣列中找到中位數方法就是小編分享給大家的全部內容了，希望能給大家一個參考，也希望大家多多支援我們。

python 實現在無序陣列中找到中位數方法

Python實現將MySQL資料庫表中的資料匯出生成csv格式檔案的方法

Python實現影象去噪方式(中值去噪和均值去噪)

python實現提取str字串/json中多級目錄下的某個值

Python 實現Excel自動化辦公《中》

python實現解析markdown文件中的圖片，並且儲存到本地~

Python 實現將陣列/矩陣轉換成Image類

python實現12306登入並儲存cookie的方法示例

Python實現棧和佇列的簡單操作方法示例

使用python實現迴文數的四種方法小結

python實現生成Word、docx檔案的方法分析

Python的bit_length函式來二進位制的位數方法

python實現文法左遞迴的消除方法

利用Python實現斐波那契數列的方法例項

Python實現敏感詞過濾的4種方法

python 實現單例模式的5種方法

python實現快速檔案格式批量轉換的方法

python 實現ping測試延遲的兩種方法

利用Python實現Shp格式向GeoJSON的轉換方法

python 實現在無序陣列中找到中位數方法

找出無序陣列中位數的方法

python 實現在無序陣列中找到中位數方法

相關推薦