*187. Repeated DNA Sequences (hashmap, one for loop)(difference between subsequence & substring)

阿新 • • 發佈：2018-06-05

sequence value n-2 return hashset cga AS repeated des

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to identify repeated sequences within the DNA.

Write a function to find all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule.

Example:

Input: s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT"

Output: ["AAAAACCCCC", "CCCCCAAAAA"]

Solution: count the frequency of 10 letter words

class Solution {
    //find all the 10-letter-long sequences that occur more than once in a DNA molecule
    public List<String> findRepeatedDnaSequences(String s) {
         
//substring -- subset n +n-1+...+1: n-k+1
        List<String> res = new ArrayList<String>();
        Map<String,Integer> map = new HashMap<String,Integer>();
        int n = s.length();
        int k  =10;
        if(n < k) return res;
        for(int i = 0; i<=n-k; i++){//11-10 1 

            String sub = s.substring(i, i+k);
            if(map.containsKey(sub)){
                map.put(sub, map.get(sub)+1);
            }else {
                map.put(sub, 1);
            }
        }
        for(Map.Entry<String, Integer> entry : map.entrySet()){
            if(entry.getValue() >1){
                res.add(entry.getKey());
            }
        }
        return res;
    }
}

Solution 2: two HashSet with a non-duplicate feature.

public List<String> findRepeatedDnaSequences(String s) {
    Set seen = new HashSet(), repeated = new HashSet();
    for (int i = 0; i + 9 < s.length(); i++) {
        String ten = s.substring(i, i + 10);
        if (!seen.add(ten))//if add then first time, else add it
            repeated.add(ten);
    }
    return new ArrayList(repeated);
}

subsequence & substring

subsequence: subset 2^n

substring: continous string : n+n-1+n-2+...+1

*187. Repeated DNA Sequences (hashmap, one for loop)(difference between subsequence & substring)

sequence value n-2 return hashset cga AS repeated des All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for

187. Repeated DNA Sequences

topic some ive ack 所有 write 影響 useful content 題目： All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for

LeetCode 187. Repeated DNA Sequences 20170706 第三十次作業

如果作業 log {} TTT enc series compose bst All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAA

[LeetCode] 187. Repeated DNA Sequences 求重復的DNA序列

item series style result table hashset nbsp identify substring All DNA is composed of a series of nucleotides abbreviated as A, C, G, and

LeetCode--187. Repeated DNA Sequences

題目連結：https://leetcode.com/problems/repeated-dna-sequences/ 要求尋找長度為10的DNA重複子字串思路一：這裡可以考慮一個HashMap來儲存出現的子字串及其出現次數，出現第二次的則加入最終答案中，而首次出現的就加入Hashmap中，

leetcode 187. Repeated DNA Sequences 編碼計數統計重複字串 + 移動視窗

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: “ACGAATTCCG”. When studying DNA, it is s

187. Repeated DNA Sequences - Medium

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to ident

leetcode:(187) Repeated DNA Sequence(java)

/** * 題目： * All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". * When studying DNA

[Swift]LeetCode187. 重復的DNA序列 | Repeated DNA Sequences

desc pre 出現 ins value find let amp strings All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "AC

[Swift]LeetCode187. 重複的DNA序列 | Repeated DNA Sequences

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to ident

[LeetCode] Repeated DNA Sequences 求重複的DNA序列

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to ide

Windows batch: call more than one command in a FOR loop?

https://stackoverflow.com/questions/2252979/windows-batch-call-more-than-one-command-in-a-for-loop Using & is fine for short commands, but that si

Learn Python 012: for loop

super iou list orm ice clas and let mat # 1 vowels = 0 consonants = 0 for letter in ‘supercalifragilisticexpialidocious‘: if lette

編寫簡單的shell腳本 - for循環 - 解決報錯 Syntax error: Bad for loop variable

one size oca http cal 編寫 image 簡單 font 為了編寫批量導入數據的程序，故而學習編寫shell腳本！現學現用！ ============================================ 1、第一個簡單的for循環 #

不用for loop循環一個讀取一個文件

stop print efault 文件 ati 循環 bre txt don 怎樣在不使用for loop的情況下循環讀取一個文件並將內容顯示出來呢？ #!/usr/bin/env python #coding:utf-8 #@Author:Andy # Date: 2

Leetcode: Repeated DNA Sequence

and == 10個 nas rect 想是運算 tco contains 方法2：進一步的方法是用HashSet, 每次取長度為10的字符串，O(N)時間遍歷數組，重復就加入result，但這樣需要O(N)的space, 準確說來O(N*10bytes), java而言

Java For Loop

oop printing st2 分享圖片 bre loop public led each Java For Loop Simple For Loop For-each or Enhanced For Loop Labeled For Loop Example im

PL/sql中如何宣告變數,常量,控制語句及for,loop,while和順序控制的使用

pl/sql 什麼是PL/SQL 　　PL/SQL是結合oracle過程語言和機構化查詢執行(SQL) 的一種擴充套件語言。使用PL/SQL可以編寫具有很多高階功能的程式,有以下優點 PL/SOL可以採用過程性語言控制程式的結構，也就是說，結構，如判斷。迴圈等程式結構。

for’ loop initial declarations are only allowed...改變GCC編譯標準！

在對.sh檔案編譯時，難免會進行gcc編譯c檔案，不同的編譯標準會出現不相容的情況，如gcc預設的編譯標準為 -std = c89 此編譯標準不允許在for迴圈中定義迴圈變數i: 因此我們需要將gcc的編譯標準換為c99。可以直接新增其至gcc命令列後面： gcc -s

Why does a 'for' loop behave differently when migrating VB.NET code to C#?

Because the for in VB is a different semantic than the for in C# (or any other C-like language) In VB, the for statement is specifically incrementing a

*187. Repeated DNA Sequences (hashmap, one for loop)(difference between subsequence & substring)

相關推薦