kmp（暴力匹配）

http://poj.org/problem?id=3080

Blue Jeans

Time Limit: 1000MS		Memory Limit: 65536K
Total Submissions: 23415		Accepted: 10349

Description

The Genographic Project is a research partnership between IBM and The National Geographic Society that is analyzing DNA from hundreds of thousands of contributors to map how the Earth was populated.

As an IBM researcher, you have been tasked with writing a program
that will find commonalities amongst given snippets of DNA that can be
correlated with individual survey information to identify new genetic
markers.

A DNA base sequence is noted by listing the nitrogen bases in the
order in which they are found in the molecule. There are four bases:
adenine (A), thymine (T), guanine (G), and cytosine (C). A 6-base DNA
sequence could be represented as TAGACC.

Given a set of DNA base sequences, determine the longest series of bases that occurs in all of the sequences.

Input

Input
to this problem will begin with a line containing a single integer n
indicating the number of datasets. Each dataset consists of the
following components:

A single positive integer m (2 <= m <= 10) indicating the number of base sequences in this dataset.
m lines each containing a single base sequence consisting of 60 bases.

Output

For
each dataset in the input, output the longest base subsequence common
to all of the given base sequences. If the longest common subsequence is
less than three bases in length, display the string "no significant
commonalities" instead. If multiple subsequences of the same longest
length exist, output only the subsequence that comes first in
alphabetical order.

Sample Input

3

2

GATACCAGATACCAGATACCAGATACCAGATACCAGATACCAGATACCAGATACCAGATA

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

3

GATACCAGATACCAGATACCAGATACCAGATACCAGATACCAGATACCAGATACCAGATA

GATACTAGATACTAGATACTAGATACTAAAGGAAAGGGAAAAGGGGAAAAAGGGGGAAAA

GATACCAGATACCAGATACCAGATACCAAAGGAAAGGGAAAAGGGGAAAAAGGGGGAAAA

3

CATCATCATCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC

ACATCATCATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

AACATCATCATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT

Sample Output

no significant commonalities

AGATAC

CATCATCAT

Source

South Central USA 2006

题意：给你几段长为60的碱基序列，让你找出最长相同的碱基序列，如果有多个最长相同序列，则输出字典序最小的碱基序列

思路：kmp暴力或strstr函数暴力

#include <cstdio>

#include <cstring>

#include <cmath>

#include <algorithm>

#include <iostream>

#include <algorithm>

#include <iostream>

#include<cstdio>

#include<string>

#include<cstring>

#include <stdio.h>

#include <string.h>

#define INF  10000000

using namespace std;

char a[][] , b[], str[];

int next[];

int ans =  ;

void getnext(char *b , int len , int *next)

{

    next[] = - ;

    int j =  , k = -;

    while(j < len)

    {

        if(k == - || b[j] == b[k])

        {

            k++;

            j++;

            next[j] = k ;

        }

        else

        {

            k = next[k];

        }

    }

}

int main()

{

    int n ;

    scanf("%d" , &n);

    while(n--)

    {

        int m ;

        memset(b , '\0' , sizeof(b));

        memset(str , '\0' , sizeof(str));

        scanf("%d" , &m);

        for(int i =  ; i < m ; i++)

        {

            scanf("%s" , a[i]);

        }

        int x =  , flag =  , ans = ;

        while(x <= )

        {

            for(int i =  ; i <=  - x ; i ++)

            {

                int jj = i ;

                for(int j =  ; j < x ; j++)

                {

                    b[j] = a[][jj++];

                }

                getnext(b , x , next);

                for(int j =  ; j < m ; j++)

                {

                    int ii =  , k =  ;

                    while(ii <  && k < x)

                    {

                        if(k == - || a[j][ii] == b[k])

                        {

                            k++;

                            ii++;

                        }

                        else

                        {

                            k = next[k];

                        }

                    }

                    if(k == x)

                    {

                        flag =  ;

                    }

                    else

                    {

                        flag =  ;

                        break ;

                    }

                }

                if(flag == )

                {

                    if(ans < x)

                    {

                        ans = x ;

                        strcpy(str , b);

                    }

                    else if(ans == x) //如果长度相等，输出字典序小的序列，我还以为是第一出现的序列，害我wa了这么久

                    {

                        if(strcmp(b , str) <)

                        {

                            strcpy(str , b);

                        }

                    }

                }

                if(i ==  - x)

                    x++ ;

            }

        }

        if(ans == )

            printf("no significant commonalities\n");

        else

        {

            printf("%s\n" , str);

        }

    }

    return  ;

}

kmp（暴力匹配）的更多相关文章

字符串查找算法总结（暴力匹配、KMP 算法、Boyer-Moore 算法和 Sunday 算法）
字符串匹配是字符串的一种基本操作:给定一个长度为 M 的文本和一个长度为 N 的模式串,在文本中找到一个和该模式相符的子字符串,并返回该字字符串在文本中的位置. KMP 算法,全称是 Knuth-Mo ...
HDU 5510 Bazinga 暴力匹配加剪枝
Bazinga Time Limit: 20 Sec Memory Limit: 256 MB 题目连接 http://acm.hdu.edu.cn/showproblem.php?pid=5510 ...
python opencv3 基于ORB的特征检测和 BF暴力匹配 knn匹配 flann匹配
git:https://github.com/linyi0604/Computer-Vision bf暴力匹配: # coding:utf-8 import cv2 """ ...
HDU4300-Clairewd’s message(KMP前缀匹配后缀)
Clairewd's message Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Other ...
字符串匹配算法--暴力匹配(Brute-Force-Match)C语言实现
一.前言暴力匹配(Brute-Force-Match)是字符串匹配算法里最基础的算法,虽然效率比较低,但胜在方便理解,在小规模数据或对时间无严格要求的情况下可以考虑. 二.代码 #include & ...
从暴力匹配到KMP算法
前言现在有两个字符串:\(s1\)和\(s2\),现在要你输出\(s2\)在\(s1\)当中每一次出现的位置,你会怎么做? 暴力匹配算法基本思路用两个指针分别指向当前匹配到的位置,并对当前状态进 ...
poj-3080(kmp+暴力枚举)
题意:给你多个字符串,问你这几个字符串的最长公共子串是哪个,如果有多个,输出字典序最大的那个,如果最长的公共子串长度小于3,输出一个奇怪的东西: 解题思路:首先看数据,数据不大,开始简单快乐的暴力之路 ...
POJ-3080-Blue jeans(KMP, 暴力)
链接: https://vjudge.net/problem/POJ-3080#author=alexandleo 题意: 给你一些字符串,让你找出最长的公共子串. 思路: 暴力枚举第一个串的子串,挨 ...
【poj 3080】Blue Jeans（字符串--KMP+暴力枚举+剪枝）
题意:求n个串的字典序最小的最长公共子串. 解法:枚举第一个串的子串,与剩下的n-1个串KMP匹配,判断是否有这样的公共子串.从大长度开始枚举,找到了就break挺快的.而且KMP的作用就是匹配子串, ...

随机推荐

数据写入到Excel，模板样式复杂
先整理好Excel模板,如: 接下来在程序获取上面整理好的Excel模板并替换关键字就可以了public ActionResult SummaryStatistics() public ActionR ...
浅谈协议（四）——wireshark强力解析视频流协议
参考链接: https://wenku.baidu.com/view/460f016e49d7c1c708a1284ac850ad02de800722.html https://wenku.baidu ...
01.Windows2008R2系统禁启SMBv1服务命令
微软漏洞安全问题: 检测:默认配置 = 已启用(未创建注册表项),所以不会返回 SMB1 值.Get-Item HKLM:\SYSTEM\CurrentControlSet\Services\Lanm ...
Codeforces 750E 线段树DP
题意:给你一个字符串,有两种操作:1:把某个位置的字符改变.2:询问l到r的子串最少需要删除多少个字符,使得这个子串含有2017子序列,并且没有2016子序列? 思路:线段树上DP,我们设状态0, 1 ...
查找目录下指定类型的所有文件(maven 打包提取脚本)
1 首先想到的是递归遍历目录筛选出符合条件的文件 dir命令递归遍历目录 /b控制显示格式 /s递归 /ad 只显示目录 dir /b/s .\* 判断文件类型操作数得用`` rem 取出文件扩 ...
c# 匿名委托
using System; namespace AnonymousMethod { delegate void ArithmeticOperation(double operand1, double ...
flask之模板之继承
一:继承基类模板base.html 中在进行挖坑 {% block 坑的名字%}{% endblock %} 子类模板test.html 中通过 {% extends "base.ht ...
[POJ1772] Substract
问题描述 We are given a sequence of N positive integers a = [a1, a2, ..., aN] on which we can perform co ...
父工程 pom版本
 <properties> <junit.version>4.12</junit.version> <spri ...
AutoLayout面试题记录-用NSLayoutConstraint写动画
import UIKit class ViewController: UIViewController { @IBOutlet weak var topY: NSLayoutConstraint! @ ...

kmp（暴力匹配）

kmp（暴力匹配）的更多相关文章

随机推荐

热门专题