dp-(LCS 基因匹配)
| Time Limit: 1000MS | Memory Limit: 10000K | |
| Total Submissions: 19885 | Accepted: 11100 |
Description
A human gene can be identified through a series of
time-consuming biological experiments, often with the help of computer
programs. Once a sequence of a gene is obtained, the next job is to
determine its function.
One of the methods for biologists to use in determining the function
of a new gene sequence that they have just identified is to search a
database with the new gene as a query. The database to be searched
stores many gene sequences and their functions – many researchers have
been submitting their genes and functions to the database and the
database is freely accessible through the Internet.
A database search will return a list of gene sequences from the database that are similar to the query gene.
Biologists assume that sequence similarity often implies
functional similarity. So, the function of the new gene might be
one of the functions that the genes from the list have. To exactly
determine which one is the right one another series of biological
experiments will be needed.
Your job is to make a program that compares two genes and determines
their similarity as explained below. Your program may be used as a part
of the database search if you can provide an efficient one.
Given two genes AGTGATG and GTTAG, how similar are they? One of the methods to measure the similarity
of two genes is called alignment. In an alignment, spaces are inserted, if necessary, in appropriate positions of
the genes to make them equally long and score the resulting genes according to a scoring matrix.
For example, one space is inserted into AGTGATG to result in
AGTGAT-G, and three spaces are inserted into GTTAG to result in
–GT--TAG. A space is denoted by a minus sign (-). The two genes
are now of equal
length. These two strings are aligned:
AGTGAT-G
-GT--TAG
In this alignment, there are four matches, namely, G in the second
position, T in the third, T in the sixth, and G in the eighth. Each
pair of aligned characters is assigned a score according to the
following scoring matrix.

denotes that a space-space match is not allowed. The score
of the alignment above is (-3)+5+5+(-2)+(-3)+5+(-3)+5=9.
Of course, many other alignments are possible. One is shown below (a
different number of spaces are inserted into different positions):
AGTGATG
-GTTA-G
This alignment gives a score of (-3)+5+5+(-2)+5+(-1) +5=14. So,
this one is better than the previous one. As a matter of fact, this one
is optimal since no other alignment can have a higher score. So, it is
said that the
similarity of the two genes is 14.
Input
input consists of T test cases. The number of test cases ) (T is
given in the first line of the input file. Each test case consists of
two lines: each line contains an integer, the length of a gene, followed
by a gene sequence. The length of each gene sequence is at least one
and does not exceed 100.
Output
Sample Input
2
7 AGTGATG
5 GTTAG
7 AGCTATT
9 AGCTTTAAA
Sample Output
14
21 题目大意 : 给出一个基因匹配表格 , 里面有一些数值 。
设 dp[i][j] 为 s1 取第 i 个字符, s2 取第 j 个字符的最大值,决定dp[i][j] 最优的情况有 三种, 类似于最长公共子序列的三种情况。
1 . s1 取第 i 个字母, s2 取 '-' , temp1 = dp[i-1][j] + score[a[i]]['-'];
2 . s2 取第 j 个字母, s1 取 '-' , temp2 = dp[i][j-1] + score['-'][b[j]];
3 . s1 取第 i 个字母 , s2 取第 j 个字母 , temp3 = dp[i-1][j-1] + score[a[i]][b[j]]; 则 dp[i][j] = max ( temp1, temp2, temp3 ); 还有初始化问题 : dp[0][0] = 0;
dp[i][0] = dp[i-1][0] + score[a[i]]['-'];
dp[0][j] = dp[0][j-1] + score['-'][b[j]]; 代码示例 :
/*
* Author: ry
* Created Time: 2017/9/3 8:00:06
* File Name: 1.cpp
*/
#include <iostream>
#include <cstdio>
#include <cstdlib>
#include <cstring>
#include <cmath>
#include <algorithm>
#include <string>
#include <vector>
#include <stack>
#include <queue>
#include <set>
#include <time.h>
using namespace std;
const int mm = 1e6+5;
#define ll long long ll t_cnt;
void t_st(){t_cnt=clock();}
void t_ot(){printf("you spent : %lldms\n", clock()-t_cnt);}
//开始t_st();
//结束t_ot(); int dp[150][150];
int score['T'+1]['T'+1]; void intial () {
score['A']['A'] = 5;
score['C']['C'] = 5;
score['G']['G'] = 5;
score['T']['T'] = 5;
score['A']['C'] = score['C']['A'] = -1;
score['A']['G'] = score['G']['A'] = -2;
score['A']['T'] = score['T']['A'] = -1;
score['A']['-'] = score['-']['A'] = -3;
score['C']['G'] = score['G']['C'] = -3;
score['C']['T'] = score['T']['C'] = -2;
score['C']['-'] = score['-']['C'] = -4;
score['G']['T'] = score['T']['G'] = -2;
score['G']['-'] = score['-']['G'] = -2;
score['T']['-'] = score['-']['T'] = -1;
} int MAX (int x, int y, int z) {
int k = (x>y?x:y);
return z>k?z:k;
} int main() {
int t;
int len1, len2;
char a[105], b[105]; intial();
cin >> t;
getchar();
while ( t-- ){
memset (dp, 0, sizeof(dp));
scanf("%d %s", &len1, a);
scanf("%d %s", &len2, b); for (int i = len1; i > 0; i--)
a[i] = a[i-1];
for (int i = len2; i > 0; i--)
b[i] = b[i-1]; dp[0][0] = 0;
for (int i = 1; i <= len1; i++)
dp[i][0] = dp[i-1][0] + score[a[i]]['-'];
for (int j = 1; j <= len2; j++)
dp[0][j] = dp[0][j-1] + score['-'][b[j]]; for (int i = 1; i <= len1; i++)
for (int j = 1; j <= len2; j++){
int temp1 = dp[i-1][j] + score[a[i]]['-'];
int temp2 = dp[i][j-1] + score['-'][b[j]];
int temp3 = dp[i-1][j-1] + score[a[i]][b[j]];
dp[i][j] = MAX (temp1, temp2, temp3);
} printf ("%d\n", dp[len1][len2]);
} return 0;
}
dp-(LCS 基因匹配)的更多相关文章
- 【线型DP】【LCS】洛谷P4303 [AHOI2006]基因匹配
P4303 [AHOI2006]基因匹配 标签(空格分隔): 考试题 nt题 LCS优化 [题目] 卡卡昨天晚上做梦梦见他和可可来到了另外一个星球,这个星球上生物的DNA序列由无数种碱基排列而成(地球 ...
- BZOJ 1264: [AHOI2006]基因匹配Match( LCS )
序列最大长度2w * 5 = 10w, O(n²)的LCS会T.. LCS 只有当a[i] == b[j]时, 才能更新答案, 我们可以记录n个数在第一个序列中出现的5个位置, 然后从左往右扫第二个序 ...
- bzoj 1264 [AHOI2006]基因匹配Match(DP+树状数组)
1264: [AHOI2006]基因匹配Match Time Limit: 10 Sec Memory Limit: 162 MBSubmit: 793 Solved: 503[Submit][S ...
- bzoj1264 [AHOI2006]基因匹配Match 树状数组+lcs
1264: [AHOI2006]基因匹配Match Time Limit: 10 Sec Memory Limit: 162 MBSubmit: 1255 Solved: 835[Submit][ ...
- BZOJ 1264: [AHOI2006]基因匹配Match 树状数组+DP
1264: [AHOI2006]基因匹配Match Description 基因匹配(match) 卡卡昨天晚上做梦梦见他和可可来到了另外一个星球,这个星球上生物的DNA序列由无数种碱基排列而成(地球 ...
- 【BZOJ1264】[AHOI2006]基因匹配Match DP+树状数组
[BZOJ1264][AHOI2006]基因匹配Match Description 基因匹配(match) 卡卡昨天晚上做梦梦见他和可可来到了另外一个星球,这个星球上生物的DNA序列由无数种碱基排列而 ...
- 基因匹配(bzoj 1264)
Description 基因匹配(match) 卡卡昨天晚上做梦梦见他和可可来到了另外一个星球,这个星球上生物的DNA序列由无数种碱基排列而成(地球上只有4种),而更奇怪的是,组成DNA序列的每一种碱 ...
- BZOJ1264: [AHOI2006]基因匹配Match
1264: [AHOI2006]基因匹配Match Time Limit: 10 Sec Memory Limit: 162 MBSubmit: 541 Solved: 347[Submit][S ...
- [AHOI2006]基因匹配
题目描述 卡卡昨天晚上做梦梦见他和可可来到了另外一个星球,这个星球上生物的DNA序列由无数种碱基排列而成(地球上只有4种),而更奇怪的是,组成DNA序列的每一种碱基在该序列中正好出现5次!这样如果一个 ...
- bzoj 1264: [AHOI2006]基因匹配Match
1264: [AHOI2006]基因匹配Match Description 基因匹配(match) 卡卡昨天晚上做梦梦见他和可可来到了另外一个星球,这个星球上生物的DNA序列由无数种碱基排列而成(地球 ...
随机推荐
- 扶桑号战列舰 (单调栈+线段树区间更新懒惰标记 or 栈)
传送门 •题目描述 题目描述 众所周知,一战过后,在世界列强建造超无畏级战列舰的竞争之中,旧日本海军根据“个舰优越主义”,建造了扶桑级战列舰,完工时为当时世界上武装最为强大的舰只. 同时,扶桑号战列舰 ...
- asp dotnet core 从 Frp 获取用户真实 IP 地址
我在本地开一个服务,然后通过 Frp 让小伙伴可以在外网访问我的 API 连接,但是直接通过 RemoteIp 拿到的是本地的地址.本文告诉小伙伴如何通过 Frp 可以拿到用户的真实 IP 地址 我写 ...
- PowerShell 通过 WMI 获取系统安装软件
本文告诉大家如何通过 WMI 获取系统安装的软件 通过 Win32_Product 可以获取系统安装的软件 Get-WmiObject Win32_Product | Format-List Capt ...
- 小白学 Python 爬虫(29):Selenium 获取某大型电商网站商品信息
人生苦短,我用 Python 前文传送门: 小白学 Python 爬虫(1):开篇 小白学 Python 爬虫(2):前置准备(一)基本类库的安装 小白学 Python 爬虫(3):前置准备(二)Li ...
- 2016湖南省赛 I Tree Intersection(线段树合并,树链剖分)
2016湖南省赛 I Tree Intersection(线段树合并,树链剖分) 传送门:https://ac.nowcoder.com/acm/contest/1112/I 题意: 给你一个n个结点 ...
- 阿里云“网红"运维工程师白金:做一个平凡的圆梦人
他是阿里云的一位 P8 运维专家,却很有野心得给自己取花名“辟拾(P10)”:他没有华丽的履历,仅凭着 26 年的热爱与坚持,一步一个脚印踏出了属于自己的技术逆袭之路:他爱好清奇,练就了能在 20 秒 ...
- [Vue源码]一起来学Vue模板编译原理(一)-Template生成AST
本文我们一起通过学习Vue模板编译原理(一)-Template生成AST来分析Vue源码.预计接下来会围绕Vue源码来整理一些文章,如下. 一起来学Vue双向绑定原理-数据劫持和发布订阅 一起来学Vu ...
- 一篇长文说 git 基础
版本管理在产品级开发中是非常重要的一个部分,它涉及到团队协作,且影响到产品最终的发布.上线以及测试环节,当前最流行的版本控制系统是 git.git 内容非常多,本文尽量克制地来介绍 git 的基础内容 ...
- ansible批量部署nginx
1.1 将nginx的安装包,和安装脚本copy到客户端,并安装脚本 vim /ansible/roles/nginx/tasks/main.yml - name: copy nginx_tar_gz ...
- 基于ambassador实现K8S灰度发布
为什么需要灰度发布 灰度发布(又名金丝雀发布)是指在黑与白之间,能够平滑过渡的一种发布方式.在其上可以进行A/B testing,即让一部分用户继续用产品特性A,一部分用户开始用产品特性B,如果用户对 ...