POJ3415 Common Substrings —— 后缀数组 + 单调栈公共子串个数

题目链接：https://vjudge.net/problem/POJ-3415

Common Substrings

Time Limit: 5000MS		Memory Limit: 65536K
Total Submissions: 12240		Accepted: 4144

Description

A substring of a string T is defined as:

T(i, k)=T_iT_i₊₁...T_i+k_-1, 1≤i≤i+k-1≤|T|.

Given two strings A, B and one integer K, we define S, a set of triples (i, j, k):

S = {(i, j, k) | k≥K, A(i, k)=B(j, k)}.

You are to give the value of |S| for specific A, B and K.

Input

The input file contains several blocks of data. For each block, the first line contains one integer K, followed by two lines containing strings A and B, respectively. The input file is ended by K=0.

1 ≤ |A|, |B| ≤ 10⁵
1 ≤ K ≤ min{|A|, |B|}
Characters of A and B are all Latin letters.

Output

For each case, output an integer |S|.

Sample Input

2

aababaa

abaabaa

1

xx

xx

0

Sample Output

22

5

Source

POJ Monthly--2007.10.06, wintokk

题意：

给出两个字符串，求有多少对长度不小于k的公共子串，子串相同但位置不同也单独算作一对。

题解：

1.将两个字符串拼接在一起，中间用分隔符隔开，得到新串。并且需要记录每个位置上的字符（后缀）属于哪一个字符串。

2.求出新串的后缀数组。可知sa[i]和sa[j]的最长公共前缀为：min（height[k]）i+1<=k<=j。

3.根据第二点，可以枚举sa数组，当遇到A串时，就先放着，当遇到B串时，就往前统计与所有A串的最长公共前缀，假如为len，那么就能增加len-k+1个公共前缀了。由于是按着sa的顺序枚举下去的，所以对于在B串下面的A串是没有统计到的，所以需要二次统计：把A串当成B串， B串当成A串，然后再进行统计，方可无遗漏。

4.往前统计时需要用到单调栈。

代码如下：

 #include <iostream>

 #include <cstdio>

 #include <cstring>

 #include <algorithm>

 #include <vector>

 #include <cmath>

 #include <queue>

 #include <stack>

 #include <map>

 #include <string>

 #include <set>

 using namespace std;

 typedef long long LL;

 const int INF = 2e9;

 const LL LNF = 9e18;

 const int MOD = 1e9+;

 const int MAXN = 2e5+;

 int id[MAXN];

 int r[MAXN], sa[MAXN], Rank[MAXN], height[MAXN];

 int t1[MAXN], t2[MAXN], c[MAXN];

 bool cmp(int *r, int a, int b, int l)

 {

     return r[a]==r[b] && r[a+l]==r[b+l];

 }

 void DA(int str[], int sa[], int Rank[], int height[], int n, int m)

 {

     n++;

     int i, j, p, *x = t1, *y = t2;

     for(i = ; i<m; i++) c[i] = ;

     for(i = ; i<n; i++) c[x[i] = str[i]]++;

     for(i = ; i<m; i++) c[i] += c[i-];

     for(i = n-; i>=; i--) sa[--c[x[i]]] = i;

     for(j = ; j<=n; j <<= )

     {

         p = ;

         for(i = n-j; i<n; i++) y[p++] = i;

         for(i = ; i<n; i++) if(sa[i]>=j) y[p++] = sa[i]-j;

         for(i = ; i<m; i++) c[i] = ;

         for(i = ; i<n; i++) c[x[y[i]]]++;

         for(i = ; i<m; i++) c[i] += c[i-];

         for(i = n-; i>=; i--) sa[--c[x[y[i]]]] = y[i];

         swap(x, y);

         p = ; x[sa[]] = ;

         for(i = ; i<n; i++)

             x[sa[i]] = cmp(y, sa[i-], sa[i], j)?p-:p++;

         if(p>=n) break;

         m = p;

     }

     int k = ;

     n--;

     for(i = ; i<=n; i++) Rank[sa[i]] = i;

     for(i = ; i<n; i++)

     {

         if(k) k--;

         j = sa[Rank[i]-];

         while(str[i+k]==str[j+k]) k++;

         height[Rank[i]] = k;

     }

 }

 int Stack[MAXN][], top;

 LL cal(int k, int len, int flag)

 {

     LL sum = , tmp = ;

     top = ;

     for(int i = ; i<=len; i++)

     {

         if(height[i]<k)

             tmp = top = ;

         else

         {

             int cnt = ;

             if(id[sa[i-]]==flag)

                 tmp += height[i]-k+, cnt++;

             while(top> && height[i]<=Stack[top-][])

             {

                 tmp -= 1LL*Stack[top-][]*(Stack[top-][]-height[i]);

                 cnt += Stack[top-][];

                 top--;

             }

             Stack[top][] = height[i];

             Stack[top++][] = cnt;

             if(id[sa[i]]!=flag)

                 sum += tmp;

         }

     }

     return sum;

 }

 char str[MAXN];

 int main()

 {

     int k;

     while(scanf("%d",&k)&&k)

     {

         int len = ;

         scanf("%s", str);

         int LEN = strlen(str);

         for(int j = ; j<LEN; j++)

         {

             r[len] = str[j];

             id[len++] = ;

         }

         r[len] = '$';

         id[len++] = ;

         scanf("%s", str);

         LEN = strlen(str);

         for(int j = ; j<LEN; j++)

         {

             r[len] = str[j];

             id[len++] = ;

         }

         r[len] = ;

         DA(r,sa,Rank,height,len,);

         cout<< cal(k,len,)+cal(k,len,) <<endl;

     }

 }

POJ3415 Common Substrings —— 后缀数组 + 单调栈公共子串个数的更多相关文章

POJ3415 Common Substrings(后缀数组单调栈)
借用罗穗骞论文中的讲解: 计算A 的所有后缀和B 的所有后缀之间的最长公共前缀的长度,把最长公共前缀长度不小于k 的部分全部加起来.先将两个字符串连起来,中间用一个没有出现过的字符隔开.按height ...
poj 3415 Common Substrings 后缀数组+单调栈
题目链接题意:求解两个字符串长度大于等于k的所有相同子串对有多少个,子串可以相同,只要位置不同即可:两个字符串的长度不超过1e5; 如 s1 = "xx" 和 s2 = &qu ...
poj 3415 Common Substrings——后缀数组+单调栈
题目:http://poj.org/problem?id=3415 因为求 LCP 是后缀数组的 ht[ ] 上的一段取 min ,所以考虑算出 ht[ ] 之后枚举每个位置作为右端的贡献. 一开始想 ...
poj 3415 Common Substrings —— 后缀数组+单调栈
题目:http://poj.org/problem?id=3415 先用后缀数组处理出 ht[i]: 用单调栈维护当前位置 ht[i] 对之前的 ht[j] 取 min 的结果,也就是当前的后缀与之前 ...
poj3415 Common Substrings (后缀数组+单调队列)
Time Limit: 5000MS Memory Limit: 65536K Total Submissions: 9414 Accepted: 3123 Description A sub ...
SPOJ - SUBST1 New Distinct Substrings —— 后缀数组单个字符串的子串个数
题目链接:https://vjudge.net/problem/SPOJ-SUBST1 SUBST1 - New Distinct Substrings #suffix-array-8 Given a ...
【BZOJ-3238】差异后缀数组 + 单调栈
3238: [Ahoi2013]差异 Time Limit: 20 Sec Memory Limit: 512 MBSubmit: 1561 Solved: 734[Submit][Status] ...
BZOJ_3879_SvT_后缀数组+单调栈
BZOJ_3879_SvT_后缀数组+单调栈 Description (我并不想告诉你题目名字是什么鬼) 有一个长度为n的仅包含小写字母的字符串S,下标范围为[1,n]. 现在有若干组询问,对于每一个 ...
BZOJ_3238_[Ahoi2013]差异_后缀数组+单调栈
BZOJ_3238_[Ahoi2013]差异_后缀数组+单调栈 Description Input 一行,一个字符串S Output 一行,一个整数,表示所求值 Sample Input cacao ...

随机推荐

【开发总结】—— BABYLON 3D开发常见问题及解决方法
前言:组内同事们根据长时间的Babylon.js开发实践,一起将项目开发中遇到的问题及解决方法做了一个梳理. ios [最好] 关闭离线缓存—— 解决添加了反射的mesh 丢失的问题不要使用 pos ...
void f(int(&p)[3]){} 和void f(int(*p)[3]){}的差别
#include<iostream> using namespace std; void f(int(&p)[3]){ cout<<p[0]<& ...
java导出excel不须要额外jar包
眼下我知道的在java中导出Excel能够用poi或在jsp的文件头改变输出流. 以下再介绍一种就用java基础包导出的Excel.导出的格式形如: 源代码例如以下: package csvExcel ...
Oracle 复制表创建的sql语句
http://cache.baiducontent.com/c?m=9d78d513d99e01fc09b3c3690d67c0161343f0652ba1d4020ed08449e3732b4250 ...
Vue.js 很好，但会比 Angular 或 React 更好吗？
文章转自:http://www.oschina.net/translate/vuejs-is-good-but-is-it-better-than-angular-or-rea Vue.js 是一个用 ...
Linux 编译ffmpeg 生成ffplay
本来主要介绍linux环境下如何编译ffmpeg使之生成ffplay.编译总是离不开源码的版本,以及编译环境下:编译环境Ubutun 16.04 ,ffmpeg 版本3.4.2.如何下载ffmpeg ...
为备考二级C语言做的代码练习---辅导资料《C语言经典编程282例》--(1)
因为二级考试的时候用的C语言编译器是VC++6.0 真是日了狗了用这个编译器这是我第2个C编译器吧,第一个用的是啊哈C编译器..第二个是VS++6.0 然后在win下用VS2013感觉挺不错的毕 ...
centos部署Python环境
在centos上部署Python之前,我们需要先配置开发环境. 1.安装Python依赖的开发工具包 gcc自然少不了,可以直接用“Development Tools”: yum grouplist ...
安装 r 里的 igraph 报错
转载来源:http://genek.tv/article/40 1186 0 0 安装 r 里的 igraph 报错: foreign-graphml.c: In function ‘igraph_w ...
Linux进入单用户模式
有时候配置linux的过程中,因为一些误操作导致系统初始化时堵塞或挂起而无法进入系统,原因往往是因为配置文件设置错误,部分文件被误删之类.遇到这种情况一般新手的做法就是重装(虚拟机不装白不装),但在实 ...

POJ3415 Common Substrings —— 后缀数组 + 单调栈 公共子串个数

POJ3415 Common Substrings —— 后缀数组 + 单调栈 公共子串个数的更多相关文章

随机推荐

热门专题

POJ3415 Common Substrings —— 后缀数组 + 单调栈公共子串个数

POJ3415 Common Substrings —— 后缀数组 + 单调栈公共子串个数的更多相关文章