hdu 4641 K-string SAM的O(n^2)算法以及 SAM+并查集优化

链接：http://acm.hdu.edu.cn/showproblem.php?pid=4641

题意：有一个长度为n(n < 5e4)的字符串，Q（Q<=2e5）次操作；操作分为：在末尾插入一个字符ch和查询不同子串出现次数不小于K的数量；

思路1:SAM在线求解；

对于每次找到将一个字符x插入到SAM之后，我们知道pre[p]所含有的Tx的后缀字符串数目为step[pre[np]]个，那么只需要每次插入之后更新下这些字符串出现的次数cnt即可；

由于Right(fa)与Right(r)没有交集（max(fa) = min(r) - 1）,所以需要一直递推到root，但是root不能计算，因为root并没有表示后缀，只是一个init状态；

还有一点就是在拷贝q的信息到nq中时，主要把cnt的信息也拷贝过去；

由于数据较弱。。当出现5e4长度均为a的字符串，2e5次插入操作；这个算法复杂度将达到O(T*n*Q)；

（因为每插入一个字符，都需要更新5e4次父节点，这时上面的flag优化没什么卵用。。）

 #include<iostream>

 #include<cstdio>

 #include<cstring>

 using namespace std;

 #define maxn 100007

 #define SIGMA_SIZE 26

 struct SAM{

     int sz,tot,last,k;

     int g[maxn<<][SIGMA_SIZE],pre[maxn<<],step[maxn<<];

     int vs[maxn<<],cnt[maxn<<];

     void newNode(int s){

         step[++sz] = s;

         pre[sz] = ;

         vs[sz] = cnt[sz] = ;

         memset(g[sz],,sizeof(g[sz]));

     }

     void init(){

         tot = ;

         sz = ; last = ;

         newNode();

     }

     int idx(char ch){return ch - 'a';}

     void Insert(char ch){

         newNode(step[last]+);

         int v = idx(ch), p = last, np = sz;

         while(p && !g[p][v])

             g[p][v] = np,p = pre[p];    //知道找到Right集合中包含x的边的祖宗节点

         if(p){

             int q = g[p][v];

             if(step[q] == step[p] + )

                 pre[np] = q;

             else{

                 newNode(step[p]+);

                 int nq = sz;             //nq替换掉q节点

                 for(int i = ;i < SIGMA_SIZE;i++)

                     g[nq][i] = g[q][i];

                 cnt[nq] = cnt[q];     //**

                 pre[nq] = pre[q];

                 pre[np] = pre[q] = nq;

                 while(p && g[p][v] == q)

                     g[p][v] = nq,p = pre[p];

             }

         }

         else pre[np] = ;

         for(int aux = np;aux !=  && !vs[aux];aux = pre[aux]){

             if(++cnt[aux] >= k){

                 tot += step[aux] - step[pre[aux]];

                 vs[aux] = true;    //该父节点的子串已经加到tot中

             }

         }

         last = np;

     }

 }SA;

 char str[maxn];

 int main()

 {

     int n,Q;

     while(scanf("%d%d%d",&n,&Q,&SA.k) == ){

         scanf("%s",str);

         SA.init();

         int len = strlen(str);

         for(int i = ;i < len;i++){

             SA.Insert(str[i]);

         }

         int op;

         char ch[];

         while(Q--){

             scanf("%d",&op);

             if(op & ){

                 scanf("%s",ch);

                 SA.Insert(ch[]);

             }

             else printf("%d\n",SA.tot);

         }

     }

 }

思路2：SAM离线+并查集优化

将操作全部插入到SAM并存储之后，先进行拓扑排序；

1.为什么要进行拓扑排序？

因为拓扑的目的是为了使得SAM分层，即之后可以使用后缀数组基数排序的思想得到每个节点状态的|Right|即包含的子节点个数；

思路1由于是在线算法，并不需要知道一个节点的所有子节点（在线+1）；

2.并查集优化哪里？ <=> 如何逆向删除末尾加入的字符？

删除字符其实就是在Insert时存储下来每个字符对应的节点id，之后用并查集Find(p)来得到每次删除时，实际上该节点已经转移到哪个祖先节点的思想；

并且删除有两次，一次是开始就小于K次，就一次删到大于K次，这时该节点由于一条路径被删了，更改之后看是否也小于K次，循环即可；

时间复杂度为O(T*(n+m))

代码参考：JeraKrs

 #include<iostream>

 #include<cstdio>

 #include<cstring>

 using namespace std;

 #define maxn 250007

 #define SIGMA_SIZE 26

 int ans[maxn],op[maxn];

 char str[maxn];

 int N,Q,K;

 struct SAM{

     int sz,last;

     int g[maxn<<][SIGMA_SIZE], pre[maxn<<], step[maxn<<];

     int cnt[maxn<<], pos[maxn<<], id[maxn<<];

     int f[maxn<<], sub[maxn<<];

     int Find(int x){ return f[x] == x? x: f[x] = Find(f[x]); }

     void init(){

         sz = ;last = ;

         newNode();

     }

     void newNode(int s){

         pre[++sz] = ;

         step[sz] = s;

         memset(g[sz],,sizeof(g[sz]));

     }

     int idx(char ch){ return ch - 'a'; }

     void Insert(char ch);

     void topoSort();

     void getCnt();

     void solve(int Q,int *op,int K);

 }SA;

 void SAM::Insert(char ch){

     newNode(step[last] + );

     int v = idx(ch), np = sz, p = last;

     id[N] = np;

     while(p && !g[p][v]){

         g[p][v] = np;

         p = pre[p];

     }

     if(p){

         int q = g[p][v];

         if(step[q] == step[p] + )

             pre[np] = q;

         else{

             newNode(step[p] + );

             int nq = sz;

             for(int i = ;i < SIGMA_SIZE;i++)

                 g[nq][i] = g[q][i];

             pre[nq] = pre[q];

             pre[q] = pre[np] = nq;

             while(p && g[p][v] == q)

                 g[p][v] = nq, p = pre[p];

         }

     }

     else pre[np] = ;

     last = np;

 }

 void SAM::topoSort(){

     for(int i = ; i <= sz; i++) cnt[i] = ;

     for(int i = ; i <= sz; i++) cnt[step[i]]++;

     for(int i = ; i <= sz; i++) cnt[i] += cnt[i-];

     for(int i = ; i <= sz; i++) pos[cnt[step[i]]--] = i;

 }

 void SAM::getCnt(){

     for(int i = ; i <= sz; i++) cnt[i] = ;

     for(int p = ,i = ; i < N;i++){

         int v = idx(str[i]);

         p = g[p][v];

         cnt[p] = ;   //必须是后缀才能赋值root为0

     }

     for(int i = sz; i; i--){

         int p = pos[i];

         cnt[pre[p]] += cnt[p];

     }

 }

 void SAM::solve(int Q,int *op,int K){

     long long ret = ;

     for(int i = ; i <= sz;i++){

         int p = pos[i];

         if(cnt[p] >= K) ret += step[p] - step[pre[p]];

     }

     for(int i = ;i <= sz;i++) f[i] = i, sub[i] = ;

     for(int i = Q; i; i--){

         if(op[i] == ) ans[i] = ret;

         else{

             int p = id[N--];

             int fp = Find(p);

             while(fp && cnt[fp] < K){

                 p = f[fp] = pre[fp];    //更新

                 fp = Find(p);           //压缩

             }

             if(fp == ) continue;

             sub[fp]++;

             while(fp && cnt[fp] - sub[fp] < K){ //由于单调性 cnt[fp] >= K 是一定成立的

                 ret -= step[fp] - step[pre[fp]];

                 p = f[fp] = pre[fp];

                 sub[pre[fp]] += sub[fp];

                 fp = Find(p);

             }

         }

     }

 }

 int main()

 {

     while(scanf("%d%d%d",&N,&Q,&K) == ){

         scanf("%s",str);

         SA.init();

         for(int i = ; i < N; i++)

             SA.Insert(str[i]);

         char aux[];

         for(int i = ;i <= Q; i++){

             scanf("%d",op + i);

             if(op[i] & ){

                 scanf("%s",aux);

                 str[N++] = aux[];

                 SA.Insert(aux[]);

             }

         }

         str[N] = '\0';

         SA.topoSort();

         SA.getCnt();

         SA.solve(Q,op,K);

         for(int i = ;i <= Q;i++)

             if(op[i] == ) printf("%d\n",ans[i]);

     }

 }

hdu 4641 K-string SAM的O(n^2)算法以及 SAM+并查集优化的更多相关文章

hdu 4641K-string SAM的O(n^2)算法以及 SAM+并查集优化
转载:http://www.cnblogs.com/hxer/p/5675149.html 题意:有一个长度为n(n < 5e4)的字符串,Q(Q<=2e5)次操作:操作分为:在末尾插入一 ...
hdu 1233(还是畅通project)（prime算法，克鲁斯卡尔算法）（并查集，最小生成树）
还是畅通project Time Limit: 4000/2000 MS (Java/Others) Memory Limit: 65536/32768 K (Java/Others) Tota ...
K - Find them, Catch them POJ - 1703 （带权并查集）
题目链接: K - Find them, Catch them POJ - 1703 题目大意:警方决定捣毁两大犯罪团伙:龙帮和蛇帮,显然一个帮派至少有一人.该城有N个罪犯,编号从1至N(N<= ...
HDU 1198 Farm Irrigation （并查集优化，构图）
本题和HDU畅通project类似.仅仅只是畅通project给出了数的连通关系, 而此题须要自己推断连通关系,即两个水管能否够连接到一起,也是本题的难点所在. 记录状态.不断combine(),注意 ...
HDU OJ 5326 Work( 2015多校联合训练第3场) 并查集
题目连接:戳ME #include <iostream> #include <cstdio> #include <cstring> using namespace ...
hdu 并查集分类（待续）
hdu 1829 A Bug's Life 题目大意: 给你n个动物,输入m行a,b,表示a和b应该是异性的,要你判断是否有同性恋. 并查集中,1到n代表应性别,n+1到2n代表一个性别,合并一下,判 ...
K：Union-Find（并查集）算法
相关介绍: 并查集的相关算法,是我见过的,最为之有趣的算法之一.并查集是一种树型的数据结构,用于处理一些不相交集合(Disjoint Sets)的合并及查询问题.其相关的实现代码较为简短,实现思想也 ...
hdu 1598 (并查集加贪心) 速度与激情
题目传送门:http://acm.hdu.edu.cn/showproblem.php?pid=1598 一道带有贪心思想的并查集所以说像二分,贪心这类基础的要掌握的很扎实才行. 用结构体数组储存公 ...
HDU 4641 K-string 后缀自动机并查集
http://acm.hdu.edu.cn/showproblem.php?pid=4641 https://blog.csdn.net/asdfgh0308/article/details/4096 ...

随机推荐

标签切换JS代码
//标签切换 var nav = $('.index-nav'); var content = $('.index-nav-content li'); function hoverNav ($eleA ...
Python刷票小脚本——网络人气奖？不好意思，我要了
零.前言最近参加微软的kinect大赛,报名之后发现有一个网络投票,票数最多的项目可以得到网络人气奖. 这种事,必然是要搞一搞! 说干就干. 说明:由于本人过于懒惰,所以就不截图了,让大家失望了! ...
JMS - Exceptions
The JMSException JMS defines JMSException as the root class for exceptions thrown by JMS methods. JM ...
和阿文一起学H5-- H5排版八大套路
二.中心型三.倾斜型四.三角形 5.全图形 6.渐变型 7.蒙版型 \ 8.骨骼型实例
asp.net MVC中使用Html.Checkbox提示该字符串未被识别为有效的布尔值错误的解决方法
在asp.net MVC中使用Html.CheckBox提交后出现该字符串未被识别为有效的布尔值错误,或从类型“System.String”到类型“System.Boolean”的参数转换失败. 错误 ...
【转】用capability 特征加强Linux系统安全
用capability 特征加强Linux系统安全摘要:传统UNIX系统的访问控制模型非常简单——普通用户对超级用户.在这种模型中,一个进程或者帐户要么只有很小的权限,要么具有全部的系统权限.显然, ...
Commons-logging + Log4j
一.Commons-logging能帮我们做什么? 1.提供一个统一的日志接口,简单了操作,同时避免项目与某个日志实现系统紧密耦合 2.自动选择适当的日志实现系统 a.classpath下查找comm ...
TFS2012常见问题及解答
1.删除workItem工作项(包括Bug,用户场景,任务等) 需要利用到witadmin工具,目录在cd %programfiles%\Microsoft Visual Studio 11.0\Co ...
第九篇、Swift的基本使用
1.访问权限 /* 1> internal : 内部的 1. 默认情况下所有的类&属性&方法的访问权限都是internal 2. 在本模块(项目/包/target)中可以访问 2 ...
iOS - 字典(NSDictionary)
1. 字典类型的常用处理 //---------------不可变字典 //1.字典的创建 NSArray *array1 = [NSArray arrayWithObjects:@"zha ...

hdu 4641 K-string SAM的O(n^2)算法 以及 SAM+并查集优化

hdu 4641 K-string SAM的O(n^2)算法 以及 SAM+并查集优化的更多相关文章

随机推荐

热门专题

hdu 4641 K-string SAM的O(n^2)算法以及 SAM+并查集优化

hdu 4641 K-string SAM的O(n^2)算法以及 SAM+并查集优化的更多相关文章