POJ 2408 - Anagram Groups

题目链接：http://poj.org/problem?id=2408

World-renowned Prof. A. N. Agram's current research deals with large anagram groups. He has just found a new application for his theory on the distribution of characters in English language texts. Given such a text, you are to find the largest anagram groups.

A text is a sequence of words. A word w is an anagram of a word v if and only if there is some permutation p of character positions that takes w to v. Then, w and v are in the same anagram group. The size of an anagram group is the number of words in that group. Find the 5 largest anagram groups.

Input
The input contains words composed of lowercase alphabetic characters, separated by whitespace(or new line). It is terminated by EOF. You can assume there will be no more than 30000 words.

Output
Output the 5 largest anagram groups. If there are less than 5 groups, output them all. Sort the groups by decreasing size. Break ties lexicographically by the lexicographical smallest element. For each group output, print its size and its member words. Sort the member words lexicographically and print equal words only once.

Sample Input
undisplayed
trace
tea
singleton
eta
eat
displayed
crate
cater
carte
caret
beta
beat
bate
ate
abet

Sample Output
Group of size 5: caret carte cater crate trace .
Group of size 4: abet bate beat beta .
Group of size 4: ate eat eta tea .
Group of size 1: displayed .
Group of size 1: singleton .

题解：

用字典树把每个组都hash成一个数字 $x$，然后把组内的字符串全部存在编号为 $x$ 的vector内。

然后对vector进行排序（实际上是对vector的编号进行排序），再输出前五项就好了，注意相同的单词虽然计数，但是输出时只输出一次。

AC代码：

#include<iostream>

#include<string>

#include<algorithm>

#include<vector>

#include<cstring>

using namespace std;

const int maxn=3e4+;

string str;

int idx[maxn];

vector<string> g[maxn];

bool cmp(int i,int j)

{

    if(g[i].size()!=g[j].size())

        return g[i].size()>g[j].size();

    else if(g[i].size() && g[j].size())

        return g[i][]<g[j][];

}

namespace Trie

{

    const int SIZE=maxn*;

    int sz,tot;

    struct TrieNode{

        int ed;

        int nxt[];

    }trie[SIZE];

    void init(){sz=, tot=;}

    int insert(const string& s)

    {

        int p=;

        for(int i=;i<s.size();i++)

        {

            int ch=s[i]-'a';

            if(!trie[p].nxt[ch]) trie[p].nxt[ch]=++sz;

            p=trie[p].nxt[ch];

        }

        return trie[p].ed?trie[p].ed:(trie[p].ed=++tot);

    }

};

int main()

{

    ios::sync_with_stdio();

    cin.tie(), cout.tie();

    Trie::init();

    while(cin>>str)

    {

        string tmp=str;

        sort(tmp.begin(),tmp.end());

        int t=Trie::insert(tmp);

        g[t].push_back(str);

    }

    for(int i=;i<=Trie::tot;i++) idx[i]=i;

    sort(idx+,idx+Trie::tot+,cmp);

    for(int i=;i<= && g[idx[i]].size()>;i++)

    {

        vector<string>& v=g[idx[i]];

        cout<<"Group of size "<<v.size()<<": ";

        sort(v.begin(),v.end());

        v.erase(unique(v.begin(),v.end()),v.end());

        for(int k=;k<v.size();k++) cout<<v[k]<<' ';

        cout<<".\n";

    }

}

POJ 2408 - Anagram Groups - [字典树]的更多相关文章

poj 2408 Anagram Groups(hash)
id=2408" target="_blank" style="">题目链接:poj 2408 Anagram Groups 题目大意:给定若干 ...
poj 2408 Anagram Groups
Description World-renowned Prof. A. N. Agram's current research deals with large anagram groups. He ...
POJ 2001 Shortest Prefixes(字典树）
题目地址:POJ 2001 考察的字典树,利用的是建树时将每个点仅仅要走过就累加.最后从根节点開始遍历,当遍历到仅仅有1次走过的时候,就说明这个地方是最短的独立前缀.然后记录下长度,输出就可以. 代码 ...
poj 1204 Word Puzzles(字典树)
题目链接:http://poj.org/problem?id=1204 思路分析:由于题目数据较弱,使用暴力搜索:对于所有查找的单词建立一棵字典树,在图中的每个坐标,往8个方向搜索查找即可: 需要注意 ...
poj 1056 IMMEDIATE DECODABILITY 字典树
题目链接:http://poj.org/problem?id=1056 思路: 字典树的简单应用,就是判断当前所有的单词中有木有一个是另一个的前缀,直接套用模板再在Tire定义中加一个bool类型的变 ...
POJ 1816 - Wild Words - [字典树+DFS]
题目链接: http://poj.org/problem?id=1816 http://bailian.openjudge.cn/practice/1816?lang=en_US Time Limit ...
nyoj 163 Phone List(动态字典树<trie>) poj Phone List (静态字典树<trie>)
Phone List 时间限制:1000 ms | 内存限制:65535 KB 难度:4 描述 Given a list of phone numbers, determine if it i ...
poj 2503:Babelfish（字典树，经典题，字典翻译）
Babelfish Time Limit: 3000MS Memory Limit: 65536K Total Submissions: 30816 Accepted: 13283 Descr ...
poj 2513 连接火柴字典树+欧拉通路好题
Colored Sticks Time Limit: 5000MS Memory Limit: 128000K Total Submissions: 27134 Accepted: 7186 ...

随机推荐

windows下vbs脚本隐藏控制台
每次想写python代码时,都需要打开IDE进行编写,并且需要创建许多小文件.如果使用jupyter就能够直接书写.但是jupyter需要手动通过控制台打开,这不够方便.通过把jupyter note ...
Socket网络编程--简单Web服务器(4)
上一小节已经实现了对图片的传输,接下来就是判断文件是否为js,css,png等格式.我们增加一个函数用于判断格式 int WebServer::get_filetype(char *type,char ...
Clustered Shading架构实现步骤
最终决定越过Forward+,一步到位,直接调整至更先进的Clustered架构.步骤如下: 里程碑1:以CPU方式实现Light Culling,旨在理念验证,并与D3D10兼容里程碑2:以GPU ...
[转]深刻理解Python中的元类(metaclass)以及元类实现单例模式
使用元类深刻理解Python中的元类(metaclass)以及元类实现单例模式在看一些框架源代码的过程中碰到很多元类的实例,看起来很吃力很晦涩:在看python cookbook中关于元类创建单例 ...
html网页采集
UI_Less.pas: unit UI_Less; interface uses Windows, Classes, Messages, Forms, MsHtml, Urlmon, ActiveX ...
【emWin】例程二十八：窗口对象——Menu
简介: MENU 小工具可用于创建若干种菜单.每个菜单项代表一个应用程序命令或子菜单.MENU 可水平显示和/ 或垂直显示.菜单项可使用分隔符进行分组.水平菜单和垂直菜单均支持分隔符.选择一个菜单 ...
关于Stm32定时器+ADC+DMA进行AD采样的实现
Stm32的ADC有DMA功能这都毋庸置疑,也是我们用的最多的!然而,如果我们要对一个信号(比如脉搏信号)进行定时采样(也就是隔一段时间,比如说2ms),有三种方法: 1.使用定时器中断每隔一定时间进 ...
(笔记)Linux内核学习(二)之进程
一进程与线程进程就是处于执行期的程序,包含了独立地址空间,多个执行线程等资源. 线程是进程中活动的对象,每个线程都拥有独立的程序计数器.进程栈和一组进程寄存器. 内核调度的对象是线程而不是进程.对 ...
bootstrap入门基础
1.字体 text-left text-center text-right text-lowercase 小写 text-uppercase 大写 text-capitalize 首字母大写 2.表格 ...
windows系统下，express构建的node项目中，如何用debug控制调试日志
debug是一款控制日志输出的库,可以在开发调试环境下打开日志输出,生产环境下关闭日志输出.这样比console.log方便多了,console.log只有注释掉才能不输出. debug库还可以根据d ...

POJ 2408 - Anagram Groups - [字典树]

POJ 2408 - Anagram Groups - [字典树]的更多相关文章

随机推荐

热门专题