UVA 12686 Trending Topic

Trending Topic

Time limit: 1.000 seconds

Imagine you are in the hiring process for a company whose principal activity is the analysis of information in the Web. One of the tests consists in writing a program for maintaining up to date a set of trending topics. You will be hired depending on the eﬃciency of your solution. They provide you with text from the most active blogs. The text is organised daily and you have to provide the sorted list of the N most frequent words during the last 7 days, when asked.

INPUT

Each input ﬁle contains one test case. The text corresponding to a day is delimited by tag <text>. Queries of top N words can appear between texts corresponding to two diﬀerent days. A top N query appears as a tag like <top 10 />. In order to facilitate you the process of reading from input, the number always will be delimited by white spaces, as in the sample.

Notes:

• All words are composed only of lowercase letters of size at most 20.

• The maximum number of diﬀerent words that can appear is 20000.

• The maximum number of words per day is 20000.

• Words of length less than four characters are considered of no interest.

• The number of days will be at most 1000.

• 1 ≤ N ≤ 20

OUTPUT

The list of N most frequent words during the last 7 days must be shown given a query. Words must appear in decreasing order of frequency and in alphabetical order when equal frequency. There must be shown all words whose counter of appearances is equal to the word at position N. Even if the amount of words to be shown exceeds N.

SAMPLE INPUT

<text> imagine you are in the hiring process of a company whose main business is analyzing the information that appears in the web </text>

<text> a simple test consists in writing a program for maintaining up to date a set of trending topics </text>

<text> you will be hired depending on the efficiency of your solution </text>

<text> they provide you with a file containing the text corresponding to a highly active blog </text>

<text> the text is organized daily and you have to provide the sorted list of the n most frequent words during last week when asked </text>

<text> each input file contains one test case the text corresponding to a day is delimited by tag text </text>

<text> the query of top n words can appear between texts corresponding to two different days </text>

<text> blah blah blah blah blah blah blah blah blah please please please </text>

SAMPLE OUTPUT
<top 5>

analyzing 1

appears 1

business 1

company 1

consists 1

date 1

depending 1

efficiency 1

hired 1

hiring 1

imagine 1

information 1

main 1

maintaining 1

process 1

program 1

simple 1

solution 1

test 1

that 1

topics 1

trending 1

whose 1

will 1

writing 1

your 1

</top>

<top 3>

text 4

corresponding 3

file 2

provide 2

test 2

words 2

</top>

<top 3>

blah 9

text 4

corresponding 3

please 3

</top>

解题：极其蛋疼狗血的破题目。

 #include <iostream>

 #include <cstdio>

 #include <cstring>

 #include <cmath>

 #include <algorithm>

 #include <climits>

 #include <vector>

 #include <queue>

 #include <cstdlib>

 #include <string>

 #include <set>

 #include <map>

 #include <stack>

 #define LL long long

 #define pii pair<int,int>

 #define INF 0x3f3f3f3f

 using namespace std;

 struct cao{

     string word;

     int cnt;

     cao(string x = "*",int y = ){

         word = x;

         cnt = y;

     }

 };

 map<string,int>mp;

 string tmp;

 vector<string>v[];

 cao nima[];

 bool cmp(const cao &x,const cao &y){

     if(x.cnt == y.cnt) return x.word < y.word;

     return x.cnt > y.cnt;

 }

 int main() {

     mp.clear();

     int n = ,i,j,ask,tot;

     while(cin>>tmp){

         if(tmp == "<text>"){

             j = n%;

             for(i = ; i < v[j].size(); i++) mp[v[j][i]]--;

             v[j].clear();

             while(cin>>tmp){

                 if(tmp == "</text>") break;

                 if(tmp.length() >= ){

                     mp[tmp]++;

                     v[j].push_back(tmp);

                 }

             }

             n++;

         }else{

             cin>>ask>>tmp;

             tot = ;

             for(map<string,int>::iterator it = mp.begin();it != mp.end(); it++)

                 nima[tot++] = cao(it->first,it->second);

             sort(nima,nima+tot,cmp);

             printf("<top %d>\n",ask);

             for(i = ,j = ; i < ask; i++)

                 printf("%s %d\n",nima[i].word.c_str(),nima[i].cnt);

             for(j = i,i--; j < tot && nima[j].cnt == nima[i].cnt; j++)

                  printf("%s %d\n",nima[j].word.c_str(),nima[j].cnt);

             puts("</top>");

         }

     }

     return ;

 }

UVA 12686 Trending Topic的更多相关文章

SWERC13 Trending Topic
map暴力. .. Imagine you are in the hiring process for a company whose principal activity is the analys ...
USER STORIES AND USE CASES - DON’T USE BOTH
We’re in Orlando for a working session as part of the Core Team building BABOK V3 and over dinner th ...
[转载]Three Trending Computer Vision Research Areas，从CVPR看接下来几年的CV的发展趋势
As I walked through the large poster-filled hall at CVPR 2013, I asked myself, “Quo vadis Computer V ...
UVA 607 二十二 Scheduling Lectures
Scheduling Lectures Time Limit:3000MS Memory Limit:0KB 64bit IO Format:%lld & %llu Submi ...
UVa 10006 - Carmichael Numbers
UVa 10006 - Carmichael Numbers An important topic nowadays in computer science is cryptography. Some ...
Kafka 如何读取offset topic内容 (__consumer_offsets)
众所周知,由于Zookeeper并不适合大批量的频繁写入操作,新版Kafka已推荐将consumer的位移信息保存在Kafka内部的topic中,即__consumer_offsets topic,并 ...
Kafka如何创建topic？
Kafka创建topic命令很简单,一条命令足矣:bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-facto ...
Kafka0.8.2.1删除topic逻辑
前提条件: 在启动broker时候开启删除topic的开关,即在server.properties中添加: delete.topic.enable=true 命令: bin/kafka-topics ...
[bigdata] kafka基本命令 -- 迁移topic partition到指定的broker
版本 0.9.2 创建topic bin/kafka-topics.sh --create --topic topic_name --partition 6 --replication-factor ...

随机推荐

bzoj 1050: [HAOI2006]旅行comf【枚举+并查集】
m是5000,就想到了直接枚举比例具体做法是是先把边按照边权从小到大排序,然后先枚举最小边权,再枚举最大边权,就是从最小边权里一个一个加进并查集里,每次查st是否联通,联通则退出,更新答案 #inc ...
SpringBoot集成MybatisPlus解决Mapper文件修改后动态刷新的问题
很多人在使用SpringBoot集成Mybatis或者MybatisPlus的时候在查询复杂的情况下会写mapper文件,虽然说MyBatisPlus提供了常用的增删查改,但还是难以应付复杂的查询.关 ...
SeasLog的日志
https://github.com/Neeke/SeasLog/blob/master/README_zh.md 中文文档地址, 这是一个很好用的记录日志扩展,可以用于项目之中
$Hdu1381\ Crazy\ Search$
前置芝士 :string 的基本用法 string s = "hello world" ; string tmp(s,0,5) ; cout << tmp <& ...
mysql-5.7.20基本用法
第1章安装mysql-5.7.20 1.1 系统环境 [root@mysql ~]# cat /etc/redhat-release CentOS Linux release 7.2.1511 (C ...
JAVAEE 是什么，如何获取各种规范jar包及各种规范的jar包源码
1.什么是JAVA EE JAVA EE是由一系列规范组成的,规范是由JCP制定的,并且提供了参考实现.规范(Specification)是一系列接口,不包含具体实现有以下常见的JAVA EE实现, ...
Visual Studio 生成项目时脚本执行
项目属性 - 生成事件项目生成前:预先生成事件命令行: 项目生成后:后期生成事件命令行:例:copy $(ProjectDir)Reference\sms_cfg.ini $(TargetDir) ...
MySql学习笔记(1)-安装
一.安装环境操作系统:win7 64ibt MySql版本:5.7.10.0 二.安装过程 1.点击Add 2.选择需要安装的组件 3.Excute 4.Next 5.Next 6.进入服务器配置 ...
配置JDK、tomcat及Java Web项目部署
一.JDK的安装 (1)下载安装JDK: 这个就不用说了,直接官网下载jdk安装即可.http://www.oracle.com/technetwork/java/javaee/downloads/i ...
Redis系列(九)--几道面试题
这里只是一点面试题,想了解更多,可以查看本人的Redis系列:https://www.cnblogs.com/huigelaile/category/1461895.html 1.Redis和Memc ...

UVA 12686 Trending Topic

UVA 12686 Trending Topic的更多相关文章

随机推荐

热门专题