《Cracking the Coding Interview》——第18章：难题—

2014-04-29 02:27

题目：找出10亿个数中最小的100万个数，假设内存可以装得下。

解法1：内存可以装得下？可以用快速选择算法得到无序的结果。时间复杂度总体是O(n)级别，但是常系数不小。

代码：

 // 18.6 Find the smallest one million number among one billion numbers.

 // Suppose one billion numbers can fit in memory.

 // I'll use quick selection algorithm to find them. This will return an unsorted result.

 // Time complexity is O(n), but the constant factor may be massive. I don't quite like this algorithm.

 #include <algorithm>

 #include <iostream>

 #include <vector>

 using namespace std;

 const int CUT_OFF = ;

 int medianThree(vector<int> &v, int ll, int rr)

 {

     int mm = (ll + rr) / ;

     if (v[ll] > v[mm]) {

         swap(v[ll], v[mm]);

     }

     if (v[ll] > v[rr]) {

         swap(v[ll], v[rr]);

     }

     if (v[mm] > v[rr]) {

         swap(v[mm], v[rr]);

     }

     swap(v[mm], v[rr - ]);

     return v[rr - ];

 }

 void quickSelect(vector<int> &v, int ll, int rr, int k)

 {

     // reference from "Data Structure and Algorithm Analysis in C" by Mark Allen Weiss.

     int pivot;

     int i, j;

     if (ll + CUT_OFF <=    rr) {

         pivot = medianThree(v, ll, rr);

         i = ll;

         j = rr - ;

         while (true) {

             while (v[++i] < pivot);

             while (v[--j] > pivot);

             if (i > j) {

                 break;

             }

             swap(v[i], v[j]);

         }

         swap(v[i], v[rr - ]);

         if (k < i) {

             return quickSelect(v, ll, i - , k);

         } else if (k > i) {

             return quickSelect(v, i + , rr, k);

         }

     } else {

         for (i = ll; i <= rr; ++i) {

             for (j = i + ; j <= rr; ++j) {

                 if (v[i] > v[j]) {

                     swap(v[i], v[j]);

                 }

             }

         }

     }

 }

 int main()

 {

     vector<int> v;

     vector<int> res;

     int n, k;

     int i;

     int k_small, count;

     while (cin >> n >> k && (n >  && k > )) {

         v.resize(n);

         for (i = ; i < n; ++i) {

             cin >> v[i];

         }

         // find the kth smallest number

         // this will change the order of elements

         quickSelect(v, , n - , k - );

         k_small = v[k - ];

         count = k;

         for (i = ; i < n; ++i) {

             if (v[i] < k_small) {

                 --count;

             }

         }

         for (i = ; i < n; ++i) {

             if (v[i] < k_small) {

                 res.push_back(v[i]);

             } else if (v[i] == k_small && count > ) {

                 res.push_back(v[i]);

                 --count;

             }

         }

         cout << '{';

         for (i = ; i < k; ++i) {

             i ? (cout << ' '),  : ;

             cout << res[i];

         }

         cout << '}' << endl;

         v.clear();

         res.clear();

     }

     return ;

 }

解法2：如果要求结果也是有序的，那可以用最大堆得到有序结果。时间复杂度是O(n * log(m))级别，思路和代码相比快速选择算法都更简单，不过效率低了些。

代码：

 // 18.6 Find the smallest one million number among one billion numbers.

 // Suppose one billion numbers can fit in memory.

 // I'll use a max heap, which runs in O(n * log(k)) time, returns a sorted result.

 #include <iostream>

 #include <queue>

 #include <vector>

 using namespace std;

 template <class T>

 struct myless {

     bool operator () (const T &x, const T &y) {

         return x < y;

     };

 };

 int main()

 {

     int val;

     int n, k;

     int i;

     // max heap

     priority_queue<int, vector<int>, myless<int> > q;

     vector<int> v;

     while (cin >> n >> k && (n >  && k > )) {

         k = k < n ? k : n;

         for (i = ; i < k; ++i) {

             cin >> val;

             q.push(val);

         }

         for (i = k; i < n; ++i) {

             cin >> val;

             if (q.top() > val) {

                 q.pop();

                 q.push(val);

             }

         }

         while (!q.empty()) {

             v.push_back(q.top());

             q.pop();

         }

         reverse(v.begin(), v.end());

         cout << '{';

         for (i = ; i < k; ++i) {

             i ? (cout << ' '),  : ;

             cout << v[i];

         }

         cout << '}' << endl;

         v.clear();

     }

     return ;

 }

《Cracking the Coding Interview》——第18章：难题——题目6的更多相关文章

Cracking the coding interview 第一章问题及解答
Cracking the coding interview 第一章问题及解答不管是不是要挪地方,面试题具有很好的联系代码总用,参加新工作的半年里,做的大多是探索性的工作,反而代码写得少了,不高兴,最 ...
《Cracking the Coding Interview》读书笔记
<Cracking the Coding Interview>是适合硅谷技术面试的一本面试指南,因为题目分类清晰,风格比较靠谱,所以广受推崇. 以下是我的读书笔记,基本都是每章的课后习题解 ...
Cracking the coding interview
写在开头最近忙于论文的开题等工作,还有阿里的实习笔试,被虐的还行,说还行是因为自己的水平或者说是自己准备的还没有达到他们所需要人才的水平,所以就想找一本面试的书<Cracking the co ...
Cracking the coding interview目录及资料收集
前言 <Cracking the coding interview>是一本被许多人极力推荐的程序员面试书籍, 详情可见:http://www.careercup.com/book. 第六版 ...
Cracking the Coding Interview（Trees and Graphs）
Cracking the Coding Interview(Trees and Graphs) 树和图的训练平时相对很少,还是要加强训练一些树和图的基础算法.自己对树节点的设计应该不是很合理,多多少少 ...
Cracking the Coding Interview（Stacks and Queues）
Cracking the Coding Interview(Stacks and Queues) 1.Describe how you could use a single array to impl ...
二刷Cracking the Coding Interview（CC150第五版）
第18章---高度难题 1,-------另类加法.实现加法. 另类加法参与人数:327时间限制:3秒空间限制:32768K 算法知识视频讲解题目描述请编写一个函数,将两个数字相加.不得使用+或 ...
《Cracking the Coding Interview》——第18章：难题——题目13
2014-04-29 04:40 题目:给定一个字母组成的矩阵,和一个包含一堆单词的词典.请从矩阵中找出一个最大的子矩阵,使得从左到右每一行,从上到下每一列组成的单词都包含在词典中. 解法:O(n^3 ...
《Cracking the Coding Interview》——第18章：难题——题目12
2014-04-29 04:36 题目:最大子数组和的二位扩展:最大子矩阵和. 解法:一个维度上进行枚举,复杂度O(n^2):另一个维度执行最大子数组和算法,复杂度O(n).总体时间复杂度为O(n^3 ...
《Cracking the Coding Interview》——第18章：难题——题目11
2014-04-29 04:30 题目:给定一个由‘0’或者‘1’构成的二维数组,找出一个四条边全部由‘1’构成的正方形(矩形中间可以有‘0’),使得矩形面积最大. 解法:用动态规划思想,记录二维数组 ...

随机推荐

Windows计算下载文件的SHA256 MD5 SHA1
引用自 http://blog.163.com/licanli2082@126/blog/static/35748686201284611330/ certutil -hashfile yourfil ...
Apache2.4 authz_core_module模块使用
Description: Core Authorization Status: Base Moduledentifier: authz_core_module Sourceile: mod_authz ...
java中string类型转换成map
背景:有时候string类型的数据取出来是个很标准的key.value形式,通过Gson的可以直接转成map 使用方式: Gson gson = new Gson(); Map<String, ...
类似LCS,构成目标单词(POJ2192)
题目链接:http://poj.org/problem?id=2192 解题报告: 1.类似最长公共子序列,dp[i][j]表示用s1前i个字符和s2前j个字符来构成目标单词的一部分,是否成功 2.状 ...
SSH连接linux时，长时间不操作就断开的解决方案（增强版）
1.第一次尝试失败修改/etc/ssh/sshd_config文件, 找到 ClientAliveInterval 0 ClientAliveCountMax 3 并将注释符号("#&qu ...
P1909 买铅笔
题目描述 P老师需要去商店买n支铅笔作为小朋友们参加NOIP的礼物.她发现商店一共有 33种包装的铅笔,不同包装内的铅笔数量有可能不同,价格也有可能不同.为了公平起见,P老师决定只买同一种包装的铅笔 ...
简单使用hibernate(idea中使用)
首先创建一个maven项目创建成功后,进行创建数据库的表 CREATE TABLE BOOK( ID INT AUTO_INCREMENT PRIMARY KEY, NAME ), NUMBER i ...
SpringBoot学习11：springboot异常处理方式1(自定义异常页面)
SpringBoot 默认的处理异常的机制:SpringBoot 默认的已经提供了一套处理异常的机制.一旦程序中出现了异常 SpringBoot 会向/error 的 url 发送请求.在 sprin ...
Java分享笔记：泛型机制的程序演示
package packA; import java.util.*; public class GenericDemo { public static void main(String[] args) ...
【例题收藏】◇例题·I◇ Snuke's Subway Trip
◇例题·I◇ Snuke's Subway Trip 题目来源:Atcoder Regular 061 E题(beta版) +传送门+ 一.解析 (1)最短路实现由于在同一家公司的铁路上移动是不花费 ...

《Cracking the Coding Interview》——第18章：难题——题目6

《Cracking the Coding Interview》——第18章：难题——题目6的更多相关文章

随机推荐

热门专题