Problem:

Given n points on a 2D plane, find the maximum number of points that lie on the same straight line.

Suppose that the structure Point is already defined in as following:

/**
* Definition for a point.
* struct Point {
* int x;
* int y;
* Point() : x(0), y(0) {}
* Point(int a, int b) : x(a), y(b) {}
* };
*/

Solution:

First idea coming to my mind is brute-force method which can be done in O(n^2) time...however, after I thinked hard and did some research, what I have is still O(n^2) solution. The brute-force solution is very straightforward, by given n points, there are at most n^2 different lines. To sovle the problem, we just check all possible lines and count the number of points on it. The naive approach is in O(n^3) time. However, if we check the lines of every pair of points and use a hash table to store the lines, we can do it in O(n^2) time.

Also, there is another way to solve the problem by converting the 2D-plane to its dual space. In a word, map the (points, lines)-space into a (lines, points) space. The n points are converted into n lines in the dual space, and the intersection of lines corresponds to the lines passing through those points in the original space. Therefore, instead of solving the problem "find the line pass through most points by given n points", we are asked to solve the problem "find the intersection points that most lines pass through by given n lines". The bad news is that, the time complexity to solve this new problem seems to be same as O(n^2)...

And I also tried to solve the problem from a algorithm view, use Dynamic Programming. Let M[n] be the solution to the problem with points P1, P2, ..., Pn. Then we have the recursive function as follows:

  M[n] = n, if n <= 2

  M[n] = max( BestSolutionPassingPoint(Pn, {P1...Pn-1}), M[n-1] ), otherwise

However, this DP is just a very simple divide-and-conquer idea. Suppose we have points P1, ..., Pn, there are only two non-overlapped cases: (1) The solution line passes through the point Pn; or (2) The solution lines does not pass through the point Pn. If case (1), then we only need to find the number of points lie on the line that also passes through Pn (the BestSolutionPassingPoint function). This can be done in O(n^2) time by using hash technique but much easier than brute-force method (I will explain it later). If case (2), removing the Pn does not change anything, which means the solution for P1,...,Pn-1 is equal to that for P1,...,Pn.

Finally, I decided to use the DP approach for two reasons:

1. The line representation and hashing lines. The key of the algorithm is to represent the line passing through two points and hash the line. Let (x1,y1) and (x2,y2) be two points, then we can represent a line by kx+y=d. However, this representation has a drawback that it is not able to cover the case of x1==x2. (Another representation is to use ax+by=c, but personally I do not like it since it requires gcd and more values to hash.) Therefore, the pair (k,d) is unique to a line, so it can be used as the hash key. However, in DP approach, all lines we concern are passing through a same point. Then we considered this point as an origin, then all lines can represented as y=kx or x=c for lines parallel to the x-axis. So we can use the value of k only as the hash key. Note that the line x=c should be considered seperately.

2. Duplicate points. We have to consider the case that there exist same points in the point set. If we use brute-force approach, we need to scan the points and count the duplicates first. For DP approach, it becomes much easier to handle the duplicates. To calculate BestSolutionPassingPoint(P, S), if there exist same point to P in S, lets say Q. What we are going to do is just to add 1 to every count since all lines must pass through Q since they passes through P.

The C++ code should like:

/**
* Definition for a point.
* struct Point {
* int x;
* int y;
* Point() : x(0), y(0) {}
* Point(int a, int b) : x(a), y(b) {}
* };
*/
#include <unordered_map> class Solution {
public:
static const int EPSILON = 1000000; // float precision purpose
std::unordered_map<int, int> lines; // We use the unordered_map, key is the slope of the line and value is the count of points
int find_sub_max_points(vector<Point> &points, int last) {
lines.clear(); // Clear the hash table
int num_same_x = 0; // count the number of points with same x-coordinate of Point[last]
int num_same_point = 1;
const int x0 = points[last].x;
const int y0 = points[last].y;
int x,y,tmp;
int res = 0;
for (int i = last-1; i >= 0; i--) {
x = points[i].x;
y = points[i].y;
if (x == x0) {
if (y == y0) num_same_point++;
else if(++num_same_x > res)
res = num_same_x;
}
else {
tmp = int( (y-y0) * EPSILON ) / (x-x0);
if (++lines[tmp] > res) res = lines[tmp];
/* the line above is equal to following lines
std::unordered_map<int,int>::const_iterator got = lines.find (tmp);
if (got == lines.end())
lines[tmp] = 1;
else
lines[tmp]++;
if (lines[tmp]>res) res = lines[tmp];
*/
}
}
return res+num_same_point; // Point[last] itself should be counted
}
int maxPoints(vector<Point> &points) {
int n = points.size();
if (n <= 2) return n;
int res = 2;
int tmp = 0;
for(int i=2; i<n; i++) {
tmp = find_sub_max_points(points, i);
if (tmp > res) res = tmp;
}
return res;
}
};

C++ basics:

In C++, we can use <unordered_map> as hash table. And there is a very quick way to do the following update:

if the key is not in contained in the hash table, then set the key with value 1;

otherwise, add 1 to the key's value.

This udpate can be done in one line (see code) since the HashTable[key] is set to 0 if the key does not exist.

【LeetCode OJ】Max Points on a Line的更多相关文章

  1. 【leetcode】Max Points on a Line

    Max Points on a Line 题目描述: Given n points on a 2D plane, find the maximum number of points that lie ...

  2. 【LeetCode OJ】Interleaving String

    Problem Link: http://oj.leetcode.com/problems/interleaving-string/ Given s1, s2, s3, find whether s3 ...

  3. 【LeetCode OJ】Reverse Words in a String

    Problem link: http://oj.leetcode.com/problems/reverse-words-in-a-string/ Given an input string, reve ...

  4. 【leetcode】Max Points on a Line(hard)☆

    Given n points on a 2D plane, find the maximum number of points that lie on the same straight line. ...

  5. 【leetcode刷题笔记】Max Points on a Line

    Given n points on a 2D plane, find the maximum number of points that lie on the same straight line. ...

  6. 【LeetCode OJ】Balanced Binary Tree

    Problem Link: http://oj.leetcode.com/problems/balanced-binary-tree/ We use a recursive auxilar funct ...

  7. 【LeetCode OJ】Best Time to Buy and Sell Stock III

    Problem Link: http://oj.leetcode.com/problems/best-time-to-buy-and-sell-stock-iii/ Linear Time Solut ...

  8. 【LeetCode OJ】Binary Tree Maximum Path Sum

    Problem Link: http://oj.leetcode.com/problems/binary-tree-maximum-path-sum/ For any path P in a bina ...

  9. 【LeetCode OJ】Longest Consecutive Sequence

    Problem Link: http://oj.leetcode.com/problems/longest-consecutive-sequence/ This problem is a classi ...

随机推荐

  1. Hbase0.98的环境搭建

    0 安装前提: jDK7 ,hadoop1.1.2 1 下载与hadoop对应的hbase版本: http://mirror.bit.edu.cn/apache/hbase/hbase-0.98.12 ...

  2. Touch ID集成

    作者感言 这个国庆由于种种原因, 过的不太安稳, 搬家, 办证, 东跑西跑, 忙的压根就不像是在过节....不过算了, 挑最后一天写写博文.最后:如果你有更好的建议或者对这篇文章有不满的地方, 请联系 ...

  3. ECC中的CRM UI端摆弄

    前段时间想搞CRM了,可是公司没有环境,就去ECC直接试试事务码,结果竟然可以打开网页...兴奋之余又去看了一下CRM里的一些CLASS,结果很多都是没有的.沮丧! 后来想想,只能用UI的框架,挂WD ...

  4. hadoop2.0 和1.0的区别

    1. Hadoop 1.0中的资源管理方案Hadoop 1.0指的是版本为Apache Hadoop 0.20.x.1.x或者CDH3系列的Hadoop,内核主要由HDFS和MapReduce两个系统 ...

  5. Objective-C:Foundation框架-常用类-NSMutableString

    NSString是不可变的,不能删除字符或修改字符,它有一个子类NSMutableString,为可变字符串. NSMutableString的两种创建方法: - (id) initWithCapac ...

  6. linux 安装 apache

    1.  系统基本信息 CentOS  6.4   内存2G  硬盘 200G   cpu 4核  (cat /proc/cpuinfo |grep 'processor'|wc -l  查看cpu核数 ...

  7. java之socket

    socket系java中网络编程的内容. 1客户端: package com.wtd.socket; import java.io.BufferedReader; import java.io.IOE ...

  8. C#重启系统代码

    C#重启窗体代码 System.Diagnostics.Process.Start(System.Reflection.Assembly.GetExecutingAssembly().Location ...

  9. Adriod—— DVM

    Android 运行环境主要指的虚拟机技术——Dalvik.Android中的所有Java程序都是运行在Dalvik VM上的.Android上的每个程序都有自己的线程,DVM只执行.dex的Dalv ...

  10. FOJ 2105 Digits Count

    题意:对一串数字进行抑或某数,和某数,或某数,统计某区间和的操作. 思路:因为化成二进制就4位可以建4颗线段树,每颗代表一位二进制. and 如果该为是1  直接无视,是0则成段赋值为0: or  如 ...