Huffman Coding 哈夫曼编码

作者：jostree 转载请注明出处 http://www.cnblogs.com/jostree/p/4096079.html

使用优先队列实现，需要注意以下几点：

1.在使用priority_queue时，内部需要存储哈夫曼树节点的指针，而不能是节点。因为构建哈夫曼树时，需要把其左右指针指向孩子，而如果储存的是节点，那么孩子的地址是会改变的。同理节点应当使用new在内存中开辟，而不能使用vector，原因是vector在数组大小为2整数次幂时，大小会倍增，开辟新数组并把老数组的数字copy过去，从而也会导致地址变化。

2.优先队列对指针的排列，需要额外写一个比较函数来比较指针指向的节点的大小。bool operator () (wcnode * node1, wcnode * node2) return node1->lessthan(node2);并在定义优先队列时使用这种方法： priority_queue <wcnode*, vector<wcnode*>, compare> 第一个参数是节点类型，第二个参数是优先队列的储存结构，第三个参数是比较函数。

3.C++在写入文件时，由于只能按字节写入，因此需要把8个bit位转化为一个字节，最后不足8位用0补齐，并记录文件总bit数，便于解码。然后写入文件。另写入二进制文件可以使用ofstream out("output.txt",std::ofstream::binary);

4.哈夫曼编码信息包括每种字符的映射，和该文件的总bit数。

其代码如下：

 #include <cstdio>

 #include <cstdlib>

 #include <iostream>

 #include <cstring>

 #include  <fstream>

 #include  <queue>

 #include  <map>

 #include  <vector>

 using namespace std;

 class compare;

 class wcnode

 {

     public:

         friend class compare;

         char word;

         int count;

         wcnode* left;

         wcnode* right;

         bool lessthan (const wcnode *w)const

         {

             return count > w->count;

         }

         wcnode(char w='\0', int c=, wcnode* l=NULL, wcnode * r=NULL)

         {

             word = w; count = c; left = l; right = r;

         }

 };

 class compare

 {

     public:

         bool operator () (wcnode * node1, wcnode * node2)

         {

             return node1->lessthan(node2);

         }

 };

 void preorder(wcnode *head, vector<bool> rec, map<char, vector<bool> > & res)

 {

     if( head->left == NULL && head->right == NULL )

     {

         res[head->word] = rec;

         return;

     }

     vector<bool> l = rec;

     l.push_back();

     vector<bool> r = rec;

     r.push_back();

     if(head->left != NULL) preorder(head->left, l, res);

     if(head->right != NULL) preorder(head->right, r, res);

 }

 map<char, vector<bool> > encode(map<char, int> &wordcount)

 {

     map<char, vector<bool> > res;

     priority_queue <wcnode*, vector<wcnode*>, compare> pq;

     map<char, int>::iterator t;

     wcnode *tmp;

     wcnode *t1, *t2, *t3;

     for( t = wordcount.begin() ; t != wordcount.end() ; t++ )

     {

         tmp = new wcnode();

         tmp->word = t->first;

         tmp->count = t->second;

         pq.push(tmp);

     }

     while( pq.size() >  )

     {

         t1 = pq.top();

         pq.pop();

         t2 = pq.top();

         pq.pop();

         t3 = new wcnode();

         t3->count = t1->count + t2->count;

         t3->left = t1;

         t3->right = t2;

         pq.push(t3);

     }

     wcnode *huffmanhead = pq.top();

     vector<bool> rec;

     preorder(huffmanhead, rec, res);

     map<char, vector<bool>  >::iterator it;

     for( it = res.begin() ; it != res.end() ; it++ )

     {

         cout<<it->first<<":";

         for( int i = ; i < it->second.size() ; i++ )

         {

             cout<<it->second[i];

         }

         cout<<", ";

     }

     return res;

 }

 void output(string s, string passage, map<char, vector<bool> > res)

 {

     ofstream out(s.c_str());

     vector<bool> bit;

     for( int i =  ; i < passage.size() ; i++ )

     {

         vector<bool> tmp = res[passage[i]];

         for( int i =  ; i < tmp.size(); i++ )

         {

             bit.push_back(tmp[i]);

         }

     }

     char outputchar = ;

     for( int i =  ; i < bit.size() ; i++ )

     {

         if( i %  ==  )

         {

            out.write(&outputchar, sizeof(outputchar));

            outputchar = ;

         }

         outputchar = outputchar + bit[i];

         outputchar = outputchar * ;

     }

     if( outputchar !=  )

     {

         out.write(&outputchar, sizeof(outputchar));

     }

     out.close();

 }

 int main(int argc, char *argv[])

 {

     char tmp;

     ifstream in("Aesop_Fables.txt");

     map <char, int> wordcount;

     map <char, vector<bool> > res;

     string passage;

     while( in.get(tmp) )

     {

         passage += tmp;

         if( wordcount.count(tmp) ==   )

         {

             wordcount[tmp] = ;

         }

         else

         {

             wordcount[tmp]++;

         }

     }

     res = encode(wordcount);

     output("outAesop.txt", passage, res);

     in.close();

 }

Huffman Coding 哈夫曼编码的更多相关文章

Huffuman Coding (哈夫曼编码)
哈夫曼编码(Huffman Coding),又称霍夫曼编码,是一种编码方式,哈夫曼编码是可变字长编码(VLC)的一种.Huffman于1952年提出一种编码方法,该方法完全依据字符出现概率来构造异字头 ...
哈夫曼(Huffman)树+哈夫曼编码
前天acm实验课,老师教了几种排序,抓的一套题上有一个哈夫曼树的题,正好之前离散数学也讲过哈夫曼树,这里我就结合课本,整理一篇关于哈夫曼树的博客. 主要摘自https://www.cnblogs.co ...
霍夫曼编码（Huffman Coding）
霍夫曼编码(Huffman Coding)是一种编码方法,霍夫曼编码是可变字长编码(VLC)的一种. 霍夫曼编码使用变长编码表对源符号(如文件中的一个字母)进行编码,其中变长编码表是通过一种评估来源符 ...
哈夫曼编码(Huffman coding)的那些事,(编码技术介绍和程序实现)
前言哈夫曼编码(Huffman coding)是一种可变长的前缀码.哈夫曼编码使用的算法是David A. Huffman还是在MIT的学生时提出的,并且在1952年发表了名为<A Metho ...
哈夫曼编码的理解(Huffman Coding)
哈夫曼编码(Huffman Coding),又称霍夫曼编码,是一种编码方式,可变字长编码(VLC)的一种.Huffman于1952年提出一种编码方法,该方法完全依据字符出现概率来构造异字头的平均长度最 ...
赫夫曼\哈夫曼\霍夫曼编码 (Huffman Tree)
哈夫曼树给定n个权值作为n的叶子结点,构造一棵二叉树,若带权路径长度达到最小,称这样的二叉树为最优二叉树,也称为哈夫曼树(Huffman Tree).哈夫曼树是带权路径长度最短的树,权值较大的结点离 ...
哈夫曼树（Huffman Tree）与哈夫曼编码
哈夫曼树(Huffman Tree)与哈夫曼编码(Huffman coding)
哈夫曼（huffman）树和哈夫曼编码
哈夫曼树哈夫曼树也叫最优二叉树(哈夫曼树) 问题:什么是哈夫曼树? 例:将学生的百分制成绩转换为五分制成绩:≥90 分: A,80-89分: B,70-79分: C,60-69分: D,<60 ...
（转载）哈夫曼编码（Huffman）
转载自:click here 1.哈夫曼编码的起源: 哈夫曼编码是 1952 年由 David A. Huffman 提出的一种无损数据压缩的编码算法.哈夫曼编码先统计出每种字母在字符串里出现的频率, ...

随机推荐

淘宝IP地址查询
官方网址:http://ip.taobao.com/index.php 相关文章: http://www.cnblogs.com/zetee/p/3482085.html http://www.cnb ...
cocos2d-x中本地推送消息
作者:HU 转载请注明,原文链接:http://www.cnblogs.com/xioapingguo/p/4038277.html IOS下很简单: 添加一条推送 void PushNotific ...
ServletContextListener 启动SPRING加载数据到缓存的应用
java 代码 public class LoadTreeForXML implements ServletContextListener { public void contextInitia ...
xcopy拷贝判断是否成功 robocopy排除子目录
xcopy \\172.16.22.65\server\*.* C:\Inetpub\wwwroot\Server /h /r /s /yif %errorlevel% neq 0 echo copy ...
JSON数据格式以及与后台交互数据转换实例
/* 作者:烟大阳仔时间:20131013 介绍:主要了解一下json的格式,看看数据是怎么存储的 */ <!DOCTYPE html PUBLIC "-//W3C//DTD HTM ...
[React Native] Reusable components with required propType
In this React Native lesson, we will be creating a reusable Badge component. The component will also ...
iOS开发——适配篇&iOS9适配
iOS9适配 1. Demo1_iOS9网络适配_ATS:改用更安全的HTTPS [摘要]iOS9把所有的http请求都改为https了:iOS9系统发送的网络请求将统一使用TLS 1.2 SSL.采 ...
java复习1 java简单介绍
在学校的时候.学JAVA学的模棱两可,半知半解.工作以后给我带来了非常大的困扰,所以我须要在学一遍.如今就開始吧... . java[1]是一种能够撰写跨平台应用软件的面向对象的程序设计语言,是由Su ...
VC6.0设置选项解读(转)
其实软件调试还是一个技术熟练过程,得慢慢自己总结,可以去搜索引擎查找一些相关的文章看看,下边是一篇关于VC6使用的小文章,贴出来大家看看: 大家可能一直在用VC开发软件,但是对于这个编译器却未必很了解 ...
信号之system函数
在http://www.cnblogs.com/nufangrensheng/p/3512291.html中已经有了一个system函数的实现,但是该版本并不执行任何信号处理.POSIX.1要求sys ...

Huffman Coding 哈夫曼编码

Huffman Coding 哈夫曼编码的更多相关文章

随机推荐

热门专题