Q: 如何把jupyter notebook 转为 pdf 文档?

A: 尝试了几种python包, 结果都没有成功. 包括: xhtml2pdf,

查看官方的介绍说用pandoc也是一种方法, 但是觉得安装一个可怕的Latex和pandoc太麻烦了.

还好, 找到了一个开源方法: 用wkhtmltopdf 程序.

用python写一个脚本, 调用wkhtmltopdf, 运行命令行指令, 得以实现. 非常符合我的预期. 简明, 优雅.

wkhtml2pdf 简介

wkhtmltopdf,一个集成好了的exe文件(C++编写),

基本的调用方法是:

"c:\Program Files\bin\wkhtmltopdf.exe" https://github.com/mementum/backtrader/blob/master/docs2/signal_strategy/signal_strategy.rst signal_strategy.pdf

Loading pages (1/6)
Counting pages (2/6)
Resolving links (4/6)
Loading headers and footers (5/6)
Printing pages (6/6)
Done

C:\Documents and Settings\Administrator\duanqs\strategy_study>dir *.pdf

 驱动器 C 中的卷是 160GB_XP
卷的序列号是 EC5F-C44B C:\Documents and Settings\Administrator\duanqs\strategy_study 的目录 2017-04-17 14:47 120,295 signal_strategy.pdf
2017-04-17 13:32 597,111 backtest.pdf
2 个文件 717,406 字节
0 个目录 19,999,031,296 可用字节

可以先在命令行测试一下,有其他的需要, 可以在命令行通过wkhtmltopdf --help查询,

如果是超长页的话,可以用命令:

wkhtmltopdf.exe http://passport.yupsky.com/ac/register e:\yupskyreg.pdf -H --outline

Here:

-H 是显示扩展帮助

--outline 是添加pdf的左侧概要!(缺省设置)

而且可以批量生成哦,中间用空格隔开

python 脚本: (封装了运行wkhtml2pdf.exe 命令行的py脚本)


# code:utf-8 '''
IPython/Jupyter Problems saving notebook as PDF - Stack Overflow
http://stackoverflow.com/questions/29156653/ipython-jupyter-problems-saving-notebook-as-pdf This Python script has GUI to select with explorer a Ipython Notebook you want to convert to pdf.
The approach with wkhtmltopdf is the only approach I found works well and provides high quality pdfs.
Other approaches described here are problematic, syntax highlighting does not work or graphs are messed up. You'll need to install wkhtmltopdf: http://wkhtmltopdf.org/downloads.html
and Nbconvert pip install nbconvert
# OR
conda install nbconvert '''
# Script adapted from CloudCray
# Original Source: https://gist.github.com/CloudCray/994dd361dece0463f64a
# 2016--06-29
# This will create both an HTML and a PDF file import subprocess
import os
from Tkinter import Tk
from tkFileDialog import askopenfilename WKHTMLTOPDF_PATH = "C:/Program Files/wkhtmltopdf/bin/wkhtmltopdf" # or wherever you keep it def export_to_html(filename):
cmd = 'ipython nbconvert --to html "{0}"'
subprocess.call(cmd.format(filename), shell=True)
return filename.replace(".ipynb", ".html") def convert_to_pdf(filename):
cmd = '"{0}" "{1}" "{2}"'.format(WKHTMLTOPDF_PATH, filename, filename.replace(".html", ".pdf"))
subprocess.call(cmd, shell=True)
return filename.replace(".html", ".pdf") def export_to_pdf(filename):
fn = export_to_html(filename)
return convert_to_pdf(fn) def main():
print("Export IPython notebook to PDF")
print(" Please select a notebook:") Tk().withdraw() # Starts in folder from which it is started, keep the root window from appearing
x = askopenfilename() # show an "Open" dialog box and return the path to the selected file
x = str(x.split("/")[-1]) print(x) if not x:
print("No notebook selected.")
return 0
else:
fn = export_to_pdf(x)
print("File exported as:\n\t{0}".format(fn))
return 1 main()

这里也记录一下尝试xhtml2pdf的经过.

安装完了以后, 编写脚本, 运行时主要是: 卡在了html5lib这个包里:

异常是:

inputstream

CSS parser

等等.

搞定不了, 所以放弃之.

install xhtml2pdf and update html5lib from old vertion to new version (1.0b8)

Here is the logging:

C:\Documents and Settings\Administrator>pip install xhtml2pdf
Collecting xhtml2pdf
Downloading xhtml2pdf-0.0.6.zip (120kB)
100% |████████████████████████████████| 122kB 467kB/s
Collecting html5lib (from xhtml2pdf)
Using cached html5lib-0.999999999-py2.py3-none-any.whl
Collecting pyPdf2 (from xhtml2pdf)
Downloading PyPDF2-1.26.0.tar.gz (77kB)
100% |████████████████████████████████| 81kB 10kB/s
Requirement already satisfied: Pillow in d:\anaconda2\lib\site-packages (from xhtml2pdf)
Collecting reportlab>=2.2 (from xhtml2pdf)
Downloading reportlab-3.4.0-cp27-cp27m-win32.whl (2.1MB)
100% |████████████████████████████████| 2.1MB 261kB/s
Collecting webencodings (from html5lib->xhtml2pdf)
Downloading webencodings-0.5.1-py2.py3-none-any.whl
Requirement already satisfied: setuptools>=18.5 in d:\anaconda2\lib\site-packages (from html5lib->xhtml2pdf)
Requirement already satisfied: six in d:\anaconda2\lib\site-packages (from html5lib->xhtml2pdf)
Requirement already satisfied: pip>=1.4.1 in d:\anaconda2\lib\site-packages (from reportlab>=2.2->xhtml2pdf)
Requirement already satisfied: packaging>=16.8 in d:\anaconda2\lib\site-packages (from setuptools>=18.5->html5lib->xhtml
2pdf)
Requirement already satisfied: appdirs>=1.4.0 in d:\anaconda2\lib\site-packages (from setuptools>=18.5->html5lib->xhtml2
pdf)
Requirement already satisfied: pyparsing in d:\anaconda2\lib\site-packages (from packaging>=16.8->setuptools>=18.5->html
5lib->xhtml2pdf)
Building wheels for collected packages: xhtml2pdf, pyPdf2
Running setup.py bdist_wheel for xhtml2pdf ... done
Stored in directory: C:\Documents and Settings\Administrator\Local Settings\Application Data\pip\Cache\wheels\ec\eb\db
\13a2be9c15f492c65086709a69042924ebfb7aa4c4cc7284f1
Running setup.py bdist_wheel for pyPdf2 ... done
Stored in directory: C:\Documents and Settings\Administrator\Local Settings\Application Data\pip\Cache\wheels\86\6a\6a
\1ce004a5996894d33d93e1fb1b67c30973dc945cc5875a1dd0
Successfully built xhtml2pdf pyPdf2
Installing collected packages: webencodings, html5lib, pyPdf2, reportlab, xhtml2pdf
Successfully installed html5lib-0.999999999 pyPdf2-1.26.0 reportlab-3.4.0 webencodings-0.5.1 xhtml2pdf-0.0.6 C:\Documents and Settings\Administrator>pip install html5lib==1.0b8
Collecting html5lib==1.0b8
Downloading html5lib-1.0b8.tar.gz (889kB)
100% |████████████████████████████████| 890kB 311kB/s
Requirement already satisfied: six in d:\anaconda2\lib\site-packages (from html5lib==1.0b8)
Building wheels for collected packages: html5lib
Running setup.py bdist_wheel for html5lib ... done
Stored in directory: C:\Documents and Settings\Administrator\Local Settings\Application Data\pip\Cache\wheels\d4\d1\0b
\a6b6f9f204af55c9bb8c97eae2a78b690b7150a7b850bb9403
Successfully built html5lib
Installing collected packages: html5lib
Found existing installation: html5lib 0.999999999
Uninstalling html5lib-0.999999999:
Successfully uninstalled html5lib-0.999999999
Successfully installed html5lib-1.0b8 C:\Documents and Settings\Administrator>

ipynb to pdf的更多相关文章

  1. Windows7下Jupyter Notebook使用入门

    目录 一.Jupyter简介 二.Jupyter安装 2.1 python 3安装 2.2 Jupyter 安装 三.Jupyter使用示例 四.Jupyter常用命令 五.其他说明 一.Jupyte ...

  2. 简单python脚本,将jupter notebook的ipynb文件转为pdf(包含中文)

    直接执行的python代码ipynb2pdf.py 主要思路.将ipynb文件转成tex文件,然后使用latex编译成pdf.由于latex默认转换不显示中文,需要向tex文件中添加相关中文包. 依赖 ...

  3. windows jupyter lab中.ipynb转中文PDF

    在jupyter lab中,File-Export Notebook as-Export Notebook to PDF,可以导出成PDF格式的文档,但在操作前需要安装些程序.1. 安装pandocA ...

  4. Jupyter Notebook PDF输出的中文支持

    Jupyter Notebook是什么 Jupyter Notebook是ipython Notebook 的升级.Jupyter能够将实时代码,公式,可视化图表以Cell的方式组织在一起,形成一个对 ...

  5. Jupyter Notebook通过latex输出pdf

    主要步骤 1.将ipynb编译成tex ipython nbconvert --to latex Example.ipynb 2. 修改tex,增加中文支持 在\documentclass{artic ...

  6. 是程序员,就用python导出pdf

    这两天一直在做课件,我个人一直不太喜欢PPT这个东西--能不用就不用,我个人特别崇尚极简风. 谁让我们是程序员呢,所以就爱上了Jupyter写课件,讲道理markdown也是个非常不错的写书格式啊. ...

  7. Python学习笔记——jupyter notebook 入门和中文pdf输出方案

    简单粗暴的安装 对于懒人而言,我还是喜欢直接安装python的集成开发环境 anaconda 多个内核控制 jupyter官网 1). 同时支持python2 和python 3 conda crea ...

  8. 【原创】JavaFx程序解决Jupyter Notebook导出PDF不显示中文

    0.ATTENTION!!! JavaFx里是通过Java调用控制台执行的的jupyter和xelatex指令, 这些个指令需要在本地安装Jupyter和MikTeX之后才能正常在电脑上运行 1.[问 ...

  9. C#给PDF文档添加文本和图片页眉

    页眉常用于显示文档的附加信息,我们可以在页眉中插入文本或者图形,例如,页码.日期.公司徽标.文档标题.文件名或作者名等等.那么我们如何以编程的方式添加页眉呢?今天,这篇文章向大家分享如何使用了免费组件 ...

随机推荐

  1. 浅学html

    数据库web端需要了解html等语言,就初浅学习一下 <!DOCTYPE html> <html> <head> <meta charset="ut ...

  2. nginx配置hls

    备注:本来是想用浏览器播放hls,后来没有成功,最后使用flash播放rtmp的方案.所以下面的配置未使用. 修改/usr/local/nginx/conf/nginx.conf文件内容如下: wor ...

  3. 深入理解JAVA I/O系列二:字节流详解

    流的概念 JAVA程序通过流来完成输入/输出.流是生产或消费信息的抽象,流通过JAVA的输入输出与物理设备链接,尽管与它们链接的物理设备不尽相同,所有流的行为具有相同的方式.这样就意味一个输入流能够抽 ...

  4. 软工网络15团队作业8——Beta阶段敏捷冲刺(day1)

    第 1 篇 Scrum 冲刺博客 1. 介绍小组新加入的成员,Ta担任的角色 --给出让ta担当此角色的理由 小组新加入的成员:3085叶金蕾 担任的角色:测试/用户体验/开发 理由:根据小组讨论以及 ...

  5. v-html的应用

    var app=new Vue({ el: '#app', data:{ link:'<a href="#">这是一个连接</a>' },}) <di ...

  6. zoj 2588 Burning Bridges(割边/桥)

    题目链接:http://acm.zju.edu.cn/onlinejudge/showProblem.do?problemId=1588 题意:Ferry王国有n个岛,m座桥,每个岛都可以互达,现在要 ...

  7. Python高级特性:Python迭代、生成器、列表生成式

    迭代 给定一个list或tuple,我们可以通过for循环来遍历这个list或tuple,这种遍历称为迭代(Iteration). 在java和C语言中,迭代是通过循环list的下标来完成的,Pyth ...

  8. mybatis的setting

    在mybaits中,setting的的配置参数如下(如果不在配置文件中配置将使用默认值): 设置参数 描述 有效值 默认值 cacheEnabled 该配置影响的所有映射器中配置的缓存的全局开关 tr ...

  9. 计算机网络【6】—— 从浏览器输入URL到显示页面发生了什么

    当在浏览器地址栏输入网址,如:www.baidu.com后浏览器是怎么把最终的页面呈现出来的呢?这个过程可以大致分为两个部分:网络通信和页面渲染. 一.网络通信 互联网内各网络设备间的通信都遵循TCP ...

  10. Day24-part1-原生Ajax

    参考老师博客:http://www.cnblogs.com/wupeiqi/articles/5703697.html 主要讲了:发数据的3种方式以及上传文件的3种方式.(后续需要总结) 一,原生Aj ...