There are many instances when you want to take the screenshot of a webpage from within a php script or the command line. On ubuntu there are several ways to do it and most of them produce a webkit , gecko or khtml rendered screenshot image.
Some of the methods require X session to open a window and take screenshots. So VNC can be used to run in on servers for example. To Setup a VNC server on Ubuntu read this article.
1. wkhtmltopdf
Url :
http://code.google.com/p/wkhtmltopdf/
wkhtmltopdf is a command line utility that converts html to pdf using webkit rendering engine.
Install :
sudo apt-get install wkhtmltopdf
Usage :
1 |
$ wkhtmltopdf www.google.com google.pdf |
2 |
Loading page (1/2) |
3 |
Printing pages (2/2) |
4 |
Done |
Now the google.pdf file should be there in your home directory.
2. wkhtmltoimage
Url :
http://code.google.com/p/wkhtmltopdf/
Docs :
http://madalgo.au.dk/~jakobt/wkhtmltoxdoc/wkhtmltoimage_0.10.0_rc2-doc.html
It uses the webkit rendering engine.
Download from : http://code.google.com/p/wkhtmltopdf/downloads/list
Extract the archive.
Usage :
1 |
$ ./wkhtmltoimage-amd64 http: //www.google.com google.png
|
It should create google.png in home directory with the screenshot of www.google.com
Other options :
Quality – Controls the quality/compression of the generation image. Default is 94
1 |
$ ./wkhtmltoimage-amd64 --quality 50 http: //www.google.com google.png
|
Disable images
1 |
$ ./wkhtmltoimage-amd64 --no-images http: //www.google.com google.png
|
Disable javascript
1 |
$ ./wkhtmltoimage-amd64 --disable-javascript http: //www.google.com google.png
|
Crop the screenshot
1 |
$ ./wkhtmltoimage-amd64 --crop-h 300 --crop-w 300 --crop-x 0 --crop-y 0 http: //www.google.com google.png
|
Advantages :
1. Can automatically determine the height of the page to take full page screenshots unlike most other utilities.
Disadvantages :
1. Fails many times due to unknown reasons with an error saying “Painter not active”.
2. Cannot render cufon fonts and flash animations. Sometimes it even fails in jquery animations which take long time to load.
3. cutycapt
Url :
http://cutycapt.sourceforge.net/
Install :
sudo apt-get install subversion libqt4-webkit libqt4-dev g++ cutycapt
Usage :
1 |
$ cutycapt --url=http: //www.google.com/ --out=google.png
|
It should create a google.png file in home directory which would have the screenshot of www.google.com
3. khtml2png
khtml2png uses the konqueror rendering engine to create screenshots of web pages.
Download : http://khtml2png.sourceforge.net/index.php?page=download
Install :
1. sudo apt-get install kdelibs4-dev zlib1g-dev g++ cmake
2. Extract the khtml2png archive.
3. ./configure
4. make
5. sudo checkinstall (this will create a deb file and install it , so that it can easily uninstalled later).
Usage :
1 |
$ khtml2png2 --width 800 --height 600 http: //www.google.com/ google.png
|
2 |
kbuildsycoca running... |
3 |
DCOP Cleaning up dead connections. |
This would create a google.png in home directory with the screenshot of www.google.com.
This method requires a running X session. To run it from VNC use as :
Start vnc server :
1 |
$ vncserver :12 |
2 |
New 'enlightened-desktop:12 (enlightened)' desktop is enlightened-desktop:12
|
3 |
Starting applications specified in /home/enlightened/.vnc/xstartup |
4 |
Log file is /home/enlightened/.vnc/enlightened-desktop:12. log
|
Run the program in the vnc server :
1 |
$ DISPLAY=:12 khtml2png2 --width 1024 --height 768 http://www.google.com/ google.png |
2 |
Xlib: extension "XInputExtension" missing on display ":12" .
|
3 |
Failed to get list of devices |
4 |
Xlib: extension "XInputExtension" missing on display ":12" .
|
5 |
Failed to get list of devices |
6 |
kbuildsycoca running... |
Use with xvfb :
1 |
$ xvfb-run --server-args= "-screen 0, 1024x768x24" khtml2png2 --width 800 --height 600 http://www.google.com/ google.png
|
4. PyWebShot
Url :
http://www.coderholic.com/pywebshot-generate-website-thumbnails-using-python/
https://github.com/coderholic/PyWebShot
Pywebshot uses python bindings embedded mozilla ( http://www.mozilla.org/unix/gtk-embedding.html )
Install :
1. sudo apt-get install python-gtkmozembed
2. Download pywebshot from https://github.com/coderholic/PyWebShot
Usage :
1 |
$ python pywebshot.py www.google.com -t 1024x768& lt ;br /& gt ;<br />Loading www.google.com... saved as www.google.com.png
|
It should create a www.google.com.png in the directory which has the screenshot of size 1024 x 768.
This method requires a running X session , since it opens a window to load the url and then save its screenshot.
So VNC can be used like this :
start vnc server :
1 |
$ vncserver :12 |
2 |
New 'enlightened-desktop:12 (enlightened)' desktop is enlightened-desktop:12
|
3 |
Starting applications specified in /home/enlightened/.vnc/xstartup
|
4 |
Log file is /home/enlightened/.vnc/enlightened-desktop:12.log
|
Run pywebshot :
1 |
$ DISPLAY=:12 python pywebshot.py www.google.com -t 1024x768 |
2 |
Loading www.google.com... saved as www.google.com.png |
Advantages
1. Can succesfully render cufon fonts, complex/long loading jquery animations.
Disadvantages
1. Cannot automatically determine page height to take full page screenshot.
Workaround: If dimensions are available then the dimensions can be specified as the screensize and using such parameters with a virtual monitor can give full page screenshots.
2. Opens up an annoying browers every time on the desktop.
Workaround: Use xvfb.
5. python-webkit2png
Url :
1. http://www.blogs.uni-osnabrueck.de/rotapken/2008/12/03/create-screenshots-of-a-web-page-using-python-and-qtwebkit/
2. https://github.com/AdamN/python-webkit2png/
Install :
1. sudo apt-get install python-qt4 libqt4-webkit
2. Download package from github and extract the archive.
Usage :
1 |
$ python webkit2png.py -o google.png http://www.google.com/ |
It should create a google.png file in the directory with the screenshot of www.google.com
Use with xvfb :
On a webserver since Xorg is not running , xvfb can bed used to take the screenshot.
1 |
$ xvfb-run --server-args= "-screen 0, 1024x768x24" python webkit2png.py -o google.png http://www.google.com/
|
Advantages
1. Can succesfully render cufon fonts, complex/long loading jquery animations.
Disadvantages
1. Fails at CSS3 fonts.
2. Cannot automatically determine page height to take full page screenshot.
Workaround: If dimensions are available then the dimensions can be specified as the screensize and using such parameters with a virtual monitor can give full page screenshots
3. Opens up an annoying browers every time on the desktop
Workaround: Use xvfb,vnc.
The above tools and techniques can be used on a webserver with a language like PHP.