Skip to main content

Enabling Imagick to Read or Manipulate PDF File

Imagick is one of the popular tools for manipulating image files. Some popular languages such as PHP and Node.js have provided libraries that can be used for manipulating images based on Imagick. One of the common use cases for using Imagick is for generating a thumbnail from an image or PDF file.

In PHP, we can install the PHP Imagick module by running the following command.

apt install php-imagick

Then, we can verify the installation by running this command.

php -m | grep imagick

For example, we want to generate a thumbnail image for a PDF file in PHP. We can use the following script.

<?php
$im = new Imagick();
$im->setResolution(50, 50); // set the reading resolution before read the file
$im->readImage('file.pdf[0]'); // read the first page of the PDF file (index 0)
//$im = $im->flattenImages(); // @deprecated // handle transparency problem
$im = $im->mergeImageLayers( Imagick::LAYERMETHOD_FLATTEN );
$im->setImageFormat('png');
$im->writeImage('file.png');

If we want to set a specific size for the thumbnail. We can run the following script after flattening the image.

<?php
$imageprops = $im->getImageGeometry();
$width = $imageprops['width'];
$height = $imageprops['height'];
$baseHeight = 500;
$baseWidth = 250;
if($width > $height){
    $newHeight = $baseHeight;
    $newWidth = ceil($baseHeight * $width / $height);
}else{
    $newWidth = $baseWidth;
    $newHeight = ceil($baseWidth * $height / $width);
}
$im->resizeImage($newWidth, $newHeight, imagick::FILTER_LANCZOS, 0.9, true);

If we run the code above, PHP may throw an error that says something like:

ImagickException: not authorized `file.pdf' @ error/constitute.c/ReadImage/412

It is caused by the restriction policy of Imagick for a PDF file. A PDF file may contain a script (PostScript) that can perform harmful actions. Imagick utilizes GhostScript for interpreting scripts in a PDF file. We can configure the Imagick operation policy for specific file types by modifying the /etc/ImageMagick-6/policy.xml file. You can read more detail about it here. Find the following declaration and modify it as you require.

<policy domain="coder" rights="none" pattern="EPS" />
<!--policy domain="coder" rights="none" pattern="PDF" /-->
<policy domain="coder" rights="none" pattern="XPS" />
<policy domain="coder" rights="read|write" pattern="PDF,PS" />

If we use PHP in a Windows environment, there are several procedures to make our program can result in the desired outcome.

Firstly, we need to download the DLL files of Imagick from the PECL site. The folder contains a lot of DLL files, not only the php_imagick.dll file but also other files required by the operating systems. We can put the files in a subdirectory of the PHP ext directory, for example, we put them in ext/imagick.

Because it is not a standard directory for a PHP dynamic module, we configure the declaration of its usages using an absolute path in php.ini.

[Imagick]
extension="C:\path\to\php\ext\imagick\php_imagick.dll"

Next step, we add a new record in the PATH environment variable by location of the Imagick supported DLL files which is C:\path\to\php\ext\imagick.

After the previous step, if we run the program, PHP still throws an error with a message that looks like this:

PDFDelegateFailed `The system cannot find the file specified. ' @ error/pdf.c/ReadPDFImage/801

It is caused by the absence of GhostScript support. So the last step, we need to download and install the GhostScript application from here. Then, we must add the bin directory in the GhostScript installation folder into the PATH environment variable. If we use a 64bit version, we need to make a copy of gswin64.exe and rename the copy into gs.exe. It is because Imagick will look for gs.exe.

Comments

Popular posts from this blog

Configuring Swap Memory on Ubuntu Using Ansible

If we maintain a Linux machine with a low memory capacity while we are required to run an application with high memory consumption, enabling swap memory is an option. Ansible can be utilized as a helper tool to automate the creation of swap memory. A swap file can be allocated in the available storage of the machine. The swap file then can be assigned as a swap memory. Firstly, we should prepare the inventory file. The following snippet is an example, you must provide your own configuration. [server] 192.168.1.2 [server:vars] ansible_user=root ansible_ssh_private_key_file=~/.ssh/id_rsa Secondly, we need to prepare the task file that contains not only the tasks but also some variables and connection information. For instance, we set /swapfile  as the name of our swap file. We also set the swap memory size to 2GB and the swappiness level to 60. - hosts: server become: true vars: swap_vars: size: 2G swappiness: 60 For simplicity, we only check the...

Rangkaian Sensor Infrared dengan Photo Dioda

Keunggulan photodioda dibandingkan LDR adalah photodioda lebih tidak rentan terhadap noise karena hanya menerima sinar infrared, sedangkan LDR menerima seluruh cahaya yang ada termasuk infrared. Rangkaian yang akan kita gunakan adalah seperti gambar di bawah ini. Pada saat intensitas Infrared yang diterima Photodiode besar maka tahanan Photodiode menjadi kecil, sedangkan jika intensitas Infrared yang diterima Photodiode kecil maka tahanan yang dimiliki photodiode besar. Jika  tahanan photodiode kecil  maka tegangan  V- akan kecil . Misal tahanan photodiode mengecil menjadi 10kOhm. Maka dengan teorema pembagi tegangan: V- = Rrx/(Rrx + R2) x Vcc V- = 10 / (10+10) x Vcc V- = (1/2) x 5 Volt V- = 2.5 Volt Sedangkan jika  tahanan photodiode besar  maka tegangan  V- akan besar  (mendekati nilai Vcc). Misal tahanan photodiode menjadi 150kOhm. Maka dengan teorema pembagi tegangan: V- = Rrx/(Rrx + R2) x Vcc V- = 150 / (1...

Deploying a Web Server on UpCloud using Terraform Modules

In my earlier post , I shared an example of deploying UpCloud infrastructure using Terraform from scratch. In this post, I want to share how to deploy the infrastructure using available Terraform modules to speed up the set-up process, especially for common use cases like preparing a web server. For instance, our need is to deploy a website with some conditions as follows. The website can be accessed through HTTPS. If the request is HTTP, it will be redirected to HTTPS. There are 2 domains, web1.yourdomain.com and web2.yourdomain.com . But, users should be redirected to "web2" if they are visiting "web1". There are 4 main modules that we need to set up the environment. Private network. It allows the load balancer to connect with the server and pass the traffic. Server. It is used to host the website. Load balancer. It includes backend and frontend configuration. Dynamic certificate. It is requ...

Configure Gitlab SMTP Setting

Gitlab CE or EE is shipped with the capability to send messages through SMTP service as the basic feature to send notifications or updates to the users. The configuration parameters are available in /etc/gitlab/gitlab.rb . Each SMTP service provider has a different configuration, therefore the Gitlab configuration parameters should be adjusted according to the requirements. Some examples have been provided by Gitlab here . This is an example if you use the Zoho service. gitlab_rails['smtp_enable'] = true gitlab_rails['smtp_address'] = "smtp.zoho.com" gitlab_rails['smtp_port'] = 587 gitlab_rails['smtp_authentication'] = "plain" gitlab_rails['smtp_enable_starttls_auto'] = true gitlab_rails['smtp_user_name'] = "gitlab@mydomain.com" gitlab_rails['smtp_password'] = "mypassword" gitlab_rails['smtp_domain'] = "smtp.zoho.com" This is another example of using Amazon SES w...

API Gateway Using KrakenD

The increasing demands of users for high-quality web services create the need to integrate various technologies into our application. This will cause the code base to grow larger, making maintenance more difficult over time. A microservices approach offers a solution, where the application is built by combining multiple smaller services, each with a distinct function. For example, one service handles authentication, another manages business functions, another maintains file uploads, and so on. These services communicate and integrate through a common channel. On the client side, users don't need to understand how the application is built or how it functions internally. They simply send a request to a single endpoint, and processes like authentication, caching, or database querying happen seamlessly. This is where an API gateway is effective. It handles user requests and directs them to the appropriate handler. There are several tools available for building an API gateway, su...

Running CI/CD Pipeline with GitLab CI

GitLab allows us to deploy CI/CD pipeline runners on our own resources within our environment. This option is available not only for the self-hosted plan but also for the cloud service plan (gitlab.com). With this setup, unlike GitHub Action, we can avoid incurring additional costs for extended pipeline runtime. This is because we can deploy the runner on an on-demand server and optimize its usage. GitLab CI offers several options for setting up resources to run CI/CD pipelines. A runner can be configured to handle jobs for specific groups or projects using designated tags. It can also be set to use different executors, such as Shell, Docker, Kubernetes, or VirtualBox. A comparison table of the supported executors is available in the executor documentation . Some executors offer greater flexibility and ease of use, while others may be more rigid but enhance server security. Installing the runner in our machine For example, we will deploy the runner on an Ubuntu serve...