How to Use OCR to Make PDF Image Text Searchable with cURL, Tutorial

Share this page

Why Use OCR to make Searchable PDF with cURL?

The pdfRest OCR PDF API Tool is a powerful resource for converting scanned documents and images into searchable and extractable text in PDF files. This tutorial will show you how to send an API call to OCR PDF with cURL, a command-line tool for transferring data using various network protocols.

Imagine you have a large number of scanned documents that you need to make searchable for easy retrieval. Using the OCR PDF API, you can automate this process, making it efficient and less prone to errors compared to manual data entry. This can be particularly useful for businesses dealing with large volumes of paperwork, such as legal firms or medical offices.

OCR PDF with cURL Code Example

curl -X POST "https://api.pdfrest.com/pdf-with-ocr-text" \
  -H "Accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -H "Api-Key: xxxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx" \
  -F "file=@/path/to/file" \
  -F "output=example_out"

Source: GitHub

Breaking Down the Code

Let's break down the provided cURL command to understand how it works:

curl -X POST "https://api.pdfrest.com/pdf-with-ocr-text"

This line initiates a POST request to the /pdf-with-ocr-text endpoint.

-H "Accept: application/json"

This header tells the server that the client expects the response to be in JSON format.

-H "Content-Type: multipart/form-data"

This header specifies that the content type of the request is multipart/form-data, which is necessary for file uploads.

-H "Api-Key: xxxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx"

This header includes the API key, which is required for authentication. Replace xxxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx with your actual API key.

-F "file=@/path/to/file"

This line specifies the file to be uploaded. Replace /path/to/file with the actual path to the file you want to upload.

-F "output=example_out"

This line specifies the output file name. In this case, the output will be named example_out.

Beyond the Tutorial

In this tutorial, you learned how to make an API call to the pdfRest OCR PDF endpoint using cURL. This allows you to convert scanned documents into searchable PDFs efficiently.

To explore more functionalities, you can demo all of the pdfRest API Tools in the API Lab. For more detailed information, refer to the API Reference Guide.

Note: This is an example of a multipart API call. Code samples using JSON payloads can be found at GitHub.