MyBotBoxMyBotBox

Pulse

Extract text from documents using Pulse OCR

Overview

Integrate Pulse into your workflow to extract text from PDF documents, images, and Office files. Supports URL-based or uploaded documents with optional chunking and layout analysis.

Setup

  1. Add the Pulse block to your workflow
  2. Enter your Pulse API key
  3. Upload a document or provide a URL

Configuration

ParameterTypeRequiredDescription
apiKeystringYesPulse API key
documentfile/URLYesDocument to extract (PDF, images, DOCX, PPTX, XLSX; max 50MB)
pagesstringNoPage range (e.g., 1-3,5)
chunkingstringNoChunking strategy: semantic, header, page, or recursive
chunkSizenumberNoMax characters per chunk

Tools

pulse_parser

Extracts text and structure from documents using Pulse OCR.

Output

ParameterTypeDescription
markdownstringExtracted content in markdown format
page_countnumberNumber of pages
job_idstringUnique job identifier
bounding_boxesjsonBounding box layout information
htmlstringHTML content if requested
structured_outputjsonStructured output if schema provided
chunksjsonChunked content if chunking enabled
figuresjsonExtracted figures if enabled

Supported Formats

  • PDF documents
  • Images (JPEG, PNG)
  • Microsoft Office files (DOCX, PPTX, XLSX)

Notes

  • Category: tools
  • Type: pulse
  • Maximum file size: 50MB