The PDF should be: - either streamed or parsed externally before being sent to the LLM - or converted into structured text chunks (not raw base64) The agent should receive: - structured text - or extracted segments - NOT a raw base64 PDF representation

crewai - 💡(How to fix) Fix [BUG] input_files (PDFFile) are passed as base64 via read_file tool, causing context overflow and inconsistent LLM behavior [1 pull requests]

crewai2026-05-26 12:58:47

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

When using input_files with PDFFile (or File), CrewAI does not appear to handle the file as a native file input at the provider level.

Instead, the file is processed via the read_file tool and its content is returned as a binary/base64 representation. This content is then indirectly injected into the agent execution context.

As a result:

The PDF is effectively treated as inline binary data (base64)
The LLM context becomes extremely large
Responses become inconsistent or fail due to context overflow
The same file is re-processed during agent execution via tools

This makes PDFFile unreliable for large or even medium-sized documents.

Root Cause

This issue blocks any production usage of PDF ingestion in multi-step CrewAI flows, because:

Fix Action

Fixed

Fixed by PR: Fix #5930: Extract text from PDFs in ReadFileTool instead of returning base64 (https://github.com/crewAIInc/crewAI/pull/5932)

Code Example

from crewai import Agent, Task, Crew
from crewai_files import PDFFile

agent = Agent(
    role="Document Analyst",
    goal="Extract structured information from PDFs",
    backstory="Expert in document analysis",
    llm="gpt-4o-mini",
)

task = Task(
    description="""
    Read the PDF document {doc}
    Extract the main sections and summarize them precisely.
    """,
    expected_output="Structured list of sections",
    agent=agent,
)

crew = Crew(agents=[agent], tasks=[task], verbose=True)

result = crew.kickoff(
    input_files={
        "doc": PDFFile(source="./src/test_crewai_files/pdfs/sample.pdf")
    }
)

print(result)

---

from crewai import Agent, Task, Crew
from crewai_files import PDFFile

agent = Agent(
    role="Document Analyst",
    goal="Extract structured information from PDFs",
    backstory="Expert in document analysis",
    llm="gpt-4o-mini",
)

task = Task(
    description="""
    Read the PDF document {doc}
    Extract the main sections and summarize them precisely.
    """,
    expected_output="Structured list of sections",
    agent=agent,
)

crew = Crew(agents=[agent], tasks=[task], verbose=True)

result = crew.kickoff(
    input_files={
        "doc": PDFFile(source="./src/test_crewai_files/pdfs/sample.pdf")
    }
)

print(result)

---

Tool Execution Started (#3)

Tool: read_file
Args: {'file_name': 'doc'}

Tool Execution Completed (#3)

Tool Completed
Tool: read_file
Output: [Binary file: sample.pdf (application/pdf)]
Base64:
...

RAW_BUFFERClick to expand / collapse

Description

When using input_files with PDFFile (or File), CrewAI does not appear to handle the file as a native file input at the provider level.

Instead, the file is processed via the read_file tool and its content is returned as a binary/base64 representation. This content is then indirectly injected into the agent execution context.

As a result:

The PDF is effectively treated as inline binary data (base64)
The LLM context becomes extremely large
Responses become inconsistent or fail due to context overflow
The same file is re-processed during agent execution via tools

This makes PDFFile unreliable for large or even medium-sized documents.

Steps to Reproduce

Create a minimal CrewAI setup:

from crewai import Agent, Task, Crew
from crewai_files import PDFFile

agent = Agent(
    role="Document Analyst",
    goal="Extract structured information from PDFs",
    backstory="Expert in document analysis",
    llm="gpt-4o-mini",
)

task = Task(
    description="""
    Read the PDF document {doc}
    Extract the main sections and summarize them precisely.
    """,
    expected_output="Structured list of sections",
    agent=agent,
)

crew = Crew(agents=[agent], tasks=[task], verbose=True)

result = crew.kickoff(
    input_files={
        "doc": PDFFile(source="./src/test_crewai_files/pdfs/sample.pdf")
    }
)

print(result)

Run the flow: crewai run
Observe agent execution logs with verbose=True

Expected behavior

The PDF should be:

either streamed or parsed externally before being sent to the LLM
or converted into structured text chunks (not raw base64)

The agent should receive:

structured text
or extracted segments
NOT a raw base64 PDF representation

Screenshots/Code snippets

from crewai import Agent, Task, Crew
from crewai_files import PDFFile

agent = Agent(
    role="Document Analyst",
    goal="Extract structured information from PDFs",
    backstory="Expert in document analysis",
    llm="gpt-4o-mini",
)

task = Task(
    description="""
    Read the PDF document {doc}
    Extract the main sections and summarize them precisely.
    """,
    expected_output="Structured list of sections",
    agent=agent,
)

crew = Crew(agents=[agent], tasks=[task], verbose=True)

result = crew.kickoff(
    input_files={
        "doc": PDFFile(source="./src/test_crewai_files/pdfs/sample.pdf")
    }
)

print(result)

Operating System

Ubuntu 24.04

Python Version

3.12

crewAI Version

v.1.14.4

crewAI Tools Version

v.1.14.4

Virtual Environment

Venv

Evidence

Verbose output evidence

Tool Execution Started (#3)

Tool: read_file
Args: {'file_name': 'doc'}

Tool Execution Completed (#3)

Tool Completed
Tool: read_file
Output: [Binary file: sample.pdf (application/pdf)]
Base64:
...

Possible Solution

None

Additional context

This issue blocks any production usage of PDF ingestion in multi-step CrewAI flows, because:

context size grows linearly with file size
multiple tasks re-trigger file expansion
sequential workflows amplify token explosion

A safer architecture would:

load file once
extract structured representation once
reuse extracted representation across tasks without re-injecting raw binary

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

The PDF should be:

either streamed or parsed externally before being sent to the LLM
or converted into structured text chunks (not raw base64)

The agent should receive:

structured text
or extracted segments
NOT a raw base64 PDF representation

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

crewai - 💡(How to fix) Fix [BUG] input_files (PDFFile) are passed as base64 via read_file tool, causing context overflow and inconsistent LLM behavior [1 pull requests]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fixed

Code Example

Description

Steps to Reproduce

Expected behavior

Screenshots/Code snippets

Operating System

Python Version

crewAI Version

crewAI Tools Version

Virtual Environment

Evidence

Verbose output evidence

Possible Solution

Additional context

FAQ

Expected behavior

Still need to ship something?

TRENDING