LlamaParse

Parse documents (PDFs, images, spreadsheets, presentations — 130+ formats) into LLM-ready text, markdown, and structured data using the LlamaParse API.

Prerequisites

- Python package: llama-cloud>=1.0 (pip install llama-cloud)
API key: Set LLAMA_CLOUD_API_KEY environment variable. Get one at https://cloud.llamaindex.ai

Verify setup:

CODEBLOCK0

Quick Start

CODEBLOCK1

Core Concepts

Tiers (required — choose one)

Tier	Use Case	Cost
INLINECODE3	Maximum accuracy, complex layouts, charts	Highest
INLINECODE4

Always specify both tier and version. Use version="latest" for dev, or a date string like "2026-01-08" for production reproducibility.

Output Views (expand parameter)

Request one or more in the expand list:

- markdown — Structured markdown with headings, lists, tables. Best for RAG/LLM pipelines.
text — Clean flattened text per page. Good for search/retrieval.
items — Structured tree of page elements (headers, paragraphs, tables, figures) with bounding boxes. Use for layout-aware processing.
metadata — Document metadata.
images_content_metadata — Image/screenshot metadata with presigned URLs.

Access results: result.markdown.pages[i].markdown, result.text.pages[i].text, INLINECODE19

Output Options

Control markdown rendering:

CODEBLOCK2

Processing Options

CODEBLOCK3

Custom Prompts (Agentic Parsing Instructions)

Guide the parser like an LLM — useful for extracting specific data or transforming output:

CODEBLOCK4

Common Workflows

Parse a single document

Use scripts/parse_document.py:

CODEBLOCK5

Batch parse a folder

Use scripts/batch_parse.py:

CODEBLOCK6

Extract tables from a document

Request items in expand, then filter for table items:

CODEBLOCK7

Extract chart data

Enable specialized chart parsing, then pull table rows from the chart page:

CODEBLOCK8

Download page screenshots

CODEBLOCK9

API Reference

For complete API details, see references/api-reference.md.

External Service & Security

This skill uses the LlamaParse API (https://cloud.llamaindex.ai), a cloud document parsing service by LlamaIndex.

- API key required: You must set the LLAMA_CLOUD_API_KEY environment variable. Get a key at https://cloud.llamaindex.ai.
Data sent externally: Documents are uploaded to the LlamaParse API for server-side parsing. Parsed results are returned to your local machine.
No other network calls: The scripts only communicate with api.cloud.llamaindex.ai. Screenshot downloads use presigned URLs from the same service.
Scripts are reference utilities: scripts/parse_document.py and scripts/batch_parse.py are helper scripts meant to be run manually by the user. They are not executed automatically by the skill.

Tips

- Request only the expand views you need — more views = larger response + higher latency.
Use agentic_plus tier with specialized_chart_parsing for documents with charts/graphs.
For production, pin a specific version date instead of "latest".
Use semaphore-based concurrency for batch parsing to respect rate limits.
The items view provides bounding boxes (b_box) for each element — useful for spatial analysis.

LlamaParse

使用LlamaParse API将文档（PDF、图片、电子表格、演示文稿等130+格式）解析为适用于LLM的文本、Markdown和结构化数据。

前置条件

- Python包： llama-cloud>=1.0（pip install llama-cloud）
API密钥： 设置LLAMACLOUDAPI_KEY环境变量。在 https://cloud.llamaindex.ai 获取密钥

验证配置：

bash
pip install llama-cloud>=1.0
export LLAMACLOUDAPI_KEY=llx-...

快速开始

python
from llama_cloud import AsyncLlamaCloud
import asyncio

async def parsedocument(filepath: str):
client = AsyncLlamaCloud() # 使用LLAMACLOUDAPI_KEY环境变量
file = await client.files.create(file=file_path, purpose=parse)
result = await client.parsing.parse(
file_id=file.id,
tier=agentic,
version=latest,
expand=[markdown, text],
)
return result

result = asyncio.run(parse_document(document.pdf))
print(result.markdown.pages[0].markdown)

核心概念

层级（必选——选择一项）

层级	使用场景	成本
agentic_plus	最高精度，复杂布局，图表	最高
agentic

始终同时指定tier和version。开发环境使用version=latest，生产环境可复现性使用日期字符串如2026-01-08。

输出视图（expand参数）

在expand列表中请求一个或多个：

- markdown — 包含标题、列表、表格的结构化Markdown。最适合RAG/LLM流水线。
text — 每页的纯文本。适合搜索/检索。
items — 页面元素（标题、段落、表格、图形）的结构化树，包含边界框。适用于布局感知处理。
metadata — 文档元数据。
imagescontentmetadata — 包含预签名URL的图像/截图元数据。

访问结果：result.markdown.pages[i].markdown、result.text.pages[i].text、result.items.pages[i].items

输出选项

控制Markdown渲染：

python
output_options={
markdown: {
tables: {
outputtablesas_markdown: True, # 或False使用HTML表格
},
},
imagestosave: [screenshot], # 保存页面截图
}

处理选项

python
processing_options={
ignore: {ignorediagonaltext: True},
ocr_parameters: {languages: [en]}, # OCR语言提示
specializedchartparsing: agentic_plus, # 将图表提取为结构化数据
}

自定义提示（代理解析指令）

像指导LLM一样引导解析器——适用于提取特定数据或转换输出：

python
from llamacloud.types.parsingcreate_params import (
ProcessingOptions, ProcessingOptionsAutoModeConfiguration,
ProcessingOptionsAutoModeConfigurationParsingConf
)

result = await client.parsing.parse(
file_id=file.id,
tier=agentic,
version=latest,
expand=[markdown],
processing_options=ProcessingOptions(
automodeconfiguration=[ProcessingOptionsAutoModeConfiguration(
parsing_conf=ProcessingOptionsAutoModeConfigurationParsingConf(
custom_prompt=仅从该收据中提取价格和总额。
)
)]
),
)

常见工作流

解析单个文档

使用scripts/parse_document.py：

bash
python scripts/parse_document.py document.pdf --tier agentic --output markdown,text

批量解析文件夹

使用scripts/batch_parse.py：

bash
python scripts/batch_parse.py ./documents/ --tier agentic --max-concurrent 5

从文档中提取表格

在expand中请求items，然后过滤表格项：

python
for page in result.items.pages:
for item in page.items:
if hasattr(item, rows): # 表格项
print(f第{page.page_number}页的表格：{len(item.rows)}行)
# 可使用item.csv、item.html、item.md

提取图表数据

启用专门的图表解析，然后从图表页面提取表格行：

python
result = await client.parsing.parse(
file_id=file.id,
tier=agentic_plus,
version=latest,
processingoptions={specializedchartparsing: agenticplus},
expand=[items],
)

下载页面截图

python
import httpx, re

result = await client.parsing.parse(
file_id=file.id, tier=agentic, version=latest,
outputoptions={imagesto_save: [screenshot]},
expand=[imagescontentmetadata],
)

for img in result.imagescontentmetadata.images:
if img.presignedurl and re.match(r^page\d+\.jpg$, img.filename):
async with httpx.AsyncClient() as http:
resp = await http.get(img.presigned_url)
with open(img.filename, wb) as f:
f.write(resp.content)

API参考

完整API详情请参阅references/api-reference.md。

外部服务与安全

本技能使用LlamaParse API（https://cloud.llamaindex.ai），这是LlamaIndex提供的云端文档解析服务。

- 需要API密钥： 您必须设置LLAMACLOUDAPIKEY环境变量。在 https://cloud.llamaindex.ai 获取密钥。
外部发送数据： 文档会上传到LlamaParse API进行服务端解析。解析结果返回至您的本地机器。
无其他网络调用： 脚本仅与api.cloud.llamaindex.ai通信。截图下载使用同一服务的预签名URL。
脚本为参考工具： scripts/parsedocument.py和scripts/batch_parse.py是辅助脚本，供用户手动运行。它们不会由技能自动执行。

提示

- 仅请求您需要的expand视图——更多视图意味着更大的响应和更高的延迟。
对于包含图表/图形的文档，使用agenticplus层级配合specializedchartparsing。
生产环境请固定特定version日期，而非使用latest。
批量解析时使用基于信号量的并发控制以遵守速率限制。
items视图为每个元素提供边界框（bbox）——适用于空间分析。

llamaparseLlamaParse文档解析

llamaparse

LlamaParse

Prerequisites

Quick Start

Core Concepts

Tiers (required — choose one)

Output Views (expand parameter)

Output Options

Processing Options

Custom Prompts (Agentic Parsing Instructions)

Common Workflows

Parse a single document

Batch parse a folder

Extract tables from a document

Extract chart data

Download page screenshots

API Reference

External Service & Security

Tips

LlamaParse

前置条件

快速开始

核心概念

层级（必选——选择一项）

输出视图（expand参数）

输出选项

处理选项

自定义提示（代理解析指令）

常见工作流

解析单个文档

批量解析文件夹

从文档中提取表格

提取图表数据

下载页面截图

API参考

外部服务与安全

提示

标签

通过对话安装

方式一：安装 SkillHub 和技能

方式二：设置 SkillHub 为优先技能安装源

通过命令行安装

下载

相关推荐

self-improvement

self-improvement

self-improvement

self-improvement