Skip to main content
s3

Connect Amazon S3 to ZenSearch

Index any file type from your S3 buckets. PDFs, documents, images, and data files become searchable through AI-powered semantic search.

What ZenSearch indexes from Amazon S3

Any File Type

PDFs, Word documents, spreadsheets, plain text, HTML, Markdown, images, and more. ZenSearch parses 50+ file formats stored in your S3 buckets.

IAM Role Authentication

Authenticate with IAM roles, access keys, or instance profiles. No need to share long-lived credentials — use your existing AWS IAM policies.

S3-Compatible Storage

Works with any S3-compatible storage including RustFS, DigitalOcean Spaces, Backblaze B2, and Wasabi. Same connector, any S3-compatible endpoint.

Prefix Filtering

Index specific prefixes (folders) within a bucket. Exclude directories containing logs, backups, or raw data that should not be searchable.

Large-Scale Indexing

Designed to handle buckets with millions of objects. Pagination, parallel processing, and incremental sync keep indexing efficient at any scale.

Metadata Preservation

S3 object metadata, tags, and content types are captured and used to enrich search results with additional context about each file.

Up and running in three steps

1
Connect

Authenticate your account and select the resources to index. ZenSearch handles OAuth, API keys, and enterprise SSO.

2
Index

ZenSearch parses, chunks, and vectorizes your content. Incremental sync keeps everything up to date automatically.

3
Search

Your team searches across all connected sources with AI-powered semantic search, chat, and agents.

Questions your team can finally answer

Once Amazon S3 is connected, your team can ask natural-language questions and get cited answers instantly.

Find the data processing agreement template we use for EU clients

Searches across legal document prefixes in your S3 bucket and finds the DPA template, even if the filename is an opaque ID.

What does the latest audit report say about our access controls?

Locates the audit report PDF in your compliance bucket, parses its content, and extracts the access control findings section.

Where is the training dataset documentation for the recommendation model?

Finds README and documentation files in your ML artifacts bucket and returns the dataset description, schema, and versioning information.

Ready to search your Amazon S3 data?

Connect Amazon S3 in minutes. No credit card required.

Start Free