May 5, 2025
2 min read

400 Million Pages Later: Introducing Pulse Ultra

400 Million Pages Later: Introducing Pulse Ultra

When we launched Pulse's first extraction model last year, we made a promise to improve enterprises’ data ingestion pipelines with maximal accuracy. Today, with Pulse Ultra, we're taking a massive leap toward that vision.

After processing over 400 million pages across some of the world's most demanding enterprises including investment firms, tech giants, and leading AI startups, we've learned exactly where traditional OCR breaks down. Complex tables. Multi-column layouts. Mixed file formats. Handwritten notes buried in scanned PDFs. These aren't edge cases; they're the reality of enterprise documents.

The Problem with "Good Enough"

Most document extraction tools plateau at 80-90% accuracy. For consumer applications, that might work. But when you're processing millions of financial statements, quarterly reports, or investment memorandums, those missing 10-20% aren't just data points. They're critical business decisions left on the table.

Enterprises using Pulse are handling 10x more document volume with the same resources, reducing processing time from days to minutes while driving up accuracy. Existing datasets are now providing exponentially more value in downstream applications.

Building Ultra

Over the past year, our engineering team has been quietly rebuilding our extraction engine from the ground up. The result is Pulse Ultra, a new internal architecture trained for the most optimal accuracy to latency ratio. Learn more about our approach to document intelligence.

Here's what makes it different:

  • 80% faster processing with a lighter architecture that also improves accuracy
  • Intelligent adaptation that automatically switches between reasoning and standard extraction based on document complexity
  • Visual document understanding that captures nuanced details like formatting, styling, and text color, critical context that other tools miss
  • Zero migration overhead, it's live for all Pulse customers starting today

Early Results Are Transformative

Our enterprise customers are already seeing remarkable improvements:

  • Dramatic improvements in document workflows across finance, healthcare, and manufacturing
  • Minimized manual verification for standard document types

We plan to also release Pulse’s benchmark this month, a fully open-sourced evaluation suite consisting of 10,000+ annotated documents with complex layouts and failure modes to contribute to the OCR community.

Ready to upgrade your document extraction? Pulse Ultra is live today for all Pulse customers. New to Pulse? Book a demo or reach out directly at hello@trypulse.ai.