Get 10k free credits when you signup for LlamaParse!

LlamaIndex Newsletter 2026-03-31

Hi there, Llama Enthusiasts! 🦙

Welcome to this week's edition of the LlamaIndex newsletter! We're excited to share major breakthroughs in document parsing, including intelligent table extraction that goes beyond basic OCR, revolutionary improvements to Word document processing, and the launch of LiteSearch - a fully local document retrieval system. Plus, we've got exciting integrations with Gemini's Live API and comprehensive guides for legal discovery use cases.

Ready to get started with LlamaParse?

Explore our free and paid plans today.

🎉 Join Us in San Francisco

  • Celebrate our move to the 'AI Waterfront'! We're hosting a pregame event for First Thursdays on April 2nd at our new office on 2nd Street. Come meet our team, grab food and drinks, and connect with the community. Space is limited, so RSVP early!

🤩 The Highlights

  • Transform Your Document Processing with Intelligent Table Extraction: Our comprehensive deep dive explains how modern OCR for tables reconstructs spatial relationships, preserves header hierarchies, and ensures data integrity across complex documents. Learn the three core phases and see real-world applications from invoice processing to lab results. Read the complete guide.
  • LiteSearch: Fully Local Document Retrieval System: Our OSS engineer built a high-performance, local-first retrieval pipeline using LiteParse, demonstrating how to assemble open tools for parsing, chunking, embeddings, and vector storage with zero external dependencies. Check out the repository and explore LiteParse docs.
  • Revolutionary Word Document Processing: We've solved the counterintuitive challenge of .docx parsing by mapping Word XML table elements to their correct page positions. This breakthrough significantly improves quality for tables with rich formatting, merged cells, and nested structures. Read the full writeup.

☁️ LlamaParse

  • Voice-Powered Document Assistant with Gemini Live API: We built a demo integrating Gemini 3.1's Live API with LiteParse for a TUI-based voice assistant that can parse documents through spoken commands and read back results in real time. Explore the GitHub repo and check out LiteParse docs.
  • Visual Citations with Bounding Boxes: New guide showing how to use LiteParse for visual citations using bounding box extraction and page screenshots to associate text with page elements. Learn more about visual citations.
  • Smart Financial Assistant with Google: Collaborative blog with Google showing how to build a financial assistant using LlamaParse and Gemini 3, including VLM-enabled agentic OCR for accurate text and table extraction. Read the blog and explore the repo.
  • Legal Discovery Document Processing: Comprehensive guide for handling difficult scans, degraded documents, and complex legal discovery use cases with vision models and custom parsing instructions. Read the full blog.

✨ Community

  • GDPR Breach Report Automation: Congratulations to contest winner @zubeensyed for building an agentic AI workflow that automates GDPR breach report structuring, mapping incident reports to standardized schemas aligned with Article 33 requirements. Read about the solution and watch the walkthrough.

Related articles

PortableText [components.type] is missing "undefined"

Start building your first document agent today

PortableText [components.type] is missing "undefined"