What is the difference between indexing and scanning?

Scanning converts a physical document into a digital image file. Indexing adds structured metadata — such as document type, date, grantor, or parcel number — to that image so it can be searched and retrieved. Scanning without indexing leaves you with image files that are difficult to find later.

Who is responsible for document indexing in a county office?

In most county offices, the clerk or recorder's office handles indexing for recorded documents. Some offices also manage indexing for court records, permits, or vital records. The specific responsibility depends on state law and local office structure.

What metadata fields are typically indexed?

Common index fields include document type, recording date, book and page number, grantor and grantee names, legal description, parcel number, and instrument number. The exact fields depend on the document category and state requirements.

Can indexing be automated?

Yes. Modern document-indexing tools use optical character recognition (OCR) and AI to extract fields from scanned images automatically. These tools still require human review for exceptions and edge cases, but they can significantly reduce the manual labor involved in high-volume indexing work.

What Is Document Indexing?

A practical overview for county clerks, recorders, and records managers.

Definition

Document indexing is the process of assigning structured metadata to a document so it can be found later. In a county office, that typically means tagging a recorded deed, court filing, or permit with fields like document type, recording date, names of parties, and legal descriptions.

Without indexing, a scanned document is just an image file sitting in a folder. With indexing, it becomes a searchable record that staff, attorneys, title companies, and the public can retrieve quickly.

Why it matters for county offices

County clerks and recorders handle high volumes of documents every day — deeds, mortgages, liens, plats, court orders, and more. Each of these documents needs to be recorded, stored, and made available for retrieval, sometimes decades later.

Accurate indexing is what makes retrieval possible. When a title company needs to verify ownership history or a citizen requests a copy of a recorded document, the index is how staff find it. Poorly indexed records lead to longer search times, duplicate requests, and errors that can have legal consequences.

How indexing traditionally works

In many offices, indexing is a manual process. Staff receive a document — either physically at a counter or digitally through an e-recording system — and key in the relevant fields by hand. This typically involves:

Reading the document to identify its type
Entering names, dates, legal descriptions, and reference numbers into a records management system
Assigning a book and page or instrument number
Verifying the entry against the original document

This process is time-consuming, especially for backfile projects where offices need to index years or decades of previously scanned but unindexed records.

Modern approaches to indexing

Newer tools use optical character recognition (OCR) and machine learning to extract index fields from scanned documents automatically. Instead of a clerk reading every page and typing in each field, the software identifies the relevant data and populates the index record.

These tools don't eliminate human review entirely. Most workflows include a validation step where staff review flagged exceptions — documents where the software's confidence is low or where the extracted data needs correction. But they can significantly reduce the number of keystrokes per document and speed up high-volume indexing work.

Backfile vs. day-forward indexing

Backfile indexing refers to the process of going back through previously scanned or stored documents and adding index data to them. This is common when an office migrates to a new system or when older records were scanned without metadata.

Day-forward indexing is the ongoing process of indexing new documents as they are received. Offices that adopt automated indexing tools often start with day-forward workflows and then tackle backfile projects as time and budget allow.

What good indexing looks like

Effective indexing is consistent, accurate, and follows a defined standard. Key characteristics include:

Standardized field names and formats across document types
Clear rules for how names, dates, and legal descriptions are entered
A review process for catching errors before records are finalized
Support for both current and historical naming conventions

Whether indexing is done manually or with automated tools, the goal is the same: make every recorded document findable by the people who need it.

Disclaimer: This guide is educational in nature. It is not legal advice, records-retention advice, or a substitute for consulting with your office's legal counsel or state records management agency. Always verify requirements against your state's specific laws and retention schedules.

Frequently Asked Questions

Related Guides

Managing Public Records Requests

How to handle open-records requests from intake through fulfillment.

Read guide

Document Management Systems for County Offices

What a DMS does, how to evaluate one, and what to consider before adopting.

Read guide

Retention, Metadata, and Search

How retention, metadata extraction, auto-tagging, and search work together.

Read guide