Nonsearchable Content

Nonsearchable Content

Nonsearchable Content


Nonsearchable content

– Dean Sappey, President of DocsCorp, says:

Nonsearchable content is an ongoing problem for businesses, and it is growing every year. DocsCorp research has revealed that, on average, more than 20% of documents in a content repository are “invisible” to search technology. These documents are often profiled as a result of ingestion of legacy or litigation documents, saving emails with attachments, mobile technology and employee workarounds that bypass the OCR’ing process. When a company’s search technology fails to find every possible data source, it impacts the bottom line, workplace efficiency, regulatory compliance, and productivity.

We have announced the release of contentCrawler 2.1, the newest version of its integrated analysis, processing and reporting software that addresses the very real and serious issue of nonsearchable content in enterprise content management systems.

contentCrawler intelligently examines image-based documents in a content repository and converts them to searchable PDFs, making them available to search technologies for indexing. The contentCrawler 2.1 release includes several usability and performance enhancements and improvements. It is now being used by a wide variety of companies such as Marshall Dennehey, Cuatrecasas Gonçalves Pereira, Hugh James and the Law Society of British Columbia, contentCrawler’s versatile automated end-to-end process

Pedro Monteiro, Support at Truewind-Chiron welcomed the upgrade, saying “I upgraded directly with no problems. Former documents were transferred to the new version with no problems at all. This new version is faster, more informative and much lighter process wise to the server. Documents are assessed and saved 10x faster comparing to the old version we had.”

contentCrawler is not a search engine. It intelligently evaluates, and OCRS, to ensure that all documents in your content repositories are 100% searchable. If a PDF or an image document in your files has been saved without OCR’ing, contentCrawler will find it and fix it, without changing the original document or doubling your repository size.

Many firms are discovering that they can shift the effort of OCR’ing documents from their employees to contentCrawler’s automated process, and not only save time, but also cover more document sources reliably. contentCrawler makes more data available while taking OCR burden off the employees, increasing productivity at the same time as ensuring 100% searchable content.