For Earth Day, Recycle Those Stacks of Paper and Instantly Find the Forest Through the Trees

Home
News
Accounting/Banking
Technology
Featured Article

Written by Scott Koegler

Staying Secure on a Shoestring Budget: Cybersecurity for Small Businesses
As a small business, you face the same cyber threats as large enterprises. However, unlike the larger companies, you likely don’t have the same IT budgets to leverage sophisticated security solutions. Regardless of the size of your business, implementing cybersecurity best practices is crucial to safeguard sensitive customer data, financial information, intellectual property, and your reputation. Follow these practical tips… Read more...
Popular Article

Written by SBN Editors

The Top Five Laptops For Small Business
Today we are going to focus on a few of the best laptops for small… Read more...

Written by SBN Editors

DriveSavers Data Recovery Tips for Small Business Owners
As a small business owner you have a million projects on your mind at any… Read more...
Most Recent

Staying Secure on a Shoestring Budget: Cybersecurity for Small Businesses

DIY Tips for Improving Your Small Business Website on a Budget

The Role of Internet of Things (IoT) in Small Business Operations

Tips for Effective Data Backup and Recovery in Small Businesses

The Benefits of Voice Search Optimization for Small Business Websites

Responsive Web Design for Small Businesses

Cybersecurity for Small Businesses: Protecting Your Digital Assets
Operations
Featured Article

Written by Successful Business News

Navigating Supply Chain Disruptions: Tactics for Operational Resilience
Explore strategies for navigating supply chain disruptions with insights into operational resilience, including risk mitigation, digital technology adoption, and collaborative partnerships. Read more...
Popular Article

Written by SBN Editors

Manage Your Fleet: 4 Tools to Track Your Vehicles' Performance
If you own a business, you should get fleet vehicles. These fleet vehicles are the… Read more...

Written by SBN Editors

Have iPad, Will Track Inventory: iPad apps to help manage inventory
Inventory management can be a constant source of stress for the small business owner. However,… Read more...
Most Recent

Navigating Supply Chain Disruptions: Tactics for Operational Resilience

Streamlining Business Operations: Leveraging Technology for Efficiency

The Intersection of Remote Work and Employee Productivity: Balancing Flexibility and Efficiency

The Role of Operational Efficiency in Scaling Businesses

Navigating Supply Chain Disruptions: Best Practices for Business Operations

The Rise of Sustainable Business Practices in Operations

Navigating the Challenges of Remote Team Management
HR/Benefits
Featured Article

Written by Successful Business News

Navigating the New Era of Employee Benefits: Key Trends and Opportunities
Explore the evolving landscape of employee benefits with key trends in personalized packages, technology integration, and DEI initiatives. Learn how companies can innovate to attract and retain top talent. Read more...
Popular Article

Written by SBN Editors

Non-Traditional Employee Benefits
Have you ever thought about providing your employees with some… Read more...

Written by Tami Kamin Meyer

Job Turnover Can Be Worrisome for Employers
The good news: the “quit rate,” or the percentage of… Read more...
Most Recent

Navigating the New Era of Employee Benefits: Key Trends and Opportunities

Navigating the Rise of Remote Work Benefits in Modern Organizations

The Rise of Flexible Work Arrangements: A Win-Win for Employers and Employees

Navigating the Remote Work Trend: Innovations in Employee Benefits

The Rise of Hybrid Work Models: Balancing Flexibility and Productivity

Methods for Building a Strong and Reliable Team

Hiring is Complicated - Simplify With These Practices
Legal
Featured Article

Written by Deborah Huyett

Do You Need A Registered Agent For Your Business?
You probably know that starting a business requires an innovative idea, a solid business plan, and a funding source to launch. But, do you also have the Registered Agent ready to go? Do you need a Registered Agent? Read more...
Popular Article

Written by Deborah Huyett

Do You Need A Registered Agent For Your Business?
You probably know that starting a business requires an innovative idea, a solid business plan,… Read more...

Written by Danielle Loughnane

Creating a Promissory Note
Your friend gets laid off of work and is unable to pay her rent for… Read more...
Most Recent

Do You Need A Registered Agent For Your Business?

Creating a Promissory Note

5 Ways Businesses Can Avoid Becoming Ensnared In An Ethical Lapse

Mediate, don’t litigate

Contemplating legal templates for your small business

Estate planning matters for small business owners

Cloud Storage And Client Confidentiality: A Perfect Match Or A Perfect Storm?
Lifestyle
Sales/Marketing
White Papers
Subscribe!

Estimated reading time: 3 minutes, 57 seconds

For Earth Day, Recycle Those Stacks of Paper and Instantly Find the Forest Through the Trees ^Featured

Monday, Apr 03 2023

Operations

Written by Elizabeth Thede

font size decrease font size increase font size
Print
Email
"Hold the Green"

For Earth Day, Recycle Those Stacks of Paper and Instantly Find the Forest Through the Trees

"Hold the Green"

Yes, there is a connection between these two seemingly unrelated items. The missing steps are scanning, OCR, PDFs and enterprise search. This article will fill in the blanks, leaving you all set for Earth Day.

Before you toss those stacks of paper into the recycling bin, you’ll want to scan them. Scanning takes a picture of the pages. While your picture may be worth 1,000 words, you can’t do much with it other than look at it. To take an image of the word forest on the page and turn it into something you can do more with than just gaze at requires OCR, or optical character recognition.

An application like Adobe Acrobat will OCR the text from the scanned image, turning the text into something that you can copy, paste and otherwise work with. Now suppose you are scanning a copy of a memo. OCR will take the main typed text and digitally store that. But what if there are some notes that someone may have scribbled in the margins of one of the pages?

The gold standard—or should I say the green standard—for combining text and images is “searchable image” PDF. The format superimposes the OCR’ed text onto the original image of the page. Plus you can add on metadata. Let’s say the memo you are digitizing came from a Project Waterways binder. Even if Project Waterways appears nowhere on the memo itself, you can add a metadata element containing that phrase and that too will be part of the “searchable image” PDF.

The last step is to install an enterprise search engine. An enterprise search engine isn’t a span-the-internet search engine like Google. Rather, it is an application like dtSearch® that goes deep into your organization’s own data to retrieve anything anywhere—in the full-text and metadata—that matches your query. For “searchable image” PDFs, a search will show an OCR “hit” overlaying the original image including items like margin notes.

Enterprise search can instantly span terabytes only after first indexing the data. The index isn’t like a reference book index; rather it is just an internal guide holding each word and number in the data and its location for the sole purpose of enabling instant search. To get the search engine to build its index, all you need to do is point to the folders and the like to index, and the search engine will do everything else.

The search engine will automatically recognize and index PDFs, both “searchable image” and otherwise. And it can also automatically recognize and support other formats like Microsoft Word, Excel, PowerPoint, OneNote and Access files; web-based formats; compressed formats like ZIP or RAR; and even emails plus nested attachments. For example, if you have an email with a compressed attachment that includes a PDF and an Excel spreadsheet with a Word document embedded inside, enterprise search can automatically index and search the whole thing.

It's not just instant individual queries that enterprise search enables but also concurrent network or web-based queries as well. Online, search can proceed in a stateless matter, making multithreaded text retrieval fully scalable without affecting search speed. Search features encompass over 25 different full-text and metadata word, phrase and number-oriented options, so everyone can find the forest, the trees as well as the woodland creatures.

A catalog of all of the search features, relevancy-ranking and sorting options is beyond the scope of this article. But I’ll end with 3 search tips relating to “searchable image” PDFs specifically.

Search tip #1: before you make these old stacks of paper instantly searchable by the entire office, do a quick indexed search for credit card numbers to make sure that these do not appear in the newly digitized collection. The search engine can flag valid credit cards that appear in the OCR’ed PDFs – or anywhere else across indexed data.

Search tip #2: turn on fuzzy searching to a low level when you search through OCR’ed PDFs to sift through any minor OCR errors. For example, if the word activate is mis-OCR’ed as actiwate, a fuzziness level of 1 would pick that up in a search for activate. Fuzzy searching is also very helpful for searching emails, where mistypings can be common.

Search tip #3: Glancing at a collection of files in the folder system, it is impossible to distinguish “searchable image” PDFs from “image only” PDFs. The latter are not full-text searchable; just the filename and metadata are searchable. The search engine can flag “image only” PDFs when it builds its index. If you find “image only” PDFs, just run them through Adobe Acrobat to turn them into “searchable image” PDFs.

Elizabeth Thede dtSearch Corp

Read 1825 times

Rate this item

(0 votes)

More in this category: « How to Create an Effective Business Plan for Your Small Business Techniques For Effectively Communicating With Clients And Stakeholders »

Register

For Earth Day, Recycle Those Stacks of Paper and Instantly Find the Forest Through the Trees ^Featured

Most Read

SBN

Visit other PMG Sites:

Register

For Earth Day, Recycle Those Stacks of Paper and Instantly Find the Forest Through the Trees Featured

Most Read

SBN

Visit other PMG Sites:

For Earth Day, Recycle Those Stacks of Paper and Instantly Find the Forest Through the Trees ^Featured