JavaScript is disabled for your browser. Some features of this site may not work without it.
  • About K
  • Academics
  • Admission
  • Alumni Relations
  • Giving to K
  • News & Events
  • Student Life
  • HORNET HIVE
  • ATHLETICS
  • SITEMAP
  • WEBMAIL
    • Login
    View Item 
    •   CACHE Homepage
    • Academic Departments, Programs, and SIPs
    • Computer Science
    • Computer Science Senior Integrated Projects
    • View Item
    •   CACHE Homepage
    • Academic Departments, Programs, and SIPs
    • Computer Science
    • Computer Science Senior Integrated Projects
    • View Item

    An Overview of Optical Character Recognition Processes at Canon Research America

    Thumbnail
    View/Open
    Searchable PDF / Kalamazoo College Only (977.6Kb)
    Date
    1996
    Author
    Evans, Colin
    Metadata
    Show full item record
    Abstract
    Optical character recognition (OCR) is the process of taking a bitmap picture and converting it into a text document. Character recognition has many useful applications, such as eliminating the need for storing paper copies of records and allowing for fast text searches on a large archive of scanned and stored documents. At Canon Research America, the Personal Imaging Computer System (PICS) project is trying to develop a document storage system consisting of an integrated laser printer, greyscale scanner, and Pentium computer system running Windows NT. The system allows for a document to be scanned and stored in a database either as a compressed image or as an OCR processed text document. The document can then be retrieved from the PICS unit across a network using a special client program, or stored in the database until it is needed, at which time it can be printed out again. The goal of the PICS project is to develop a system that allows for a total reduction in the amount of paper documents that need to be stored -- a "paperless office solution" -- and also to offer a method of storing large amounts of text in a format that is easy to search and index. One of the central systems that makes PICS possible is character recognition. The issues that are involved in developing a high accuracy OCR system are very broad, as the process can include areas of image processing and manipulation, machine learning, and natural language processing. This paper presents a general description of the optical character recognition process, followed by an overview of specific algorithms that have been researched at Canon Research America.
    URI
    http://hdl.handle.net/10920/25284
    Collections
    • Computer Science Senior Integrated Projects [236]

    Browse

    All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    DSpace software copyright © 2002-2022  DuraSpace
    DSpace Express is a service operated by 
    Atmire NV
    Logo

    Kalamazoo College
    1200 Academy Street
    Kalamazoo Michigan 49006-3295
    USA
    Info 269-337-7000
    Admission 1-800-253-3602

    About K
    Academics
    Admission
    Alumni Relations
    Giving to K
    News & Events
    Student Life
    Sitemap
    Map & Directions
    Contacts
    Directories
    Nondiscrimination Policy
    Consumer Information
    Official disclaimer
    Search this site


    Academic Calendars
    Apply
    Bookstore
    Crisis Response
    Employment
    Library
    Registrar
    DSpace Express is a service operated by 
    Atmire NV