Home | Links | Contact Us | More About Intellectual Property | Bookmark
Search patents:
Home File Sharing Method-for-graph-based-table-recognition

 Method and apparatus for summarizing previous threads in a communication-center chat session
What is claimed is: 1. A system for aiding a host of a chat session in reviewing queries and ...


 IRC name translation protocol
OF THE INVENTION The present invention relates to the technical field of networks particularly for ...


 Methods for creating and editing topics for virtual robots conversing in natural language
The present invention meets the aforementioned needs by providing automated methods of editing and ...


 Method for communicating within a chat topic in a wireless communication system
OF THE INVENTION Referring to FIG. 1, an electronic block diagram of a wireless communication ...


 Adjunct use of instant messenger software to enable communications to or between chatterbots or other software agents
The present invention allows a user's chatterbot to "participate" in instant messaging along with ...


 Method of authorizing receipt of instant messages by a recipient user
An exemplary table, such as shown in FIG. 1, illustrates the control processes that a potential ...


 Computer node architecture comprising a dedicated middleware processor
OF THE INVENTION FIG. 1 shows a system of four multicomputer node computers that exchange data via ...


 Reduced keyboard disambiguating system
The present invention provides a reduced keyboard using word level disambiguation to resolve ...


 Video-based rendering
OF THE PREFERRED EMBODIMENTS In the following description of the preferred embodiments of the ...


 Device and method of browsing an image collection
The invention claimed is: 1. A device for browsing an image collection, comprising browsing means ...


 Method for graph-based table recognition

Details
Inventors: Rahgozar, M. Armon; Cooperman, Robert;
Assignee: Xerox Corporation (Stamford, CT)
Primary Examiner: Chang; Jon
Assistant Examiner:
Attorney, Agent or Firm: Basch; Duane C.

The present invention is a method for bottom-up recognition of tables within a document. This method is based on the paradigm of graph-rewriting. First, the document image is transformed into a layout graph whose nodes and edges represent document entities and their interrelations respectively. This graph is subsequently rewritten using a set of rules designed based on apriori document knowledge and general formatting conventions. The resulting graph provides a logical view of the document content. It can be parsed to provide general format analysis information.

DETAILED DESCRIPTION The present invention is directed to a method for document structure recognition based on a graph rewriting paradigm.
Document recognition is a process by which the information regarding the organization of the document content, i.
e.
its structure, is extracted from the document image.
The structure identifies document entity types (e.
g.
paragraphs, figures and tables), their properties (e.
g.
the number of columns in a table), and their interrelations (e.
g.
, a figure is above a caption).
Although Optical Character recognition (OCR) is an inherent part of this process, it is not intended as an aspect of the present invention.
The present invention is directed to document structure recognition beyond a character level.
Indeed, the methodology presented here is equivalently applicable to any of a number of document types, including documents described by a Page Description Language (PDL) such as Postscript.
TM.
or Xerox Interpress.
TM.
.
Graphs are known to be powerful tools for document structure analysis.
They provide a compact computational abstraction for representing the complex multidimensional information embedded in a document structure.
In this abstraction, document entities and the inter-relations between these entities can be represented by elements of graphs, namely graph nodes and links respectively.
Manipulation of these nodes and links results in a change in the topology of the graph that consequently provides a new interpretation of the document structure.
As a result, the tools for graph manipulation become the basis for a computational document structure analysis framework.
The use of graphs for document recognition has classically been limited to academic and experimental systems due to the computational complexity of graph manipulation.
A notable exception is the work of Fahmy and Blostein "A Graph Grammar Programming Style for Recognition of Music Notation, " Machine Vision and Applications, Vol.
6 (1993).
This work provides a comprehensive computational framework for document recognition



Related patents
  Contents-based video story browsing system
OF PREFERRED EMBODIMENT Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the ...
  Fundamental entity-relationship models for the generic audio visual data signal description
An object of the present invention is to provide content description schemes for generic multimedia information. Another object of the present invention is to provide ...
  Method and apparatus for identifying words described in a page description language file
The present invention provides a method and apparatus for identifying words stored in a page description language file. The present invention can identify words from ...
  Intelligent compilation of procedural functions for query processing systems
To overcome the limitations in the prior art described above, and to overcome other limitations that will become apparent upon reading and understanding the present ...
  Network switch using network processor and methods
One purpose of this invention is to provide a scalable switch architecture for use in a data communication network which is capable of sizing support capabilities to a ...
  Transform processor system having reduced processing bandwith
The present invention is generally directed to various levels of features; including display technology, processor technology, and system technology, and various ...
  Method and apparatus for determination and visualization of player field coverage in a sporting event
The present invention provides, most generally, a method and apparatus for tracking moving objects, particularly athletes engaged in sporting activities. More ...
  Computer human method and system for the control and management of an airport
What is claimed is: 1. A GNSS compatible airport control and management system providing a computer human interface for use by a controller in the monitoring, control ...
  Architectures for netcentric computing systems
The present invention discloses an architecture for a netcentric computer system that is capable of expanding the reach of computing both within and outside the business ...
  System and method for automatically verifying the performance of a virtual robot
The present invention meets these aforementioned needs by providing a variety of mechanisms for verifying the performance of a virtual robot or BOT. In an automated ...

0.014

Archive: All patents - Links

Copyright (c)2006 Eipa-patents.org - All rights reserved