Information Retrieval From Image Databases | Page 9 | IJRSR Journal-UGC guidelines journal

SUBJECT AREA

Life Sciences

Biology
Botany
Zoology
Microbiology
Biotechnology
More→

Health Science

Dermatology
Medicine and Dentistry
Dentistry
Endocrinology
Immunology
Nursing and Health Professions
More→

Physical Sciences and Engineering

Physics
Kinetics
Mechanics
Electromagnetics
Mathematics
Thermodynamic
More→

Social Science and Humanities

Physical Education
Economics
Social Sciences
Arts and Humanities
Business Management
Management
Accounting
More→

Information Retrieval From Image Databases

Research Article

Seema Yadav., Saurabhkumar Jain., Priya Bhanushali and Tejinder Kaur

DOI:

xxx-xxxxx-xxxx

Subject:

science

KeyWords:

Document image analysis, Segmentation, Tesseract Engine, OpenCV, Indexing, Preprocessing.

Abstract:

As vast amount of digital image data is getting archived by the advanced libraries, there is a requirement for an ef cient search methodologies to make them accessible according to client's data requirement. For their retrieval, it is imperative to recognize their contents. Current technologies for optical character recognition (OCR) and document analysis do not handle such documents adequately because of the recognition errors. Due to these challenges, computer is unable to recognize the characters while reading them. In this paper, we propose and effective word image matching scheme that achieves high performance in the presence of noise in image, degradation and word form-variants. Initially, each image in image-database is pre-processed. In the next step find contour method is used to detect blobs which are further passed in tesseract engine. Tesseract segments the characters from the image and stores in character database. Each word in the database is used to index a given set of images. During retrieval, the query word presented to the system is matched with characters in the database and all images containing instances of the query word are retrieved and presented to the user. Using this approach, our method is able to successfully handle images with different font styles, size and heavily touching characters. From the experimental results on the variety of image database it is observed that the extraction of text from the images is mostly accurate and indexing of words based on the position is working perfectly

Certificate Request Form

6830.pdf

Monthly archive