Probabilistic Indexing by Alejandro Héctor Toselli (.PDF)

File Size: 13.2 MB

Probabilistic Indexing for Information Search and Retrieval in Large Collections of Handwritten Text Images by Alejandro Héctor Toselli, Joan Puigcerver, Enrique Vidal
Requirements: .PDF reader, 13.2 MB
Overview: This book provides a comprehensive presentation of a recently introduced framework, named “probabilistic indexing” (PrIx), for searching text in large collections of document images and other related applications. It fosters the development of new search engines for effective information retrieval from manuscripts which, however, lack the electronic text (transcripts) that would typically be required for such search and retrieval tasks. The book is structured into 11 chapters and three appendices. The first two chapters briefly outline the necessary fundamentals and state of the art in pattern recognition, statistical decision theory, and handwritten text recognition. Chapter 3 presents approaches for indexing (as opposed to “spotting”) each region of a handwritten text image which is likely to contain a word. Next, Chapter 4 describes models adopted for handwritten text in images, namely hidden Markov models, convolutional and recurrent neural networks and language models, and provides full details of weighted finite-state transducer (WFST) concepts and methods, needed in further chapters of the book. Chapter 5 explains the set of techniques and algorithms developed to generate image probabilistic indexes which allow for fast search and retrieval of textual information in the indexed images. This book is written for researchers and (post-)graduate students in pattern recognition and information retrieval.
Genre: Non-Fiction > Tech & Devices

Free Download links:

https://tbit.to/083kbrfts5ds.html

https://katfile.com/kpxeofs7lna3/Probabilistic_Indexing_for_Information_Search.pdf.html