Laboratoire Informatique, Image et Interaction (L3i)

facilityLa Rochelle, France

Research output, citation impact, and the most-cited recent papers from Laboratoire Informatique, Image et Interaction (L3i). Aggregated across the NobleBlocks index of 300M+ scholarly works.

Total works

1.1K

Citations

13.4K

h-index

i10-index

338

Also known as

Laboratoire Informatique, Image et InteractionLaboratoire Informatique, Image et Interaction (L3i)

Top-cited papers from Laboratoire Informatique, Image et Interaction (L3i)

Survey of Post-OCR Processing Approaches

Thi Tuyet Haï Nguyen, Adam Jatowt, Mickaël Coustaty, Antoine Doucet

2021· ACM Computing Surveys252doi:10.1145/3453476

Optical character recognition (OCR) is one of the most popular techniques used for converting printed documents into machine-readable ones. While OCR engines can do well with modern text, their performance is unfortunately significantly reduced on historical materials. Additionally, many texts have already been processed by various out-of-date digitisation techniques. As a consequence, digitised texts are noisy and need to be post-corrected. This article clarifies the importance of enhancing quality of OCR results by studying their effects on information retrieval and natural language processing applications. We then define the post-OCR processing problem, illustrate its typical pipeline, and review the state-of-the-art post-OCR processing approaches. Evaluation metrics, accessible datasets, language resources, and useful toolkits are also reported. Furthermore, the work identifies the current trend and outlines some research directions of this field.

Underwater color constancy: enhancement of automatic live fish recognition

Majed Chambah, Dahbia Semani, Arnaud Renouf, Pierre Courtellemont +1 more

2003· Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE139doi:10.1117/12.524540

We present in this paper some advances in color restoration of underwater images, especially with regard to the strong and non uniform color cast which is typical of underwater images. The proposed color correction method is based on ACE model, an unsupervised color equalization algorithm. ACE is a perceptual approach inspired by some adaptation mechanisms of the human visual system, in particular lightness constancy and color constancy. A perceptual approach presents a lot of advantages: it is unsupervised, robust and has local filtering properties, that lead to more effective results. The restored images give better results when displayed or processed (fish segmentation and feature extraction). The presented preliminary results are satisfying and promising.

A Job Market Signaling Scheme for Incentive and Trust Management in Vehicular Ad Hoc Networks

Nadia Haddadou, Abderrezak Rachedi, Yacine Ghamri‐Doudane

2014· IEEE Transactions on Vehicular Technology109doi:10.1109/tvt.2014.2360883

In collaborative wireless networks with low infrastructure, the presence of misbehaving nodes can have a negative impact on network performance. In particular, we are interested in dealing with this nasty presence in road safety applications, based on vehicular ad hoc networks (VANETs). In this paper, we consider as harmful the presence of malicious nodes, which spread false and forged data, and selfish nodes, which cooperate only for their own benefit. To deal with this, we propose a distributed trust model <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="TeX">$(DTM^{2})$</tex-math></inline-formula> , which is adapted from the job market signaling model. <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="TeX">$DTM^{2} $</tex-math></inline-formula> is based on allocating <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><i>credits</i></b> to nodes and securely managing these credits. To motivate selfish nodes to cooperate more, our solution establishes the cost of reception to access data, forcing them to earn credits. Moreover, to detect and exclude malicious nodes, <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="TeX">$DTM^{2} $</tex-math></inline-formula> requires the cost of sending, using signaling values inspired from economics and based on the node's behavior so that the more malicious a node is, the higher its sending cost, thus limiting their participation in the network. Similarly, rewards are given to nodes whose sent messages are considered truthful and that paid a sending cost considered correct. The latter is a guarantee for the receivers about the truthfulness of the message since, in the case of message refusal, the source node is not rewarded, despite its payment. We validated <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="TeX">$DTM^{2} $</tex-math></inline-formula> via a theoretical study using Markov chains and with a set of simulations in both urban and highway scenarios. Both theoretical and simulation results show that <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="TeX">$DTM^{2} $</tex-math></inline-formula> excludes from the network 100% of malicious nodes without causing any false-positive detection. Moreover, our solution guarantees a good ratio of reception, even in the presence of selfish nodes.

Handbook of Robust Low-Rank and Sparse Matrix Decomposition

Thierry Bouwmans, Necdet Serhat Aybat, El-Hadi Zahzah

201697doi:10.1201/b20190

Handbook of Robust Low-Rank and Sparse Matrix Decomposition: Applications in Image and Video Processing shows you how robust subspace learning and tracking by decomposition into low-rank and sparse matrices provide a suitable framework for computer vision applications. Incorporating both existing and new ideas, the book conveniently gives you one-stop access to a number of different decompositions, algorithms, implementations, and benchmarking techniques. Divided into five parts, the book begins with an overall introduction to robust principal component analysis (PCA) via decomposition into low-rank and sparse matrices. The second part addresses robust matrix factorization/completion problems while the third part focuses on robust online subspace estimation, learning, and tracking. Covering applications in image and video processing, the fourth part discusses image analysis, image denoising, motion saliency detection, video coding, key frame extraction, and hyperspectral video processing. The final part presents resources and applications in background/foreground separation for video surveillance. With contributions from leading teams around the world, this handbook provides a complete overview of the concepts, theories, algorithms, and applications related to robust low-rank and sparse matrix decompositions. It is designed for researchers, developers, and graduate students in computer vision, image and video processing, real-time architecture, machine learning, and data mining.

LRSLibrary: Low-Rank and Sparse Tools for Background Modeling and Subtraction in Videos

Andrews Sobral, Thierry Bouwmans, El-Hadi Zahzah

201694doi:10.1201/b20190-24

International audience

Online Stochastic Tensor Decomposition for Background Subtraction in Multispectral Video Sequences

Andrews Sobral, Sajid Javed, Soon Ki Jung, Thierry Bouwmans +1 more

201580doi:10.1109/iccvw.2015.125

International audience

Deep Statistical Analysis of OCR Errors for Effective Post-OCR Processing

Thi-Tuyet-Hai Nguyen, Adam Jatowt, Mickaël Coustaty, Nhu-Van Nguyen +1 more

201967doi:10.1109/jcdl.2019.00015

Post-OCR is an important processing step that follows optical character recognition (OCR) and is meant to improve the quality of OCR documents by detecting and correcting residual errors. This paper describes the results of a statistical analysis of OCR errors on four document collections. Five aspects related to general OCR errors are studied and compared with human-generated misspellings, including edit operations, length effects, erroneous character positions, real-word vs. non-word errors, and word boundaries. Based on the observations from the analysis we give several suggestions related to the design and implementation of effective OCR post-processing approaches.

Spatio-temporal Trajectory Analysis of Mobile Objects Following the same Itinerary

Laurent Étienne, Thomas Devogèle, Alain Bouju

2021· HAL (Le Centre pour la Communication Scientifique Directe)67

International audience

Background Subtraction via Superpixel-Based Online Matrix Decomposition with Structured Foreground Constraints

Sajid Javed, Seon Ho Oh, Andrews Sobral, Thierry Bouwmans +1 more

201560doi:10.1109/iccvw.2015.123

International audience

Double-constrained RPCA based on saliency maps for foreground detection in automated maritime surveillance

Andrews Sobral, Thierry Bouwmans, El-Hadi Zahzah

201560doi:10.1109/avss.2015.7301753

International audience

Impact of OCR Errors on the Use of Digital Libraries: Towards a Better Access to Information

Guillaume Chiron, Antoine Doucet, Mickaël Coustaty, Muriel Visani +1 more

201758doi:10.1109/jcdl.2017.7991582

Digital collections are increasingly used for a variety of purposes. In Europe only, we can conservatively estimate that tens of thousands of users consult digital libraries daily. The usages are often motivated by qualitative and quantitative research. However, caution must be advised as most digitized documents are indexed through their OCRed version, which is far from perfect, especially for ancient documents. In this paper, we aim to estimate the impact of OCR errors on the use of a major online platform: The Gallica digital library from the National Library of France. It accounts for more than 100M OCRed documents and receives 80M search queries every year. In this context, we introduce two main contributions. First, an original corpus of OCRed documents composed of 12M characters along with the corresponding gold standard is presented and provided, with an equal share of English- and French-written documents. Next, statistics on OCR errors have been computed thanks to a novel alignment method introduced in this paper. Making use of all the user queries submitted to the Gallica portal over 4 months, we take advantage of our error model to propose an indicator for predicting the relative risk that queried terms mismatch targeted resources due to OCR errors, underlining the critical extent to which OCR quality impacts on digital library access.

A Comparative Study for Classification of Skin Cancer

Tri Cong Pham, Giang Son Tran, Thi Phuong Nghiem, Antoine Doucet +2 more

201957doi:10.1109/icsse.2019.8823124

International audience

A System Based on Intrinsic Features for Fraudulent Document Detection

Romain Bertrand, Petra Gomez‐Krämer, Oriol Ramos Terrades, Patrick Franco +1 more

201356doi:10.1109/icdar.2013.29

Paper documents still represent a large amount of information supports used nowadays and may contain critical data. Even though official documents are secured with techniques such as printed patterns or artwork, paper documents suffer from a lack of security. However, the high availability of cheap scanning and printing hardware allows non-experts to easily create fake documents. As the use of a watermarking system added during the document production step is hardly possible, solutions have to be proposed to distinguish a genuine document from a forged one. In this paper, we present an automatic forgery detection method based on document's intrinsic features at character level. This method is based on the one hand on outlier character detection in a discriminant feature space and on the other hand on the detection of strictly similar characters. Therefore, a feature set is computed for all characters. Then, based on a distance between characters of the same class, the character is classified as a genuine one or a fake one.

New binary linear programming formulation to compute the graph edit distance

Julien Lerouge, Zeina Abu-Aisheh, Romain Raveaux, Pierre Héroux +1 more

2017· Pattern Recognition51doi:10.1016/j.patcog.2017.07.029

In this paper, a new binary linear programming formulation for computing the exact Graph Edit Distance (GED) between two graphs is proposed. A fundamental strength of the formulations lies in their genericity since the GED can be computed between directed or undirected fully attributed graphs. Moreover, a continuous relaxation of the domain constraints in the formulation provides an efficient lower bound approximation of the GED. A complete experimental study that compares the proposed formulations with six state-of-the-art algorithms is provided. By considering both the accuracy of the proposed solution and the efficiency of the algorithms as performance criteria, the results show that none of the compared methods dominate the others in the Pareto sense. In general, our formulation converges faster to optimality while being able to scale up to match the largest graphs in our experiments. The relaxed formulation leads to an accurate approach that is 12% more accurate than the best approximate method of our benchmark.

Comic Characters Detection Using Deep Learning

Nhu-Van Nguyen, Christophe Rigaud, Jean-Christophe Burie

201750doi:10.1109/icdar.2017.290

Comic characters detection has been an interesting area in comic analysis as it not only allows more efficient indexation and retrieval for comic books but also yields an adequate understanding of comics so as to help better creating the digital form of comic books. In recent years, several methods that have been proposed to extract/detect characters from comics, have given reasonable performance. However, they always use their datasets to evaluate the methods without comparing with other works or experimenting on a standard dataset. In this work, we take advantage of the recent and significant development of deep learning to apply it to comic character detection. We use the latest object detection deep networks to train the comic characters detector based on our proposed dataset. By experimenting on our proposed dataset and also on available datasets from previous works, we have found that this method significantly outperforms existing methods. We believe that this state-of-the-art approach can be considered as a reliable baseline method to compare and better understand future detection techniques.

Simple Triplet Loss Based on Intra/Inter-Class Metric Learning for Face Verification

Zuheng Ming, Joseph Chazalon, Muhammad Muzzamil Luqman, Muriel Visani +1 more

201748doi:10.1109/iccvw.2017.194

International audience

Neural Machine Translation with BERT for Post-OCR Error Detection and Correction

Thi Tuyet Haï Nguyen, Adam Jatowt, Nhu-Van Nguyen, Mickaël Coustaty +1 more

202047doi:10.1145/3383583.3398605

The quality of OCR has a direct impact on information access, and an indirect impact on the performance of natural language processing applications, making fine-grained (e.g., semantic) information access even harder. This work proposes a novel post-OCR approach based on a contextual language model and neural machine translation, aiming to improve the quality of OCRed text by detecting and rectifying erroneous tokens. This new technique obtains results comparable to the best-performing approaches on English datasets of the competition on post-OCR text correction in ICDAR 2017/2019.

Local Binary Patterns for Document Forgery Detection

Francisco Cruz, Nicolas Sidère, Mickaël Coustaty, Vincent Poulain d’Andecy +1 more

201747doi:10.1109/icdar.2017.202

Document forgery is an increasing problem for both the public administration and private companies. It represents substantial losses in time and economical resources. Classical solutions to this problem such as watermarks or other integrated security patterns can not be applied in general for any unknown incoming document due to the large variability on types of documents. In that scenario it is important to resort to forensic techniques to seek and analyze inconsistencies on the intrinsic features of the document image. In this paper we present a classification-based approach for forgery detection. We use uniform Local Binary Patterns (LBP) to capture discriminant texture features that are common on forged regions. Besides, we combine multiple descriptors from neighboring regions to model contextual information. Results using Support Vector Machines (SVM) for patch classification show that we are able to detect several types of forgeries in a wide range of types of documents.

An Analysis of the Performance of Named Entity Recognition over OCRed Documents

Ahmed Hamdi, Axel Jean-Caurant, Nicolas Sidère, Mickaël Coustaty +1 more

201944doi:10.1109/jcdl.2019.00057

The use of digital libraries requires an easy accessibility to documents which is strongly impacted by the quality of document indexing. Named entities are among the most important information to index digital documents. According to a recent study, 80% of the top 500 queries sent to a digital library portal contained at least one named entity. However most digitized documents are indexed through their OCRed version which includes numerous errors that may hinder the access to them. Named Entity Recognition (NER) is the task that aims to locate important names in a given text and to categorize them into a set of predefined classes (person, location, organization). This paper aims to estimate the performance of NER systems through OCRed data. It exhaustively iscusses NER errors arising from OCR errors; we studied the correlation between NER accuracy and OCR error rates and estimated the cost of character insertion, deletion and substitution in named entities. Results show that even if he OCR engine does contaminate named entities with errors, NER systems can overcome this issue and correctly recognize some of them.

A history of eye gaze tracking

Abdallahi Ould Mohamed, Matthieu Perreira Da Silva, Vincent Courboulay

2007· HAL (Le Centre pour la Communication Scientifique Directe)39

Rapport Interne

Search all NobleBlocks papers mentioning “Laboratoire Informatique, Image et Interaction (L3i)” →