FOSDEM '10 is a free and non-commercial event organized by the community, for the community. Its goal is to provide Free and Open Source developers a place to meet. No registration necessary.

   
Speakers
Joaquim Rocha
Schedule
Day Saturday
Room H.1309
Start time 17:15
End time 18:00
Duration 00:45
Info
Event type Podium
Track GNOME
Language English
OCRFeeder

OCRFeeder is a document layout analysis and optical character recognition system that I wrote for my Master's Thesis project.

Like it says on its website, given the images it will automatically outline its contents, distinguish between what's graphics and text and perform OCR over the latter. It generates multiple formats being its main one ODT.

I think this is currently the most complete and user friendly OCR application for GNU/Linux out there and, of course, I wrote it to be used mainly with GNOME, featuring a GUI written in PyGTK and respecting, as far as I could, the GNOME User Interface Guidelines.

I would like to present how the application works on the inside, for example the page segmentation algorithm I created for it, etc. I think this would be interest for the GNOME community and general attendants of the GNOME Dev room at FOSDEM.