Las etiquetas más populares

java x 17181

c# x 15632

javascript x 15480

python x 14317

android x 12741

c++ x 8270

php x 7054

jquery x 6650

.net x 6504

ios x 6091

html x 5896

css x 5473

git x 4202

c x 4024

sql x 3809

iphone x 3413

mysql x 3279

ruby x 3059

string x 2786

linux x 2756

asp.net x 2710

node.js x 2696

r x 2634

arrays x 2503

angularjs x 2386

django x 2241

swift x 2154

bash x 2092

json x 2084

xcode x 2006

eclipse x 1958

windows x 1908

performance x 1883

regex x 1738

wpf x 1657

algorithm x 1564

c++11 x 1564

macos x 1522

database x 1489

multithreading x 1471

scala x 1468

html5 x 1425

spring x 1418

postgresql x 1242

shell x 1236

xml x 1209

list x 1141

angular x 1136

vim x 1066

ajax x 1053

haskell x 1037

debugging x 1002

http x 996

css3 x 988

datetime x 981

mongodb x 972

sql-server x 964

linq x 949

github x 939

asp.net-mvc x 930

pandas x 927

image x 897

reactjs x 892

oop x 886

date x 882

unix x 862

tsql x 852

file x 844

numpy x 840

maven x 839

security x 815

svn x 814

rest x 803

gcc x 793

winforms x 789

generics x 779

objective-c x 777

function x 774

exception x 758

oracle x 746

hibernate x 726

class x 711

matplotlib x 711

dictionary x 706

math x 698

docker x 698

typescript x 691

powershell x 689

ruby-on-rails x 659

go x 658

laravel x 653

visual-studio x 651

sorting x 634

syntax x 626

ubuntu x 622

gradle x 622

logging x 619

templates x 615

excel x 598

apache x 593

OCR par lots pour de nombreux fichiers PDF (pas déjà OCRed)? [fermé]

J'utilise Google Desktop Search (je suis sur Vista) et pas tous mes PDF les fichiers sont reconnus dans mon dossier d'archives. C'est normal car " Les fichiers PDF contenant des images numérisées " ne sont pas indexés ( http://desktop.google.com/support/bin/answer.py?hl=en&answer=90651 )

Je voudrais donc OCR beaucoup de mes fichiers PDF qui ne sont pas déjà OCRed. Mon objectif: Je donne un dossier au programme et il recherche seul dans les sous-dossiers les fichiers PDF qui doivent être convertis en fichiers PDF-OCRED.

Remarque : Dans le passé, si un fichier PDF était protégé par mot de passe, j'ai supprimé le mot de passe avec un autre outil de lot (payant): verypdf.com "pwdremover" http://www.verypdf.com/pwdremover/

Une idée (pas trop chère)?

J'ai déjà essayé : Finereader 6 pro sur xp à l'époque, mais il n'y avait pas de processeur par lots inclus... Paperfile paperfile.net qui utilise Tesseract http://code.google.com/p/tesseract-ocr / . Mais l'OCR n'est que PDF en texte, pas PDF en PDF! Il y en a aussi un autre projet http://code.google.com/p/ocropus/

Merci d'avance ;)

ocr pdf

demandé sur