Commit Graph

  • 49205462a3 Fix bug, missing __init__.py in demo main Eric Ihli 2020-12-28 13:58:29 -0800
  • f77c6854f8 Add requests as dependency for demo Eric Ihli 2020-10-19 11:47:55 -0700
  • 8be7972bc7 Add 0.2.3 to dist Eric Ihli 2020-10-19 11:30:26 -0700
  • b7c6034331 Update README and add README.txt to setup.py Eric Ihli 2020-10-19 11:29:31 -0700
  • f404b6e673 Update README Eric Ihli 2020-10-19 09:43:16 -0700
  • 1aae198e39 Update readme and add dist for 0.2.2 Eric Ihli 2020-10-19 09:35:47 -0700
  • 248fc827cc Add README and demo Eric Ihli 2020-10-19 09:27:24 -0700
  • 7e5516eb5d Update README in setup.py Eric Ihli 2020-10-17 05:36:40 -0700
  • 5a34d0a845 Whitespace change in README Eric Ihli 2020-10-17 05:27:34 -0700
  • df50db1fbd Strip whitespace when reading ocr for csv Eric Ihli 2020-10-15 10:39:26 -0700
  • 01406752d4 Update docs to describe ocr_image defaults Eric Ihli 2020-10-14 21:33:03 -0700
  • 3b31888a55 Include tesseract traineddata files Eric Ihli 2020-10-14 21:07:25 -0700
  • 7b103723af Allow tesseract params to be passed into OSD Eric Ihli 2020-04-28 08:50:34 -0700
  • bc32d59253 Add tip for quickly creating training data Eric Ihli 2020-04-28 08:49:55 -0700
  • 7ad4c0d4dc Fix bug relating to directory of pdf Eric Ihli 2020-04-28 08:49:18 -0700
  • 449ee015d3 Update license and setup.py Eric Ihli 2020-04-27 11:47:48 -0700
  • 1156eafc5c Return sorted image paths from pdf_to_images Eric Ihli 2020-04-27 10:04:55 -0700
  • 99beaaa2d1 Make ocr_image return/print path of text file Eric Ihli 2020-04-26 18:29:04 -0700
  • 6359b86e42 Move main of extract_cells to __init__.py Eric Ihli 2020-04-25 15:27:57 -0700
  • 962abb7a02 Move `main` to __init__ for extract_tables Eric Ihli 2020-04-25 15:19:46 -0700
  • 85f864cd17 Return value from main rather than print Eric Ihli 2020-04-25 15:13:01 -0700
  • 37483148c8 Fix typo Eric Ihli 2020-04-25 13:00:27 -0700
  • 0ac2e885c1 Fix link to documentation Eric Ihli 2020-04-25 12:58:48 -0700
  • 8e9bc0e0a0 Fix typo Eric Ihli 2020-04-25 12:32:14 -0700
  • 075e265d05 Add README Eric Ihli 2020-04-25 12:30:53 -0700
  • 0420f97bd6 Update exported html Eric Ihli 2020-04-25 12:21:10 -0700
  • eb4e3d81b7 Clarify and expand on content around code Eric Ihli 2020-04-25 12:18:14 -0700
  • 6891fc9990 Add example image and csv output Eric Ihli 2020-04-24 12:52:05 -0700
  • 4eca593944 Remove unused files, finish refactor of structure Eric Ihli 2020-04-24 11:41:11 -0700
  • b911f87126 Refactor extract_cells into module Eric Ihli 2020-04-24 10:32:15 -0700
  • b9f088cf92 Refactor table extraction into module Eric Ihli 2020-04-24 10:02:44 -0700
  • 98ef6ffd85 Refactor utilities to modules Eric Ihli 2020-04-24 09:27:56 -0700
  • bea192678e Fix bug picking up noise in detecting contours Eric Ihli 2020-04-24 08:10:46 -0700
  • 54511b9a1f Fix bugs and improve accuracy Eric Ihli 2020-04-23 20:03:43 -0700
  • aa900de4e7 Use cleaner filenames for intermediate files Eric Ihli 2020-04-23 09:24:44 -0700
  • e49fffa5a7 Add module for outputting csv from parsed table Eric Ihli 2020-04-14 10:42:58 -0700
  • de398f73c2 Add ocr_image module Eric Ihli 2020-04-14 08:06:30 -0700
  • f77425fd9e Remove misnamed module Eric Ihli 2020-04-14 08:06:21 -0700
  • 96497d7327 Add doc for shell script to parse text from table Eric Ihli 2020-04-14 08:05:42 -0700
  • 32c62fd773 Add script to ocr individual cells Eric Ihli 2020-04-11 18:48:17 -0700
  • 396782051e Remove egg info from git tracking Eric Ihli 2020-04-11 18:14:16 -0700
  • 78e9cdb3f5 Add gitignore, rename modules, remove unused code Eric Ihli 2020-04-11 18:11:24 -0700
  • 8546902e64 Fix bug, html-image-size helper had no results Eric Ihli 2020-04-10 14:11:14 -0700
  • 28bcdbd4f7 Initial commit Eric Ihli 2020-04-10 13:52:29 -0700