burathar
5 years ago
1 changed files with 26 additions and 0 deletions
@ -0,0 +1,26 @@ |
|||||||
|
# Invoice-Extractor |
||||||
|
This is a purpose-built project to extract the tabluar data from Dutch KPN Mobile invoices. |
||||||
|
It is heavily relying on the [PDFQuery](https://github.com/jcushman/pdfquery) package. |
||||||
|
|
||||||
|
## Compatibility |
||||||
|
This package has been tested on invoices dating from 2016 until 2019, but will probably work with older and more recent invoices. |
||||||
|
|
||||||
|
## Installation |
||||||
|
At this moment this package does not exist in PyPI, and has to be put into the python package directory manually. For windows this is usually `C:Program Files (x86)\Python<version>\Lib\` |
||||||
|
|
||||||
|
## Usage |
||||||
|
|
||||||
|
It is possible to extract the data from just one pdf file, or a directory containing compatable pdf files. |
||||||
|
|
||||||
|
### Usage example |
||||||
|
```python3 |
||||||
|
import invoice_extractor |
||||||
|
|
||||||
|
pdf_file = <somefilepath> |
||||||
|
pfd_directory = <somefilepath> |
||||||
|
output_directory = <somepath> |
||||||
|
|
||||||
|
|
||||||
|
invoice_extractor.extract(pfd_file, output_directory) |
||||||
|
invoice_extractor.extract(pfd_directory, output_directory) |
||||||
|
``` |
Loading…
Reference in new issue