Extract content #1

tcr · 2012-12-27T18:59:26Z

Using ps2ascii content can be extracted in the form of text, images, and fills. Probably need to align this information onto a grid with a certain fineness:

Spindrift
.extract([grid fineness]) // grid fineness creates 

Group
.commands() => [<command>, <command>]
.bound(l, b, r, t) => group()
.rows() => [<group>, <group>] // tries to automatically determine "rows" of elements
.columns() => [<group>, <group>] // tries to automatically determine "columns" of elements
.text() // plaintext
.images() // images

The text was updated successfully, but these errors were encountered:

Fixing missing 'end' event in the steram API

Add API for getting number of pages in a PDF

Merging "get num pages" from another fork

Update README.md because page() actually does not exist

tcr closed this as completed Dec 28, 2012

tcr reopened this Dec 28, 2012

cboulanger pushed a commit that referenced this issue May 14, 2017

Merge pull request #1 from MindXco/master

2e53f91

Fixing missing 'end' event in the steram API

cboulanger pushed a commit that referenced this issue May 14, 2017

Merge pull request #1 from picnichealth/get-num-pages

6859ba5

Add API for getting number of pages in a PDF

cboulanger pushed a commit that referenced this issue May 14, 2017

Merge pull request #1 from picnichealth/master

d3f78c6

Merging "get num pages" from another fork

cboulanger pushed a commit that referenced this issue Oct 25, 2017

Merge pull request #1 from mfkenson/fix_misleading_readme

84f7aad

Update README.md because page() actually does not exist

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract content #1

Extract content #1

tcr commented Dec 27, 2012

Extract content #1

Extract content #1

Comments

tcr commented Dec 27, 2012