Comments (6)
Hi @modesty - how do I install the package at this particular commit?
from pdf2json.
please test with 2b47fcd
from pdf2json.
seems like a new version with the fix needs to be published to npm
from pdf2json.
I've worked around this by creating file src/@types/pdf2json/index.d.ts
with the below contents:
Contents
// Copied and adapted from https://github.com/modesty/pdf2json/blob/master/pdfparser.d.ts
declare module 'pdf2json' {
declare class PDFParser extends EventEmitter {
constructor(context?: unknown, needRawText?: number, password?: string);
parseBuffer(buffer: Buffer, verbosity?: number): void;
loadPDF(pdfFilePath: string, verbosity?: number): Promise<void>;
createParserStream(): ParserStream;
getRawTextContent(): string;
on<K extends keyof EventMap>(eventName: K, listener: EventMap[K]): this;
}
type EventMap = {
pdfParser_dataError: (errMsg: Record<'parserError', Error>) => void;
pdfParser_dataReady: (pdfData: Output) => void;
readable: (meta: Output['Meta']) => void;
data: (data: Output['Pages'][number] | null) => void;
};
declare class ParserStream {
//TODO
}
interface Output {
Transcoder: string;
Meta: Record<string, unknown>;
Pages: Page[];
}
interface Page {
Width: number;
Height: number;
HLines: Line[];
VLines: Line[];
Fills: Fill[];
Texts: Text[];
Fields: Field[];
Boxsets: Boxset[];
}
interface Fill {
x: number;
y: number;
w: number;
h: number;
oc?: string;
clr?: number;
}
interface Line {
x: number;
y: number;
w: number;
l: number;
oc?: string;
clr?: number;
}
interface Text {
x: number;
y: number;
w: number;
sw: number;
A: 'left' | 'center' | 'right';
R: TextRun[];
oc?: string;
clr?: number;
}
interface TextRun {
T: string;
S: number;
TS: [number, number, 0 | 1, 0 | 1];
RA?: number;
}
interface Boxset {
boxes: Box[];
id: {
Id: string;
EN?: number;
};
}
interface Field {
id: {
Id: string;
EN?: number;
};
style: number;
TI: number;
AM: number;
TU: string;
x: number;
y: number;
w: number;
h: number;
T: {
Name: 'alpha' | 'link';
TypeInfo: object;
};
}
interface Box {
x: number;
y: number;
w: number;
h: number;
oc?: string;
clr?: number;
}
interface Box {
id: {
Id: string;
EN?: number;
};
T: {
Name: string;
TypeInfo?: object;
};
x: number;
y: number;
w: number;
h: number;
TI: number;
AM: number;
checked?: boolean;
style: number;
}
export default PDFParser;
}
from pdf2json.
Hello.
When the fix will be published?
from pdf2json.
published in 3.1.2
from pdf2json.
Related Issues (20)
- Node.js Server got stuck when parsing specific PDF while it is working for other PDFs HOT 2
- fields with periods are truncated HOT 1
- TypeError: pdf2json_1.default is not a constructor HOT 3
- The interface for `Line` is missing the `l` property HOT 1
- ENOENT: no such file or directory - util.js HOT 10
- How to detect the HLines correctly?
- Without a ToUnicode CMap, PDF viewers can't map glyphs to Unicode values -> rely on pdf.js?? HOT 2
- Characters coming as NULL HOT 1
- Is the auto-merge broken text blocks capability active in the last stable version (3.0.5)? HOT 3
- Cloudflare Worker issue with Could not resolve "fs/promises" HOT 5
- something is colossaly f***** up in the exports for version 3.1.0 HOT 2
- ENOENT: no such file or directory, open '/var/task/../package.json' HOT 2
- no such file or directory - pkinfo.js HOT 1
- PDF parse, edit and retain HOT 1
- Cannot compile project with 3.1.3 HOT 5
- All PDF Parsing fails after upgrading from v3.1.2 to v3.1.3
- unexpected space HOT 7
- Parser NO SPACE - "pdf2json": "^3.1.3", HOT 1
- FATAL ERROR: JavaScript heap out of memory HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pdf2json.