Parser #8

venkatr123 · 2018-07-17T07:46:07Z

For Different Formats Type Resumes like doc , pdf ....etc i'am getting error as below

{ Error: Error for type: [[ application/pdf ]], file: [[ d:\code4goal-resume-parser-master/public/2016_JRM_Resume.pdf ]], extractor for type exists, but failed to initialize. Message: INFO: 'pdftotext' does not appear to be installed, so textract will be unable to extract PDFs.
at extract (d:\code4goal-resume-parser-master\node_modules\textract\lib\extract.js:147:15)
at Timeout._onTimeout (d:\code4goal-resume-parser-master\node_modules\textract\lib\extract.js:155:7)
at ontimeout (timers.js:466:11)
at tryOnTimeout (timers.js:304:5)
at Timer.listOnTimeout (timers.js:267:5) typeNotFound: true }
Error: antiword read of file named [[ Abhilash_Reddy - Copy.doc ]] failed: Error: Command failed: antiword -m UTF-8.txt "d:\code4goal-resume-parser-master/public/Abhilash_Reddy - Copy.doc"
'antiword' is not recognized as an internal or external command,
operable program or batch file.

Is there any extractor for all formats

likerRr · 2018-07-19T21:25:27Z

Not sure. Moreover part of them could be out of date because code4goal-resume-parser itself didn't receive any updates for a long time. Sorry

nrsharma11 · 2019-08-07T08:45:33Z

I have resolved the issue related to "pdftotext" but I am still facing issue "antiword read of file named".
Can you please suggest or help in this regards.
Thanks in advance.

likerRr · 2019-08-08T07:27:41Z

Great to hear! Can you send a PR with a fix?
Do you have any issues with other doc files? Or only with that one?

nrsharma11 · 2019-08-08T07:35:30Z

To resolve error "pdftotext" I have downloaded the xpdf tools from here. Copied the pdftotext.exe in windows folder.

Yes I am facing the issue with all the doc files, it keep saying "antiword read of file named" BUT interestingly when I save the same file as ".docx" extention then it processed and I got the results. So may be there is something to do with the doc files.

I am also interested to have linked in profile based on public profile url, I tried but I am not getting results it shows blank nodes in json, please have a look below
"linkedin": { "positions": { "past": [ ], "current": { "title": "", "company": "", "description": "", "period": "" } }, "languages": [ ], "skills": [ ], "educations": [ ], "volunteering": [ ], "volunteeringOpportunities": [ ] }

likerRr · 2019-08-08T08:18:42Z

Since the parser was made, linked's html or api could change. Sorry, I don't support this project for now and can't have a look. There is a fork of my project with lots of issues fixed. Can you try it and see if your issues are fixed?

nrsharma11 · 2019-08-08T09:25:26Z

Okay. Thanks for your reply.

fadiajabeen · 2020-02-07T08:44:45Z

I placed pdftotext.exe in root folder i.e. from where app.js file is being run, and now its working.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser #8

Parser #8

venkatr123 commented Jul 17, 2018

likerRr commented Jul 19, 2018

nrsharma11 commented Aug 7, 2019

likerRr commented Aug 8, 2019

nrsharma11 commented Aug 8, 2019

likerRr commented Aug 8, 2019

nrsharma11 commented Aug 8, 2019

fadiajabeen commented Feb 7, 2020

Parser #8

Parser #8

Comments

venkatr123 commented Jul 17, 2018

likerRr commented Jul 19, 2018

nrsharma11 commented Aug 7, 2019

likerRr commented Aug 8, 2019

nrsharma11 commented Aug 8, 2019

likerRr commented Aug 8, 2019

nrsharma11 commented Aug 8, 2019

fadiajabeen commented Feb 7, 2020