Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sites that failed to parse correctly #3

Open
mitechie opened this issue May 7, 2012 · 11 comments
Open

sites that failed to parse correctly #3

mitechie opened this issue May 7, 2012 · 11 comments
Labels

Comments

@mitechie
Copy link
Member

mitechie commented May 7, 2012

Leaving this open as we hope to keep adding sites that fail to parse and bug fixes that correct it.

http://shrinik.blogspot.com/2012/05/brief-introduction-of-test-automation.html

@mitechie
Copy link
Member Author

mitechie commented May 7, 2012

http://eli.thegreenplace.net/2012/05/07/automating-boring-testing-activities-with-tox/

the

conversion seems to mess up the rendering hard core.

@mitechie
Copy link
Member Author

mitechie commented May 7, 2012

http://www.osnews.com/story/25916/The_Android_Sooner_the_ill-fated_prototype_killed_by_the_iPhone

Shouldn't get the extra conent at the bottom with comments/etc.

@mitechie
Copy link
Member Author

mitechie commented May 7, 2012

http://www.markshuttleworth.com/archives/1126

Shouldn't get the comment form at the bottom, debatable if the comments themselves should come along for the ride.

@mitechie
Copy link
Member Author

mitechie commented May 9, 2012

@mitechie
Copy link
Member Author

mitechie commented May 9, 2012

http://www.antipope.org/charlie/blog-static/2012/05/bubble-20.html

pulls a comment as the post content

Now just pulls in the whole page including all comments.

@mitechie
Copy link
Member Author

@mitechie
Copy link
Member Author

http://newartisans.com/2011/04/letter-to-the-fsf/

No content

  • Working in 0.1.10

@mitechie
Copy link
Member Author

mitechie commented Jul 2, 2012

@mitechie
Copy link
Member Author

@craigmaloney
Copy link
Contributor

The L.A. Times article appears to be parsing: http://r.bmark.us/u/2cabf797b26f4a

@mitechie
Copy link
Member Author

Thanks for checking +craigmaloney, but if you look there's 4 paragraphs of text in the article and only one makes it into the readable parsed version. I guess I should update that to be more clear that it parses, but does it poorly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants