OK, I think the bookmarklet/text scraper needs a little tweeking. I just tried Pocket's bookmarklet vs. Instapapers's on the "How Microsoft Fought True Open Standards" article that is also on the Hacker News front page. Pocket mis-identified the headline, instead getting the name of the blog. In fact the title of the article/blog post is not anywhere in Pocket's scrape.
In both of these instances the simply needed to get the content of the <title> tag in the HTML. I would think (careful, here be dragons!) more often than not that the <title> tag should be a reliable way of getting the title of the story.
Link to the article: http://blogs.computerworlduk.com/open-enterprise/2012/04/how...
Edit: It also didn't do well with Bruce Schneier's Crypto-Gram Newsletter at http://www.schneier.com/crypto-gram-1204.html
In both of these instances the simply needed to get the content of the <title> tag in the HTML. I would think (careful, here be dragons!) more often than not that the <title> tag should be a reliable way of getting the title of the story.