Thought I'd jump in here. Yes, parsing sentences are actually quite difficult.
Disclaimer: I am currently working on a text summarization startup, http://Summary.io
There are a few different approaches to parsing sentences, and only a few giant NLP libraries. What I have found the best in 95% of use cases to to write custom (RegEx) rules.
Attempting a sentence such as:
And then Mr. Bean (http://www.mrbean.com/?index) said to Col. Sanders, "Holy moly sentence extraction is hard!"
You have lots of little things like making sure you take the full quote and disregard periods in surnames, http links, etc. Parsing just by periods, question marks, and exclamtion points are going to lead to a lot of problems.
Disclaimer: I am currently working on a text summarization startup, http://Summary.io
There are a few different approaches to parsing sentences, and only a few giant NLP libraries. What I have found the best in 95% of use cases to to write custom (RegEx) rules.
Attempting a sentence such as: And then Mr. Bean (http://www.mrbean.com/?index) said to Col. Sanders, "Holy moly sentence extraction is hard!"
You have lots of little things like making sure you take the full quote and disregard periods in surnames, http links, etc. Parsing just by periods, question marks, and exclamtion points are going to lead to a lot of problems.